Research Papers - Athina AI Hub

research-papers

LongLLMLingua: Accelerating and Enhancing LLMs in Long Context Scenarios via Prompt Compression

Original Paper: https://arxiv.org/abs/2310.06839 By: Huiqiang Jiang, Qianhui Wu, Xufang Luo, Dongsheng Li, Chin-Yew Lin, Yuqing Yang, Lili Qiu Abstract: In long context scenarios, large language models (LLMs) face three main challenges: higher computational/financial cost, longer latency, and inferior performance. Some studies reveal that the

research-papers

Training Language Models to Self-Correct via Reinforcement Learning

Original Paper: https://arxiv.org/abs/2409.12917 By: Aviral Kumar, Vincent Zhuang, Rishabh Agarwal, Yi Su, John D Co-Reyes, Avi Singh, Kate Baumli, Shariq Iqbal, Colton Bishop, Rebecca Roelofs, Lei M Zhang, Kay McKinney, Disha Shrivastava, Cosmin Paduraru, George Tucker, Doina Precup, Feryal Behbahani, Aleksandra Faust Abstract: Self-correction is

research-papers

Iteration of Thought: Leveraging Inner Dialogue for Autonomous Large Language Model Reasoning

Original Paper: https://arxiv.org/abs/2409.12618 By: Santosh Kumar Radha, Yasamin Nouri Jelyani, Ara Ghukasyan, Oktay Goktas Abstract: Iterative human engagement is a common and effective means of leveraging the advanced language processing power of large language models (LLMs). Using well-structured prompts in a conversational manner, human users

research-papers

To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning

Original Paper: https://arxiv.org/abs/2409.12183 By: Zayne Sprague, Fangcong Yin, Juan Diego Rodriguez, Dongwei Jiang, Manya Wadhwa, Prasann Singhal, Xinyu Zhao, Xi Ye, Kyle Mahowald, Greg Durrett Abstract: Chain-of-thought (CoT) via prompting is the de facto method for eliciting reasoning capabilities from large language models (LLMs). But

research-papers

A Comprehensive Evaluation of Quantized Instruction-Tuned Large Language Models: An Experimental Analysis up to 405B

Original Paper: https://arxiv.org/abs/2409.11055 By: Jemin Lee, Sihyeong Park, Jinse Kwon, Jihun Oh, Yongin Kwon Abstract: Prior research works have evaluated quantized LLMs using limited metrics such as perplexity or a few basic knowledge tasks and old datasets. Additionally, recent large-scale models such as Llama 3.

research-papers

Large Language Monkeys: Scaling Inference Compute with Repeated Sampling

Original Paper: https://arxiv.org/abs/2407.21787 By: Bradley Brown, Jordan Juravsky, Ryan Ehrlich, Ronald Clark, Quoc V. Le, Christopher Ré, Azalia Mirhoseini Abstract: Scaling the amount of compute used to train language models has dramatically improved their capabilities. However, when it comes to inference, we often limit the

research-papers

What is the Role of Small Models in the LLM Era: A Survey

Original Paper: https://arxiv.org/abs/2409.06857 By: Lihu Chen, Gaël Varoquaux Abstract: Large Language Models (LLMs) have made significant progress in advancing artificial general intelligence (AGI), leading to the development of increasingly large models such as GPT-4 and LLaMA-405B. However, scaling up model sizes results in exponentially higher

research-papers

Agent Workflow Memory

Original Paper: https://arxiv.org/abs/2409.07429 By: Zora Zhiruo Wang, Jiayuan Mao, Daniel Fried, Graham Neubig Abstract: Despite the potential of language model-based agents to solve real-world tasks such as web navigation, current methods still struggle with long-horizon tasks with complex action trajectories. In contrast, humans can flexibly

research-papers

Achieving Peak Performance for Large Language Models: A Systematic Review

Published Date: 7 Sep 2024 Original Paper: https://arxiv.org/abs/2409.04833 By: Zhyar Rzgar K Rostam, Sándor Szénási, Gábor Kertész Abstract: In recent years, large language models (LLMs) have achieved remarkable success in natural language processing (NLP). LLMs require an extreme amount of parameters to attain high performance.

research-papers

Strategic Chain-of-Thought: Guiding Accurate Reasoning in LLMs through Strategy Elicitation

Original Paper: https://arxiv.org/abs/2409.03271v1 By: Yu Wang, Shiwan Zhao, Zhihu Wang, Heyuan Huang, Ming Fan, Yubo Zhang, Zhixing Wang, Haijun Wang, Ting Liu Abstract: The Chain-of-Thought (CoT) paradigm has emerged as a critical approach for enhancing the reasoning capabilities of large language models (LLMs). However, despite

research-papers

Large Language Model-Based Agents for Software Engineering: A Survey

Original Paper: https://arxiv.org/abs/2409.02977 By: Junwei Liu, Kaixin Wang, Yixuan Chen, Xin Peng, Zhenpeng Chen, Lingming Zhang, Yiling Lou Abstract: The recent advance in Large Language Models (LLMs) has shaped a new paradigm of AI agents, i.e., LLM-based agents. Compared to standalone LLMs, LLM-based agents

research-papers

Beyond Preferences in AI Alignment

Original Paper: https://arxiv.org/abs/2408.16984 By: Tan Zhi-Xuan, Micah Carroll, Matija Franklin, Hal Ashton Abstract: The dominant practice of AI alignment assumes (1) that preferences are an adequate representation of human values (2) that human rationality can be understood in terms of maximizing the satisfaction of preferences