Athina AI - Athina AI Hub (Page 6)

research-papers

What is the Role of Small Models in the LLM Era: A Survey

Original Paper: https://arxiv.org/abs/2409.06857 By: Lihu Chen, Gaël Varoquaux Abstract: Large Language Models (LLMs) have made significant progress in advancing artificial general intelligence (AGI), leading to the development of increasingly large models such as GPT-4 and LLaMA-405B. However, scaling up model sizes results in exponentially higher

research-papers

Agent Workflow Memory

Original Paper: https://arxiv.org/abs/2409.07429 By: Zora Zhiruo Wang, Jiayuan Mao, Daniel Fried, Graham Neubig Abstract: Despite the potential of language model-based agents to solve real-world tasks such as web navigation, current methods still struggle with long-horizon tasks with complex action trajectories. In contrast, humans can flexibly

research-papers

Achieving Peak Performance for Large Language Models: A Systematic Review

Published Date: 7 Sep 2024 Original Paper: https://arxiv.org/abs/2409.04833 By: Zhyar Rzgar K Rostam, Sándor Szénási, Gábor Kertész Abstract: In recent years, large language models (LLMs) have achieved remarkable success in natural language processing (NLP). LLMs require an extreme amount of parameters to attain high performance.

research-papers

Strategic Chain-of-Thought: Guiding Accurate Reasoning in LLMs through Strategy Elicitation

Original Paper: https://arxiv.org/abs/2409.03271v1 By: Yu Wang, Shiwan Zhao, Zhihu Wang, Heyuan Huang, Ming Fan, Yubo Zhang, Zhixing Wang, Haijun Wang, Ting Liu Abstract: The Chain-of-Thought (CoT) paradigm has emerged as a critical approach for enhancing the reasoning capabilities of large language models (LLMs). However, despite

research-papers

Large Language Model-Based Agents for Software Engineering: A Survey

Original Paper: https://arxiv.org/abs/2409.02977 By: Junwei Liu, Kaixin Wang, Yixuan Chen, Xin Peng, Zhenpeng Chen, Lingming Zhang, Yiling Lou Abstract: The recent advance in Large Language Models (LLMs) has shaped a new paradigm of AI agents, i.e., LLM-based agents. Compared to standalone LLMs, LLM-based agents

research-papers

Beyond Preferences in AI Alignment

Original Paper: https://arxiv.org/abs/2408.16984 By: Tan Zhi-Xuan, Micah Carroll, Matija Franklin, Hal Ashton Abstract: The dominant practice of AI alignment assumes (1) that preferences are an adequate representation of human values (2) that human rationality can be understood in terms of maximizing the satisfaction of preferences

research-papers

RAGEval: Scenario Specific RAG Evaluation Dataset Generation Framework

Original Paper: https://arxiv.org/abs/2408.01262 By: Kunlun Zhu, Yifan Luo, Dingling Xu, Ruobing Wang, Shi Yu, Shuo Wang, Yukun Yan, Zhenghao Liu, Xu Han, Zhiyuan Liu, Maosong Sun Abstract: Retrieval-Augmented Generation (RAG) systems have demonstrated their advantages in alleviating the hallucination of Large Language Models (LLMs). Existing

research-papers

LLM Pruning and Distillation in Practice: The Minitron Approach

Original Paper: https://arxiv.org/pdf/2408.11796 By: Sharath Turuvekere Sreenivas, Saurav Muralidharan, Raviraj Joshi, Marcin Chochowski, Mostofa Patwary, Mohammad Shoeybi, Bryan Catanzaro, Jan Kautz, Pavlo Molchanov Abstract We present a comprehensive report on compressing the Llama 3.1 8B and Mistral NeMo 12B models to 4B and 8B

research-papers

Controllable Text Generation for Large Language Models: A Survey

Original Paper: https://arxiv.org/abs/2408.12599 By: Xun Liang, Hanyu Wang, Yezhaohui Wang, Shichao Song, Jiawei Yang, Simin Niu, Jie Hu, Dan Liu, Shunyu Yao, Feiyu Xiong, Zhiyu Li Abstract: In Natural Language Processing (NLP), Large Language Models (LLMs) have demonstrated high text generation quality. However, in real-world

research-papers

RAG-Fusion (Fusion Retrieval RAG)

Original Paper: https://arxiv.org/pdf/2402.03367 Code Sample: https://github.com/NirDiamant/RAG_Techniques/blob/main/all_rag_techniques/fusion_retrieval.ipynb RAG-Fusion or Fusion-retrieval RAG, is an advanced technique that enhances the traditional Retrieval Augmented Generation (RAG) approach used in AI and natural language processing. This method

research-papers

Query Transformations: Rewriting, Step-back Prompting, and Sub-query Decomposition

Original Papers: * Query Rewriting: https://arxiv.org/pdf/2305.14283 * Step-back Prompting: https://arxiv.org/abs/2310.06117 * Sub-query Decomposition: https://arxiv.org/pdf/2404.00610 Code Sample: https://github.com/NirDiamant/RAG_Techniques/blob/main/all_rag_techniques/query_transformations.ipynb Query transformations are advanced techniques used to enhance

research-papers

Re-ranking methods

Code Sample: https://github.com/NirDiamant/RAG_Techniques/blob/main/all_rag_techniques/reranking.ipynb Reranking is a powerful technique used in Retrieval-Augmented Generation (RAG) systems to refine and improve the relevance of retrieved documents. Here's a detailed explanation of reranking methods in RAG systems, along with their