Athina AI Hub
  • Home
  • Blogs
  • Athina Originals
  • Trending
  • Write for Us
  • Athina AI IDE
  • AI Workflows
Sign in Subscribe

Athina AI

Athina AI
What is the Role of Small Models in the LLM Era: A Survey

research-papers

What is the Role of Small Models in the LLM Era: A Survey

Original Paper: https://arxiv.org/abs/2409.06857 By: Lihu Chen, Gaël Varoquaux Abstract: Large Language Models (LLMs) have made significant progress in advancing artificial general intelligence (AGI), leading to the development of increasingly large models such as GPT-4 and LLaMA-405B. However, scaling up model sizes results in exponentially higher

By Athina AI 12 Sep 2024
Agent Workflow Memory

research-papers

Agent Workflow Memory

Original Paper: https://arxiv.org/abs/2409.07429 By: Zora Zhiruo Wang, Jiayuan Mao, Daniel Fried, Graham Neubig Abstract: Despite the potential of language model-based agents to solve real-world tasks such as web navigation, current methods still struggle with long-horizon tasks with complex action trajectories. In contrast, humans can flexibly

By Athina AI 11 Sep 2024
Achieving Peak Performance for Large Language Models: A Systematic Review

research-papers

Achieving Peak Performance for Large Language Models: A Systematic Review

Published Date: 7 Sep 2024 Original Paper: https://arxiv.org/abs/2409.04833 By: Zhyar Rzgar K Rostam, Sándor Szénási, Gábor Kertész Abstract: In recent years, large language models (LLMs) have achieved remarkable success in natural language processing (NLP). LLMs require an extreme amount of parameters to attain high performance.

By Athina AI 07 Sep 2024
Strategic Chain-of-Thought: Guiding Accurate Reasoning in LLMs through Strategy Elicitation

research-papers

Strategic Chain-of-Thought: Guiding Accurate Reasoning in LLMs through Strategy Elicitation

Original Paper: https://arxiv.org/abs/2409.03271v1 By: Yu Wang, Shiwan Zhao, Zhihu Wang, Heyuan Huang, Ming Fan, Yubo Zhang, Zhixing Wang, Haijun Wang, Ting Liu Abstract: The Chain-of-Thought (CoT) paradigm has emerged as a critical approach for enhancing the reasoning capabilities of large language models (LLMs). However, despite

By Athina AI 05 Sep 2024
Large Language Model-Based Agents for Software Engineering: A Survey

research-papers

Large Language Model-Based Agents for Software Engineering: A Survey

Original Paper: https://arxiv.org/abs/2409.02977 By: Junwei Liu, Kaixin Wang, Yixuan Chen, Xin Peng, Zhenpeng Chen, Lingming Zhang, Yiling Lou Abstract: The recent advance in Large Language Models (LLMs) has shaped a new paradigm of AI agents, i.e., LLM-based agents. Compared to standalone LLMs, LLM-based agents

By Athina AI 04 Sep 2024
Beyond Preferences in AI Alignment

research-papers

Beyond Preferences in AI Alignment

Original Paper: https://arxiv.org/abs/2408.16984 By: Tan Zhi-Xuan, Micah Carroll, Matija Franklin, Hal Ashton Abstract: The dominant practice of AI alignment assumes (1) that preferences are an adequate representation of human values (2) that human rationality can be understood in terms of maximizing the satisfaction of preferences

By Athina AI 30 Aug 2024
RAGEval: Scenario Specific RAG Evaluation Dataset Generation Framework

research-papers

RAGEval: Scenario Specific RAG Evaluation Dataset Generation Framework

Original Paper: https://arxiv.org/abs/2408.01262 By: Kunlun Zhu, Yifan Luo, Dingling Xu, Ruobing Wang, Shi Yu, Shuo Wang, Yukun Yan, Zhenghao Liu, Xu Han, Zhiyuan Liu, Maosong Sun Abstract: Retrieval-Augmented Generation (RAG) systems have demonstrated their advantages in alleviating the hallucination of Large Language Models (LLMs). Existing

By Athina AI 27 Aug 2024
LLM Pruning and Distillation in Practice: The Minitron Approach

research-papers

LLM Pruning and Distillation in Practice: The Minitron Approach

Original Paper: https://arxiv.org/pdf/2408.11796 By: Sharath Turuvekere Sreenivas, Saurav Muralidharan, Raviraj Joshi, Marcin Chochowski, Mostofa Patwary, Mohammad Shoeybi, Bryan Catanzaro, Jan Kautz, Pavlo Molchanov Abstract We present a comprehensive report on compressing the Llama 3.1 8B and Mistral NeMo 12B models to 4B and 8B

By Athina AI 26 Aug 2024
Controllable Text Generation for Large Language Models: A Survey

research-papers

Controllable Text Generation for Large Language Models: A Survey

Original Paper: https://arxiv.org/abs/2408.12599 By: Xun Liang, Hanyu Wang, Yezhaohui Wang, Shichao Song, Jiawei Yang, Simin Niu, Jie Hu, Dan Liu, Shunyu Yao, Feiyu Xiong, Zhiyu Li Abstract: In Natural Language Processing (NLP), Large Language Models (LLMs) have demonstrated high text generation quality. However, in real-world

By Athina AI 22 Aug 2024
RAG-Fusion (Fusion Retrieval RAG)

research-papers

RAG-Fusion (Fusion Retrieval RAG)

Original Paper: https://arxiv.org/pdf/2402.03367 Code Sample: https://github.com/NirDiamant/RAG_Techniques/blob/main/all_rag_techniques/fusion_retrieval.ipynb RAG-Fusion or Fusion-retrieval RAG, is an advanced technique that enhances the traditional Retrieval Augmented Generation (RAG) approach used in AI and natural language processing. This method

By Athina AI 21 Aug 2024
Query Transformations: Rewriting, Step-back Prompting, and Sub-query Decomposition

research-papers

Query Transformations: Rewriting, Step-back Prompting, and Sub-query Decomposition

Original Papers: * Query Rewriting: https://arxiv.org/pdf/2305.14283 * Step-back Prompting: https://arxiv.org/abs/2310.06117 * Sub-query Decomposition: https://arxiv.org/pdf/2404.00610 Code Sample: https://github.com/NirDiamant/RAG_Techniques/blob/main/all_rag_techniques/query_transformations.ipynb Query transformations are advanced techniques used to enhance

By Athina AI 21 Aug 2024
Re-ranking methods

research-papers

Re-ranking methods

Code Sample: https://github.com/NirDiamant/RAG_Techniques/blob/main/all_rag_techniques/reranking.ipynb Reranking is a powerful technique used in Retrieval-Augmented Generation (RAG) systems to refine and improve the relevance of retrieved documents. Here's a detailed explanation of reranking methods in RAG systems, along with their

By Athina AI 21 Aug 2024
See all
Athina AI Hub
  • Sign up
  • GitHub
  • LinkedIn
  • X
  • YouTube
Powered by Ghost

Athina AI Hub

The ultimate resource designed for AI development teams 🔥

Built with ❤️ by Athina AI

Product

  • Observe
  • Develop
  • Evaluate
  • Pricing

Resources

  • Athina AI Hub
  • Athina AI Documentation
  • Company Blog
  • Privacy Policy

AI Hub Sections

  • AI Development Blogs
  • AI Research Papers
  • Athina AI Originals
  • Top Performers

About

  • About Athina AI
  • About Athina AI Hub
  • Write for the AI Hub