Athina AI Hub
  • Home
  • Blogs
  • Athina Originals
  • Trending
  • Write for Us
  • Athina AI IDE
  • AI Workflows
Sign in Subscribe

Athina AI

Athina AI
Prover-Verifier Games improve legibility of LLM outputs

research-papers

Prover-Verifier Games improve legibility of LLM outputs

Original Paper: https://arxiv.org/abs/2407.13692 By: Jan Hendrik Kirchner, Yining Chen, Harri Edwards, Jan Leike, Nat McAleese, Yuri Burda Abstract: One way to increase confidence in the outputs of Large Language Models (LLMs) is to support them with reasoning that is clear and easy to check – a

By Athina AI 01 Aug 2024
Adaptive Retrieval-Augmented Generation for Conversational Systems

research-papers

Adaptive Retrieval-Augmented Generation for Conversational Systems

Original Paper: https://arxiv.org/abs/2407.21712 By: Xi Wang, Procheta Sen, Ruizhe Li, Emine Yilmaz Abstract: Despite the success of integrating large language models into the development of conversational systems, many studies have shown the effectiveness of retrieving and augmenting external knowledge for informative responses. Hence, many existing

By Athina AI 31 Jul 2024
Machine Unlearning in Generative AI: A Survey

research-papers

Machine Unlearning in Generative AI: A Survey

Original Paper: https://arxiv.org/abs/2407.20516 By: Zheyuan Liu, Guangyao Dou, Zhaoxuan Tan, Yijun Tian, Meng Jiang Abstract:  Generative AI technologies have been deployed in many places, such as (multimodal) large language models and vision generative models. Their remarkable performance should be attributed to massive training data and

By Athina AI 30 Jul 2024
Meta-Rewarding Language Models: Self-Improving Alignment with LLM-as-a-Meta-Judge

research-papers

Meta-Rewarding Language Models: Self-Improving Alignment with LLM-as-a-Meta-Judge

Original Paper: https://arxiv.org/abs/2407.19594 By: Tianhao Wu, Weizhe Yuan, Olga Golovneva, Jing Xu, Yuandong Tian, Jiantao Jiao, Jason Weston, Sainbayar Sukhbaatar Abstract: Large Language Models (LLMs) are rapidly surpassing human knowledge in many domains. While improving these models traditionally relies on costly human data, recent self-rewarding

By Athina AI 30 Jul 2024
ThinK: Thinner Key Cache by Query-Driven Pruning

research-papers

ThinK: Thinner Key Cache by Query-Driven Pruning

Original Paper: https://arxiv.org/abs/2407.21018 By: Yuhui Xu, Zhanming Jie, Hanze Dong, Lei Wang, Xudong Lu, Aojun Zhou, Amrita Saha, Caiming Xiong, Doyen Sahoo Abstract: Large Language Models (LLMs) have revolutionized the field of natural language processing, achieving unprecedented performance across a variety of applications by leveraging

By Athina AI 30 Jul 2024
Recursive Introspection: Teaching Language Model Agents How to Self-Improve

research-papers

Recursive Introspection: Teaching Language Model Agents How to Self-Improve

Original Paper: https://arxiv.org/abs/2407.18219 By: Yuxiao Qu, Tianjun Zhang, Naman Garg, Aviral Kumar Abstract:  A central piece in enabling intelligent agentic behavior in foundation models is to make them capable of introspecting upon their behavior, and reasoning, and correcting their mistakes as more computation or interaction

By Athina AI 29 Jul 2024
Concise Thoughts: Impact of Output Length on LLM Reasoning and Cost

research-papers

Concise Thoughts: Impact of Output Length on LLM Reasoning and Cost

Original Paper: https://arxiv.org/abs/2407.19825 By: Sania Nayab, Giulio Rossolini, Giorgio Buttazzo, Nicolamaria Manes, Fabrizio Giacomelli Abstract: Today's large language models (LLMs) can solve challenging question-answering tasks, and prompt engineering techniques, such as chain-of-thought (CoT), have gained attention for enhancing the explanation and correctness of

By Athina AI 29 Jul 2024
PersonaGym: Evaluating Persona Agents and LLMs

research-papers

PersonaGym: Evaluating Persona Agents and LLMs

Original Paper: https://arxiv.org/abs/2407.18416 By: Vinay Samuel, Henry Peng Zou, Yue Zhou, Shreyas Chaudhari, Ashwin Kalyan, Tanmay Rajpurohit, Ameet Deshpande, Karthik Narasimhan, Vishvak Murahari Abstract: Persona agents, LLMs designed to act according to assigned personas, show impressive contextual responses across various sectors like education, healthcare, and

By Athina AI 29 Jul 2024
MindSearch: Mimicking Human Minds Elicits Deep AI Searcher

research-papers

MindSearch: Mimicking Human Minds Elicits Deep AI Searcher

Original Paper: https://arxiv.org/pdf/2407.20183 By: Zehui Chen, Kuikun Liu, Qiuchen Wang, Jiangning Liu, Wenwei Zhang, Kai Chen, Feng Zhao Abstract:  Information seeking and integration is a complex cognitive task that consumes enormous time and effort. Inspired by the remarkable progress of Large Language Models, recent works

By Athina AI 29 Jul 2024
Retrieval Augmented Generation or Long-Context LLMs? A Comprehensive Study and Hybrid Approach

research-papers

Retrieval Augmented Generation or Long-Context LLMs? A Comprehensive Study and Hybrid Approach

Original Paper: https://arxiv.org/abs/2407.16833 By: Zhuowan Li, Cheng Li, Mingyang Zhang, Qiaozhu Mei, Michael Bendersky Abstract: Retrieval Augmented Generation (RAG) has been a powerful tool for Large Language Models (LLMs) to efficiently process overly lengthy contexts. However, recent LLMs like Gemini-1.5 and GPT-4 show exceptional

By Athina AI 23 Jul 2024
Context Embeddings for Efficient Answer Generation in RAG

research-papers

Context Embeddings for Efficient Answer Generation in RAG

Original Paper: https://arxiv.org/abs/2407.09252 By: David Rau, Shuai Wang, Hervé Déjean, Stéphane Clinchant Abstract Retrieval-Augmented Generation (RAG) allows for overcoming the limited knowledge of LLMs by extending the input with external information. As a consequence, the contextual inputs to the model become much longer which slows

By Athina AI 23 Jul 2024
Generation Constraint Scaling Can Mitigate Hallucination

research-papers

Generation Constraint Scaling Can Mitigate Hallucination

Original Paper: https://arxiv.org/abs/2407.16908 By: Georgios Kollias, Payel Das, Subhajit Chaudhury Abstract:  Addressing the issue of hallucinations in large language models (LLMs) is a critical challenge. As the cognitive mechanisms of hallucination have been related to memory, here we explore hallucination for LLM that is enabled

By Athina AI 23 Jul 2024
See all
Athina AI Hub
  • Sign up
  • GitHub
  • LinkedIn
  • X
  • YouTube
Powered by Ghost

Athina AI Hub

The ultimate resource designed for AI development teams 🔥

Built with ❤️ by Athina AI

Product

  • Observe
  • Develop
  • Evaluate
  • Pricing

Resources

  • Athina AI Hub
  • Athina AI Documentation
  • Company Blog
  • Privacy Policy

AI Hub Sections

  • AI Development Blogs
  • AI Research Papers
  • Athina AI Originals
  • Top Performers

About

  • About Athina AI
  • About Athina AI Hub
  • Write for the AI Hub