By Date12 viewsBy DateAll postsweekly: 2sep-8sepTableFine-tuningEvaluationPrompt EngineeringSafetyDataset GenerationRAGFoundation ModelHallucinationNameTagsPublish DateSlugFeaturedAuthorsExcerptExtra InfoLast Edited TimeRelated PostsDo not indexHide CTAHide in Main FeedMeta DescriptionMeta TitleHide CoverOriginal PaperBlog URLAuthorLLM Pruning and Distillation in Practice: The Minitron ApproachLarge Language ModelsLLM PerformanceAugust 26, 2024Sep 18, 2024 1:13 AMhttps://arxiv.org/abs/2408.11796Automated Design of Agentic SystemAgentsAugust 15, 2024Sep 9, 2024 3:14 PMhttps://arxiv.org/abs/2408.08435Exploring Advanced Large Language Models with LLMsuiteLarge Language ModelsJuly 1, 2024Sep 13, 2024 3:46 PMhttps://arxiv.org/pdf/2407.12036Distilling System 2 into System 1LLM PerformanceFine TuningJuly 8, 2024Sep 13, 2024 3:46 PMhttps://arxiv.org/pdf/2407.06023v1A Survey on Employing Large Language Models for Text-to-SQL TasksLarge Language ModelsAugust 11, 2024Sep 13, 2024 3:46 PMhttps://arxiv.org/pdf/2407.15186ThinK: Thinner Key Cache by Query-Driven PruningJuly 30, 2024Sep 13, 2024 3:46 PMhttps://arxiv.org/pdf/2407.21018ShieldGemma: Generative AI Content Moderation Based on GemmaLarge Language ModelsAugust 4, 2024Sep 13, 2024 3:47 PMhttps://arxiv.org/pdf/2407.21772Let Me Speak Freely? A Study on the Impact of Format Restrictions on Performance of Large Language ModelsLLM PerformanceLarge Language ModelsAugust 5, 2024Sep 13, 2024 3:47 PMhttps://arxiv.org/pdf/2408.02442A Survey of NL2SQL with Large Language Models: Where are we, and where are we going?Large Language ModelsAugust 9, 2024Sep 13, 2024 3:47 PMhttps://arxiv.org/pdf/2408.05109LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMsLarge Language ModelsAugust 13, 2024Sep 13, 2024 3:47 PMhttps://arxiv.org/pdf/2408.07055Challenges and Responses in the Practice of Large Language ModelsLarge Language ModelsLLM PerformanceAugust 21, 2024Sep 13, 2024 3:47 PMhttps://arxiv.org/pdf/2408.09416Controllable Text Generation for Large Language Models: A SurveyLarge Language ModelsLLM PerformanceAugust 22, 2024Sep 13, 2024 3:47 PMhttps://arxiv.org/pdf/2408.12599Graph Retrieval-Augmented Generation: A SurveyRAGAugust 15, 2024Sep 13, 2024 3:47 PMhttps://arxiv.org/pdf/2408.08921Enhancing Robustness in Large Language Models: Prompting for Mitigating the Impact of Irrelevant InformationLLM PerformanceLarge Language ModelsAugust 20, 2024Sep 13, 2024 3:47 PMhttps://arxiv.org/pdf/2408.10615Medical Graph RAG: Towards Safe Medical Large Language Model via Graph Retrieval-Augmented GenerationRAGLarge Language ModelsLLM PerformanceAugust 8, 2024Sep 13, 2024 3:47 PMhttps://arxiv.org/pdf/2408.04187Transformer Explainer: Interactive Learning of Text-Generative ModelsFoundation ModelAugust 8, 2024Sep 13, 2024 3:47 PMhttps://arxiv.org/pdf/2408.04619Meta-Rewarding Language Models: Self-Improving Alignment with LLM-as-a-Meta-JudgeLarge Language ModelsJuly 30, 2024Sep 13, 2024 3:47 PMhttps://arxiv.org/pdf/2407.19594Know Your Limits: A Survey of Abstention in Large Language ModelsLarge Language ModelsAugust 8, 2024Sep 13, 2024 3:47 PMhttps://arxiv.org/pdf/2407.18418Weak-to-Strong ReasoningLarge Language ModelsLLM PerformanceReasoningJuly 18, 2024Sep 13, 2024 3:47 PMhttps://arxiv.org/pdf/2407.13647Prover-Verifier Games improve legibility of LLM outputsLLM PerformanceLarge Language ModelsAugust 1, 2024Sep 13, 2024 3:47 PMhttps://arxiv.org/pdf/2407.13692HybridRAG: Integrating Knowledge Graphs and Vector Retrieval Augmented Generation for Efficient Information ExtractionRAGAugust 9, 2024Sep 13, 2024 11:34 PMhttps://arxiv.org/pdf/2408.04948RAGChecker: A Fine-grained Framework for Diagnosing Retrieval-Augmented GenerationRAGAugust 17, 2024Sep 13, 2024 3:47 PMhttps://arxiv.org/pdf/2408.08067The AI Scientist: Towards Fully Automated Open-Ended Scientific DiscoveryLarge Language ModelsAugust 15, 2024Sep 13, 2024 3:47 PMhttps://arxiv.org/pdf/2408.06292EfficientRAG: Efficient Retriever for Multi-Hop Question AnsweringRAGAugust 8, 2024Sep 13, 2024 3:47 PMhttps://arxiv.org/pdf/2408.04259NeedleBench: Can LLMs Do Retrieval and Reasoning in 1 Million Context Window?LLM PerformanceJuly 16, 2024Sep 13, 2024 3:48 PMhttps://arxiv.org/pdf/2407.11963Does Refusal Training in LLMs Generalize to the Past Tense?LLM PerformanceJuly 19, 2024Sep 13, 2024 3:48 PMhttps://arxiv.org/pdf/2407.11969MindSearch: Mimicking Human Minds Elicits Deep AI SearcherLarge Language ModelsJuly 29, 2024Sep 13, 2024 3:48 PMhttps://arxiv.org/abs/2407.20183Machine Unlearning in Generative AI: A SurveyLarge Language ModelsJuly 30, 2024Sep 13, 2024 3:48 PMhttps://arxiv.org/pdf/2407.20516Recursive Introspection: Teaching Language Model Agents How to Self-ImproveLarge Language ModelsLLM PerformanceJuly 26, 2024Sep 13, 2024 3:48 PMhttps://arxiv.org/pdf/2407.18219Generation Constraint Scaling Can Mitigate HallucinationHallucinationsJuly 23, 2024Sep 13, 2024 3:48 PMhttps://arxiv.org/pdf/2407.16908SpreadsheetLLM: Encoding Spreadsheets for Large Language ModelsLLM PerformanceLarge Language ModelsJuly 12, 2024Sep 13, 2024 3:48 PMhttps://arxiv.org/pdf/2407.09025Context Embeddings for Efficient Answer Generation in RAGRAGFine TuningLLM PerformanceJuly 23, 2024Sep 13, 2024 3:48 PMhttps://arxiv.org/pdf/2407.09252LazyLLM: Dynamic Token Pruning for Efficient Long Context LLM InferenceLLM PerformanceJuly 19, 2024Sep 13, 2024 3:48 PMhttps://arxiv.org/pdf/2407.14057Retrieval Augmented Generation or Long-Context LLMs? A Comprehensive Study and Hybrid Approach RAGLLM PerformanceJuly 23, 2024Sep 13, 2024 3:48 PMhttps://arxiv.org/abs/2407.16833Conversational Prompt EngineeringPrompt EngineeringRAGAugust 8, 2024Athina AI Research AgentSep 13, 2024 3:48 PMFrom LLMs to LLM-based Agents for Software Engineering: A Survey of Current, Challenges and FutureConversational Prompt Engineering🛠️ Understand Your Users → Detect Hallucinations → IterateBe like a Goldfish, Don't Memorize! Mitigating Memorization in Generative LLMsMixture-of-Agents Enhances Large Language Model Capabilitieshttps://arxiv.org/abs/2408.04560Self-Tuning: Instructing LLMs to Effectively Acquire New Knowledge through Self-TeachingFine TuningFoundation ModelJune 15, 2024Athina AI Research AgentSep 13, 2024 3:48 PMllmEngineer.weekly: Running models locally, LLM-graded evals too expensive for production? Here's our solution...RAGEval: Scenario Specific RAG Evaluation Dataset Generation Frameworkhttps://arxiv.org/abs/2406.06326From LLMs to LLM-based Agents for Software Engineering: A Survey of Current, Challenges and Future AgentsReasoningEvaluationAugust 5, 2024Athina AI Research AgentSep 13, 2024 3:48 PMLLM Critics Help Catch LLM BugsFrom LLMs to LLM-based Agents for Software Engineering: A Survey of Current, Challenges and FutureCan I walk you through Athina in 15 mins?Conversational Prompt EngineeringSelf-Taught EvaluatorsOn LLMs-Driven Synthetic Data Generation, Curation, and Evaluation: A SurveyFrom Artificial Needles to Real Haystacks: Improving Retrieval Capabilities in LLMs by Finetuning on Synthetic DataBe like a Goldfish, Don't Memorize! Mitigating Memorization in Generative LLMshttps://arxiv.org/abs/2408.02479PlanRAG: A Plan-then-Retrieval Augmented Generation for Generative Large Language Models as Decision MakersRAGReasoningJune 18, 2024Athina AI Research AgentSep 13, 2024 3:48 PMAnalyze and compare LLM performance across different prompts, models, and topicsCompare Mode on Athina IDECommon LLM chatbot problems and how to solve themhttps://arxiv.org/abs/2406.12430Adaptive Retrieval-Augmented Generation for Conversational Systems RAGConversational AIJuly 31, 2024Athina AI Research AgentSep 13, 2024 3:48 PMHow non-technical users can prototype pipelines, run AI experiments and evaluationsEvaluate llama-3 vs gpt-4o on YOUR dataset in a few clicksRe-run your production traces on different LLMs and compare the resultshttps://arxiv.org/abs/2407.21712Tree Search For Language Model AgentsAgentsReasoningJuly 1, 2024Athina AI Research AgentSep 13, 2024 3:48 PMAre you afraid of making changes to your LLM pipeline?June Product Updates: Enterprise Features, Dynamic Columns, Spreadsheet-ing, Prompt Management + moreRAGEval: Scenario Specific RAG Evaluation Dataset Generation FrameworkPersonaGym: Evaluating Persona Agents and LLMsAgentsEvaluationJuly 29, 2024Athina AI Research AgentSep 13, 2024 3:48 PMPrompts, Prompts, Prompts!RAGEval: Scenario Specific RAG Evaluation Dataset Generation FrameworkSelf-Taught Evaluatorshttps://arxiv.org/abs/2407.18416Mixture-of-Agents Enhances Large Language Model CapabilitiesAgentsJune 7, 2024Athina AI Research AgentSep 13, 2024 3:48 PMConversational Prompt EngineeringRAG Foundry: A Framework for Enhancing LLMs for Retrieval Augmented GenerationAthina IDE: A Collaborative Editor for AI teams to Prototype, Evaluate, and Experimenthttps://arxiv.org/abs/2406.04692Discovering Preference Optimization Algorithms with and for Large Language ModelsLLM PerformanceReasoningJune 12, 2024Athina AI Research AgentSep 13, 2024 3:48 PMHow the best teams evaluate their chatbotsCustom Evaluations for your AI for freeImproving Retrieval Augmented Language Model with Self-Reasoninghttps://arxiv.org/abs/2406.08414Be like a Goldfish, Don't Memorize! Mitigating Memorization in Generative LLMsReasoningLLM PerformanceJune 14, 2024Athina AI Research AgentSep 13, 2024 3:48 PMConversational Prompt EngineeringFrom LLMs to LLM-based Agents for Software Engineering: A Survey of Current, Challenges and FutureJune Product Updates: Enterprise Features, Dynamic Columns, Spreadsheet-ing, Prompt Management + morehttps://arxiv.org/abs/2406.10209Following Length Constraints in InstructionsReasoningLLM PerformanceJune 25, 2024Athina AI Research AgentSep 13, 2024 3:48 PMJune Product Updates: Enterprise Features, Dynamic Columns, Spreadsheet-ing, Prompt Management + moreAthina IDE: A Collaborative Editor for AI teams to Prototype, Evaluate, and ExperimentRAGEval: Scenario Specific RAG Evaluation Dataset Generation Frameworkhttps://arxiv.org/abs/2406.17744On LLMs-Driven Synthetic Data Generation, Curation, and Evaluation: A Survey EvaluationDataset GenerationJune 14, 2024Athina AI Research AgentSep 13, 2024 3:48 PMAthina IDE: A Collaborative Editor for AI teams to Prototype, Evaluate, and ExperimentFrom LLMs to LLM-based Agents for Software Engineering: A Survey of Current, Challenges and FutureJune Product Updates: Enterprise Features, Dynamic Columns, Spreadsheet-ing, Prompt Management + morehttps://arxiv.org/abs/2406.15126Improving Retrieval Augmented Language Model with Self-ReasoningRAGReasoningAugust 2, 2024Athina AI Research AgentSep 13, 2024 3:48 PMEvaluating LLM Chatbot Conversations is hard - here's how we're solving itEvaluating JSON responses: LLMs still can't be trusted to produce consistent JSON outputsGenerate high-quality synthetic datasets for RAG Q&A in 30 secondsDiscovering Preference Optimization Algorithms with and for Large Language Modelshttps://arxiv.org/abs/2407.19813Concise Thoughts: Impact of Output Length on LLM Reasoning and CostReasoningLLM PerformanceJuly 29, 2024Athina AI Research AgentSep 13, 2024 3:48 PMHave you tried Cohere's new model Command R+? Compare against Claude 3 and Gemini Pro on AthinaAthina IDE: A Collaborative Editor for AI teams to Prototype, Evaluate, and ExperimentJune Product Updates: Enterprise Features, Dynamic Columns, Spreadsheet-ing, Prompt Management + morehttps://arxiv.org/abs/2407.19825SelfGoal: Your Language Agents Already Know How to Achieve High-level GoalsAgentsReasoningJune 7, 2024Athina AI Research AgentSep 13, 2024 3:48 PMSelf-Taught EvaluatorsRAGEval: Scenario Specific RAG Evaluation Dataset Generation FrameworkAthina IDE: A Collaborative Editor for AI teams to Prototype, Evaluate, and Experimenthttps://arxiv.org/abs/2406.04784From Artificial Needles to Real Haystacks: Improving Retrieval Capabilities in LLMs by Finetuning on Synthetic DataFine TuningDataset GenerationRAGJune 27, 2024Athina AI Research AgentSep 13, 2024 3:48 PMFrom Artificial Needles to Real Haystacks: Improving Retrieval Capabilities in LLMs by Finetuning on Synthetic DataSupport for Custom Models hosted on Azure and AWS Bedrock!From LLMs to LLM-based Agents for Software Engineering: A Survey of Current, Challenges and Futurehttps://arxiv.org/abs/2406.19292RAGEval: Scenario Specific RAG Evaluation Dataset Generation FrameworkDataset GenerationRAGAugust 18, 2024Athina AI Research AgentSep 13, 2024 3:48 PMWe just launched on Product Hunt!Annotate LLM traces on Athina + new models support, automatic token & cost trackingHow to backtest prompt / model changes?PersonaGym: Evaluating Persona Agents and LLMsFollowing Length Constraints in InstructionsTree Search For Language Model AgentsSelf-Tuning: Instructing LLMs to Effectively Acquire New Knowledge through Self-TeachingSelfGoal: Your Language Agents Already Know How to Achieve High-level Goalshttps://arxiv.org/abs/2408.01262RAG Foundry: A Framework for Enhancing LLMs for Retrieval Augmented GenerationLLM PerformanceRAGAugust 5, 2024Athina AI Research AgentSep 13, 2024 3:48 PMProduct Hunt Launch: Help us get to #1 Product of the Day!Access your LLM Traces via our GraphQL APIRAG Foundry: A Framework for Enhancing LLMs for Retrieval Augmented GenerationMixture-of-Agents Enhances Large Language Model Capabilitieshttps://arxiv.org/abs/2408.02545Self-Taught Evaluators EvaluationLLM PerformanceAugust 8, 2024Athina AI Research AgentSep 13, 2024 3:48 PMConfiguring an eval in 15 seconds (yes, really)June Product Updates: Enterprise Features, Dynamic Columns, Spreadsheet-ing, Prompt Management + moreFrom LLMs to LLM-based Agents for Software Engineering: A Survey of Current, Challenges and FuturePersonaGym: Evaluating Persona Agents and LLMsSelfGoal: Your Language Agents Already Know How to Achieve High-level Goalshttps://arxiv.org/abs/2408.02666Retrieval with Feedback LoopsRAGSep 13, 2024 3:48 PMAthina AIExplainable Retrieval RAGSep 13, 2024 3:48 PMAthina AIContextual CompressionRAGSep 13, 2024 3:48 PMAthina AISemantic ChunkingRAGSep 13, 2024 3:48 PMAthina AIHypothetical Questions (HyDE Approach)RAGSep 13, 2024 3:48 PMAthina AIAdvanced RAG Technique: Hierarchical IndicesRAGSep 13, 2024 3:48 PMAthina AIRe-ranking methodsRAGAugust 21, 2024Sep 13, 2024 3:48 PMAthina AIQuery Transformations: Rewriting, Step-back Prompting, and Sub-query DecompositionRAGAugust 21, 2024Sep 13, 2024 3:48 PMAthina AIRAG-Fusion (Fusion Retrieval RAG)RAGAugust 21, 2024Sep 13, 2024 3:48 PMAthina AIMaatphor: Automated Variant Analysis for Prompt Injection AttacksDataset GenerationDecember 12, 2023Athina AI Research AgentSep 13, 2024 3:48 PMPrompt-Tuning Decision Transformer with Preference RankingProgressive Visual Prompt Learning with Contrastive Feature Re-formationSoft-prompt Tuning for Large Language Models to Evaluate BiasRobust Safety Classifier for Large Language Models: Adversarial Prompt Shieldhttps://arxiv.org/abs/2312.11513Universality and Limitations of Prompt TuningFoundation ModelMay 30, 2023Athina AI Research AgentSep 13, 2024 3:48 PMOne Small Step for Generative AI, One Giant Leap for AGI: A Complete Survey on ChatGPT in AIGC EraGraph of Thoughts: Solving Elaborate Problems with Large Language ModelsFew-shot Fine-tuning vs. In-context Learning: A Fair Comparison and Evaluationhttps://arxiv.org/abs/2305.18787blog.athina.aiLanguage Is Not All You Need: Aligning Perception with Language ModelsRAGFebruary 27, 2023Athina AI Research AgentSep 13, 2024 3:48 PMCan ChatGPT Understand Too? A Comparative Study on ChatGPT and Fine-tuned BERTHow Robust is GPT-3.5 to Predecessors? A Comprehensive Study on Language Understanding TasksPrompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot LearnersActive Prompting with Chain-of-Thought for Large Language ModelsNot what you've signed up for: Compromising Real-World LLM-Integrated Applications with Indirect Prompt InjectionA Prompt Pattern Catalog to Enhance Prompt Engineering with ChatGPTGuiding Large Language Models via Directional Stimulus Promptinghttps://arxiv.org/abs/2302.14045blog.athina.aiMedPromptExtract (Medical Data Extraction Tool): Anonymization and Hi-fidelity Automated data extraction using NLP and prompt engineeringPrompt EngineeringJune 6, 2024Athina AI Research AgentSep 13, 2024 3:48 PMLarge Language Models and Prompt Engineering for Biomedical Query Focused Multi-Document SummarisationEnhancing Medical Task Performance in GPT-4V: A Comprehensive Study on Prompt Engineering StrategiesCases of EFL Secondary Students' Prompt Engineering Pathways to Complete a Writing Task with ChatGPTChatGPT4PCG 2 Competition: Prompt Engineering for Science Birds Level GenerationLAMPER: LanguAge Model and Prompt EngineeRing for zero-shot time series classificationAutomated Black-box Prompt Engineering for Personalized Text-to-Image GenerationWordflow: Social Prompt Engineering for Large Language ModelsA Systematic Survey of Prompt Engineering in Large Language Models: Techniques and ApplicationsExploring EFL students' prompt engineering in human-AI story writing: an Activity Theory perspectiveA Novel Approach for Rapid Development Based on ChatGPT and Prompt EngineeringChit-Chat or Deep Talk: Prompt Engineering for Process MiningSAMAug: Point Prompt Augmentation for Segment Anything ModelSAM on Medical Images: A Comprehensive Study on Three Prompt ModesPrompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion ModelsDr ChatGPT, tell me what I want to hear: How prompt knowledge impacts health answer correctnessTowards Large-scale 3D Representation Learning with Multi-dataset Point Prompt TrainingPrompt Cache: Modular Attention Reuse for Low-Latency Inferencehttps://arxiv.org/abs/2405.02664blog.athina.aiProRes: Exploring Degradation-aware Visual Prompt for Universal Image RestorationFoundation ModelJune 23, 2023Athina AI Research AgentSep 13, 2024 3:49 PMTopicGPT: A Prompt-based Topic Modeling FrameworkLanguage Prompt for Autonomous DrivingPrompt-tuning latent diffusion models for inverse problemsDePT: Decomposed Prompt Tuning for Parameter-Efficient Fine-tuningLLMs Can Understand Encrypted Prompt: Towards Privacy-Computing Friendly Transformershttps://arxiv.org/abs/2306.13653Cases of EFL Secondary Students' Prompt Engineering Pathways to Complete a Writing Task with ChatGPTSafetyJune 19, 2023Athina AI Research AgentSep 13, 2024 3:48 PMReAct: Synergizing Reasoning and Acting in Language ModelsPrompting GPT-3 To Be ReliableDocPrompting: Generating Code by Retrieving the DocsExploring the Intersection of Large Language Models and Agent-Based Modeling via Prompt EngineeringMedPromptExtract (Medical Data Extraction Tool): Anonymization and Hi-fidelity Automated data extraction using NLP and prompt engineeringhttps://arxiv.org/abs/2307.05493blog.athina.aiPrompt Packer: Deceiving LLMs through Compositional Instruction with Hidden AttacksSafetyOctober 16, 2023Athina AI Research AgentSep 13, 2024 3:48 PMGeneralized Graph Prompt: Toward a Unification of Pre-Training and Downstream Tasks on GraphsHD-Painter: High-Resolution and Prompt-Faithful Text-Guided Image Inpainting with Diffusion ModelsPromptCARE: Prompt Copyright Protection by Watermark Injection and VerificationAn automatically discovered chain-of-thought prompt generalizes to novel models and datasetsPractical Membership Inference Attacks against Fine-tuned Large Language Models via Self-prompt CalibrationPrompt Tuning Large Language Models on Personalized Aspect Extraction for RecommendationsPrompt Middleware: Mapping Prompts for Large Language Models to UI AffordancesPrompt-based Node Feature Extractor for Few-shot Learning on Text-Attributed GraphsDivide and Prompt: Chain of Thought Prompting for Text-to-SQLLayout and Task Aware Instruction Prompt for Zero-shot Document Image Question AnsweringBadCLIP: Trigger-Aware Prompt Learning for Backdoor Attacks on CLIPhttps://arxiv.org/abs/2310.10077How to Use a Custom Grading Criteria to Evaluate LLM Responses (LLM-as-a-Judge)HallucinationsEvaluationApril 17, 2024Sep 13, 2024 3:49 PMCYBERSECEVAL 2: A Wide-Ranging Cybersecurity Evaluation Suite for Large Language ModelsReprompting: Automated Chain-of-Thought Prompt Inference Through Gibbs SamplingReasoningMay 23, 2024Athina AI Research AgentSep 13, 2024 3:49 PMImageDream: Image-Prompt Multi-view Diffusion for 3D GenerationLanguage Prompt for Autonomous DrivingLongLLMLingua: Accelerating and Enhancing LLMs in Long Context Scenarios via Prompt Compressionhttps://arxiv.org/abs/2305.09993Chain-of-Symbol Prompting Elicits Planning in Large Langauge ModelsEvaluationMay 17, 2023Athina AI Research AgentAug 19, 2024 11:43 PMEfficient Prompting via Dynamic In-Context LearningThe Web Can Be Your Oyster for Improving Large Language ModelsTreePrompt: Learning to Compose Tree Prompts for Explainable Visual Groundinghttps://arxiv.org/abs/2305.10276blog.athina.aiSoft-prompt Tuning for Large Language Models to Evaluate BiasEvaluationMarch 5, 2024Athina AI Research AgentAug 19, 2024 11:40 PMLLM Critics Help Catch LLM BugsBIM-GPT: a Prompt-Based Virtual Assistant Framework for BIM Information RetrievalProgressive Visual Prompt Learning with Contrastive Feature Re-formationMaatphor: Automated Variant Analysis for Prompt Injection AttacksSPELL: Semantic Prompt Evolution based on a LLMhttps://arxiv.org/abs/2306.04735RoT: Enhancing Large Language Models with Reflection on Search TreesReasoningApril 11, 2024Athina AI Research AgentAug 19, 2024 11:38 PMPathFinder: Guided Search over Multi-Step Reasoning PathsFounder-GPT: Self-play to evaluate the Founder-Idea fitRNNs are not Transformers (Yet): The Key Bottleneck on In-context Retrievalhttps://arxiv.org/abs/2404.05449blog.athina.aiPromptbreeder: Self-Referential Self-Improvement Via Prompt EvolutionReasoningSeptember 28, 2023Athina AI Research AgentAug 19, 2024 11:48 PMRe-imagine the Negative Prompt Algorithm: Transform 2D Diffusion into 3D, alleviate Janus problem and BeyondImageDream: Image-Prompt Multi-view Diffusion for 3D GenerationLongLLMLingua: Accelerating and Enhancing LLMs in Long Context Scenarios via Prompt CompressionConnecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt OptimizersQuantifying Language Models' Sensitivity to Spurious Features in Prompt Design or: How I learned to start worrying about prompt formattingChatGPT Prompt Patterns for Improving Code Quality, Refactoring, Requirements Elicitation, and Software Designhttps://arxiv.org/abs/2309.16797blog.athina.aiAnalyzing Toxicity in Deep Conversations: A Reddit Case StudyDataset GenerationApril 11, 2024Athina AI Research AgentAug 19, 2024 11:47 PMEvidence to Generate (E2G): A Single-agent Two-step Prompting for Context Grounded and Retrieval Augmented ReasoningPathFinder: Guided Search over Multi-Step Reasoning PathsRNNs are not Transformers (Yet): The Key Bottleneck on In-context Retrievalhttps://arxiv.org/abs/2404.07879blog.athina.aiLanguage Models Are Greedy Reasoners: A Systematic Formal Analysis of Chain-of-ThoughtReasoningJanuary 26, 2023Athina AI Research AgentAug 19, 2024 11:46 PMPAL: Program-aided Language ModelsDocPrompting: Generating Code by Retrieving the DocsLarge Language Models Are Human-Level Prompt Engineershttps://arxiv.org/abs/2210.01240v3blog.athina.aiSkeleton-of-Thought: Prompting LLMs for Efficient Parallel GenerationPrompt EngineeringJuly 28, 2023Athina AI Research AgentAug 19, 2024 11:52 PMRe-Reading Improves Reasoning in Large Language ModelsConnecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt OptimizersLLMLingua: Compressing Prompts for Accelerated Inference of Large Language ModelsPost Hoc Explanations of Language Models Can Improve Language Modelshttps://arxiv.org/abs/2307.15337blog.athina.aiPre-Training to Learn in ContextRAGMay 16, 2023Athina AI Research AgentSep 13, 2024 3:49 PMZeroPrompt: Streaming Acoustic Encoders are Zero-Shot Masked LMsTELeR: A General Taxonomy of LLM Prompts for Benchmarking Complex TasksSatLM: Satisfiability-Aided Language Models Using Declarative PromptingFrom Prompt Injections to SQL Injection Attacks: How Protected is Your LLM-Integrated Web Application?Language Prompt for Autonomous DrivingImageDream: Image-Prompt Multi-view Diffusion for 3D GenerationLongLLMLingua: Accelerating and Enhancing LLMs in Long Context Scenarios via Prompt Compressionhttps://arxiv.org/abs/2305.09137blog.athina.aiChain of Hindsight Aligns Language Models with FeedbackEvaluationFebruary 6, 2023Athina AI Research AgentAug 19, 2024 11:54 PMChain of Hindsight Aligns Language Models with FeedbackCan ChatGPT Understand Too? A Comparative Study on ChatGPT and Fine-tuned BERTHow Robust is GPT-3.5 to Predecessors? A Comprehensive Study on Language Understanding TasksHow Does In-Context Learning Help Prompt Tuning?Scalable Prompt Generation for Semi-supervised Learning with Language Modelshttps://arxiv.org/abs/2302.02676blog.athina.aiGuiding Large Language Models via Directional Stimulus PromptingPrompt EngineeringOctober 9, 2023Athina AI Research AgentAug 19, 2024 11:49 PMA Prompt Pattern Catalog to Enhance Prompt Engineering with ChatGPTLanguage Is Not All You Need: Aligning Perception with Language ModelsActive Prompting with Chain-of-Thought for Large Language ModelsHow Does In-Context Learning Help Prompt Tuning?https://arxiv.org/abs/2302.11520blog.athina.aiJailbreaking ChatGPT via Prompt Engineering: An Empirical StudyEvaluationMarch 10, 2024Athina AI Research AgentAug 19, 2024 11:51 PMQuantifying Language Models' Sensitivity to Spurious Features in Prompt Design or: How I learned to start worrying about prompt formattingPrompt Injection attack against LLM-integrated ApplicationsChatGPT Prompt Patterns for Improving Code Quality, Refactoring, Requirements Elicitation, and Software DesignIP-Adapter: Text Compatible Image Prompt Adapter for Text-to-Image Diffusion ModelsTensor Trust: Interpretable Prompt Injection Attacks from an Online GameAnomalyCLIP: Object-agnostic Prompt Learning for Zero-shot Anomaly DetectionAn LLM can Fool Itself: A Prompt-Based Adversarial AttackPromptly: Using Prompt Problems to Teach Learners How to Effectively Utilize AI Code Generatorshttps://arxiv.org/abs/2305.13860blog.athina.aiSelf-Consistency Improves Chain of Thought Reasoning in Language ModelsReasoningMarch 7, 2023Athina AI Research AgentAug 19, 2024 11:55 PMInferring Properties of Graph Neural NetworksRAGAR, Your Falsehood RADAR: RAG-Augmented Reasoning for Political Fact-Checking using Multimodal Large Language ModelsPromptAgent: Strategic Planning with Language Models Enables Expert-level Prompt OptimizationPAL: Program-aided Language Modelshttps://arxiv.org/abs/2203.11171blog.athina.aiDemystifying Chains, Trees, and Graphs of ThoughtsReasoningApril 5, 2024Athina AI Research AgentAug 19, 2024 11:57 PMThe Flan Collection: Designing Data and Methods for Effective Instruction TuningLarge Language Models are reasoners with Self-VerificationConstitutional AI: Harmlessness from AI FeedbackAlgorithm of Thoughts: Enhancing Exploration of Ideas in Large Language Modelshttps://arxiv.org/abs/2401.14295blog.athina.aiPost Hoc Explanations of Language Models Can Improve Language ModelsPrompt EngineeringMay 19, 2023Athina AI Research AgentAug 19, 2024 11:58 PMRe-Reading Improves Reasoning in Large Language ModelsSkeleton-of-Thought: Prompting LLMs for Efficient Parallel GenerationPrompt Design and Engineering: Introduction and Advanced MethodsTree of Thoughts: Deliberate Problem Solving with Large Language ModelsKnowledge Card: Filling LLMs' Knowledge Gaps with Plug-in Specialized Language Modelshttps://arxiv.org/abs/2305.11426blog.athina.aiGraph of Thoughts: Solving Elaborate Problems with Large Language ModelsPrompt EngineeringAugust 18, 2023Athina AI Research AgentAug 20, 2024 12:03 AMReasoning with Language Model Prompting: A SurveyTowards Reasoning in Large Language Models: A SurveyGraph of Thoughts: Solving Elaborate Problems with Large Language ModelsLess Likely Brainstorming: Using Language Models to Generate Alternative HypothesesUniversality and Limitations of Prompt TuningMultiTool-CoT: GPT-3 Can Use Multiple External Tools with Chain of Thought PromptingReasoning with Language Model is Planning with World ModelBetter Zero-Shot Reasoning with Self-Adaptive PromptingLet's Sample Step by Step: Adaptive-Consistency for Efficient Reasoning and Coding with LLMsTreePrompt: Learning to Compose Tree Prompts for Explainable Visual Groundinghttps://arxiv.org/abs/2308.09687v2blog.athina.aiA Novel Approach for Rapid Development Based on ChatGPT and Prompt EngineeringEvaluationDecember 21, 2023Athina AI Research AgentAug 19, 2024 11:59 PMMedPromptExtract (Medical Data Extraction Tool): Anonymization and Hi-fidelity Automated data extraction using NLP and prompt engineeringChatGPT4PCG 2 Competition: Prompt Engineering for Science Birds Level GenerationLAMPER: LanguAge Model and Prompt EngineeRing for zero-shot time series classificationhttps://arxiv.org/abs/2312.13115blog.athina.aiPrompt Tuning Large Language Models on Personalized Aspect Extraction for RecommendationsPrompt EngineeringJune 2, 2023Athina AI Research AgentAug 20, 2024 12:00 AMDP-OPT: Make Large Language Model Your Privacy-Preserving Prompt EngineerPrompt Packer: Deceiving LLMs through Compositional Instruction with Hidden AttacksPrompt Tuning Large Language Models on Personalized Aspect Extraction for RecommendationsPrompt-Guided Transformers for End-to-End Open-Vocabulary Object Detectionhttps://arxiv.org/abs/2306.01475An LLM can Fool Itself: A Prompt-Based Adversarial AttackEvaluationOctober 20, 2023Athina AI Research AgentAug 20, 2024 12:02 AMAnomalyCLIP: Object-agnostic Prompt Learning for Zero-shot Anomaly DetectionTensor Trust: Interpretable Prompt Injection Attacks from an Online GameJailbreaking ChatGPT via Prompt Engineering: An Empirical StudyPromptAid: Prompt Exploration, Perturbation, Testing and Iteration using Visual Analytics for Large Language Modelshttps://arxiv.org/abs/2310.13345blog.athina.aiPBNR: Prompt-based News Recommender SystemPrompt EngineeringApril 16, 2023Athina AI Research AgentAug 20, 2024 12:01 AMBenchmarking and Defending Against Indirect Prompt Injection Attacks on Large Language ModelsSegment Any Anomaly without Training via Hybrid Prompt RegularizationLongLLMLingua: Accelerating and Enhancing LLMs in Long Context Scenarios via Prompt CompressionPromptCARE: Prompt Copyright Protection by Watermark Injection and Verificationhttps://arxiv.org/abs/2304.07862TELeR: A General Taxonomy of LLM Prompts for Benchmarking Complex TasksEvaluationMay 19, 2023Athina AI Research AgentAug 20, 2024 12:07 AMExplaining Emergent In-Context Learning as Kernel RegressionLet's Sample Step by Step: Adaptive-Consistency for Efficient Reasoning and Coding with LLMsCompress, Then Prompt: Improving Accuracy-Efficiency Trade-off of LLM Inference with Transferable PromptEfficient Prompting via Dynamic In-Context LearningThe Web Can Be Your Oyster for Improving Large Language ModelsFlatness-Aware Prompt Selection Improves Accuracy and Sample EfficiencyReprompting: Automated Chain-of-Thought Prompt Inference Through Gibbs SamplingSatLM: Satisfiability-Aided Language Models Using Declarative PromptingPre-Training to Learn in Contexthttps://arxiv.org/abs/2305.11430blog.athina.aiHow Does In-Context Learning Help Prompt Tuning?Fine TuningFebruary 22, 2023Athina AI Research AgentAug 20, 2024 12:05 AMGuiding Large Language Models via Directional Stimulus PromptingA Prompt Pattern Catalog to Enhance Prompt Engineering with ChatGPTChain of Hindsight Aligns Language Models with FeedbackScalable Prompt Generation for Semi-supervised Learning with Language ModelsGraphPrompt: Unifying Pre-Training and Downstream Tasks for Graph Neural NetworksThe Capacity for Moral Self-Correction in Large Language Modelshttps://arxiv.org/abs/2302.11521blog.athina.aiRetrieval-Augmented Thought Process as Sequential Decision MakingRAGFebruary 12, 2024Athina AI Research AgentAug 20, 2024 12:04 AMRetrieval-Augmented Thought Process as Sequential Decision MakingMultimodal Chain-of-Thought Reasoning in Language ModelsCompositional Exemplars for In-context LearningEverything of Thoughts: Defying the Law of Penrose Triangle for Thought GenerationBoosting Logical Reasoning in Large Language Models through a New Framework: The Graph of ThoughtTree of Attacks: Jailbreaking Black-Box LLMs Automaticallyhttps://arxiv.org/abs/2402.07812blog.athina.aiChain-of-Knowledge: Grounding Large Language Models via Dynamic Knowledge Adapting over Heterogeneous SourcesPrompt EngineeringMay 22, 2023Athina AI Research AgentAug 20, 2024 12:09 AMFactuality of Large Language Models in the Year 2024Chain-of-Knowledge: Grounding Large Language Models via Dynamic Knowledge Adapting over Heterogeneous SourcesSemi-Structured Chain-of-Thought: Integrating Multiple Sources of Knowledge for Improved Language Model ReasoningFine-tuning Language Models for Factualityhttps://arxiv.org/abs/2305.13269blog.athina.aiPrompting Hard or Hardly Prompting: Prompt Inversion for Text-to-Image Diffusion ModelsPrompt EngineeringDecember 19, 2023Athina AI Research AgentAug 23, 2024 2:30 AMProgressive Visual Prompt Learning with Contrastive Feature Re-formationTesting LLMs on Code Generation with Varying Levels of Prompt SpecificityPrompting Hard or Hardly Prompting: Prompt Inversion for Text-to-Image Diffusion Modelsviz2viz: Prompt-driven stylized visualization generation using a diffusion modelhttps://arxiv.org/abs/2312.12416Plum: Prompt Learning using MetaheuristicPrompt EngineeringMarch 14, 2024Athina AI Research AgentAug 20, 2024 12:17 AMFoodGPT: A Large Language Model in Food Testing Domain with Incremental Pre-training and Knowledge Graph PromptText-driven Prompt Generation for Vision-Language Models in Federated LearningConsistency-guided Prompt Learning for Vision-Language Modelshttps://arxiv.org/abs/2311.08364Autonomous Tree-search Ability of Large Language ModelsReasoningOctober 14, 2023Athina AI Research AgentAug 20, 2024 12:16 AMPathFinder: Guided Search over Multi-Step Reasoning PathsSPROUT: Authoring Programming Tutorials with Interactive Visualization of Large Language Model Generation ProcessFounder-GPT: Self-play to evaluate the Founder-Idea fithttps://arxiv.org/abs/2310.10686blog.athina.aiTemporal Data Meets LLM -- Explainable Financial Time Series ForecastingReasoningJune 19, 2023Athina AI Research AgentAug 20, 2024 12:13 AMGuReT: Distinguishing Guilt and Regret related TextRAGAR, Your Falsehood RADAR: RAG-Augmented Reasoning for Political Fact-Checking using Multimodal Large Language ModelsEvidence to Generate (E2G): A Single-agent Two-step Prompting for Context Grounded and Retrieval Augmented Reasoninghttps://arxiv.org/abs/2306.11025blog.athina.aiFocused Prefix Tuning for Controllable Text GenerationFine TuningJune 1, 2023Athina AI Research AgentAug 20, 2024 12:37 AMTowards Reasoning in Large Language Models: A SurveyA Bibliometric Review of Large Language Models Research from 2017 to 2023Reasoning with Language Model Prompting: A Surveyhttps://arxiv.org/abs/2306.00369blog.athina.aiPromptTTS 2: Describing and Generating Voices with Text PromptPrompt EngineeringOctober 12, 2023Athina AI Research AgentAug 20, 2024 12:41 AMPrompt Tuning of Deep Neural Networks for Speaker-adaptive Visual Speech RecognitionHD-Painter: High-Resolution and Prompt-Faithful Text-Guided Image Inpainting with Diffusion ModelsGeneralized Graph Prompt: Toward a Unification of Pre-Training and Downstream Tasks on GraphsVisual Prompt Based Personalized Federated LearningAn automatically discovered chain-of-thought prompt generalizes to novel models and datasetsProgressive Visual Prompt Learning with Contrastive Feature Re-formationhttps://arxiv.org/abs/2309.02285CYBERSECEVAL 2: A Wide-Ranging Cybersecurity Evaluation Suite for Large Language ModelsSafetyEvaluationApril 18, 2024Athina AI Research AgentAug 23, 2024 12:56 AMHow to Use a Custom Grading Criteria to Evaluate LLM Responses (LLM-as-a-Judge)Mistral 7B: Foundation Model Research Paper SummaryChain-of-Verification Reduces Hallucination in Large Language Modelshttps://ai.meta.com/research/publications/cyberseceval-2-a-wide-ranging-cybersecurity-evaluation-suite-for-large-language-models/blog.athina.aiAI Chain on Large Language Model for Unsupervised Control Flow Graph Generation for Statically-Typed Partial CodeFoundation ModelJune 1, 2023Athina AI Research AgentAug 20, 2024 12:38 AMGuReT: Distinguishing Guilt and Regret related TextFounder-GPT: Self-play to evaluate the Founder-Idea fitBoosting of Thoughts: Trial-and-Error Problem Solving with Large Language Modelshttps://arxiv.org/abs/2306.00757blog.athina.aiPrompt-ICM: A Unified Framework towards Image Coding for Machines with Task-driven PromptsPrompt EngineeringMay 4, 2023Athina AI Research AgentAug 20, 2024 12:39 AMText-driven Prompt Generation for Vision-Language Models in Federated LearningImage-Object-Specific Prompt Learning for Few-Shot Class-Incremental LearningConsistency-guided Prompt Learning for Vision-Language Modelshttps://arxiv.org/abs/2305.02578Robust Safety Classifier for Large Language Models: Adversarial Prompt ShieldSafetyOctober 31, 2023Athina AI Research AgentAug 20, 2024 12:40 AMEfficient Federated Prompt Tuning for Black-box Large Pre-trained ModelsSPELL: Semantic Prompt Evolution based on a LLMMaatphor: Automated Variant Analysis for Prompt Injection AttacksImage-Object-Specific Prompt Learning for Few-Shot Class-Incremental Learninghttps://arxiv.org/abs/2311.00172EntGPT: Linking Generative Large Language Models with Knowledge BasesPrompt EngineeringFebruary 9, 2024Athina AI Research AgentAug 20, 2024 12:47 AMFrom Noise to Clarity: Unraveling the Adversarial Suffix of Large Language Model Attacks via Translation of Text EmbeddingsUniversal and Transferable Adversarial Attacks on Aligned Language ModelsFactuality of Large Language Models in the Year 2024KnowGPT: Knowledge Injection for Large Language ModelsProbabilistic Tree-of-thought Reasoning for Answering Knowledge-intensive Complex QuestionsA Survey on Hallucination in Large Language Models: Principles, Taxonomy, Challenges, and Open Questionshttps://arxiv.org/abs/2402.06738blog.athina.aiTesting LLMs on Code Generation with Varying Levels of Prompt SpecificityEvaluationNovember 10, 2023Athina AI Research AgentAug 20, 2024 12:43 AMLLM Critics Help Catch LLM BugsPrompt-Tuning Decision Transformer with Preference RankingProgressive Visual Prompt Learning with Contrastive Feature Re-formationULTRA-DP: Unifying Graph Pre-training with Multi-task Graph Dual PromptPrompting Hard or Hardly Prompting: Prompt Inversion for Text-to-Image Diffusion Modelshttps://arxiv.org/abs/2311.07599Reinforcement Learning in the Era of LLMs: What is Essential? What is needed? An RL Perspective on RLHF, Prompting, and BeyondReasoningOctober 9, 2023Athina AI Research AgentAug 20, 2024 12:48 AMExploring LLM-based Agents for Root Cause AnalysisInvestigating the Effectiveness of Task-Agnostic Prefix Prompt for Instruction FollowingModel-tuning Via Prompts Makes NLP Models Adversarially RobustTool Learning with Foundation ModelsOne Small Step for Generative AI, One Giant Leap for AGI: A Complete Survey on ChatGPT in AIGC EraA Bibliometric Review of Large Language Models Research from 2017 to 2023Natural Language Reasoning, A SurveyWalking Down the Memory Maze: Beyond Context Limit through Interactive ReadingFrom Sparse to Dense: GPT-4 Summarization with Chain of Density PromptingExploring Lottery Prompts for Pre-trained Language ModelsLet's Verify Step by StepPEARL: Prompting Large Language Models to Plan and Execute Actions Over Long DocumentsReasoning with Language Model is Planning with World ModelBetter Zero-Shot Reasoning with Self-Adaptive PromptingInteractive Natural Language ProcessingExplaining Emergent In-Context Learning as Kernel RegressionZeroPrompt: Streaming Acoustic Encoders are Zero-Shot Masked LMshttps://arxiv.org/abs/2310.06147blog.athina.aiStyleDiffusion: Prompt-Embedding Inversion for Text-Based EditingFine TuningAugust 20, 2023Athina AI Research AgentAug 20, 2024 12:51 AMBenchmarking and Defending Against Indirect Prompt Injection Attacks on Large Language ModelsRe-imagine the Negative Prompt Algorithm: Transform 2D Diffusion into 3D, alleviate Janus problem and BeyondLongLLMLingua: Accelerating and Enhancing LLMs in Long Context Scenarios via Prompt Compressionhttps://arxiv.org/abs/2303.15649Adversarial Prompt Tuning for Vision-Language ModelsSafetyDecember 25, 2023Athina AI Research AgentAug 20, 2024 12:50 AMMultimodal Prompt Perceiver: Empower Adaptiveness, Generalizability and Fidelity for All-in-One Image RestorationPromise: Prompt-driven 3D Medical Image Segmentation Using Pretrained Image Foundation ModelsAutoHint: Automatic Prompt Optimization with Hint GenerationPrompt Algebra for Task Compositionhttps://arxiv.org/abs/2311.11261Algorithm of Thoughts: Enhancing Exploration of Ideas in Large Language ModelsReasoningSeptember 28, 2023Athina AI Research AgentAug 20, 2024 12:49 AMUnleashing the potential of prompt engineering in Large Language Models: a comprehensive reviewDemystifying Chains, Trees, and Graphs of ThoughtsThe Flan Collection: Designing Data and Methods for Effective Instruction TuningEverything of Thoughts: Defying the Law of Penrose Triangle for Thought GenerationBoosting Logical Reasoning in Large Language Models through a New Framework: The Graph of ThoughtTree-of-Mixed-Thought: Combining Fast and Slow Thinking for Multi-hop Visual Reasoninghttps://arxiv.org/abs/2308.10379blog.athina.aiConnecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt OptimizersPrompt EngineeringFebruary 27, 2024Athina AI Research AgentAug 20, 2024 12:53 AMLongLLMLingua: Accelerating and Enhancing LLMs in Long Context Scenarios via Prompt CompressionRe-imagine the Negative Prompt Algorithm: Transform 2D Diffusion into 3D, alleviate Janus problem and BeyondPromptbreeder: Self-Referential Self-Improvement Via Prompt Evolutionhttps://arxiv.org/abs/2309.08532AnomalyCLIP: Object-agnostic Prompt Learning for Zero-shot Anomaly DetectionHallucinationsMarch 16, 2024Athina AI Research AgentAug 20, 2024 12:52 AMChatGPT Prompt Patterns for Improving Code Quality, Refactoring, Requirements Elicitation, and Software DesignQuantifying Language Models' Sensitivity to Spurious Features in Prompt Design or: How I learned to start worrying about prompt formattingJailbreaking ChatGPT via Prompt Engineering: An Empirical StudyAn LLM can Fool Itself: A Prompt-Based Adversarial AttackPromptly: Using Prompt Problems to Teach Learners How to Effectively Utilize AI Code Generatorshttps://arxiv.org/abs/2310.18961blog.athina.aiConsistency-guided Prompt Learning for Vision-Language ModelsFine TuningFebruary 27, 2024Athina AI Research AgentAug 20, 2024 12:54 AMLLM Critics Help Catch LLM BugsFoodGPT: A Large Language Model in Food Testing Domain with Incremental Pre-training and Knowledge Graph PromptImage-Object-Specific Prompt Learning for Few-Shot Class-Incremental LearningReverse Stable Diffusion: What prompt was used to generate this image?Does Prompt-Tuning Language Model Ensure Privacy?Prompt-ICM: A Unified Framework towards Image Coding for Machines with Task-driven PromptsLarge Language Model Prompt Chaining for Long Legal Document ClassificationPlum: Prompt Learning using Metaheuristichttps://arxiv.org/abs/2306.01195A Bibliometric Review of Large Language Models Research from 2017 to 2023EvaluationApril 3, 2023Athina AI Research AgentSep 12, 2024 11:55 PMReinforcement Learning in the Era of LLMs: What is Essential? What is needed? An RL Perspective on RLHF, Prompting, and BeyondOne Small Step for Generative AI, One Giant Leap for AGI: A Complete Survey on ChatGPT in AIGC EraFocused Prefix Tuning for Controllable Text GenerationLess Likely Brainstorming: Using Language Models to Generate Alternative HypothesesPEARL: Prompting Large Language Models to Plan and Execute Actions Over Long DocumentsHierarchical Prompting Assists Large Language Model on Web NavigationCan We Edit Factual Knowledge by In-Context Learning?Plan-and-Solve Prompting: Improving Zero-Shot Chain-of-Thought Reasoning by Large Language Modelshttps://arxiv.org/abs/2304.02020blog.athina.aiExploring EFL students' prompt engineering in human-AI story writing: an Activity Theory perspectivePrompt EngineeringFebruary 10, 2024Athina AI Research AgentAug 20, 2024 12:53 AMLAMPER: LanguAge Model and Prompt EngineeRing for zero-shot time series classificationChatGPT4PCG 2 Competition: Prompt Engineering for Science Birds Level GenerationMedPromptExtract (Medical Data Extraction Tool): Anonymization and Hi-fidelity Automated data extraction using NLP and prompt engineeringHow to Prompt LLMs for Text-to-SQL: A Study in Zero-shot, Single-domain, and Cross-domain SettingsA study on Prompt Design, Advantages and Limitations of ChatGPT for Deep Learning Program RepairGraph-ToolFormer: To Empower LLMs with Graph Reasoning Ability via Prompt Augmented by ChatGPThttps://arxiv.org/abs/2306.01798blog.athina.aiRe-Reading Improves Reasoning in Large Language ModelsPrompt EngineeringSeptember 12, 2023Athina AI Research AgentAug 20, 2024 7:39 PMLLMLingua: Compressing Prompts for Accelerated Inference of Large Language ModelsPrincipled Instructions Are All You Need for Questioning LLaMA-1/2, GPT-3.5/4Prompt Design and Engineering: Introduction and Advanced MethodsSkeleton-of-Thought: Prompting LLMs for Efficient Parallel GenerationPost Hoc Explanations of Language Models Can Improve Language Modelshttps://arxiv.org/abs/2309.06275blog.athina.aiPrompt Middleware: Mapping Prompts for Large Language Models to UI AffordancesPrompt EngineeringJuly 3, 2023Athina AI Research AgentAug 20, 2024 7:36 PMLLMs Can Understand Encrypted Prompt: Towards Privacy-Computing Friendly TransformersPrompt Packer: Deceiving LLMs through Compositional Instruction with Hidden AttacksPractical Membership Inference Attacks against Fine-tuned Large Language Models via Self-prompt CalibrationDivide and Prompt: Chain of Thought Prompting for Text-to-SQLhttps://arxiv.org/abs/2307.01142An automatically discovered chain-of-thought prompt generalizes to novel models and datasetsReasoningAugust 3, 2023Athina AI Research AgentAug 20, 2024 7:37 PMAn automatically discovered chain-of-thought prompt generalizes to novel models and datasetsLLMs Can Understand Encrypted Prompt: Towards Privacy-Computing Friendly TransformersPrompt Packer: Deceiving LLMs through Compositional Instruction with Hidden AttacksPromptTTS 2: Describing and Generating Voices with Text PromptQuery-Dependent Prompt Evaluation and Optimization with Offline Inverse RLExploring the Relationship between LLM Hallucinations and Prompt Linguistic Nuances: Readability, Formality, and ConcretenessProgressive Visual Prompt Learning with Contrastive Feature Re-formationhttps://arxiv.org/abs/2305.02897Active Retrieval Augmented GenerationRAGMay 11, 2023Athina AI Research AgentAug 20, 2024 7:41 PMFine-tuning Language Models for FactualitySearch-in-the-Chain: Interactively Enhancing Large Language Models with Search for Knowledge-intensive TasksAutoHall: Automated Hallucination Dataset Generation for Large Language ModelsPrompt Design and Engineering: Introduction and Advanced MethodsEnhancing Zero-Shot Chain-of-Thought Reasoning in Large Language Models through LogicPrincipled Instructions Are All You Need for Questioning LLaMA-1/2, GPT-3.5/4A Comprehensive Survey on Instruction Followinghttps://arxiv.org/abs/2305.06983blog.athina.aiBoosting Logical Reasoning in Large Language Models through a New Framework: The Graph of ThoughtReasoningAugust 16, 2023Athina AI Research AgentAug 20, 2024 7:40 PMEverything of Thoughts: Defying the Law of Penrose Triangle for Thought GenerationRetrieval-Augmented Thought Process as Sequential Decision MakingAlgorithm of Thoughts: Enhancing Exploration of Ideas in Large Language ModelsFounder-GPT: Self-play to evaluate the Founder-Idea fithttps://arxiv.org/abs/2308.08614blog.athina.aiMultiTool-CoT: GPT-3 Can Use Multiple External Tools with Chain of Thought PromptingReasoningMay 26, 2023Athina AI Research AgentAug 20, 2024 7:41 PMOne Small Step for Generative AI, One Giant Leap for AGI: A Complete Survey on ChatGPT in AIGC EraGraph of Thoughts: Solving Elaborate Problems with Large Language ModelsFew-shot Fine-tuning vs. In-context Learning: A Fair Comparison and Evaluationhttps://arxiv.org/abs/2305.16896blog.athina.aiBlack-Box Prompt Optimization: Aligning Large Language Models without Model TrainingEvaluationNovember 8, 2023Athina AI Research AgentAug 20, 2024 7:43 PMPromptly: Using Prompt Problems to Teach Learners How to Effectively Utilize AI Code GeneratorsIP-Adapter: Text Compatible Image Prompt Adapter for Text-to-Image Diffusion ModelsChatGPT Prompt Patterns for Improving Code Quality, Refactoring, Requirements Elicitation, and Software Designhttps://arxiv.org/abs/2311.04155blog.athina.aiUniversal and Transferable Adversarial Attacks on Aligned Language ModelsSafetyApril 14, 2024Athina AI Research AgentAug 20, 2024 7:44 PMAI Safety: Necessary, but insufficient and possibly problematicFrom Noise to Clarity: Unraveling the Adversarial Suffix of Large Language Model Attacks via Translation of Text EmbeddingsMistral 7B: Foundation Model Research Paper SummaryWizardLM: Empowering Large Language Models to Follow Complex InstructionsEntGPT: Linking Generative Large Language Models with Knowledge Baseshttps://arxiv.org/abs/2307.15043blog.athina.aiImageDream: Image-Prompt Multi-view Diffusion for 3D GenerationEvaluationDecember 2, 2023Athina AI Research AgentAug 20, 2024 7:42 PMChain-of-Verification Reduces Hallucination in Large Language ModelsEfficient Prompting via Dynamic In-Context LearningPre-Training to Learn in ContextRe-imagine the Negative Prompt Algorithm: Transform 2D Diffusion into 3D, alleviate Janus problem and BeyondLLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language ModelsPromptbreeder: Self-Referential Self-Improvement Via Prompt EvolutionBenchmarking and Defending Against Indirect Prompt Injection Attacks on Large Language ModelsPrompt a Robot to Walk with Large Language ModelsJatmo: Prompt Injection Defense by Task-Specific FinetuningReprompting: Automated Chain-of-Thought Prompt Inference Through Gibbs SamplingYou Only Prompt Once: On the Capabilities of Prompt Learning on Large Language Models to Tackle Toxic ContentAssessing Prompt Injection Risks in 200+ Custom GPTsIgnore This Title and HackAPrompt: Exposing Systemic Vulnerabilities of LLMs through a Global Scale Prompt Hacking CompetitionPrompt Stealing Attacks Against Text-to-Image Generation ModelsTopicGPT: A Prompt-based Topic Modeling FrameworkPrompt-tuning latent diffusion models for inverse problemshttps://arxiv.org/abs/2312.02201blog.athina.aiOn Second Thought, Let's Not Think Step by Step! Bias and Toxicity in Zero-Shot ReasoningSafetyJune 4, 2023Athina AI Research AgentAug 20, 2024 7:46 PMHard Prompts Made Easy: Gradient-Based Discrete Optimization for Prompt Tuning and DiscoveryLarge Language Models Can Be Easily Distracted by Irrelevant ContextGraphPrompt: Unifying Pre-Training and Downstream Tasks for Graph Neural NetworksConstitutional AI: Harmlessness from AI Feedbackhttps://arxiv.org/abs/2212.08061blog.athina.aiPrompting AI Art: An Investigation into the Creative Skill of Prompt EngineeringPrompt EngineeringDecember 3, 2023Athina AI Research AgentAug 20, 2024 7:45 PMPrompting GPT-3 To Be ReliableDocPrompting: Generating Code by Retrieving the DocsLarge Language Models Are Human-Level Prompt Engineershttps://arxiv.org/abs/2303.13534blog.athina.aiLarger language models do in-context learning differentlyEvaluationMarch 7, 2023Athina AI Research AgentAug 20, 2024 7:47 PMBoosted Prompt Ensembles for Large Language ModelsFairness-guided Few-shot Prompting for Large Language ModelsNN Prompting: Beyond-Context Learning with Calibration-Free Nearest Neighbor InferenceOpenICL: An Open-Source Framework for In-context LearningAlphazero-like Tree-Search can Guide Large Language Model Decoding and Traininghttps://arxiv.org/abs/2303.03846blog.athina.aiThe Capacity for Moral Self-Correction in Large Language ModelsReasoningFebruary 18, 2023Athina AI Research AgentAug 20, 2024 7:49 PMBounding the Capabilities of Large Language Models in Open Text Generation with Prompt ConstraintsScalable Prompt Generation for Semi-supervised Learning with Language ModelsHow Does In-Context Learning Help Prompt Tuning?SwitchPrompt: Learning Domain-Specific Gated Soft Prompts for Classification in Low-Resource DomainsEvaluating the Robustness of Discrete PromptsCompositional Exemplars for In-context Learninghttps://arxiv.org/abs/2302.07459blog.athina.aiChain-of-Thought Reasoning is a Policy Improvement OperatorReasoningNovember 8, 2023Athina AI Research AgentAug 20, 2024 7:48 PMTree of Reviews: A Tree-based Dynamic Iterative Retrieval Framework for Multi-hop Question AnsweringGuReT: Distinguishing Guilt and Regret related TextRNNs are not Transformers (Yet): The Key Bottleneck on In-context Retrievalhttps://arxiv.org/abs/2309.08589blog.athina.aiProbabilistic Tree-of-thought Reasoning for Answering Knowledge-intensive Complex QuestionsPrompt EngineeringNovember 23, 2023Athina AI Research AgentAug 20, 2024 7:51 PMSelf-contradictory Hallucinations of Large Language Models: Evaluation, Detection and MitigationSearch-in-the-Chain: Interactively Enhancing Large Language Models with Search for Knowledge-intensive TasksEntGPT: Linking Generative Large Language Models with Knowledge BasesAutoHall: Automated Hallucination Dataset Generation for Large Language ModelsA Step Closer to Comprehensive Answers: Constrained Multi-Stage Question Decomposition with Large Language ModelsPrompt Design and Engineering: Introduction and Advanced Methodshttps://arxiv.org/abs/2311.13982blog.athina.aiPrompt Tuning of Deep Neural Networks for Speaker-adaptive Visual Speech RecognitionFine TuningFebruary 16, 2023Athina AI Research AgentSep 13, 2024 11:12 PMHD-Painter: High-Resolution and Prompt-Faithful Text-Guided Image Inpainting with Diffusion ModelsEdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAMPromptCARE: Prompt Copyright Protection by Watermark Injection and VerificationPromptTTS 2: Describing and Generating Voices with Text PromptVisual Prompt Based Personalized Federated LearningDP-OPT: Make Large Language Model Your Privacy-Preserving Prompt Engineerhttps://arxiv.org/abs/2302.08102From Noise to Clarity: Unraveling the Adversarial Suffix of Large Language Model Attacks via Translation of Text EmbeddingsSafetyApril 16, 2024Athina AI Research AgentAug 23, 2024 1:33 AMUniversal and Transferable Adversarial Attacks on Aligned Language ModelsWizardLM: Empowering Large Language Models to Follow Complex InstructionsEntGPT: Linking Generative Large Language Models with Knowledge Baseshttps://arxiv.org/abs/2402.16006blog.athina.aiBoosting of Thoughts: Trial-and-Error Problem Solving with Large Language ModelsReasoningFebruary 17, 2024Athina AI Research AgentAug 20, 2024 7:53 PMGuReT: Distinguishing Guilt and Regret related TextUnleashing the potential of prompt engineering in Large Language Models: a comprehensive reviewFounder-GPT: Self-play to evaluate the Founder-Idea fitAI Chain on Large Language Model for Unsupervised Control Flow Graph Generation for Statically-Typed Partial CodePathFinder: Guided Search over Multi-Step Reasoning Pathshttps://arxiv.org/abs/2402.11140blog.athina.aiTopicGPT: A Prompt-based Topic Modeling FrameworkEvaluationApril 1, 2024Athina AI Research AgentAug 20, 2024 7:54 PMLanguage Prompt for Autonomous DrivingImageDream: Image-Prompt Multi-view Diffusion for 3D GenerationLongLLMLingua: Accelerating and Enhancing LLMs in Long Context Scenarios via Prompt CompressionProRes: Exploring Degradation-aware Visual Prompt for Universal Image RestorationAre Chatbots Ready for Privacy-Sensitive Applications? An Investigation into Input Regurgitation and Prompt-Induced Sanitizationhttps://arxiv.org/abs/2311.01449STAMP: Differentiable Task and Motion Planning via Stein Variational Gradient DescentReasoningJanuary 7, 2024Athina AI Research AgentAug 20, 2024 7:52 PMEvidence to Generate (E2G): A Single-agent Two-step Prompting for Context Grounded and Retrieval Augmented ReasoningRAGAR, Your Falsehood RADAR: RAG-Augmented Reasoning for Political Fact-Checking using Multimodal Large Language ModelsFounder-GPT: Self-play to evaluate the Founder-Idea fithttps://arxiv.org/abs/2310.01775blog.athina.aiPEARL: Prompting Large Language Models to Plan and Execute Actions Over Long DocumentsPrompt EngineeringMay 23, 2023Athina AI Research AgentAug 20, 2024 7:55 PMReinforcement Learning in the Era of LLMs: What is Essential? What is needed? An RL Perspective on RLHF, Prompting, and BeyondA Bibliometric Review of Large Language Models Research from 2017 to 2023Few-shot Fine-tuning vs. In-context Learning: A Fair Comparison and Evaluationhttps://arxiv.org/abs/2305.14564v1blog.athina.aiReflexion: Language Agents with Verbal Reinforcement LearningPrompt EngineeringMarch 20, 2023Athina AI Research AgentAug 20, 2024 7:56 PMBoosted Prompt Ensembles for Large Language ModelsGlobal Prompt Cell: A Portable Control Module for Effective Prompt TuningWhy think step by step? Reasoning emerges from the locality of experiencehttps://arxiv.org/abs/2303.11366blog.athina.ai🪁What we learned from speaking to 50+ LLM developers building RAG apps.EvaluationNovember 29, 2023Shiv SakhujaAug 20, 2024 7:58 PMControlling Personality Style in Dialogue with Zero-Shot Prompt-Based LearningPrompt EngineeringFebruary 8, 2023Athina AI Research AgentAug 20, 2024 8:01 PMLayout and Task Aware Instruction Prompt for Zero-shot Document Image Question AnsweringMultimodal Prompt Perceiver: Empower Adaptiveness, Generalizability and Fidelity for All-in-One Image RestorationBIM-GPT: a Prompt-Based Virtual Assistant Framework for BIM Information RetrievalToken-Level Adversarial Prompt Detection Based on Perplexity Measures and Contextual Informationhttps://arxiv.org/abs/2302.03848ReAct: Synergizing Reasoning and Acting in Language ModelsReasoningMarch 10, 2023Athina AI Research AgentAug 20, 2024 7:59 PMLarge Language Models Are Human-Level Prompt EngineersMachine Generated Text: A Comprehensive Survey of Threat Models and Detection MethodsReAct: Synergizing Reasoning and Acting in Language ModelsPrompt Engineering for Healthcare: Methodologies and ApplicationsUnderstanding prompt engineering may not require rethinking generalizationPrompt-Engineering and Transformer-based Question Generation and EvaluationCases of EFL Secondary Students' Prompt Engineering Pathways to Complete a Writing Task with ChatGPThttps://arxiv.org/abs/2210.03629blog.athina.aiZeroPrompt: Streaming Acoustic Encoders are Zero-Shot Masked LMsEvaluationMay 18, 2023Athina AI Research AgentAug 20, 2024 8:02 PMHarnessing the Power of LLMs in Practice: A Survey on ChatGPT and BeyondReinforcement Learning in the Era of LLMs: What is Essential? What is needed? An RL Perspective on RLHF, Prompting, and BeyondZeroPrompt: Streaming Acoustic Encoders are Zero-Shot Masked LMsWhat In-Context Learning "Learns" In-Context: Disentangling Task Recognition and Task LearningReprompting: Automated Chain-of-Thought Prompt Inference Through Gibbs SamplingSatLM: Satisfiability-Aided Language Models Using Declarative PromptingPre-Training to Learn in Contexthttps://arxiv.org/abs/2305.10649blog.athina.aiLLM Critics Help Catch LLM BugsEvaluationJune 28, 2024Athina AI Research AgentAug 23, 2024 1:33 AMTesting LLMs on Code Generation with Varying Levels of Prompt SpecificitySoft-prompt Tuning for Large Language Models to Evaluate BiasConsistency-guided Prompt Learning for Vision-Language Modelshttps://cdn.openai.com/llm-critics-help-catch-llm-bugs-paper.pdfSPROUT: Authoring Programming Tutorials with Interactive Visualization of Large Language Model Generation ProcessLLM PerformanceDecember 4, 2023Athina AI Research AgentAug 23, 2024 1:17 AMGuReT: Distinguishing Guilt and Regret related TextUnleashing the potential of prompt engineering in Large Language Models: a comprehensive reviewPathFinder: Guided Search over Multi-Step Reasoning PathsAutonomous Tree-search Ability of Large Language Modelshttps://arxiv.org/abs/2312.01801blog.athina.aiPromise: Prompt-driven 3D Medical Image Segmentation Using Pretrained Image Foundation ModelsPrompt EngineeringNovember 13, 2023Athina AI Research AgentAug 20, 2024 8:05 PMPromise: Prompt-driven 3D Medical Image Segmentation Using Pretrained Image Foundation ModelsMultimodal Prompt Perceiver: Empower Adaptiveness, Generalizability and Fidelity for All-in-One Image RestorationPrompt-In-Prompt Learning for Universal Image RestorationAdversarial Prompt Tuning for Vision-Language ModelsPrompt-Tuning Decision Transformer with Preference Rankinghttps://arxiv.org/abs/2310.19721Principled Instructions Are All You Need for Questioning LLaMA-1/2, GPT-3.5/4Prompt EngineeringDecember 26, 2023Athina AI Research AgentAug 23, 2024 2:29 AMA Survey on Hallucination in Large Language Models: Principles, Taxonomy, Challenges, and Open QuestionsA Step Closer to Comprehensive Answers: Constrained Multi-Stage Question Decomposition with Large Language ModelsActive Retrieval Augmented GenerationLLMLingua: Compressing Prompts for Accelerated Inference of Large Language ModelsConnecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt OptimizersRe-Reading Improves Reasoning in Large Language Modelshttps://arxiv.org/abs/2312.16171v1blog.athina.aiDocPrompting: Generating Code by Retrieving the DocsPrompt EngineeringFebruary 18, 2023Athina AI Research AgentAug 20, 2024 8:07 PMDynamic Prompt Learning via Policy Gradient for Semi-structured Mathematical ReasoningDocPrompting: Generating Code by Retrieving the DocsMaking Large Language Models Better Reasoners with Step-Aware VerifierLarge Language Models Are Human-Level Prompt EngineersRecitation-Augmented Language ModelsPrompting GPT-3 To Be ReliableLanguage Models Are Greedy Reasoners: A Systematic Formal Analysis of Chain-of-ThoughtPrompt Engineering for Healthcare: Methodologies and ApplicationsPrompt Engineering a Prompt EngineerPrompt Engineering or Fine Tuning: An Empirical Assessment of Large Language Models in Automated Software Engineering TasksA Survey on Segment Anything Model (SAM): Vision Foundation Model Meets Prompt EngineeringPrompting AI Art: An Investigation into the Creative Skill of Prompt EngineeringA Systematic Survey of Prompt Engineering on Vision-Language Foundation ModelsUnderstanding prompt engineering may not require rethinking generalizationTo be or not to be? an exploration of continuously controllable prompt engineeringPEACE: Prompt Engineering Automation for CLIPSeg Enhancement in Aerial RoboticsPrompt Engineering for Transformer-based Chemical Similarity Search Identifies Structurally Distinct Functional AnaloguesPrompt-Engineering and Transformer-based Question Generation and EvaluationPrompt Engineering-assisted Malware Dynamic Analysis Using GPT-4Cases of EFL Secondary Students' Prompt Engineering Pathways to Complete a Writing Task with ChatGPTLarge Language Models and Prompt Engineering for Biomedical Query Focused Multi-Document Summarisationhttps://arxiv.org/abs/2207.05987blog.athina.aiBenchmarking and Defending Against Indirect Prompt Injection Attacks on Large Language ModelsSafetyMarch 8, 2024Athina AI Research AgentAug 23, 2024 1:33 AMBenchmarking and Defending Against Indirect Prompt Injection Attacks on Large Language ModelsLanguage Prompt for Autonomous DrivingImageDream: Image-Prompt Multi-view Diffusion for 3D GenerationStyleDiffusion: Prompt-Embedding Inversion for Text-Based EditingYou Only Prompt Once: On the Capabilities of Prompt Learning on Large Language Models to Tackle Toxic ContentPBNR: Prompt-based News Recommender SystemPrompt Stealing Attacks Against Text-to-Image Generation ModelsDePT: Decomposed Prompt Tuning for Parameter-Efficient Fine-tuninghttps://arxiv.org/abs/2312.14197Automatic Root Cause Analysis via Large Language Models for Cloud IncidentsEvaluationNovember 13, 2023Athina AI Research AgentAug 20, 2024 8:08 PMPromptAgent: Strategic Planning with Language Models Enables Expert-level Prompt OptimizationEvidence to Generate (E2G): A Single-agent Two-step Prompting for Context Grounded and Retrieval Augmented ReasoningText2MDT: Extracting Medical Decision Trees from Medical Texts https://arxiv.org/abs/2305.15778blog.athina.aiEnhancing Medical Task Performance in GPT-4V: A Comprehensive Study on Prompt Engineering StrategiesPrompt EngineeringDecember 12, 2023Athina AI Research AgentAug 20, 2024 8:07 PMPrompting GPT-3 To Be ReliablePrompt Engineering for Transformer-based Chemical Similarity Search Identifies Structurally Distinct Functional AnaloguesPAL: Program-aided Language ModelsMedPromptExtract (Medical Data Extraction Tool): Anonymization and Hi-fidelity Automated data extraction using NLP and prompt engineeringhttps://arxiv.org/abs/2312.04344blog.athina.aiDynamic Prompting: A Unified Framework for Prompt TuningPrompt EngineeringMarch 6, 2023Athina AI Research AgentAug 20, 2024 8:16 PMBoosted Prompt Ensembles for Large Language ModelsRevisiting Automated Prompting: Are We Actually Doing Better?CoTEVer: Chain of Thought Prompting Annotation Toolkit for Explanation VerificationART: Automatic multi-step reasoning and tool-use for large language modelsMultitask Prompt Tuning Enables Parameter-Efficient Transfer LearningAlphazero-like Tree-Search can Guide Large Language Model Decoding and Traininghttps://arxiv.org/abs/2303.02909blog.athina.aiLarge Language Models are Zero-Shot ReasonersReasoningJanuary 29, 2023Athina AI Research AgentAug 20, 2024 8:14 PMRAGAR, Your Falsehood RADAR: RAG-Augmented Reasoning for Political Fact-Checking using Multimodal Large Language ModelsPromptAgent: Strategic Planning with Language Models Enables Expert-level Prompt OptimizationLayoutLLM: Layout Instruction Tuning with Large Language Models for Document Understandinghttps://arxiv.org/abs/2205.11916blog.athina.aiTemporal evolution of depolarization and magnetic field of FRB 20201124AEvaluationSeptember 13, 2023Athina AI Research AgentAug 20, 2024 8:16 PMReasoning with Language Model Prompting: A SurveyTowards Reasoning in Large Language Models: A SurveyFew-shot Fine-tuning vs. In-context Learning: A Fair Comparison and Evaluationhttps://arxiv.org/abs/2309.06653blog.athina.aiA Survey on Segment Anything Model (SAM): Vision Foundation Model Meets Prompt EngineeringEvaluationJuly 3, 2023Athina AI Research AgentAug 20, 2024 8:13 PMA Survey on Segment Anything Model (SAM): Vision Foundation Model Meets Prompt EngineeringDocPrompting: Generating Code by Retrieving the DocsLarge Language Models Are Human-Level Prompt EngineersTo be or not to be? an exploration of continuously controllable prompt engineeringLarge Language Models and Prompt Engineering for Biomedical Query Focused Multi-Document Summarisationhttps://arxiv.org/abs/2306.06211blog.athina.aiExploring Prompt Engineering Practices in the EnterprisePrompt EngineeringMarch 13, 2024Athina AI Research AgentAug 20, 2024 8:11 PMLAMPER: LanguAge Model and Prompt EngineeRing for zero-shot time series classificationChatGPT4PCG 2 Competition: Prompt Engineering for Science Birds Level GenerationExploring Prompt Engineering Practices in the EnterpriseWordflow: Social Prompt Engineering for Large Language Modelshttps://arxiv.org/abs/2403.0895blog.athina.aiPrompt-tuning latent diffusion models for inverse problemsFine TuningOctober 2, 2023Athina AI Research AgentAug 20, 2024 8:19 PMLanguage Prompt for Autonomous DrivingImageDream: Image-Prompt Multi-view Diffusion for 3D GenerationIgnore This Title and HackAPrompt: Exposing Systemic Vulnerabilities of LLMs through a Global Scale Prompt Hacking CompetitionProRes: Exploring Degradation-aware Visual Prompt for Universal Image RestorationDePT: Decomposed Prompt Tuning for Parameter-Efficient Fine-tuningPromptCARE: Prompt Copyright Protection by Watermark Injection and VerificationAre Chatbots Ready for Privacy-Sensitive Applications? An Investigation into Input Regurgitation and Prompt-Induced Sanitizationhttps://arxiv.org/abs/2310.01110Towards Large-scale 3D Representation Learning with Multi-dataset Point Prompt TrainingDataset GenerationAugust 18, 2023Athina AI Research AgentAug 20, 2024 8:17 PMHow to Prompt LLMs for Text-to-SQL: A Study in Zero-shot, Single-domain, and Cross-domain SettingsTranslating Radiology Reports into Plain Language using ChatGPT and GPT-4 with Prompt Learning: Promising Results, Limitations, and PotentialMedPromptExtract (Medical Data Extraction Tool): Anonymization and Hi-fidelity Automated data extraction using NLP and prompt engineeringhttps://arxiv.org/abs/2308.09718blog.athina.aiPrompt Algebra for Task CompositionPrompt EngineeringJune 1, 2023Athina AI Research AgentAug 20, 2024 8:19 PMMultimodal Prompt Perceiver: Empower Adaptiveness, Generalizability and Fidelity for All-in-One Image RestorationPrompt-Tuning Decision Transformer with Preference RankingAdversarial Prompt Tuning for Vision-Language ModelsSafeguarding Crowdsourcing Surveys from ChatGPT with Prompt InjectionPrompt Sapper: LLM-Empowered Software Engineering Infrastructure for AI-Native ServicesLast One Standing: A Comparative Analysis of Security and Privacy of Soft Prompt Tuning, LoRA, and In-Context LearningSD4Match: Learning to Prompt Stable Diffusion Model for Semantic Matchinghttps://arxiv.org/abs/2306.00310ChatGPT4PCG 2 Competition: Prompt Engineering for Science Birds Level GenerationPrompt EngineeringMarch 5, 2024Athina AI Research AgentAug 20, 2024 8:18 PMExploring the Intersection of Large Language Models and Agent-Based Modeling via Prompt EngineeringMedPromptExtract (Medical Data Extraction Tool): Anonymization and Hi-fidelity Automated data extraction using NLP and prompt engineeringLarge Language Models and Prompt Engineering for Biomedical Query Focused Multi-Document SummarisationLAMPER: LanguAge Model and Prompt EngineeRing for zero-shot time series classificationExploring Prompt Engineering Practices in the EnterpriseAutomated Black-box Prompt Engineering for Personalized Text-to-Image GenerationA Systematic Survey of Prompt Engineering in Large Language Models: Techniques and ApplicationsExploring EFL students' prompt engineering in human-AI story writing: an Activity Theory perspectiveA Novel Approach for Rapid Development Based on ChatGPT and Prompt EngineeringChit-Chat or Deep Talk: Prompt Engineering for Process MiningImproving ChatGPT Prompt for Code Generationhttps://arxiv.org/abs/2403.02610blog.athina.aiPost-Semantic-Thinking: A Robust Strategy to Distill Reasoning Capacity from Large Language ModelsReasoningApril 12, 2024Athina AI Research AgentAug 20, 2024 8:31 PMIterAlign: Iterative Constitutional Alignment of Large Language ModelsBreaking Down the Defenses: A Comparative Survey of Attacks on Large Language Modelshttps://arxiv.org/html/2404.09170v1blog.athina.aiExploring LLM-based Agents for Root Cause AnalysisEvaluationMarch 7, 2024Athina AI Research AgentAug 20, 2024 8:27 PMExploring LLM-based Agents for Root Cause AnalysisAutomatic Prompt Augmentation and Selection with Chain-of-Thought from Labeled DataModel-tuning Via Prompts Makes NLP Models Adversarially RobustReinforcement Learning in the Era of LLMs: What is Essential? What is needed? An RL Perspective on RLHF, Prompting, and BeyondFew-shot Fine-tuning vs. In-context Learning: A Fair Comparison and EvaluationHarnessing the Power of LLMs in Practice: A Survey on ChatGPT and BeyondGlobal Prompt Cell: A Portable Control Module for Effective Prompt TuningPrompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion ModelsPrompt EngineeringJune 1, 2023Athina AI Research AgentAug 20, 2024 8:20 PMHow to Prompt LLMs for Text-to-SQL: A Study in Zero-shot, Single-domain, and Cross-domain SettingsExploring the Intersection of Large Language Models and Agent-Based Modeling via Prompt EngineeringMedPromptExtract (Medical Data Extraction Tool): Anonymization and Hi-fidelity Automated data extraction using NLP and prompt engineeringhttps://arxiv.org/abs/2305.16223blog.athina.aiEver: Mitigating Hallucination in Large Language Models through Real-Time Verification and RectificationSafetyHallucinationsApril 15, 2024Athina AI Research AgentThe EVER (Real-Time Verification and Rectification) framework is designed to dynamically mitigate hallucinations during text generation by ensuring the accuracy and trustworthiness of each sentence before proceeding.Aug 23, 2024 1:33 AMMany-Shot Jailbreaking (Anthropic Research)Breaking Down the Defenses: A Comparative Survey of Attacks on Large Language ModelsSemi-Structured Chain-of-Thought: Integrating Multiple Sources of Knowledge for Improved Language Model ReasoningSearch-in-the-Chain: Interactively Enhancing Large Language Models with Search for Knowledge-intensive Taskshttps://arxiv.org/html/2311.09114v2blog.athina.aiPrompting GPT-3 To Be ReliableSafetyFebruary 15, 2023Athina AI Research AgentAug 20, 2024 8:31 PMDocPrompting: Generating Code by Retrieving the DocsInferring Properties of Graph Neural NetworksMachine Generated Text: A Comprehensive Survey of Threat Models and Detection MethodsDecomposed Prompting: A Modular Approach for Solving Complex TasksPrompt Engineering or Fine Tuning: An Empirical Assessment of Large Language Models in Automated Software Engineering TasksPrompting AI Art: An Investigation into the Creative Skill of Prompt EngineeringUnderstanding prompt engineering may not require rethinking generalizationTo be or not to be? an exploration of continuously controllable prompt engineeringPrompt-Engineering and Transformer-based Question Generation and EvaluationCases of EFL Secondary Students' Prompt Engineering Pathways to Complete a Writing Task with ChatGPTEnhancing Medical Task Performance in GPT-4V: A Comprehensive Study on Prompt Engineering StrategiesLarge Language Models and Prompt Engineering for Biomedical Query Focused Multi-Document Summarisationhttps://arxiv.org/abs/2210.09150blog.athina.aiLanguage Prompt for Autonomous DrivingDataset GenerationSeptember 8, 2023Athina AI Research AgentAug 20, 2024 8:33 PMPre-Training to Learn in ContextPlan-and-Solve Prompting: Improving Zero-Shot Chain-of-Thought Reasoning by Large Language ModelsSegment Any Anomaly without Training via Hybrid Prompt RegularizationLLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language ModelsBenchmarking and Defending Against Indirect Prompt Injection Attacks on Large Language ModelsTEMPO: Prompt-based Generative Pre-trained Transformer for Time Series ForecastingPrompt a Robot to Walk with Large Language ModelsJatmo: Prompt Injection Defense by Task-Specific FinetuningReprompting: Automated Chain-of-Thought Prompt Inference Through Gibbs SamplingAssessing Prompt Injection Risks in 200+ Custom GPTsIgnore This Title and HackAPrompt: Exposing Systemic Vulnerabilities of LLMs through a Global Scale Prompt Hacking CompetitionTopicGPT: A Prompt-based Topic Modeling FrameworkPrompt-tuning latent diffusion models for inverse problemsProRes: Exploring Degradation-aware Visual Prompt for Universal Image Restorationhttps://arxiv.org/abs/2309.04379blog.athina.aiNLPBench: Evaluating Large Language Models on Solving NLP ProblemsEvaluationOctober 19, 2023Athina AI Research AgentSep 13, 2024 3:46 PMGuReT: Distinguishing Guilt and Regret related TextUnleashing the potential of prompt engineering in Large Language Models: a comprehensive reviewPathFinder: Guided Search over Multi-Step Reasoning Pathshttps://arxiv.org/abs/2309.15630blog.athina.aiIP-Adapter: Text Compatible Image Prompt Adapter for Text-to-Image Diffusion ModelsFine TuningAugust 13, 2023Athina AI Research AgentAug 20, 2024 8:34 PMJailbreaking ChatGPT via Prompt Engineering: An Empirical StudyChatGPT Prompt Patterns for Improving Code Quality, Refactoring, Requirements Elicitation, and Software DesignQuantifying Language Models' Sensitivity to Spurious Features in Prompt Design or: How I learned to start worrying about prompt formattingPromptly: Using Prompt Problems to Teach Learners How to Effectively Utilize AI Code GeneratorsPromptAid: Prompt Exploration, Perturbation, Testing and Iteration using Visual Analytics for Large Language ModelsBlack-Box Prompt Optimization: Aligning Large Language Models without Model TrainingBoosted Prompt Ensembles for Large Language Modelshttps://arxiv.org/abs/2308.06721blog.athina.aiTree of Attacks: Jailbreaking Black-Box LLMs AutomaticallySafetyFebruary 21, 2024Athina AI Research AgentAug 20, 2024 8:35 PMAlphazero-like Tree-Search can Guide Large Language Model Decoding and TrainingUnleashing the potential of prompt engineering in Large Language Models: a comprehensive reviewRetrieval-Augmented Thought Process as Sequential Decision Makinghttps://arxiv.org/abs/2312.02119blog.athina.aiFoodGPT: A Large Language Model in Food Testing Domain with Incremental Pre-training and Knowledge Graph PromptFine TuningAugust 20, 2023Athina AI Research AgentAug 20, 2024 8:36 PMPrompt-Tuning Decision Transformer with Preference RankingFoodGPT: A Large Language Model in Food Testing Domain with Incremental Pre-training and Knowledge Graph Promptviz2viz: Prompt-driven stylized visualization generation using a diffusion modelText-driven Prompt Generation for Vision-Language Models in Federated LearningConsistency-guided Prompt Learning for Vision-Language ModelsLarge Language Model Prompt Chaining for Long Legal Document ClassificationPlum: Prompt Learning using Metaheuristichttps://arxiv.org/abs/2308.10173BIM-GPT: a Prompt-Based Virtual Assistant Framework for BIM Information RetrievalPrompt EngineeringApril 18, 2023Athina AI Research AgentAug 20, 2024 8:37 PMMultimodal Prompt Perceiver: Empower Adaptiveness, Generalizability and Fidelity for All-in-One Image RestorationDP-OPT: Make Large Language Model Your Privacy-Preserving Prompt EngineerControlling Personality Style in Dialogue with Zero-Shot Prompt-Based LearningPrompt-Tuning Decision Transformer with Preference RankingSafeguarding Crowdsourcing Surveys from ChatGPT with Prompt InjectionPrompt Sapper: LLM-Empowered Software Engineering Infrastructure for AI-Native ServicesLast One Standing: A Comparative Analysis of Security and Privacy of Soft Prompt Tuning, LoRA, and In-Context LearningSoft-prompt Tuning for Large Language Models to Evaluate Biashttps://arxiv.org/abs/2304.09333Flatness-Aware Prompt Selection Improves Accuracy and Sample EfficiencyEvaluationMay 18, 2023Athina AI Research AgentAug 20, 2024 8:38 PMTELeR: A General Taxonomy of LLM Prompts for Benchmarking Complex TasksCompress, Then Prompt: Improving Accuracy-Efficiency Trade-off of LLM Inference with Transferable PromptTreePrompt: Learning to Compose Tree Prompts for Explainable Visual GroundingWhat In-Context Learning "Learns" In-Context: Disentangling Task Recognition and Task LearningSatLM: Satisfiability-Aided Language Models Using Declarative PromptingBoosted Prompt Ensembles for Large Language Modelshttps://arxiv.org/abs/2305.10713blog.athina.aiPractical Membership Inference Attacks against Fine-tuned Large Language Models via Self-prompt CalibrationSafetyDecember 12, 2023Athina AI Research AgentAug 20, 2024 8:40 PMPrompt Packer: Deceiving LLMs through Compositional Instruction with Hidden AttacksDP-OPT: Make Large Language Model Your Privacy-Preserving Prompt EngineerExploring the Relationship between LLM Hallucinations and Prompt Linguistic Nuances: Readability, Formality, and ConcretenessPrompt Middleware: Mapping Prompts for Large Language Models to UI Affordanceshttps://arxiv.org/abs/2311.06062Everything of Thoughts: Defying the Law of Penrose Triangle for Thought GenerationReasoningFebruary 23, 2024Athina AI Research AgentAug 20, 2024 8:39 PMAlgorithm of Thoughts: Enhancing Exploration of Ideas in Large Language ModelsRetrieval-Augmented Thought Process as Sequential Decision MakingEverything of Thoughts: Defying the Law of Penrose Triangle for Thought GenerationEmpowering Multi-step Reasoning across Languages via Tree-of-ThoughtsBoosting Logical Reasoning in Large Language Models through a New Framework: The Graph of ThoughtTree-of-Mixed-Thought: Combining Fast and Slow Thinking for Multi-hop Visual ReasoningLarge Language Model Guided Tree-of-ThoughtMACM: Utilizing a Multi-Agent System for Condition Mining in Solving Complex Mathematical Problemshttps://arxiv.org/abs/2311.04254blog.athina.aiPrompt-Tuning Decision Transformer with Preference RankingFine TuningMay 16, 2023Athina AI Research AgentAug 20, 2024 8:40 PMPrompt-Tuning Decision Transformer with Preference RankingBIM-GPT: a Prompt-Based Virtual Assistant Framework for BIM Information RetrievalPromise: Prompt-driven 3D Medical Image Segmentation Using Pretrained Image Foundation ModelsPrompt Algebra for Task CompositionAntifakePrompt: Prompt-Tuned Vision-Language Models are Fake Image DetectorsPrompt Sapper: LLM-Empowered Software Engineering Infrastructure for AI-Native ServicesLast One Standing: A Comparative Analysis of Security and Privacy of Soft Prompt Tuning, LoRA, and In-Context LearningTesting LLMs on Code Generation with Varying Levels of Prompt SpecificityULTRA-DP: Unifying Graph Pre-training with Multi-task Graph Dual PromptMaatphor: Automated Variant Analysis for Prompt Injection AttacksSPELL: Semantic Prompt Evolution based on a LLMFoodGPT: A Large Language Model in Food Testing Domain with Incremental Pre-training and Knowledge Graph Prompthttps://arxiv.org/abs/2305.09648Meta-in-context learning in large language modelsReasoningMay 22, 2023Athina AI Research AgentAug 20, 2024 8:41 PMPlan-and-Solve Prompting: Improving Zero-Shot Chain-of-Thought Reasoning by Large Language ModelsExplaining Emergent In-Context Learning as Kernel RegressionMeta-in-context learning in large language modelsCompress, Then Prompt: Improving Accuracy-Efficiency Trade-off of LLM Inference with Transferable PromptTreePrompt: Learning to Compose Tree Prompts for Explainable Visual Groundinghttps://arxiv.org/abs/2305.12907blog.athina.aiPrompt-In-Prompt Learning for Universal Image RestorationPrompt EngineeringDecember 8, 2023Athina AI Research AgentAug 20, 2024 8:45 PMMultimodal Prompt Perceiver: Empower Adaptiveness, Generalizability and Fidelity for All-in-One Image RestorationTCP:Textual-based Class-aware Prompt tuning for Visual-Language ModelBadCLIP: Trigger-Aware Prompt Learning for Backdoor Attacks on CLIPPromise: Prompt-driven 3D Medical Image Segmentation Using Pretrained Image Foundation Modelshttps://arxiv.org/abs/2312.05038Knowledge-Driven CoT: Exploring Faithful Reasoning in LLMs for Knowledge-intensive Question AnsweringReasoningOctober 28, 2023Athina AI Research AgentSep 18, 2024 3:07 PMPathFinder: Guided Search over Multi-Step Reasoning PathsGuReT: Distinguishing Guilt and Regret related TextFounder-GPT: Self-play to evaluate the Founder-Idea fithttps://arxiv.org/abs/2308.13259blog.athina.aiA Systematic Survey of Prompt Engineering in Large Language Models: Techniques and ApplicationsPrompt EngineeringFebruary 5, 2024Athina AI Research AgentAug 20, 2024 8:42 PMMedPromptExtract (Medical Data Extraction Tool): Anonymization and Hi-fidelity Automated data extraction using NLP and prompt engineeringChatGPT4PCG 2 Competition: Prompt Engineering for Science Birds Level GenerationLAMPER: LanguAge Model and Prompt EngineeRing for zero-shot time series classificationhttps://arxiv.org/abs/2402.07927blog.athina.aiLarge Language Models Can Be Easily Distracted by Irrelevant ContextEvaluationJune 6, 2023Athina AI Research AgentAug 20, 2024 8:46 PMMultimodal Chain-of-Thought Reasoning in Language ModelsSwitchPrompt: Learning Domain-Specific Gated Soft Prompts for Classification in Low-Resource DomainsHard Prompts Made Easy: Gradient-Based Discrete Optimization for Prompt Tuning and DiscoverySynthetic Prompting: Generating Chain-of-Thought Demonstrations for Large Language ModelsProgressive Prompts: Continual Learning for Language ModelsDemonstrate-Search-Predict: Composing retrieval and language models for knowledge-intensive NLPOn Second Thought, Let's Not Think Step by Step! Bias and Toxicity in Zero-Shot ReasoningUnleashing the potential of prompt engineering in Large Language Models: a comprehensive reviewhttps://arxiv.org/abs/2302.00093blog.athina.aiConstitutional AI: Harmlessness from AI FeedbackReasoningDecember 15, 2022Athina AI Research AgentAug 20, 2024 8:49 PMGraphPrompt: Unifying Pre-Training and Downstream Tasks for Graph Neural NetworksOn Second Thought, Let's Not Think Step by Step! Bias and Toxicity in Zero-Shot ReasoningBatch Prompting: Efficient Inference with Large Language Model APIsSuccessive Prompting for Decomposing Complex QuestionsLarge Language Models are reasoners with Self-VerificationDemystifying Chains, Trees, and Graphs of Thoughtshttps://arxiv.org/abs/2212.08073blog.athina.aiPrompt-based Node Feature Extractor for Few-shot Learning on Text-Attributed GraphsPrompt EngineeringSeptember 6, 2023Athina AI Research AgentAug 20, 2024 8:48 PMLLMs Can Understand Encrypted Prompt: Towards Privacy-Computing Friendly TransformersPrompt Packer: Deceiving LLMs through Compositional Instruction with Hidden AttacksPrompt-based Node Feature Extractor for Few-shot Learning on Text-Attributed GraphsDivide and Prompt: Chain of Thought Prompting for Text-to-SQLEfficient Federated Prompt Tuning for Black-box Large Pre-trained Modelshttps://arxiv.org/abs/2309.02848Prompt Cache: Modular Attention Reuse for Low-Latency InferenceEvaluationApril 25, 2024Athina AI Research AgentAug 20, 2024 8:47 PMTranslating Radiology Reports into Plain Language using ChatGPT and GPT-4 with Prompt Learning: Promising Results, Limitations, and PotentialHow to Prompt LLMs for Text-to-SQL: A Study in Zero-shot, Single-domain, and Cross-domain SettingsMedPromptExtract (Medical Data Extraction Tool): Anonymization and Hi-fidelity Automated data extraction using NLP and prompt engineeringhttps://arxiv.org/abs/2311.04934blog.athina.aiNot what you've signed up for: Compromising Real-World LLM-Integrated Applications with Indirect Prompt InjectionSafetyFebruary 23, 2023Athina AI Research AgentAug 20, 2024 8:49 PMPrompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot LearnersLanguage Is Not All You Need: Aligning Perception with Language ModelsEvoPrompting: Language Models for Code-Level Neural Architecture SearchA Prompt Pattern Catalog to Enhance Prompt Engineering with ChatGPThttps://arxiv.org/abs/2302.12173blog.athina.aiAlphazero-like Tree-Search can Guide Large Language Model Decoding and TrainingReasoningFebruary 9, 2024Athina AI Research AgentAug 20, 2024 8:51 PMDynamic Prompting: A Unified Framework for Prompt TuningAlphazero-like Tree-Search can Guide Large Language Model Decoding and TrainingLarger language models do in-context learning differentlyEmpowering Multi-step Reasoning across Languages via Tree-of-ThoughtsTree of Attacks: Jailbreaking Black-Box LLMs AutomaticallyTree-of-Mixed-Thought: Combining Fast and Slow Thinking for Multi-hop Visual ReasoningLarge Language Model Guided Tree-of-ThoughtMACM: Utilizing a Multi-Agent System for Condition Mining in Solving Complex Mathematical Problemshttps://arxiv.org/abs/2309.17179blog.athina.aiTCP:Textual-based Class-aware Prompt tuning for Visual-Language ModelFine TuningMarch 13, 2024Athina AI Research AgentAug 20, 2024 8:52 PMMultimodal Prompt Perceiver: Empower Adaptiveness, Generalizability and Fidelity for All-in-One Image RestorationDP-OPT: Make Large Language Model Your Privacy-Preserving Prompt EngineerSoft Prompt Tuning for Augmenting Dense Retrieval with Large Language ModelsPrompt-In-Prompt Learning for Universal Image RestorationAutoHint: Automatic Prompt Optimization with Hint Generationhttps://arxiv.org/abs/2311.18231Tree-of-Mixed-Thought: Combining Fast and Slow Thinking for Multi-hop Visual ReasoningReasoningAugust 21, 2023Athina AI Research AgentAug 20, 2024 8:51 PMEverything of Thoughts: Defying the Law of Penrose Triangle for Thought GenerationAlphazero-like Tree-Search can Guide Large Language Model Decoding and TrainingAlgorithm of Thoughts: Enhancing Exploration of Ideas in Large Language ModelsLarge Language Model Guided Tree-of-ThoughtMACM: Utilizing a Multi-Agent System for Condition Mining in Solving Complex Mathematical ProblemsGuReT: Distinguishing Guilt and Regret related Texthttps://arxiv.org/abs/2308.09658blog.athina.aiNN Prompting: Beyond-Context Learning with Calibration-Free Nearest Neighbor InferenceEvaluationMarch 24, 2023Athina AI Research AgentAug 20, 2024 8:53 PMA Comprehensive Survey on Instruction FollowingRevisiting Automated Prompting: Are We Actually Doing Better?Global Prompt Cell: A Portable Control Module for Effective Prompt TuningContext-faithful Prompting for Large Language ModelsStructure Pretraining and Prompt Tuning for Knowledge Graph TransferCoTEVer: Chain of Thought Prompting Annotation Toolkit for Explanation VerificationLarger language models do in-context learning differentlyhttps://arxiv.org/abs/2303.13824blog.athina.aiProgressive Visual Prompt Learning with Contrastive Feature Re-formationFine TuningApril 17, 2023Athina AI Research AgentAug 20, 2024 8:55 PMAn automatically discovered chain-of-thought prompt generalizes to novel models and datasetsVisual Prompt Based Personalized Federated LearningPromptTTS 2: Describing and Generating Voices with Text PromptTesting LLMs on Code Generation with Varying Levels of Prompt SpecificitySoft-prompt Tuning for Large Language Models to Evaluate BiasPrompting Hard or Hardly Prompting: Prompt Inversion for Text-to-Image Diffusion ModelsMaatphor: Automated Variant Analysis for Prompt Injection AttacksSPELL: Semantic Prompt Evolution based on a LLMhttps://arxiv.org/abs/2304.08386Automated Black-box Prompt Engineering for Personalized Text-to-Image GenerationPrompt EngineeringMarch 28, 2024Athina AI Research AgentAug 20, 2024 8:54 PMExploring the Intersection of Large Language Models and Agent-Based Modeling via Prompt EngineeringMedPromptExtract (Medical Data Extraction Tool): Anonymization and Hi-fidelity Automated data extraction using NLP and prompt engineeringChatGPT4PCG 2 Competition: Prompt Engineering for Science Birds Level Generationhttps://arxiv.org/abs/2403.1910blog.athina.aiBoosted Prompt Ensembles for Large Language ModelsPrompt EngineeringApril 12, 2023Athina AI Research AgentAug 20, 2024 8:56 PMPromptly: Using Prompt Problems to Teach Learners How to Effectively Utilize AI Code GeneratorsIP-Adapter: Text Compatible Image Prompt Adapter for Text-to-Image Diffusion ModelsPromptAid: Prompt Exploration, Perturbation, Testing and Iteration using Visual Analytics for Large Language Modelshttps://arxiv.org/abs/2304.05970blog.athina.aiReflexion: Language Agents with Verbal Reinforcement LearningPrompt EngineeringAthina AI Research AgentAug 20, 2024 8:57 PMDirect Preference Optimization: Your Language Model is Secretly a Reward Modelhttps://arxiv.org/pdf/2303.11366.pdfblog.athina.aiEvoPrompting: Language Models for Code-Level Neural Architecture SearchPrompt EngineeringFebruary 28, 2023Athina AI Research AgentAug 20, 2024 8:55 PMHow Robust is GPT-3.5 to Predecessors? A Comprehensive Study on Language Understanding TasksPrompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot LearnersEffectiveness of Data Augmentation for Parameter Efficient Tuning with Limited DataActive Prompting with Chain-of-Thought for Large Language ModelsNot what you've signed up for: Compromising Real-World LLM-Integrated Applications with Indirect Prompt Injectionhttps://arxiv.org/abs/2302.14838blog.athina.aiPAL: Program-aided Language ModelsReasoningJanuary 27, 2023Athina AI Research AgentAug 20, 2024 8:58 PMMaking Large Language Models Better Reasoners with Step-Aware VerifierDynamic Prompt Learning via Policy Gradient for Semi-structured Mathematical ReasoningSelf-Consistency Improves Chain of Thought Reasoning in Language ModelsLarge Language Models Are Human-Level Prompt EngineersRecitation-Augmented Language ModelsDecomposed Prompting: A Modular Approach for Solving Complex TasksLanguage Models Are Greedy Reasoners: A Systematic Formal Analysis of Chain-of-ThoughtPrompt Engineering a Prompt EngineerBatch Calibration: Rethinking Calibration for In-Context Learning and Prompt EngineeringA Systematic Survey of Prompt Engineering on Vision-Language Foundation ModelsPEACE: Prompt Engineering Automation for CLIPSeg Enhancement in Aerial RoboticsPrompt Engineering for Transformer-based Chemical Similarity Search Identifies Structurally Distinct Functional AnaloguesPrompt Engineering-assisted Malware Dynamic Analysis Using GPT-4Enhancing Medical Task Performance in GPT-4V: A Comprehensive Study on Prompt Engineering Strategieshttps://arxiv.org/abs/2211.10435blog.athina.aiUPRISE: Universal Prompt Retrieval for Improving Zero-Shot EvaluationPrompt EngineeringMarch 15, 2023Athina AI Research AgentAug 20, 2024 9:09 PMLLMLingua: Compressing Prompts for Accelerated Inference of Large Language ModelsPrompt Design and Engineering: Introduction and Advanced MethodsKnowledge Card: Filling LLMs' Knowledge Gaps with Plug-in Specialized Language Models https://arxiv.org/abs/2303.08518blog.athina.aiREFINER: Reasoning Feedback on Intermediate RepresentationsReasoningApril 4, 2023Athina AI Research AgentAug 20, 2024 8:59 PMGlobal Prompt Cell: A Portable Control Module for Effective Prompt TuningBoosted Prompt Ensembles for Large Language ModelsWhy think step by step? Reasoning emerges from the locality of experiencehttps://arxiv.org/abs/2304.01904blog.athina.aiBatch Prompting: Efficient Inference with Large Language Model APIsEvaluationOctober 24, 2023Athina AI Research AgentAug 20, 2024 9:00 PMMultimodal Chain-of-Thought Reasoning in Language ModelsHard Prompts Made Easy: Gradient-Based Discrete Optimization for Prompt Tuning and DiscoveryGraphPrompt: Unifying Pre-Training and Downstream Tasks for Graph Neural NetworksConstitutional AI: Harmlessness from AI FeedbackSuccessive Prompting for Decomposing Complex Questionshttps://arxiv.org/abs/2301.08721blog.athina.aiAssessing Prompt Injection Risks in 200+ Custom GPTsSafetyMay 25, 2024Athina AI Research AgentAug 20, 2024 9:12 PMLanguage Prompt for Autonomous DrivingImageDream: Image-Prompt Multi-view Diffusion for 3D GenerationLongLLMLingua: Accelerating and Enhancing LLMs in Long Context Scenarios via Prompt Compressionhttps://arxiv.org/abs/2311.11538Batch Calibration: Rethinking Calibration for In-Context Learning and Prompt EngineeringEvaluationJanuary 24, 2024Athina AI Research AgentAug 20, 2024 9:11 PMLarge Language Models Are Human-Level Prompt EngineersPAL: Program-aided Language ModelsMachine Generated Text: A Comprehensive Survey of Threat Models and Detection Methodshttps://arxiv.org/abs/2309.17249blog.athina.aiA Systematic Survey of Prompt Engineering on Vision-Language Foundation ModelsPrompt EngineeringJuly 24, 2023Athina AI Research AgentAug 20, 2024 9:10 PMDocPrompting: Generating Code by Retrieving the DocsPAL: Program-aided Language ModelsMachine Generated Text: A Comprehensive Survey of Threat Models and Detection Methodshttps://arxiv.org/abs/2307.12980blog.athina.aiBetter Zero-Shot Reasoning with Self-Adaptive PromptingPrompt EngineeringMay 23, 2023Athina AI Research AgentAug 20, 2024 9:12 PMGraph of Thoughts: Solving Elaborate Problems with Large Language ModelsReinforcement Learning in the Era of LLMs: What is Essential? What is needed? An RL Perspective on RLHF, Prompting, and BeyondFew-shot Fine-tuning vs. In-context Learning: A Fair Comparison and Evaluationhttps://arxiv.org/abs/2305.14106blog.athina.aiHierarchical Prompting Assists Large Language Model on Web NavigationPrompt EngineeringMay 23, 2023Athina AI Research AgentAug 20, 2024 9:45 PMA Bibliometric Review of Large Language Models Research from 2017 to 2023Harnessing the Power of LLMs in Practice: A Survey on ChatGPT and Beyondhttps://arxiv.org/abs/2305.14257blog.athina.aiEnhancing Large Language Models Against Inductive Instructions with Dual-critique PromptingPrompt EngineeringMay 23, 2023Athina AI Research AgentAug 20, 2024 9:44 PMPrompt Design and Engineering: Introduction and Advanced MethodsLLMLingua: Compressing Prompts for Accelerated Inference of Large Language ModelsEnhancing Zero-Shot Chain-of-Thought Reasoning in Large Language Models through LogicTree of Thoughts: Deliberate Problem Solving with Large Language ModelsKnowledge Card: Filling LLMs' Knowledge Gaps with Plug-in Specialized Language Modelshttps://arxiv.org/abs/2305.13733blog.athina.aiImproving ChatGPT Prompt for Code GenerationEvaluationMay 15, 2023Athina AI Research AgentAug 20, 2024 9:13 PMExploring the Intersection of Large Language Models and Agent-Based Modeling via Prompt EngineeringHow to Prompt LLMs for Text-to-SQL: A Study in Zero-shot, Single-domain, and Cross-domain SettingsChatGPT4PCG 2 Competition: Prompt Engineering for Science Birds Level Generationhttps://arxiv.org/abs/2305.08360blog.athina.aiTree of Thoughts: Deliberate Problem Solving with Large Language ModelsPrompt EngineeringMay 17, 2023Athina AI Research AgentAug 20, 2024 9:45 PMPrompt Design and Engineering: Introduction and Advanced MethodsPost Hoc Explanations of Language Models Can Improve Language ModelsEnhancing Large Language Models Against Inductive Instructions with Dual-critique PromptingKnowledge Card: Filling LLMs' Knowledge Gaps with Plug-in Specialized Language Modelshttps://arxiv.org/abs/2305.10601blog.athina.aiJatmo: Prompt Injection Defense by Task-Specific FinetuningFine TuningJanuary 8, 2024Athina AI Research AgentAug 20, 2024 9:14 PMLanguage Prompt for Autonomous DrivingImageDream: Image-Prompt Multi-view Diffusion for 3D GenerationRe-imagine the Negative Prompt Algorithm: Transform 2D Diffusion into 3D, alleviate Janus problem and Beyondhttps://arxiv.org/abs/2312.17673Active Prompting with Chain-of-Thought for Large Language ModelsEvaluationFebruary 23, 2023Athina AI Research AgentAug 20, 2024 9:49 PMLanguage Is Not All You Need: Aligning Perception with Language ModelsPrompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot LearnersEvoPrompting: Language Models for Code-Level Neural Architecture SearchA Prompt Pattern Catalog to Enhance Prompt Engineering with ChatGPTGuiding Large Language Models via Directional Stimulus Promptinghttps://arxiv.org/abs/2302.12246blog.athina.aiCompositional Exemplars for In-context LearningPrompt EngineeringJune 20, 2023Athina AI Research AgentAug 20, 2024 9:47 PMSwitchPrompt: Learning Domain-Specific Gated Soft Prompts for Classification in Low-Resource DomainsThe Capacity for Moral Self-Correction in Large Language ModelsEvaluating the Robustness of Discrete PromptsHard Prompts Made Easy: Gradient-Based Discrete Optimization for Prompt Tuning and DiscoveryMultimodal Chain-of-Thought Reasoning in Language ModelsRetrieval-Augmented Thought Process as Sequential Decision Makinghttps://arxiv.org/abs/2302.05698blog.athina.aiChain-of-Thought Prompting Elicits Reasoning in Large Language ModelsReasoningJanuary 10, 2023Athina AI Research AgentAug 20, 2024 9:46 PMText2MDT: Extracting Medical Decision Trees from Medical TextsRAGAR, Your Falsehood RADAR: RAG-Augmented Reasoning for Political Fact-Checking using Multimodal Large Language ModelsDiffusionGPT: LLM-Driven Text-to-Image Generation Systemhttps://arxiv.org/abs/2201.11903blog.athina.aiLet's Verify Step by StepEvaluationMay 31, 2023Athina AI Research AgentAug 20, 2024 9:48 PMOne Small Step for Generative AI, One Giant Leap for AGI: A Complete Survey on ChatGPT in AIGC EraReasoning with Language Model Prompting: A SurveyReinforcement Learning in the Era of LLMs: What is Essential? What is needed? An RL Perspective on RLHF, Prompting, and Beyond https://arxiv.org/abs/2305.20050blog.athina.aiQuantifying Language Models' Sensitivity to Spurious Features in Prompt Design or: How I learned to start worrying about prompt formattingPrompt EngineeringOctober 17, 2023Athina AI Research AgentAug 20, 2024 9:52 PMPromptbreeder: Self-Referential Self-Improvement Via Prompt EvolutionLLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language ModelsLongLLMLingua: Accelerating and Enhancing LLMs in Long Context Scenarios via Prompt CompressionPrompt Injection attack against LLM-integrated ApplicationsJailbreaking ChatGPT via Prompt Engineering: An Empirical StudyIP-Adapter: Text Compatible Image Prompt Adapter for Text-to-Image Diffusion ModelsTensor Trust: Interpretable Prompt Injection Attacks from an Online GameAnomalyCLIP: Object-agnostic Prompt Learning for Zero-shot Anomaly Detectionhttps://arxiv.org/abs/2310.11324blog.athina.aiMultimodal Chain-of-Thought Reasoning in Language ModelsReasoningFebruary 17, 2023Athina AI Research AgentAug 20, 2024 9:53 PMSwitchPrompt: Learning Domain-Specific Gated Soft Prompts for Classification in Low-Resource DomainsEvaluating the Robustness of Discrete PromptsCompositional Exemplars for In-context LearningLarge Language Models Can Be Easily Distracted by Irrelevant ContextSynthetic Prompting: Generating Chain-of-Thought Demonstrations for Large Language ModelsProgressive Prompts: Continual Learning for Language ModelsBatch Prompting: Efficient Inference with Large Language Model APIsRetrieval-Augmented Thought Process as Sequential Decision Makinghttps://arxiv.org/abs/2302.00923blog.athina.aiEnhancing Few-shot Text-to-SQL Capabilities of Large Language Models: A Study on Prompt Design StrategiesPrompt EngineeringMay 21, 2023Athina AI Research AgentAug 20, 2024 9:50 PMThe Web Can Be Your Oyster for Improving Large Language ModelsFrom Prompt Injections to SQL Injection Attacks: How Protected is Your LLM-Integrated Web Application?Can ChatGPT Understand Too? A Comparative Study on ChatGPT and Fine-tuned BERTSpeechPrompt v2: Prompt Tuning for Speech Classification TasksPrivacy-Preserving Prompt Tuning for Large Language Model ServicesNegative-prompt Inversion: Fast Image Inversion for Editing with Text-guided Diffusion Modelshttps://arxiv.org/abs/2305.12586blog.athina.aiPrivacy-Preserving Prompt Tuning for Large Language Model ServicesFine TuningMay 10, 2023Athina AI Research AgentAug 20, 2024 9:51 PMSpeechPrompt v2: Prompt Tuning for Speech Classification TasksEnhancing Few-shot Text-to-SQL Capabilities of Large Language Models: A Study on Prompt Design StrategiesCan ChatGPT Understand Too? A Comparative Study on ChatGPT and Fine-tuned BERThttps://arxiv.org/abs/2305.06212blog.athina.aiWalking Down the Memory Maze: Beyond Context Limit through Interactive ReadingReasoningOctober 8, 2023Athina AI Research AgentAug 20, 2024 9:54 PMReasoning with Language Model Prompting: A SurveyTowards Reasoning in Large Language Models: A SurveyReinforcement Learning in the Era of LLMs: What is Essential? What is needed? An RL Perspective on RLHF, Prompting, and Beyondhttps://arxiv.org/abs/2310.05029blog.athina.aiCompress, Then Prompt: Improving Accuracy-Efficiency Trade-off of LLM Inference with Transferable PromptPrompt EngineeringMay 17, 2023Athina AI Research AgentAug 20, 2024 9:57 PMLet's Sample Step by Step: Adaptive-Consistency for Efficient Reasoning and Coding with LLMsMeta-in-context learning in large language modelsCan We Edit Factual Knowledge by In-Context Learning?TELeR: A General Taxonomy of LLM Prompts for Benchmarking Complex TasksFlatness-Aware Prompt Selection Improves Accuracy and Sample EfficiencySegment Any Anomaly without Training via Hybrid Prompt Regularization https://arxiv.org/abs/2305.11186blog.athina.aiEvidence to Generate (E2G): A Single-agent Two-step Prompting for Context Grounded and Retrieval Augmented ReasoningReasoningJanuary 11, 2024Athina AI Research AgentAug 20, 2024 9:55 PMRAGAR, Your Falsehood RADAR: RAG-Augmented Reasoning for Political Fact-Checking using Multimodal Large Language ModelsEvidence to Generate (E2G): A Single-agent Two-step Prompting for Context Grounded and Retrieval Augmented ReasoningPathFinder: Guided Search over Multi-Step Reasoning PathsOn the Empirical Complexity of Reasoning and Planning in LLMsGTBench: Uncovering the Strategic Reasoning Limitations of LLMs via Game-Theoretic EvaluationsTemporal Data Meets LLM -- Explainable Financial Time Series ForecastingSTAMP: Differentiable Task and Motion Planning via Stein Variational Gradient DescentAnalyzing Toxicity in Deep Conversations: A Reddit Case StudyLayoutLLM: Layout Instruction Tuning with Large Language Models for Document UnderstandingDiffusionGPT: LLM-Driven Text-to-Image Generation SystemPromptAgent: Strategic Planning with Language Models Enables Expert-level Prompt OptimizationText2MDT: Extracting Medical Decision Trees from Medical TextsAutomatic Root Cause Analysis via Large Language Models for Cloud Incidentshttps://arxiv.org/abs/2401.05787blog.athina.aiAutomatic Prompt Augmentation and Selection with Chain-of-Thought from Labeled DataPrompt EngineeringFebruary 24, 2023Athina AI Research AgentAug 20, 2024 9:55 PMEnhancing Zero-Shot Chain-of-Thought Reasoning in Large Language Models through LogicPrompt Design and Engineering: Introduction and Advanced MethodsConnecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt OptimizersExploring LLM-based Agents for Root Cause AnalysisFew-shot Fine-tuning vs. In-context Learning: A Fair Comparison and EvaluationHarnessing the Power of LLMs in Practice: A Survey on ChatGPT and Beyondhttps://arxiv.org/abs/2302.12822blog.athina.aiInvestigating the Effectiveness of Task-Agnostic Prefix Prompt for Instruction FollowingPrompt EngineeringFebruary 28, 2023Athina AI Research AgentAug 20, 2024 9:56 PMEnhancing Zero-Shot Chain-of-Thought Reasoning in Large Language Models through LogicLLMLingua: Compressing Prompts for Accelerated Inference of Large Language ModelsPrompt Design and Engineering: Introduction and Advanced MethodsReinforcement Learning in the Era of LLMs: What is Essential? What is needed? An RL Perspective on RLHF, Prompting, and Beyondhttps://arxiv.org/abs/2302.14691blog.athina.aiDeficiency of Large Language Models in Finance: An Empirical Examination of HallucinationHallucinationsNovember 27, 2023Athina AI Research AgentAug 20, 2024 10:00 PMDeficiency of Large Language Models in Finance: An Empirical Examination of HallucinationChain-of-Verification Reduces Hallucination in Large Language ModelsSelf-contradictory Hallucinations of Large Language Models: Evaluation, Detection and MitigationA Step Closer to Comprehensive Answers: Constrained Multi-Stage Question Decomposition with Large Language Modelshttps://arxiv.org/abs/2311.15548blog.athina.aiLarge Language Model Prompt Chaining for Long Legal Document ClassificationPrompt EngineeringAugust 8, 2023Athina AI Research AgentAug 20, 2024 9:59 PMFoodGPT: A Large Language Model in Food Testing Domain with Incremental Pre-training and Knowledge Graph PromptText-driven Prompt Generation for Vision-Language Models in Federated LearningConsistency-guided Prompt Learning for Vision-Language Modelshttps://arxiv.org/abs/2308.04138Mixture of Soft Prompts for Controllable Data GenerationDataset GenerationMarch 2, 2023Athina AI Research AgentAug 20, 2024 9:59 PMMultitask Prompt Tuning Enables Parameter-Efficient Transfer LearningEffectiveness of Data Augmentation for Parameter Efficient Tuning with Limited DataMixture of Soft Prompts for Controllable Data GenerationPrompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot LearnersHow Robust is GPT-3.5 to Predecessors? A Comprehensive Study on Language Understanding Taskshttps://arxiv.org/abs/2303.01580blog.athina.aiSGL-PT: A Strong Graph Learner with Graph Prompt TuningFine TuningAugust 15, 2023Athina AI Research AgentAug 20, 2024 9:58 PMExploring the Intersection of Large Language Models and Agent-Based Modeling via Prompt EngineeringHow to Prompt LLMs for Text-to-SQL: A Study in Zero-shot, Single-domain, and Cross-domain SettingsTranslating Radiology Reports into Plain Language using ChatGPT and GPT-4 with Prompt Learning: Promising Results, Limitations, and Potentialhttps://arxiv.org/abs/2302.12449blog.athina.aiReasoning with Language Model Prompting: A SurveyReasoningDecember 19, 2022Athina AI Research AgentAug 21, 2024 8:32 PMA Survey on In-context LearningNatural Language Reasoning, A SurveyEmergent Abilities of Large Language ModelsA Taxonomy of Prompt Modifiers for Text-To-Image GenerationWalking Down the Memory Maze: Beyond Context Limit through Interactive ReadingTemporal evolution of depolarization and magnetic field of FRB 20201124AChain-of-Verification Reduces Hallucination in Large Language ModelsGraph of Thoughts: Solving Elaborate Problems with Large Language ModelsFocused Prefix Tuning for Controllable Text GenerationExploring Lottery Prompts for Pre-trained Language ModelsLess Likely Brainstorming: Using Language Models to Generate Alternative HypothesesLet's Verify Step by Stephttps://arxiv.org/abs/2212.09597blog.athina.aiPrompt Engineering-assisted Malware Dynamic Analysis Using GPT-4Prompt EngineeringDecember 13, 2023Athina AI Research AgentAug 20, 2024 10:01 PMDocPrompting: Generating Code by Retrieving the DocsPAL: Program-aided Language ModelsLarge Language Models Are Human-Level Prompt Engineershttps://arxiv.org/abs/2312.08317blog.athina.aiTool Learning with Foundation ModelsEvaluationApril 17, 2023Athina AI Research AgentAug 20, 2024 10:03 PMFew-shot Fine-tuning vs. In-context Learning: A Fair Comparison and EvaluationReinforcement Learning in the Era of LLMs: What is Essential? What is needed? An RL Perspective on RLHF, Prompting, and BeyondTool Learning with Foundation Modelshttps://arxiv.org/abs/2304.08354blog.athina.aiDynamic Prompt Learning via Policy Gradient for Semi-structured Mathematical ReasoningReasoningMarch 2, 2023Athina AI Research AgentAug 20, 2024 10:02 PMRAGAR, Your Falsehood RADAR: RAG-Augmented Reasoning for Political Fact-Checking using Multimodal Large Language ModelsText2MDT: Extracting Medical Decision Trees from Medical TextsDiffusionGPT: LLM-Driven Text-to-Image Generation SystemDocPrompting: Generating Code by Retrieving the DocsPAL: Program-aided Language ModelsLarge Language Models Are Human-Level Prompt EngineersMachine Generated Text: A Comprehensive Survey of Threat Models and Detection Methodshttps://arxiv.org/abs/2209.14610blog.athina.aiSegment Any Anomaly without Training via Hybrid Prompt RegularizationFoundation ModelMay 18, 2023Athina AI Research AgentAug 21, 2024 8:38 PMLanguage Prompt for Autonomous DrivingCompress, Then Prompt: Improving Accuracy-Efficiency Trade-off of LLM Inference with Transferable PromptEfficient Prompting via Dynamic In-Context LearningTEMPO: Prompt-based Generative Pre-trained Transformer for Time Series ForecastingYou Only Prompt Once: On the Capabilities of Prompt Learning on Large Language Models to Tackle Toxic ContentPBNR: Prompt-based News Recommender SystemPrompt Stealing Attacks Against Text-to-Image Generation Modelshttps://arxiv.org/abs/2305.10724blog.athina.aiPEACE: Prompt Engineering Automation for CLIPSeg Enhancement in Aerial RoboticsPrompt EngineeringDecember 8, 2023Athina AI Research AgentAug 21, 2024 8:33 PMDocPrompting: Generating Code by Retrieving the DocsPAL: Program-aided Language ModelsMachine Generated Text: A Comprehensive Survey of Threat Models and Detection Methodshttps://arxiv.org/abs/2310.000blog.athina.aiGuReT: Distinguishing Guilt and Regret related TextDataset GenerationJanuary 29, 2024Athina AI Research AgentAug 21, 2024 8:36 PMUnleashing the potential of prompt engineering in Large Language Models: a comprehensive reviewLarge Language Model Guided Tree-of-ThoughtTree-of-Mixed-Thought: Combining Fast and Slow Thinking for Multi-hop Visual ReasoningFounder-GPT: Self-play to evaluate the Founder-Idea fitRNNs are not Transformers (Yet): The Key Bottleneck on In-context RetrievalBoosting of Thoughts: Trial-and-Error Problem Solving with Large Language ModelsAI Chain on Large Language Model for Unsupervised Control Flow Graph Generation for Statically-Typed Partial CodePathFinder: Guided Search over Multi-Step Reasoning PathsSPROUT: Authoring Programming Tutorials with Interactive Visualization of Large Language Model Generation ProcessNLPBench: Evaluating Large Language Models on Solving NLP ProblemsSelf-Taught Optimizer (STOP): Recursively Self-Improving Code GenerationKnowledge-Driven CoT: Exploring Faithful Reasoning in LLMs for Knowledge-intensive Question AnsweringTree of Reviews: A Tree-based Dynamic Iterative Retrieval Framework for Multi-hop Question AnsweringChain-of-Thought Reasoning is a Policy Improvement OperatorEnhancing Large Language Models for Clinical Decision Support by Incorporating Clinical Practice GuidelinesRAGAR, Your Falsehood RADAR: RAG-Augmented Reasoning for Political Fact-Checking using Multimodal Large Language ModelsTemporal Data Meets LLM -- Explainable Financial Time Series ForecastingInferring Properties of Graph Neural Networkshttps://arxiv.org/abs/2401.16541blog.athina.aiSelf-Taught Optimizer (STOP): Recursively Self-Improving Code GenerationReasoningMarch 1, 2024Athina AI Research AgentAug 21, 2024 8:35 PMPathFinder: Guided Search over Multi-Step Reasoning PathsGuReT: Distinguishing Guilt and Regret related TextFounder-GPT: Self-play to evaluate the Founder-Idea fithttps://arxiv.org/abs/2310.02304blog.athina.aiPrompt Engineering a Prompt EngineerPrompt EngineeringFebruary 19, 2024Athina AI Research AgentAug 21, 2024 8:39 PMDocPrompting: Generating Code by Retrieving the DocsPAL: Program-aided Language ModelsLarge Language Models Are Human-Level Prompt Engineershttps://arxiv.org/abs/2311.05661blog.athina.aiEfficient Prompting via Dynamic In-Context LearningPrompt EngineeringMay 18, 2023Athina AI Research AgentAug 21, 2024 8:41 PMTELeR: A General Taxonomy of LLM Prompts for Benchmarking Complex TasksTreePrompt: Learning to Compose Tree Prompts for Explainable Visual GroundingPlan-and-Solve Prompting: Improving Zero-Shot Chain-of-Thought Reasoning by Large Language ModelsChain-of-Symbol Prompting Elicits Planning in Large Langauge ModelsWhat In-Context Learning "Learns" In-Context: Disentangling Task Recognition and Task LearningReprompting: Automated Chain-of-Thought Prompt Inference Through Gibbs SamplingImageDream: Image-Prompt Multi-view Diffusion for 3D GenerationLongLLMLingua: Accelerating and Enhancing LLMs in Long Context Scenarios via Prompt CompressionSegment Any Anomaly without Training via Hybrid Prompt RegularizationRe-imagine the Negative Prompt Algorithm: Transform 2D Diffusion into 3D, alleviate Janus problem and BeyondLLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Modelshttps://arxiv.org/abs/2305.11170blog.athina.aiA study on Prompt Design, Advantages and Limitations of ChatGPT for Deep Learning Program RepairPrompt EngineeringApril 17, 2023Athina AI Research AgentAug 21, 2024 8:38 PMExploring EFL students' prompt engineering in human-AI story writing: an Activity Theory perspectiveChit-Chat or Deep Talk: Prompt Engineering for Process MiningTranslating Radiology Reports into Plain Language using ChatGPT and GPT-4 with Prompt Learning: Promising Results, Limitations, and Potentialhttps://arxiv.org/abs/2304.0819blog.athina.aiCAMEL: Communicative Agents for "Mind" Exploration of Large Language Model SocietyPrompt EngineeringMarch 31, 2023Athina AI Research AgentAug 21, 2024 8:40 PMBoosted Prompt Ensembles for Large Language ModelsGlobal Prompt Cell: A Portable Control Module for Effective Prompt TuningWhy think step by step? Reasoning emerges from the locality of experiencehttps://arxiv.org/abs/2303.17760blog.athina.aiYou Only Prompt Once: On the Capabilities of Prompt Learning on Large Language Models to Tackle Toxic ContentPrompt EngineeringAugust 10, 2023Athina AI Research AgentAug 21, 2024 8:43 PMBenchmarking and Defending Against Indirect Prompt Injection Attacks on Large Language ModelsSegment Any Anomaly without Training via Hybrid Prompt RegularizationImageDream: Image-Prompt Multi-view Diffusion for 3D Generationhttps://arxiv.org/abs/2308.05596Towards Reasoning in Large Language Models: A SurveyReasoningDecember 20, 2022Athina AI Research AgentAug 21, 2024 8:45 PMOne Small Step for Generative AI, One Giant Leap for AGI: A Complete Survey on ChatGPT in AIGC EraAugmented Language Models: a SurveyHarnessing the Power of LLMs in Practice: A Survey on ChatGPT and BeyondEmergent Abilities of Large Language ModelsA Taxonomy of Prompt Modifiers for Text-To-Image GenerationPre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language ProcessingWalking Down the Memory Maze: Beyond Context Limit through Interactive ReadingTemporal evolution of depolarization and magnetic field of FRB 20201124AChain-of-Verification Reduces Hallucination in Large Language ModelsFrom Sparse to Dense: GPT-4 Summarization with Chain of Density PromptingGraph of Thoughts: Solving Elaborate Problems with Large Language ModelsFocused Prefix Tuning for Controllable Text GenerationExploring Lottery Prompts for Pre-trained Language Modelshttps://arxiv.org/abs/2212.10403blog.athina.aiReasoning with Language Model is Planning with World ModelReasoningMay 24, 2023Athina AI Research AgentAug 21, 2024 8:43 PMGraph of Thoughts: Solving Elaborate Problems with Large Language ModelsReinforcement Learning in the Era of LLMs: What is Essential? What is needed? An RL Perspective on RLHF, Prompting, and BeyondFew-shot Fine-tuning vs. In-context Learning: A Fair Comparison and Evaluationhttps://arxiv.org/abs/2305.14992v1blog.athina.aiULTRA-DP: Unifying Graph Pre-training with Multi-task Graph Dual PromptPrompt EngineeringDecember 17, 2023Athina AI Research AgentAug 21, 2024 8:42 PMPrompt-Tuning Decision Transformer with Preference RankingMultimodal Prompt Perceiver: Empower Adaptiveness, Generalizability and Fidelity for All-in-One Image RestorationTesting LLMs on Code Generation with Varying Levels of Prompt Specificityhttps://arxiv.org/abs/2310.14845Progressive Prompts: Continual Learning for Language ModelsEvaluationJanuary 29, 2023Athina AI Research AgentAug 21, 2024 8:46 PMGraphPrompt: Unifying Pre-Training and Downstream Tasks for Graph Neural NetworksMultimodal Chain-of-Thought Reasoning in Language ModelsLarge Language Models Can Be Easily Distracted by Irrelevant ContextThe Flan Collection: Designing Data and Methods for Effective Instruction Tuninghttps://arxiv.org/abs/2301.12314blog.athina.aiDemonstrate-Search-Predict: Composing retrieval and language models for knowledge-intensive NLPRAGJanuary 23, 2023Athina AI Research AgentAug 21, 2024 8:47 PMGraphPrompt: Unifying Pre-Training and Downstream Tasks for Graph Neural NetworksLarge Language Models Can Be Easily Distracted by Irrelevant ContextEvaluating the Robustness of Discrete PromptsSuccessive Prompting for Decomposing Complex QuestionsLarge Language Models are reasoners with Self-Verificationhttps://arxiv.org/abs/2212.14024blog.athina.aiPrompt Engineering or Fine Tuning: An Empirical Assessment of Large Language Models in Automated Software Engineering TasksEvaluationOctober 11, 2023Athina AI Research AgentAug 21, 2024 8:45 PMPrompting GPT-3 To Be ReliableDocPrompting: Generating Code by Retrieving the DocsLarge Language Models Are Human-Level Prompt Engineershttps://arxiv.org/abs/2310.10508blog.athina.aiHarnessing the Power of LLMs in Practice: A Survey on ChatGPT and BeyondEvaluationApril 26, 2023Athina AI Research AgentAug 21, 2024 8:51 PMHarnessing the Power of LLMs in Practice: A Survey on ChatGPT and BeyondExploring LLM-based Agents for Root Cause AnalysisAutomatic Prompt Augmentation and Selection with Chain-of-Thought from Labeled DataOne Small Step for Generative AI, One Giant Leap for AGI: A Complete Survey on ChatGPT in AIGC EraNatural Language Reasoning, A SurveyAugmented Language Models: a SurveyA Survey on In-context LearningTowards Reasoning in Large Language Models: A SurveyHierarchical Prompting Assists Large Language Model on Web NavigationZeroPrompt: Streaming Acoustic Encoders are Zero-Shot Masked LMshttps://arxiv.org/abs/2304.13712blog.athina.aiA-la-carte Prompt Tuning (APT): Combining Distinct Data Via Composable PromptingFine TuningFebruary 15, 2023Athina AI Research AgentAug 21, 2024 8:48 PMBounding the Capabilities of Large Language Models in Open Text Generation with Prompt ConstraintsCan ChatGPT Understand Too? A Comparative Study on ChatGPT and Fine-tuned BERTA Prompt Pattern Catalog to Enhance Prompt Engineering with ChatGPTGraphPrompt: Unifying Pre-Training and Downstream Tasks for Graph Neural NetworksSwitchPrompt: Learning Domain-Specific Gated Soft Prompts for Classification in Low-Resource Domainshttps://arxiv.org/abs/2302.07994blog.athina.aiLarge Language Models as Analogical ReasonersPrompt EngineeringOctober 3, 2023Athina AI Research AgentAug 21, 2024 8:50 PMEnhancing Zero-Shot Chain-of-Thought Reasoning in Large Language Models through LogicA Survey on Hallucination in Large Language Models: Principles, Taxonomy, Challenges, and Open QuestionsPrompt Design and Engineering: Introduction and Advanced Methodshttps://arxiv.org/abs/2310.01714blog.athina.aiReverse Stable Diffusion: What prompt was used to generate this image?Prompt EngineeringAugust 2, 2023Athina AI Research AgentAug 21, 2024 8:49 PMImage-Object-Specific Prompt Learning for Few-Shot Class-Incremental LearningConsistency-guided Prompt Learning for Vision-Language ModelsReverse Stable Diffusion: What prompt was used to generate this image?Rethinking Visual Prompt Learning as Masked Visual Token Modelinghttps://arxiv.org/abs/2308.01472Soft Prompt Tuning for Augmenting Dense Retrieval with Large Language ModelsPrompt EngineeringJune 17, 2024Athina AI Research AgentAug 21, 2024 8:53 PMLayout and Task Aware Instruction Prompt for Zero-shot Document Image Question AnsweringSoft Prompt Tuning for Augmenting Dense Retrieval with Large Language ModelsTCP:Textual-based Class-aware Prompt tuning for Visual-Language Modelhttps://arxiv.org/abs/2307.08303Siren's Song in the AI Ocean: A Survey on Hallucination in Large Language ModelsHallucinationsSeptember 3, 2023Athina AI Research AgentAug 21, 2024 8:54 PMSemi-Structured Chain-of-Thought: Integrating Multiple Sources of Knowledge for Improved Language Model ReasoningFine-tuning Language Models for FactualitySelf-contradictory Hallucinations of Large Language Models: Evaluation, Detection and MitigationA Survey on Hallucination in Large Language Models: Principles, Taxonomy, Challenges, and Open Questionshttps://arxiv.org/abs/2309.01219blog.athina.aiMACM: Utilizing a Multi-Agent System for Condition Mining in Solving Complex Mathematical ProblemsReasoningApril 6, 2024Athina AI Research AgentAug 21, 2024 8:52 PMAlphazero-like Tree-Search can Guide Large Language Model Decoding and TrainingEverything of Thoughts: Defying the Law of Penrose Triangle for Thought GenerationTree-of-Mixed-Thought: Combining Fast and Slow Thinking for Multi-hop Visual ReasoningLarge Language Models are Few-shot Generators: Proposing Hybrid Prompt Algorithm To Generate Webshell Escape Sampleshttps://arxiv.org/abs/2404.04735blog.athina.aiSPELL: Semantic Prompt Evolution based on a LLMPrompt EngineeringOctober 2, 2023Athina AI Research AgentAug 21, 2024 8:55 PMPrompt-Tuning Decision Transformer with Preference RankingProgressive Visual Prompt Learning with Contrastive Feature Re-formationSoft-prompt Tuning for Large Language Models to Evaluate Biasviz2viz: Prompt-driven stylized visualization generation using a diffusion modelRobust Safety Classifier for Large Language Models: Adversarial Prompt Shieldhttps://arxiv.org/abs/2310.01260Does Prompt-Tuning Language Model Ensure Privacy?SafetyApril 15, 2023Athina AI Research AgentAug 21, 2024 8:57 PMText-driven Prompt Generation for Vision-Language Models in Federated LearningImage-Object-Specific Prompt Learning for Few-Shot Class-Incremental LearningConsistency-guided Prompt Learning for Vision-Language Modelshttps://arxiv.org/abs/2304.03472Promptly: Using Prompt Problems to Teach Learners How to Effectively Utilize AI Code GeneratorsEvaluationJuly 31, 2023Athina AI Research AgentAug 21, 2024 8:59 PMAnomalyCLIP: Object-agnostic Prompt Learning for Zero-shot Anomaly DetectionJailbreaking ChatGPT via Prompt Engineering: An Empirical StudyIP-Adapter: Text Compatible Image Prompt Adapter for Text-to-Image Diffusion ModelsPromptAid: Prompt Exploration, Perturbation, Testing and Iteration using Visual Analytics for Large Language ModelsBlack-Box Prompt Optimization: Aligning Large Language Models without Model TrainingBoosted Prompt Ensembles for Large Language Modelshttps://arxiv.org/abs/2307.16364blog.athina.aiMachine Generated Text: A Comprehensive Survey of Threat Models and Detection MethodsSafetyMay 8, 2023Athina AI Research AgentAug 21, 2024 8:56 PMLarge Language Models Are Human-Level Prompt EngineersMachine Generated Text: A Comprehensive Survey of Threat Models and Detection MethodsDynamic Prompt Learning via Policy Gradient for Semi-structured Mathematical ReasoningRecitation-Augmented Language ModelsReAct: Synergizing Reasoning and Acting in Language ModelsPrompting GPT-3 To Be ReliableDecomposed Prompting: A Modular Approach for Solving Complex TasksPrompt Engineering for Healthcare: Methodologies and ApplicationsBatch Calibration: Rethinking Calibration for In-Context Learning and Prompt EngineeringA Systematic Survey of Prompt Engineering on Vision-Language Foundation ModelsPEACE: Prompt Engineering Automation for CLIPSeg Enhancement in Aerial Roboticshttps://arxiv.org/abs/2210.07321blog.athina.aiIgnore This Title and HackAPrompt: Exposing Systemic Vulnerabilities of LLMs through a Global Scale Prompt Hacking CompetitionDataset GenerationMarch 3, 2024Athina AI Research AgentAug 21, 2024 8:58 PMLanguage Prompt for Autonomous DrivingImageDream: Image-Prompt Multi-view Diffusion for 3D GenerationLongLLMLingua: Accelerating and Enhancing LLMs in Long Context Scenarios via Prompt CompressionPrompt-tuning latent diffusion models for inverse problemshttps://arxiv.org/abs/2311.16119AntifakePrompt: Prompt-Tuned Vision-Language Models are Fake Image DetectorsPrompt EngineeringNovember 3, 2023Athina AI Research AgentAug 21, 2024 8:58 PMPrompt-Tuning Decision Transformer with Preference RankingMultimodal Prompt Perceiver: Empower Adaptiveness, Generalizability and Fidelity for All-in-One Image RestorationSafeguarding Crowdsourcing Surveys from ChatGPT with Prompt Injectionhttps://arxiv.org/abs/2310.17419Synthetic Prompting: Generating Chain-of-Thought Demonstrations for Large Language ModelsEvaluationFebruary 1, 2023Athina AI Research AgentAug 21, 2024 9:00 PMSwitchPrompt: Learning Domain-Specific Gated Soft Prompts for Classification in Low-Resource DomainsMultimodal Chain-of-Thought Reasoning in Language ModelsLarge Language Models Can Be Easily Distracted by Irrelevant ContextUnleashing the potential of prompt engineering in Large Language Models: a comprehensive reviewhttps://arxiv.org/abs/2302.00618blog.athina.aiSatLM: Satisfiability-Aided Language Models Using Declarative PromptingReasoningMay 16, 2023Athina AI Research AgentAug 21, 2024 9:02 PMFlatness-Aware Prompt Selection Improves Accuracy and Sample EfficiencyZeroPrompt: Streaming Acoustic Encoders are Zero-Shot Masked LMsTELeR: A General Taxonomy of LLM Prompts for Benchmarking Complex TasksPre-Training to Learn in ContextBoosted Prompt Ensembles for Large Language ModelsNegative-prompt Inversion: Fast Image Inversion for Editing with Text-guided Diffusion Modelshttps://arxiv.org/abs/2305.09656blog.athina.aiPrompt Design and Engineering: Introduction and Advanced MethodsPrompt EngineeringJanuary 24, 2024Athina AI Research AgentAug 21, 2024 9:00 PMA Survey on Hallucination in Large Language Models: Principles, Taxonomy, Challenges, and Open QuestionsActive Retrieval Augmented GenerationProbabilistic Tree-of-thought Reasoning for Answering Knowledge-intensive Complex QuestionsLarge Language Models as Analogical ReasonersLLMLingua: Compressing Prompts for Accelerated Inference of Large Language ModelsRe-Reading Improves Reasoning in Large Language ModelsEnhancing Large Language Models Against Inductive Instructions with Dual-critique PromptingPost Hoc Explanations of Language Models Can Improve Language ModelsTree of Thoughts: Deliberate Problem Solving with Large Language ModelsUPRISE: Universal Prompt Retrieval for Improving Zero-Shot EvaluationModel-tuning Via Prompts Makes NLP Models Adversarially RobustInvestigating the Effectiveness of Task-Agnostic Prefix Prompt for Instruction FollowingAutomatic Prompt Augmentation and Selection with Chain-of-Thought from Labeled DataGlobal Prompt Cell: A Portable Control Module for Effective Prompt Tuninghttps://arxiv.org/abs/2401.14423blog.athina.aiBounding the Capabilities of Large Language Models in Open Text Generation with Prompt ConstraintsEvaluationFebruary 17, 2023Athina AI Research AgentAug 21, 2024 9:36 PMCan ChatGPT Understand Too? A Comparative Study on ChatGPT and Fine-tuned BERTPrompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot LearnersEffectiveness of Data Augmentation for Parameter Efficient Tuning with Limited DataA-la-carte Prompt Tuning (APT): Combining Distinct Data Via Composable PromptingGraphPrompt: Unifying Pre-Training and Downstream Tasks for Graph Neural NetworksThe Capacity for Moral Self-Correction in Large Language Modelshttps://arxiv.org/abs/2302.09185blog.athina.aiLet's Sample Step by Step: Adaptive-Consistency for Efficient Reasoning and Coding with LLMsEvaluationMay 19, 2023Athina AI Research AgentAug 21, 2024 10:33 PMGraph of Thoughts: Solving Elaborate Problems with Large Language ModelsExplaining Emergent In-Context Learning as Kernel RegressionLet's Sample Step by Step: Adaptive-Consistency for Efficient Reasoning and Coding with LLMsCompress, Then Prompt: Improving Accuracy-Efficiency Trade-off of LLM Inference with Transferable PromptTreePrompt: Learning to Compose Tree Prompts for Explainable Visual GroundingTELeR: A General Taxonomy of LLM Prompts for Benchmarking Complex TasksThe Web Can Be Your Oyster for Improving Large Language Modelshttps://arxiv.org/abs/2305.11860blog.athina.aiSAM on Medical Images: A Comprehensive Study on Three Prompt ModesFoundation ModelApril 28, 2023Athina AI Research AgentAug 21, 2024 9:04 PMHow to Prompt LLMs for Text-to-SQL: A Study in Zero-shot, Single-domain, and Cross-domain SettingsPrompt Engineering for Transformer-based Chemical Similarity Search Identifies Structurally Distinct Functional AnaloguesMedPromptExtract (Medical Data Extraction Tool): Anonymization and Hi-fidelity Automated data extraction using NLP and prompt engineeringhttps://arxiv.org/abs/2305.00035blog.athina.aiFactuality of Large Language Models in the Year 2024HallucinationsFebruary 4, 2024Athina AI Research AgentAug 21, 2024 9:09 PMFactuality of Large Language Models in the Year 2024EntGPT: Linking Generative Large Language Models with Knowledge BasesChain-of-Knowledge: Grounding Large Language Models via Dynamic Knowledge Adapting over Heterogeneous Sourceshttps://arxiv.org/abs/2402.02420blog.athina.aiInteractive Natural Language ProcessingEvaluationMay 22, 2023Athina AI Research AgentAug 21, 2024 10:43 PMFew-shot Fine-tuning vs. In-context Learning: A Fair Comparison and EvaluationReinforcement Learning in the Era of LLMs: What is Essential? What is needed? An RL Perspective on RLHF, Prompting, and BeyondCan We Edit Factual Knowledge by In-Context Learning?Explaining Emergent In-Context Learning as Kernel Regressionhttps://arxiv.org/abs/2305.13246blog.athina.aiGTBench: Uncovering the Strategic Reasoning Limitations of LLMs via Game-Theoretic EvaluationsReasoningFebruary 19, 2024Athina AI Research AgentAug 21, 2024 10:41 PMPathFinder: Guided Search over Multi-Step Reasoning PathsEvidence to Generate (E2G): A Single-agent Two-step Prompting for Context Grounded and Retrieval Augmented ReasoningRNNs are not Transformers (Yet): The Key Bottleneck on In-context Retrievalhttps://arxiv.org/abs/2402.12348blog.athina.aiRevisiting Automated Prompting: Are We Actually Doing Better?EvaluationApril 7, 2023Athina AI Research AgentAug 21, 2024 10:44 PMRevisiting Automated Prompting: Are We Actually Doing Better?A Comprehensive Survey on Instruction FollowingGlobal Prompt Cell: A Portable Control Module for Effective Prompt TuningNN Prompting: Beyond-Context Learning with Calibration-Free Nearest Neighbor InferenceDynamic Prompting: A Unified Framework for Prompt TuningEffectiveness of Data Augmentation for Parameter Efficient Tuning with Limited Datahttps://arxiv.org/abs/2304.03609blog.athina.aiVisual-Language Prompt Tuning with Knowledge-guided Context OptimizationFine TuningMarch 23, 2023Athina AI Research AgentAug 21, 2024 10:42 PMGlobal Prompt Cell: A Portable Control Module for Effective Prompt TuningA Comprehensive Survey on Instruction FollowingWhy think step by step? Reasoning emerges from the locality of experienceFairness-guided Few-shot Prompting for Large Language Modelshttps://arxiv.org/abs/2303.13283blog.athina.aiHow Robust is GPT-3.5 to Predecessors? A Comprehensive Study on Language Understanding TasksEvaluationMarch 1, 2023Athina AI Research AgentAug 21, 2024 10:49 PMPrompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot LearnersMultitask Prompt Tuning Enables Parameter-Efficient Transfer LearningMixture of Soft Prompts for Controllable Data GenerationEvoPrompting: Language Models for Code-Level Neural Architecture SearchChain of Hindsight Aligns Language Models with FeedbackLanguage Is Not All You Need: Aligning Perception with Language Modelshttps://arxiv.org/abs/2303.00293blog.athina.aiPrompt Engineering for Healthcare: Methodologies and ApplicationsPrompt EngineeringMarch 23, 2024Athina AI Research AgentAug 21, 2024 10:45 PMReAct: Synergizing Reasoning and Acting in Language ModelsDocPrompting: Generating Code by Retrieving the DocsMachine Generated Text: A Comprehensive Survey of Threat Models and Detection Methodshttps://arxiv.org/abs/2304.14670blog.athina.aiFrom Sparse to Dense: GPT-4 Summarization with Chain of Density PromptingPrompt EngineeringSeptember 8, 2023Athina AI Research AgentAug 21, 2024 10:48 PMTowards Reasoning in Large Language Models: A SurveyReinforcement Learning in the Era of LLMs: What is Essential? What is needed? An RL Perspective on RLHF, Prompting, and Beyondhttps://arxiv.org/abs/2309.04269blog.athina.aiStructure Pretraining and Prompt Tuning for Knowledge Graph TransferRAGMarch 3, 2023Athina AI Research AgentAug 21, 2024 10:47 PMFairness-guided Few-shot Prompting for Large Language ModelsBoosted Prompt Ensembles for Large Language ModelsNN Prompting: Beyond-Context Learning with Calibration-Free Nearest Neighbor InferenceOpenICL: An Open-Source Framework for In-context Learninghttps://arxiv.org/abs/2303.03922blog.athina.aiPre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language ProcessingPrompt EngineeringJuly 28, 2021Athina AI Research AgentAug 21, 2024 10:51 PMTowards Reasoning in Large Language Models: A SurveyAugmented Language Models: a Surveyhttps://arxiv.org/abs/2107.13586blog.athina.aiSearch-in-the-Chain: Interactively Enhancing Large Language Models with Search for Knowledge-intensive TasksPrompt EngineeringApril 28, 2023Athina AI Research AgentAug 21, 2024 10:49 PMWizardLM: Empowering Large Language Models to Follow Complex InstructionsEver: Mitigating Hallucination in Large Language Models through Real-Time Verification and RectificationProbabilistic Tree-of-thought Reasoning for Answering Knowledge-intensive Complex QuestionsAutoHall: Automated Hallucination Dataset Generation for Large Language ModelsActive Retrieval Augmented Generationhttps://arxiv.org/abs/2304.14732blog.athina.aiCan We Edit Factual Knowledge by In-Context Learning?ReasoningMay 22, 2023Athina AI Research AgentAug 21, 2024 10:52 PMFew-shot Fine-tuning vs. In-context Learning: A Fair Comparison and EvaluationA Bibliometric Review of Large Language Models Research from 2017 to 2023Interactive Natural Language ProcessingPlan-and-Solve Prompting: Improving Zero-Shot Chain-of-Thought Reasoning by Large Language ModelsCompress, Then Prompt: Improving Accuracy-Efficiency Trade-off of LLM Inference with Transferable Prompthttps://arxiv.org/abs/2305.12740blog.athina.aiLLMLingua: Compressing Prompts for Accelerated Inference of Large Language ModelsPrompt EngineeringOctober 9, 2023Athina AI Research AgentAug 21, 2024 10:55 PMEnhancing Zero-Shot Chain-of-Thought Reasoning in Large Language Models through LogicPrompt Design and Engineering: Introduction and Advanced MethodsPrincipled Instructions Are All You Need for Questioning LLaMA-1/2, GPT-3.5/4Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt OptimizersRe-Reading Improves Reasoning in Large Language ModelsSkeleton-of-Thought: Prompting LLMs for Efficient Parallel GenerationEnhancing Large Language Models Against Inductive Instructions with Dual-critique PromptingUPRISE: Universal Prompt Retrieval for Improving Zero-Shot EvaluationModel-tuning Via Prompts Makes NLP Models Adversarially RobustInvestigating the Effectiveness of Task-Agnostic Prefix Prompt for Instruction Followinghttps://arxiv.org/abs/2310.05736blog.athina.aiSwitchPrompt: Learning Domain-Specific Gated Soft Prompts for Classification in Low-Resource DomainsPrompt EngineeringFebruary 14, 2023Athina AI Research AgentAug 21, 2024 10:54 PMThe Capacity for Moral Self-Correction in Large Language ModelsGraphPrompt: Unifying Pre-Training and Downstream Tasks for Graph Neural NetworksA-la-carte Prompt Tuning (APT): Combining Distinct Data Via Composable PromptingCompositional Exemplars for In-context LearningMultimodal Chain-of-Thought Reasoning in Language ModelsLarge Language Models Can Be Easily Distracted by Irrelevant ContextSynthetic Prompting: Generating Chain-of-Thought Demonstrations for Large Language Modelshttps://arxiv.org/abs/2302.06868blog.athina.aiExploring Lottery Prompts for Pre-trained Language ModelsPrompt EngineeringMay 31, 2023Athina AI Research AgentAug 21, 2024 10:56 PMTowards Reasoning in Large Language Models: A SurveyReasoning with Language Model Prompting: A SurveyReinforcement Learning in the Era of LLMs: What is Essential? What is needed? An RL Perspective on RLHF, Prompting, and Beyondhttps://arxiv.org/abs/2305.19500blog.athina.aiToken-Level Adversarial Prompt Detection Based on Perplexity Measures and Contextual InformationSafetyFebruary 18, 2024Athina AI Research AgentAug 21, 2024 10:53 PMDP-OPT: Make Large Language Model Your Privacy-Preserving Prompt EngineerMultimodal Prompt Perceiver: Empower Adaptiveness, Generalizability and Fidelity for All-in-One Image RestorationControlling Personality Style in Dialogue with Zero-Shot Prompt-Based Learninghttps://arxiv.org/abs/2311.11509Tensor Trust: Interpretable Prompt Injection Attacks from an Online GameFine TuningNovember 2, 2023Athina AI Research AgentAug 21, 2024 10:58 PMChatGPT Prompt Patterns for Improving Code Quality, Refactoring, Requirements Elicitation, and Software DesignJailbreaking ChatGPT via Prompt Engineering: An Empirical StudyQuantifying Language Models' Sensitivity to Spurious Features in Prompt Design or: How I learned to start worrying about prompt formattingAn LLM can Fool Itself: A Prompt-Based Adversarial Attackhttps://arxiv.org/abs/2311.01011blog.athina.aiTree of Reviews: A Tree-based Dynamic Iterative Retrieval Framework for Multi-hop Question AnsweringReasoningApril 22, 2024Athina AI Research AgentAug 21, 2024 10:56 PMLarge Language Models are Few-shot Generators: Proposing Hybrid Prompt Algorithm To Generate Webshell Escape SamplesGuReT: Distinguishing Guilt and Regret related TextUnleashing the potential of prompt engineering in Large Language Models: a comprehensive reviewChain-of-Thought Reasoning is a Policy Improvement OperatorLLM Guided Evolution -- The Automation of Models Advancing Modelshttps://arxiv.org/abs/2404.14464blog.athina.aiImage-Object-Specific Prompt Learning for Few-Shot Class-Incremental LearningFine TuningDecember 7, 2023Athina AI Research AgentAug 21, 2024 10:57 PMImage-Object-Specific Prompt Learning for Few-Shot Class-Incremental LearningText-driven Prompt Generation for Vision-Language Models in Federated LearningRobust Safety Classifier for Large Language Models: Adversarial Prompt ShieldConsistency-guided Prompt Learning for Vision-Language ModelsReverse Stable Diffusion: What prompt was used to generate this image?Does Prompt-Tuning Language Model Ensure Privacy?Prompt-ICM: A Unified Framework towards Image Coding for Machines with Task-driven PromptsRethinking Visual Prompt Learning as Masked Visual Token Modelinghttps://arxiv.org/abs/2309.02833LAMPER: LanguAge Model and Prompt EngineeRing for zero-shot time series classificationEvaluationMarch 23, 2024Athina AI Research AgentAug 21, 2024 10:59 PMMedPromptExtract (Medical Data Extraction Tool): Anonymization and Hi-fidelity Automated data extraction using NLP and prompt engineeringChatGPT4PCG 2 Competition: Prompt Engineering for Science Birds Level GenerationLarge Language Models and Prompt Engineering for Biomedical Query Focused Multi-Document SummarisationExploring Prompt Engineering Practices in the EnterpriseWordflow: Social Prompt Engineering for Large Language ModelsA Systematic Survey of Prompt Engineering in Large Language Models: Techniques and ApplicationsExploring EFL students' prompt engineering in human-AI story writing: an Activity Theory perspectiveA Novel Approach for Rapid Development Based on ChatGPT and Prompt Engineeringhttps://arxiv.org/abs/2403.15875blog.athina.aiEvaluating the Robustness of Discrete PromptsEvaluationFebruary 11, 2023Athina AI Research AgentAug 21, 2024 11:00 PMGraphPrompt: Unifying Pre-Training and Downstream Tasks for Graph Neural NetworksThe Capacity for Moral Self-Correction in Large Language ModelsEvaluating the Robustness of Discrete PromptsCompositional Exemplars for In-context LearningMultimodal Chain-of-Thought Reasoning in Language ModelsDemonstrate-Search-Predict: Composing retrieval and language models for knowledge-intensive NLPhttps://arxiv.org/abs/2302.05619blog.athina.aiA Survey on In-context LearningEvaluationDecember 31, 2022Athina AI Research AgentAug 21, 2024 11:03 PMOne Small Step for Generative AI, One Giant Leap for AGI: A Complete Survey on ChatGPT in AIGC EraHarnessing the Power of LLMs in Practice: A Survey on ChatGPT and BeyondAugmented Language Models: a SurveyReasoning with Language Model Prompting: A Surveyhttps://arxiv.org/abs/2301.00234blog.athina.aiChain-of-Verification Reduces Hallucination in Large Language ModelsPrompt EngineeringSeptember 20, 2023Athina AI Research AgentAug 21, 2024 11:02 PMDeficiency of Large Language Models in Finance: An Empirical Examination of HallucinationCYBERSECEVAL 2: A Wide-Ranging Cybersecurity Evaluation Suite for Large Language ModelsImageDream: Image-Prompt Multi-view Diffusion for 3D GenerationLongLLMLingua: Accelerating and Enhancing LLMs in Long Context Scenarios via Prompt CompressionChain-of-Verification Reduces Hallucination in Large Language Modelsblog.athina.aiKnowGPT: Knowledge Injection for Large Language ModelsRAGDecember 11, 2023Athina AI Research AgentAug 21, 2024 11:05 PMKnowGPT: Knowledge Injection for Large Language ModelsEntGPT: Linking Generative Large Language Models with Knowledge BasesFine-tuning Language Models for Factualityhttps://arxiv.org/abs/2312.06185blog.athina.aiEdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAMFine TuningDecember 11, 2023Athina AI Research AgentAug 21, 2024 11:05 PMEdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAMAre Chatbots Ready for Privacy-Sensitive Applications? An Investigation into Input Regurgitation and Prompt-Induced SanitizationPromptCARE: Prompt Copyright Protection by Watermark Injection and VerificationGeneralized Graph Prompt: Toward a Unification of Pre-Training and Downstream Tasks on GraphsPrompt Tuning of Deep Neural Networks for Speaker-adaptive Visual Speech Recognitionhttps://arxiv.org/abs/2312.06660Global Prompt Cell: A Portable Control Module for Effective Prompt TuningFine TuningApril 12, 2023Athina AI Research AgentAug 21, 2024 11:06 PMA Comprehensive Survey on Instruction FollowingExploring LLM-based Agents for Root Cause AnalysisPrompt Design and Engineering: Introduction and Advanced MethodsWhy think step by step? Reasoning emerges from the locality of experienceRevisiting Automated Prompting: Are We Actually Doing Better?REFINER: Reasoning Feedback on Intermediate RepresentationsReflexion: Language Agents with Verbal Reinforcement LearningCAMEL: Communicative Agents for "Mind" Exploration of Large Language Model SocietySelf-Refine: Iterative Refinement with Self-FeedbackNN Prompting: Beyond-Context Learning with Calibration-Free Nearest Neighbor InferenceVisual-Language Prompt Tuning with Knowledge-guided Context Optimizationhttps://arxiv.org/abs/2304.05642blog.athina.aiDr ChatGPT, tell me what I want to hear: How prompt knowledge impacts health answer correctnessEvaluationFebruary 23, 2023Athina AI Research AgentAug 21, 2024 11:04 PMChit-Chat or Deep Talk: Prompt Engineering for Process MiningHow to Prompt LLMs for Text-to-SQL: A Study in Zero-shot, Single-domain, and Cross-domain SettingsMedPromptExtract (Medical Data Extraction Tool): Anonymization and Hi-fidelity Automated data extraction using NLP and prompt engineeringhttps://arxiv.org/abs/2302.13793blog.athina.aiChit-Chat or Deep Talk: Prompt Engineering for Process MiningRAGJuly 19, 2023Athina AI Research AgentAug 21, 2024 11:07 PMExploring the Intersection of Large Language Models and Agent-Based Modeling via Prompt EngineeringMedPromptExtract (Medical Data Extraction Tool): Anonymization and Hi-fidelity Automated data extraction using NLP and prompt engineeringChatGPT4PCG 2 Competition: Prompt Engineering for Science Birds Level GenerationHow to Prompt LLMs for Text-to-SQL: A Study in Zero-shot, Single-domain, and Cross-domain SettingsDr ChatGPT, tell me what I want to hear: How prompt knowledge impacts health answer correctnessTranslating Radiology Reports into Plain Language using ChatGPT and GPT-4 with Prompt Learning: Promising Results, Limitations, and PotentialA study on Prompt Design, Advantages and Limitations of ChatGPT for Deep Learning Program RepairGraph-ToolFormer: To Empower LLMs with Graph Reasoning Ability via Prompt Augmented by ChatGPTState of What Art? A Call for Multi-Prompt LLM Evaluationhttps://arxiv.org/abs/2307.09909blog.athina.aiGraphPrompt: Unifying Pre-Training and Downstream Tasks for Graph Neural NetworksPrompt EngineeringFebruary 25, 2023Athina AI Research AgentAug 21, 2024 11:08 PMBounding the Capabilities of Large Language Models in Open Text Generation with Prompt ConstraintsA-la-carte Prompt Tuning (APT): Combining Distinct Data Via Composable PromptingHow Does In-Context Learning Help Prompt Tuning?SwitchPrompt: Learning Domain-Specific Gated Soft Prompts for Classification in Low-Resource DomainsEvaluating the Robustness of Discrete PromptsHard Prompts Made Easy: Gradient-Based Discrete Optimization for Prompt Tuning and DiscoveryProgressive Prompts: Continual Learning for Language ModelsBatch Prompting: Efficient Inference with Large Language Model APIsDemonstrate-Search-Predict: Composing retrieval and language models for knowledge-intensive NLPOn Second Thought, Let's Not Think Step by Step! Bias and Toxicity in Zero-Shot ReasoningConstitutional AI: Harmlessness from AI FeedbackLarge Language Models are reasoners with Self-Verification https://arxiv.org/abs/2302.08043blog.athina.aiHD-Painter: High-Resolution and Prompt-Faithful Text-Guided Image Inpainting with Diffusion ModelsPrompt EngineeringMarch 18, 2024Athina AI Research AgentAug 21, 2024 11:08 PMGeneralized Graph Prompt: Toward a Unification of Pre-Training and Downstream Tasks on GraphsDePT: Decomposed Prompt Tuning for Parameter-Efficient Fine-tuningPromptCARE: Prompt Copyright Protection by Watermark Injection and VerificationPrompt Packer: Deceiving LLMs through Compositional Instruction with Hidden AttacksPrompt Tuning of Deep Neural Networks for Speaker-adaptive Visual Speech RecognitionPromptTTS 2: Describing and Generating Voices with Text Prompthttps://arxiv.org/abs/2312.14091The Web Can Be Your Oyster for Improving Large Language ModelsRAGMay 18, 2023Athina AI Research AgentAug 21, 2024 11:09 PMTreePrompt: Learning to Compose Tree Prompts for Explainable Visual GroundingLet's Sample Step by Step: Adaptive-Consistency for Efficient Reasoning and Coding with LLMsTELeR: A General Taxonomy of LLM Prompts for Benchmarking Complex TasksChain-of-Symbol Prompting Elicits Planning in Large Langauge ModelsFrom Prompt Injections to SQL Injection Attacks: How Protected is Your LLM-Integrated Web Application?Enhancing Few-shot Text-to-SQL Capabilities of Large Language Models: A Study on Prompt Design StrategiesSpeechPrompt v2: Prompt Tuning for Speech Classification TasksNegative-prompt Inversion: Fast Image Inversion for Editing with Text-guided Diffusion Modelshttps://arxiv.org/abs/2305.10998blog.athina.aiMultitask Prompt Tuning Enables Parameter-Efficient Transfer LearningFine TuningMarch 6, 2023Athina AI Research AgentAug 21, 2024 11:13 PMMultitask Prompt Tuning Enables Parameter-Efficient Transfer LearningART: Automatic multi-step reasoning and tool-use for large language modelsDynamic Prompting: A Unified Framework for Prompt TuningMixture of Soft Prompts for Controllable Data GenerationHow Robust is GPT-3.5 to Predecessors? A Comprehensive Study on Language Understanding TasksCan ChatGPT Understand Too? A Comparative Study on ChatGPT and Fine-tuned BERThttps://arxiv.org/abs/2303.02861blog.athina.aiLarge Language Models are Few-shot Generators: Proposing Hybrid Prompt Algorithm To Generate Webshell Escape SamplesDataset GenerationFebruary 12, 2024Athina AI Research AgentAug 21, 2024 11:10 PMMACM: Utilizing a Multi-Agent System for Condition Mining in Solving Complex Mathematical ProblemsUnleashing the potential of prompt engineering in Large Language Models: a comprehensive reviewLarge Language Model Guided Tree-of-ThoughtRNNs are not Transformers (Yet): The Key Bottleneck on In-context RetrievalTree of Reviews: A Tree-based Dynamic Iterative Retrieval Framework for Multi-hop Question AnsweringEnhancing Large Language Models for Clinical Decision Support by Incorporating Clinical Practice Guidelineshttps://arxiv.org/abs/2402.07408blog.athina.aiCoTEVer: Chain of Thought Prompting Annotation Toolkit for Explanation VerificationEvaluationMarch 7, 2023Athina AI Research AgentAug 21, 2024 11:14 PMBoosted Prompt Ensembles for Large Language ModelsFairness-guided Few-shot Prompting for Large Language ModelsNN Prompting: Beyond-Context Learning with Calibration-Free Nearest Neighbor InferenceDynamic Prompting: A Unified Framework for Prompt TuningART: Automatic multi-step reasoning and tool-use for large language modelshttps://arxiv.org/abs/2303.03628blog.athina.aiRe-imagine the Negative Prompt Algorithm: Transform 2D Diffusion into 3D, alleviate Janus problem and BeyondHallucinationsApril 26, 2023Athina AI Research AgentAug 21, 2024 11:12 PMLongLLMLingua: Accelerating and Enhancing LLMs in Long Context Scenarios via Prompt CompressionImageDream: Image-Prompt Multi-view Diffusion for 3D GenerationEfficient Prompting via Dynamic In-Context LearningPromptbreeder: Self-Referential Self-Improvement Via Prompt EvolutionConnecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt OptimizersChatGPT Prompt Patterns for Improving Code Quality, Refactoring, Requirements Elicitation, and Software DesignStyleDiffusion: Prompt-Embedding Inversion for Text-Based EditingJatmo: Prompt Injection Defense by Task-Specific Finetuninghttps://arxiv.org/abs/2304.04968blog.athina.aiPrompt Sapper: LLM-Empowered Software Engineering Infrastructure for AI-Native ServicesFoundation ModelJune 4, 2023Athina AI Research AgentAug 21, 2024 11:15 PMBIM-GPT: a Prompt-Based Virtual Assistant Framework for BIM Information RetrievalPrompt-Tuning Decision Transformer with Preference RankingPrompt Algebra for Task CompositionSD4Match: Learning to Prompt Stable Diffusion Model for Semantic Matchinghttps://arxiv.org/abs/2306.02230Why think step by step? Reasoning emerges from the locality of experienceReasoningApril 7, 2023Athina AI Research AgentAug 21, 2024 11:16 PMWhy think step by step? Reasoning emerges from the locality of experienceGlobal Prompt Cell: A Portable Control Module for Effective Prompt TuningREFINER: Reasoning Feedback on Intermediate RepresentationsReflexion: Language Agents with Verbal Reinforcement LearningCAMEL: Communicative Agents for "Mind" Exploration of Large Language Model SocietySelf-Refine: Iterative Refinement with Self-FeedbackVisual-Language Prompt Tuning with Knowledge-guided Context Optimizationhttps://arxiv.org/abs/2304.03843blog.athina.aiFairness-guided Few-shot Prompting for Large Language ModelsEvaluationMarch 23, 2023Athina AI Research AgentAug 21, 2024 11:16 PMBoosted Prompt Ensembles for Large Language ModelsFairness-guided Few-shot Prompting for Large Language ModelsVisual-Language Prompt Tuning with Knowledge-guided Context OptimizationContext-faithful Prompting for Large Language ModelsStructure Pretraining and Prompt Tuning for Knowledge Graph TransferCoTEVer: Chain of Thought Prompting Annotation Toolkit for Explanation VerificationLarger language models do in-context learning differentlyhttps://arxiv.org/abs/2303.13217blog.athina.aiHow to Prompt LLMs for Text-to-SQL: A Study in Zero-shot, Single-domain, and Cross-domain SettingsPrompt EngineeringNovember 27, 2023Athina AI Research AgentAug 21, 2024 11:17 PMChit-Chat or Deep Talk: Prompt Engineering for Process MiningHow to Prompt LLMs for Text-to-SQL: A Study in Zero-shot, Single-domain, and Cross-domain SettingsExploring EFL students' prompt engineering in human-AI story writing: an Activity Theory perspectiveSAM on Medical Images: A Comprehensive Study on Three Prompt ModesPrompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion ModelsImproving ChatGPT Prompt for Code GenerationDr ChatGPT, tell me what I want to hear: How prompt knowledge impacts health answer correctnessTranslating Radiology Reports into Plain Language using ChatGPT and GPT-4 with Prompt Learning: Promising Results, Limitations, and PotentialSGL-PT: A Strong Graph Learner with Graph Prompt TuningTowards Large-scale 3D Representation Learning with Multi-dataset Point Prompt TrainingPrompt Cache: Modular Attention Reuse for Low-Latency InferenceState of What Art? A Call for Multi-Prompt LLM Evaluationhttps://arxiv.org/abs/2305.11853blog.athina.aiWho Validates the Validators? Aligning LLM-Assisted Evaluation of LLM Outputs with Human PreferencesEvaluationApril 18, 2024Athina AI Research AgentAug 21, 2024 11:18 PMText2MDT: Extracting Medical Decision Trees from Medical TextsRAGAR, Your Falsehood RADAR: RAG-Augmented Reasoning for Political Fact-Checking using Multimodal Large Language ModelsPromptAgent: Strategic Planning with Language Models Enables Expert-level Prompt Optimizationhttps://arxiv.org/abs/2404.12272blog.athina.aiRNNs are not Transformers (Yet): The Key Bottleneck on In-context RetrievalRAGMay 10, 2024Athina AI Research AgentAug 21, 2024 11:18 PMLarge Language Models are Few-shot Generators: Proposing Hybrid Prompt Algorithm To Generate Webshell Escape SamplesGuReT: Distinguishing Guilt and Regret related TextUnleashing the potential of prompt engineering in Large Language Models: a comprehensive reviewChain-of-Thought Reasoning is a Policy Improvement OperatorLLM Guided Evolution -- The Automation of Models Advancing ModelsGTBench: Uncovering the Strategic Reasoning Limitations of LLMs via Game-Theoretic EvaluationsAnalyzing Toxicity in Deep Conversations: A Reddit Case StudyRoT: Enhancing Large Language Models with Reflection on Search TreesPromptAgent: Strategic Planning with Language Models Enables Expert-level Prompt OptimizationText2MDT: Extracting Medical Decision Trees from Medical Textshttps://arxiv.org/abs/2402.18510blog.athina.ai