2024
- January 2 - Improving Text Embeddings with Large Language Models
- January 9 - How Johnny Can Persuade LLMs to Jailbreak Them: Rethinking Persuasion to Challenge AI Safety by Humanizing LLMs
- January 15 - Long-Context Retrieval Models with Monarch Mixer
- January 19 - Fine-grained Hallucination Detection and Editing for Language Models
- January 20 - Using a 7B Model + RAG to Identify and Edit Word-level Hallucinations
- January 24 - A Simple Overview of the LLM Training Steps 🔡
- January 29 - Matryoshka Representation Learning
- February 13 - Retrieval-Augmented Generation for Large Language Models: A Survey
- February 19 - Spotting LLMs With Binoculars: Zero-Shot Detection Of Machine-Generated Text
- April 25 - Retrieval-Augmented Dual Instruction Tuning (RA-DIT)
- April 28 - Visual Instruction Tuning
- June 18 - Be like a Goldfish, Don't Memorize! Mitigating Memorization in Generative LLMs
- June 22 - GLiNER: Generalist Model for Named Entity Recognition using Bidirectional Transformer
- June 29 - Mixture-of-Agents Enhances Large Language Model Capabilities
- June 30 - Token Pooling to Scale Multi-Vector Retrieval Systems
- July 5 - Many-Shot In-Context Learning
- July 7 - Adaptive Retrieval and Scalable Indexing for k-NN Search with Cross-Encoders
- July 14 - RouteLLM: Learning to Route LLMs with Preference Data
- July 22 - Prover-verifier Games Improve Legibility of LLM Outputs
- July 28 - LoRA: Low-Rank Adaptation of Large Language Models
- August 8 - Language Model Distillation
- September 1 - Distillation Experiments
2023
- November 18 - Retrieval meets Long Context Large Language Models
- November 19 - Lost in the Middle: How Language Models Use Long Contexts
- November 22 - Can Large Language Models Infer Causation from Correlation?
- November 23 - The Curse of Recursion: Training on Generated Data Makes Models Forget
- November 25 - A Watermark for Large Language Model
- December 4 - Retrieval-Augmented Multimodal Language Modeling
- December 6 - Long context prompting for Claude 2.1
- December 6 - Who’s Harry Potter? Approximate Unlearning in LLMs
- December 8 - SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models
- December 17 - Dense X Retrieval: What Retrieval Granularity Should We Use?
- December 19 - Fine-Tuning or Retrieval? Comparing Knowledge Injection in LLMs
- December 25 - Discovering the Hidden Vocabulary of DALLE-2