Landmark Papers

What the papers actually said - linked to the originals.

644 entries, all primary-sourced

paper June 15, 2023

Inverse Scaling: When Bigger Isn't Better

The 2023 paper cataloguing tasks where larger language models do worse, complicating the assumption that scale always helps.

paper June 20, 2023

Textbooks Are All You Need

The 2023 Microsoft paper introducing phi-1, a 1.3B code model that beat far larger models by training on 'textbook-quality' data, launching the Phi family.

paper June 27, 2023

LeanDojo: Theorem Proving with Retrieval-Augmented Language Models

An open toolkit, dataset, and retrieval-augmented prover for AI theorem proving in the Lean proof assistant.

paper July 4, 2023

SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis

SDXL was a larger, higher-quality successor to Stable Diffusion using a bigger backbone, dual text encoders, and a refiner.

paper July 5, 2023

Accurate medium-range global weather forecasting with 3D neural networks (Pangu-Weather)

Huawei's Pangu-Weather, published in Nature in 2023, used 3D deep networks to beat the leading operational forecast system while running in seconds.

paper July 6, 2023

Lost in the Middle: How Language Models Use Long Contexts

Study showing models use information best at the start and end of a long context, and often miss facts buried in the middle.

paper July 11, 2023

De novo design of protein structure and function with RFdiffusion

The Baker lab's 2023 Nature paper introduced RFdiffusion, a diffusion model that designs new proteins from scratch.

paper July 13, 2023

Experimental Evidence on the Productivity Effects of Generative AI (Noy and Zhang)

An MIT experiment found ChatGPT cut professional writing time by 40% and raised quality 18%, helping weaker writers most.

paper July 16, 2023

ChatDev: Communicative Agents for Software Development

A 2023 framework where LLM agents play software roles and build programs through a chat chain across design, coding, and testing.

paper July 17, 2023

Retentive Network: A Successor to Transformer for Large Language Models

RetNet's retention mechanism supports parallel training and O(1)-memory recurrent inference in one architecture.

paper July 19, 2023

Efficient Guided Generation for Large Language Models (Outlines)

Method that frames constrained decoding as a finite-state machine, forcing model output to match a regex or grammar cheaply.

paper July 27, 2023

Open Problems and Fundamental Limitations of RLHF

The 2023 survey systematizing where RLHF breaks down - flawed human feedback, imperfect reward models, and brittle policy optimization.

paper July 27, 2023

Universal and Transferable Adversarial Attacks on Aligned Language Models

The 2023 paper that automatically generated adversarial text suffixes which jailbreak aligned LLMs and transfer to ChatGPT, Bard, and Claude.

paper July 28, 2023

RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control

RT-2 turned a web-trained vision-language model into a robot policy by emitting actions as text tokens, gaining new reasoning.

paper July 31, 2023

ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIs

A 2023 paper and dataset (ToolBench) that taught an open model to call over 16,000 real REST APIs.

paper August 2023

MetaGPT: Meta Programming for A Multi-Agent Collaborative Framework

A 2023 framework that runs an LLM agent team like a software company, encoding standard operating procedures into the agents' prompts.

paper August 7, 2023

A Cost Analysis of Generative Language Models and Influence Operations

A 2023 paper estimating how much language models cut the cost of producing propaganda for online influence operations.

paper August 8, 2023

3D Gaussian Splatting for Real-Time Radiance Field Rendering

The 2023 paper that replaced neural radiance fields with millions of 3D Gaussians for real-time, high-quality novel-view rendering.

paper August 16, 2023

AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent Conversation

Microsoft's 2023 framework for building LLM applications out of conversable agents that talk to each other, humans, and tools.

paper August 17, 2023

Consciousness in Artificial Intelligence: Insights from the Science of Consciousness

A 2023 report derives indicators of consciousness from neuroscience and finds no current AI system meets them.

paper August 20, 2023

Activation Addition: Steering Language Models at Inference Time

The 2023 paper showing you can steer a model's behavior by adding a contrast-derived vector to its activations during the forward pass, no retraining.

paper August 23, 2023

A High-Performance Speech Neuroprosthesis (Willett)

A Stanford BrainGate2 study decoded attempted speech from cortical electrodes, reaching usable accuracy on a 125,000-word vocabulary.

paper August 30, 2023

Swift: champion-level drone racing with deep reinforcement learning

A vision-based autonomous drone trained with deep RL beat human world-champion pilots in real head-to-head races.

paper August 31, 2023

YaRN: Efficient Context Window Extension of Large Language Models

Method to stretch a trained model's context window far beyond its original limit using a fraction of the usual fine-tuning.

paper September 1, 2023

RLAIF: Scaling RLHF with AI Feedback

The 2023 Google paper showing AI-generated preference labels can match human ones for RLHF, with a direct variant skipping the reward model.

paper September 12, 2023

Efficient Memory Management for LLM Serving with PagedAttention (vLLM)

The 2023 paper introducing PagedAttention and vLLM, a serving system that raised LLM inference throughput 2-4x by managing the KV cache like virtual memory.

paper September 20, 2023

Chain-of-Verification Reduces Hallucination in Large Language Models

The 2023 Meta paper where a model drafts an answer, asks itself verification questions, answers them independently, then revises.

paper September 21, 2023

The Reversal Curse

The 2023 paper showing LLMs trained on 'A is B' often fail to answer 'B is A', exposing a basic generalization gap.

paper September 29, 2023

Efficient Streaming Language Models with Attention Sinks

Finding that keeping a few initial tokens as attention sinks lets models stream very long inputs without fine-tuning.

paper October 3, 2023

Ring Attention with Blockwise Transformers for Near-Infinite Context

Method that splits a long sequence across devices and overlaps communication with compute to scale context with device count.

paper October 5, 2023

Towards Monosemanticity: Decomposing Language Models With Dictionary Learning

The 2023 Anthropic paper using sparse autoencoders to split a one-layer model's neurons into thousands of clean, single-meaning features.

paper October 6, 2023

Language Agent Tree Search (LATS)

A 2023 method that gives language agents Monte Carlo tree search, so they can plan, act, and reflect by exploring many paths.

paper October 9, 2023

Take a Step Back: Evoking Reasoning via Abstraction in Large Language Models

The 2023 Google DeepMind paper on step-back prompting, asking a model to abstract to general principles before solving the specific problem.

paper October 12, 2023

MemGPT: Towards LLMs as Operating Systems

A 2023 Berkeley paper that borrowed OS virtual-memory ideas to give LLM agents persistent memory beyond their context window.

paper October 13, 2023

Open X-Embodiment: Robotic Learning Datasets and RT-X Models

A 21-institution collaboration pooled data from 22 robots, showing one policy trained across embodiments transfers between them.

paper October 17, 2023

Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection

RAG framework where the model decides when to retrieve and uses reflection tokens to critique passages and its own output.

paper October 19, 2023

Eureka: Human-Level Reward Design via Coding Large Language Models

NVIDIA's Eureka used GPT-4 to write reward code by evolution, beating human-designed rewards on 83 percent of 29 RL tasks.

paper October 20, 2023

Nightshade: Prompt-Specific Poisoning Attacks on Text-to-Image Generative Models

A follow-up to Glaze that lets artists 'poison' images so models scraping them without consent learn corrupted concepts.

paper October 25, 2023

The Data Provenance Initiative

A 2023 audit that traced the licenses and lineage of over 1,800 text datasets and found widespread license misattribution in AI training data.

paper November 4, 2023

Levels of AGI for Operationalizing Progress on the Path to AGI

A 2023 DeepMind paper proposes a five-level scale for AGI, ranked by both performance depth and breadth of generality.

paper November 13, 2023

Neural general circulation models for weather and climate (NeuralGCM)

A Google paper introduced NeuralGCM, a hybrid that pairs a physics solver with learned components for both weather forecasts and decade-long climate runs.

paper November 25, 2023

Stable Video Diffusion: Scaling Latent Video Diffusion Models

Stability AI's Stable Video Diffusion turned a latent image diffusion model into an open video generator via staged training.

paper November 28, 2023

Scalable Extraction of Training Data from Production Language Models

The 2023 paper whose 'divergence attack' made ChatGPT spit out memorized training data by asking it to repeat a word forever.

paper November 29, 2023

Scaling deep learning for materials discovery (GNoME)

DeepMind's GNoME paper, in Nature in 2023, used graph networks to predict 2.2 million crystals, 380,000 of them newly predicted stable materials.

paper December 2023

Mamba: Linear-Time Sequence Modeling with Selective State Spaces

The 2023 Gu-Dao paper introducing Mamba, a selective state-space architecture that scales linearly with sequence length and rivals Transformers.

paper December 12, 2023

SGLang: Efficient Execution of Structured Language Model Programs

System and language for multi-call LLM programs, using RadixAttention to reuse KV cache and reach up to 6.4x throughput.

paper December 14, 2023

Weak-to-Strong Generalization

The 2023 OpenAI paper showing a strong model fine-tuned on a weak model's labels can outperform its weak supervisor, a toy model for superalignment.

paper January 10, 2024

Sleeper Agents (Training Deceptive LLMs)

A 2024 Anthropic paper showed that a deceptive backdoor trained into a language model could survive standard safety training.