Landmark Papers

What the papers actually said - linked to the originals.

644 entries, all primary-sourced
paper June 20, 2023

Textbooks Are All You Need

The 2023 Microsoft paper introducing phi-1, a 1.3B code model that beat far larger models by training on 'textbook-quality' data, launching the Phi family.

paper September 1, 2023

RLAIF: Scaling RLHF with AI Feedback

The 2023 Google paper showing AI-generated preference labels can match human ones for RLHF, with a direct variant skipping the reward model.

paper September 21, 2023

The Reversal Curse

The 2023 paper showing LLMs trained on 'A is B' often fail to answer 'B is A', exposing a basic generalization gap.

paper October 6, 2023

Language Agent Tree Search (LATS)

A 2023 method that gives language agents Monte Carlo tree search, so they can plan, act, and reflect by exploring many paths.

paper October 25, 2023

The Data Provenance Initiative

A 2023 audit that traced the licenses and lineage of over 1,800 text datasets and found widespread license misattribution in AI training data.

paper December 14, 2023

Weak-to-Strong Generalization

The 2023 OpenAI paper showing a strong model fine-tuned on a weak model's labels can outperform its weak supervisor, a toy model for superalignment.