Landmark Papers

What the papers actually said - linked to the originals.

644 entries, all primary-sourced
paper January 26, 2023

MusicLM: Generating Music From Text

Google's 2023 MusicLM generated minutes of coherent music from a text caption and released MusicCaps, 5,500 expert-written pairs.

paper March 2, 2023

Consistency Models

Consistency models generate images in a single step by mapping noise directly to data, cutting diffusion's many sampling steps.

paper April 5, 2023

Segment Anything (SAM)

Meta's 2023 Segment Anything introduced a promptable segmentation model and SA-1B, a dataset of over 1 billion masks on 11 million images.

paper April 17, 2023

Visual Instruction Tuning (LLaVA)

LLaVA connected a CLIP vision encoder to an LLM and trained it on GPT-4-generated image instructions to build an open visual assistant.

paper April 24, 2023

Generative AI at Work

The first large field study of generative AI at work found a 14% average productivity gain for support agents, concentrated among novices.

paper May 29, 2023

Direct Preference Optimization (DPO)

The 2023 Stanford paper showing preference alignment can be done with a simple classification loss, skipping the reward model and RL of classic RLHF.

paper June 2023

The RefinedWeb Dataset for Falcon LLM

The 2023 TII paper arguing that filtered, deduplicated web data alone can train models that beat ones trained on curated corpora like The Pile.