Landmark Papers

What the papers actually said - linked to the originals.

644 entries, all primary-sourced
paper January 11, 2024

AMIE: towards conversational diagnostic AI

Google's 2024 AMIE paper described an LLM tuned for diagnostic dialogue that matched or beat primary care doctors in simulated text consultations.

paper March 21, 2024

AI and Memory Wall

Berkeley paper arguing memory bandwidth, not compute, is now the binding constraint on serving large AI models.

paper April 17, 2024

Many-Shot In-Context Learning

The 2024 Agarwal et al. paper showing hundreds or thousands of in-context examples in long contexts beat few-shot prompting.

paper April 30, 2024

KAN: Kolmogorov-Arnold Networks

KANs put learnable activation functions on the edges of a network, aiming for more accurate and interpretable models than MLPs.

paper May 13, 2024

The Platonic Representation Hypothesis

Argues that as models scale, their internal representations converge toward a shared statistical model of reality across modalities.

paper July 31, 2024

The Llama 3 Herd of Models

Meta's 2024 paper describing the Llama 3 family, including a 405B open-weight model that rivals leading closed models.

paper December 27, 2024

DeepSeek-V3 Technical Report

DeepSeek's 2024 report on a 671B-parameter MoE model trained for under 2.8 million GPU hours that rivals top closed models.