Landmark Papers

What the papers actually said - linked to the originals.

644 entries, all primary-sourced

paper June 12, 2017

Deep Reinforcement Learning from Human Preferences

The 2017 paper that trained RL agents from human comparisons of trajectory segments instead of a hand-coded reward function - the seed of RLHF.

paper June 19, 2017

Towards Deep Learning Models Resistant to Adversarial Attacks

The 2017 Madry paper that framed robustness as min-max optimization and made PGD adversarial training the standard defense.

paper June 28, 2017

CatBoost: Unbiased Boosting with Categorical Features

Yandex researchers' 2017 paper introducing CatBoost, a gradient-boosting library that handles categorical features and reduces a subtle target-leakage bias.

paper July 20, 2017

Proximal Policy Optimization (PPO)

OpenAI's 2017 PPO paper introduced a simple, stable policy-gradient method that became the default RL algorithm, including for RLHF.

paper July 21, 2017

A Distributional Perspective on RL (C51)

The 2017 C51 paper proposed learning the full distribution of returns, not just their average, improving Atari agents.

paper July 27, 2017

Robust Physical-World Attacks on Deep Learning Models (Stop Sign Attack)

The 2017 paper that fooled a vision system into misreading a real stop sign as a speed-limit sign using only black-and-white stickers.

paper August 16, 2017

Neural Collaborative Filtering

Replaced matrix factorization's fixed dot product with a neural network that learns user-item interactions.

paper August 17, 2017

Deep & Cross Network for Ad Click Predictions

A network that learns explicit feature combinations automatically, built for predicting ad clicks.

paper August 22, 2017

BadNets: Identifying Vulnerabilities in the Machine Learning Model Supply Chain

The 2017 paper that introduced backdoor attacks, showing a model can behave normally yet misfire on inputs carrying a hidden trigger.

paper September 5, 2017

Squeeze-and-Excitation Networks (SENet)

The 2017 paper introducing a lightweight block that lets a network reweight its feature channels by importance, winning that year's ImageNet contest.

paper October 6, 2017

Rainbow: Combining Improvements in Deep RL

The 2017 Rainbow paper combined six separate DQN improvements into one agent that set a new Atari benchmark record.

paper October 10, 2017

Mixed Precision Training

NVIDIA and Baidu paper showing deep networks can train in 16-bit floating point at full accuracy, halving memory use.

paper October 24, 2017

One Pixel Attack for Fooling Deep Neural Networks

A 2017 paper showing that changing a single pixel can make an image classifier confidently give the wrong answer.

paper October 25, 2017

mixup: Beyond Empirical Risk Minimization

The 2017 paper introducing mixup, which trains on blended pairs of examples and labels to improve generalization and robustness.

paper October 27, 2017

Progressive Growing of GANs (ProGAN)

NVIDIA's ProGAN grew generator and discriminator layer by layer to produce stable, high-resolution synthetic faces.

paper October 30, 2017

Graph Attention Networks (GAT)

GAT brings self-attention to graphs, letting each node weigh its neighbors instead of treating them all equally.

paper October 30, 2017

Practical Secure Aggregation for Privacy-Preserving Machine Learning

A cryptographic protocol that lets a server sum thousands of devices' model updates without seeing any single device's update.

paper October 31, 2017

Unsupervised Machine Translation Using Monolingual Corpora Only

A 2017 paper showed a model could learn to translate between two languages using only unpaired text, no parallel sentences at all.

paper November 2017

AI and the Modern Productivity Paradox

Brynjolfsson, Rock and Syverson explain why AI's promise and flat productivity statistics can coexist: implementation lags.

paper November 2, 2017

Neural Discrete Representation Learning (VQ-VAE)

VQ-VAE learned discrete latent codes via vector quantization, a building block for later image and audio generation.

paper November 6, 2017

A Survey on Dialogue Systems: Recent Advances and New Frontiers

A survey of deep-learning dialogue systems that frames the field's main split between task-oriented and open-domain chatbots.

paper November 11, 2017

Software 2.0 (Andrej Karpathy, 2017)

Andrej Karpathy's 2017 essay arguing neural network weights are a new kind of software that replaces hand-written code.

paper November 14, 2017

CheXNet: radiologist-level pneumonia detection on chest X-rays

Stanford's 2017 CheXNet paper trained a 121-layer CNN on 100,000-plus chest X-rays and reported exceeding radiologists at detecting pneumonia.

paper November 14, 2017

Decoupled Weight Decay Regularization (AdamW)

The 2017 paper showing weight decay and L2 regularization differ for Adam, and introducing AdamW, now standard for training large models.

paper November 27, 2017

Population Based Training of Neural Networks

DeepMind's 2017 method that jointly trains a population of models and evolves their hyperparameters into adaptive schedules during training.

paper November 28, 2017

Parallel WaveNet: Fast High-Fidelity Speech Synthesis

DeepMind's 2017 Parallel WaveNet distilled the slow WaveNet into a parallel network fast enough to ship in Google Assistant.

paper November 28, 2017

Snorkel: Rapid Training Data Creation with Weak Supervision

The Stanford system that builds training labels from noisy heuristics ('labeling functions') instead of hand-labeling, then denoises them automatically.

paper November 30, 2017

TCAV (Testing with Concept Activation Vectors)

The 2018 Kim paper that measures how much a human-defined concept like 'stripes' influences a neural network's prediction.

paper December 4, 2017

LightGBM: A Highly Efficient Gradient Boosting Decision Tree

Microsoft's 2017 NeurIPS paper introducing LightGBM, a gradient-boosting library that trains far faster on large datasets using GOSS and EFB.

paper December 13, 2017

Identifying Exoplanets with Deep Learning (Shallue and Vanderburg)

The 2017 paper that used a convolutional neural network to find Kepler-90i and Kepler-80g in Kepler light curves, published in The Astronomical Journal.

paper December 16, 2017

Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions (Tacotron 2)

Google's 2017 Tacotron 2 reached a 4.53 naturalness score, near the 4.58 of professionally recorded human speech.

paper December 16, 2017

Ray: A Distributed Framework for Emerging AI Applications

The 2017 paper introducing Ray, a distributed execution engine for AI workloads that scaled past 1.8 million tasks per second on a single unified interface.

paper December 27, 2017

Adversarial Patch

A 2017 paper introducing a printable sticker that, placed in any scene, makes image classifiers report an attacker-chosen object.

paper 2018

Artificial Intelligence, Automation and Work

Acemoglu and Restrepo's task framework showing automation displaces labor but new labor-intensive tasks can reinstate it.

paper 2018

Prophet: Forecasting at Scale

Taylor and Letham's Prophet, a Facebook forecasting tool that fits trend, seasonality, and holidays so analysts can produce decent forecasts without expertise.

paper January 2, 2018

Deep Learning: A Critical Appraisal

Gary Marcus's 2018 paper lists ten limits of deep learning and argues it must be combined with symbolic methods.

paper January 4, 2018

Soft Actor-Critic (SAC)

The 2018 SAC paper introduced a stable, sample-efficient off-policy RL method that maximizes both reward and action entropy.

paper January 18, 2018

Universal Language Model Fine-tuning for Text Classification (ULMFiT)

The 2018 fast.ai paper showing a pre-trained language model could be fine-tuned to any NLP task, bringing transfer learning to language.

paper January 21, 2018

Gender Shades: Intersectional Accuracy Disparities in Commercial Gender Classification

The study that measured how badly commercial face-analysis tools misread darker-skinned women compared with lighter-skinned men.

paper February 5, 2018

IMPALA: Scalable Distributed Deep-RL

The 2018 IMPALA paper introduced a distributed RL architecture with V-trace correction for high-throughput, multi-task training.

paper February 9, 2018

UMAP: Uniform Manifold Approximation and Projection for Dimension Reduction

McInnes, Healy, and Melville's 2018 paper introducing UMAP, a fast dimensionality-reduction method that rivals t-SNE and keeps more global structure.

paper February 15, 2018

Deep Contextualized Word Representations (ELMo)

The 2018 NAACL paper introducing ELMo, word vectors that change with context, ending the one-fixed-vector-per-word era of word2vec and GloVe.

paper February 15, 2018

Horovod: fast and easy distributed deep learning in TensorFlow

Uber's 2018 paper introducing Horovod, which uses ring-allreduce to scale data-parallel training across many GPUs with only a few lines of code change.

paper February 18, 2018

Trojaning Attack on Neural Networks

A 2018 NDSS paper showing an attacker can implant a trojan trigger into a trained network without access to its original training data.

paper February 19, 2018

Predicting cardiovascular risk from retinal photographs via deep learning

Google's 2018 study showed deep learning could read age, sex, smoking status, and blood pressure from retinal photos, hinting at non-invasive risk screening.

paper February 23, 2018

WaveRNN: Efficient Neural Audio Synthesis

DeepMind's 2018 WaveRNN compressed neural vocoding into a small recurrent network fast enough to synthesize speech on a mobile CPU.

paper February 26, 2018

Twin Delayed DDPG (TD3)

The 2018 TD3 paper fixed the overestimation bias that made DDPG unstable, using twin critics and delayed policy updates.

paper March 9, 2018

The Lottery Ticket Hypothesis

The 2018 Frankle-Carbin paper proposing that dense networks contain small 'winning ticket' subnetworks that train to full accuracy on their own.