Landmark Papers

What the papers actually said - linked to the originals.

644 entries, all primary-sourced
paper April 30, 2015

Fast R-CNN

The 2015 paper that sped up region-based object detection by running the convolutional network once per image and sharing features across regions.

paper May 3, 2015

Highway Networks

The 2015 Schmidhuber-lab paper that used learned gates to let gradients flow through very deep networks, a direct precursor to ResNet's skip connections.

paper May 3, 2015

VQA: Visual Question Answering

The 2015 VQA paper defined free-form visual question answering and shipped a dataset of 0.25M images with 0.76M questions.

paper June 8, 2015

You Only Look Once (YOLO)

The 2015 YOLO paper reframed object detection as a single regression pass, hitting 45 frames per second and making real-time detection mainstream.

paper June 22, 2015

BookCorpus and Aligning Books and Movies

The 2015 paper that introduced BookCorpus, a collection of free e-books that later quietly trained BERT, GPT, and many early language models.

paper August 5, 2015

Listen, Attend and Spell

A 2015 Google paper that transcribed speech to characters with an attention-based encoder-decoder, no separate phoneme or HMM stage.

paper September 22, 2015

Double DQN

The 2015 Double DQN paper showed standard deep Q-learning overestimates values and fixed it with a simple decoupling trick.

paper November 18, 2015

Prioritized Experience Replay

The 2015 prioritized replay paper showed RL agents learn faster by replaying important past experiences more often.

paper December 8, 2015

SSD: Single Shot MultiBox Detector

The 2015 paper that detected objects in one network pass using default boxes across multiple feature scales, trading a little accuracy for big speed.

paper March 9, 2016

XGBoost: A Scalable Tree Boosting System

Chen and Guestrin's 2016 paper introducing XGBoost, the scalable gradient-boosting system that dominated tabular machine learning and Kaggle.

paper June 21, 2016

Concrete Problems in AI Safety

A 2016 paper that reframed AI safety around five concrete engineering problems in present-day machine learning systems rather than far-off speculation.