Explaining and Harnessing Adversarial Examples
The 2014 paper behind the famous panda example: it explained adversarial examples as a result of neural networks' linearity and introduced FGSM.
What the papers actually said - linked to the originals.
The 2014 paper behind the famous panda example: it explained adversarial examples as a result of neural networks' linearity and introduced FGSM.
The 2014 paper introducing Adam, the adaptive optimizer that became the default for training deep neural networks.
SRCNN, the 2014 paper that first applied a deep convolutional network end-to-end to single-image super-resolution.
Show, Attend and Tell added visual attention to captioning, letting the model fix its gaze on the object it was naming word by word.
The 2015 paper introducing batch normalization, which let much deeper networks train far faster and more reliably.
The 2015 TRPO paper made policy-gradient training stable with a trust region, paving the way for PPO and modern deep RL.
The 2015 Hinton-Vinyals-Dean paper that formalized knowledge distillation, training a small student model to mimic a large model's soft outputs.
A 2015 paper showed a rotation-invariant CNN could classify galaxy shapes from Galaxy Zoo crowdsourced labels at over 99% agreement.
The 2015 paper that sped up region-based object detection by running the convolutional network once per image and sharing features across regions.
The 2015 Schmidhuber-lab paper that used learned gates to let gradients flow through very deep networks, a direct precursor to ResNet's skip connections.
The 2015 VQA paper defined free-form visual question answering and shipped a dataset of 0.25M images with 0.76M questions.
The 2015 U-Net paper introduced an encoder-decoder with skip connections for image segmentation, now a backbone of diffusion image generators.
The 2015 paper that built region proposals into the network itself, making accurate object detection nearly real-time and fully end-to-end.
The 2015 GAE paper introduced a variance-reduction technique for policy gradients that underpins methods like PPO.
The 2015 YOLO paper reframed object detection as a single regression pass, hitting 45 frames per second and making real-time detection mainstream.
Google's 2015 FaceNet learned a face embedding with a triplet loss and reached 99.63% on the LFW benchmark.
The 2015 paper that introduced BookCorpus, a collection of free e-books that later quietly trained BERT, GPT, and many early language models.
David Autor's argument that automation both substitutes for and complements labor, which is why employment persists despite automation.
A 2015 Google paper that transcribed speech to characters with an attention-based encoder-decoder, no separate phoneme or HMM stage.
Introduces global and local attention for translation, with simpler scoring that became widely adopted.
Adapts a compression algorithm to split rare words into subword units, solving open-vocabulary translation.
chrF is a 2015 translation metric that compares text at the character level, working better than BLEU for morphologically rich languages.
DeepMind's 2015 DDPG paper extended deep Q-learning to continuous action spaces, enabling RL control of simulated robots from pixels.
The 2015 Double DQN paper showed standard deep Q-learning overestimates values and fixed it with a simple decoupling trick.
A 2015 paper showing that an ML model's confidence scores can be used to reconstruct sensitive training inputs, even faces.
The 2015 prioritized replay paper showed RL agents learn faster by replaying important past experiences more often.
DCGAN gave GANs a stable convolutional architecture, making adversarial image generation practical and reproducible.
The 2015 dueling network paper split a Q-network into separate value and advantage streams, improving Atari performance.
Used recurrent neural networks to recommend the next item from a short browsing session alone.
Feurer et al.'s 2015 NeurIPS paper introducing auto-sklearn, which automatically selects, configures, and ensembles ML pipelines.
Google's influential 2015 NeurIPS paper arguing that the model is a small part of a real ML system and most cost is hidden maintenance debt.
The 2015 paper that detected objects in one network pass using default boxes across multiple feature scales, trading a little accuracy for big speed.
The 2015 Microsoft paper that introduced residual connections, letting networks grow to hundreds of layers and winning that year's ImageNet contest.
DeepMind's 2016 A3C paper trained deep RL agents in parallel on CPUs, beating the state of the art on Atari in half the time.
A 2016 paper showing adversarial examples transfer: a local substitute model can fool a remote classifier you cannot see inside.
The 2016 Ribeiro paper that explains any classifier's individual predictions by fitting a simple model in a local neighborhood.
The original federated learning paper, introducing the FedAvg algorithm that trains a shared model without moving raw data off devices.
Cohen and Welling extended convolution to respect rotations and reflections, not just translations, improving sample efficiency.
Chen and Guestrin's 2016 paper introducing XGBoost, the scalable gradient-boosting system that dominated tabular machine learning and Kaggle.
Li et al.'s 2016 method that speeds up tuning by giving more compute to promising configurations and killing weak ones early.
The graph-based nearest-neighbor algorithm that powers fast vector search in most vector databases.
The 2016 paper documenting Theano, the pioneering Python math compiler from MILA that seeded the modern deep learning framework era.
Google's 2016 systems paper describing TensorFlow's dataflow-graph architecture for running machine learning across CPUs, GPUs, and TPUs at cluster scale.
The 2016 CHIRP paper introduced a computational-imaging method for reconstructing black hole images from sparse VLBI data.
Robin Hanson's 2016 book forecasts an economy run by emulated human brains, copied and sped up at will.
Showed a neural network can run on homomorphically encrypted inputs, making predictions a cloud server cannot read.
A 2016 paper that reframed AI safety around five concrete engineering problems in present-day machine learning systems rather than far-off speculation.
Google's recommender that combines a memorizing linear model with a generalizing neural network.