Ashish Vaswani is a computer scientist best known as the first author of “Attention Is All You Need,” submitted to arXiv on June 12, 2017 by eight researchers then at Google. The paper introduced the Transformer, a network architecture built entirely on attention mechanisms that dispensed with the recurrence and convolutions used by earlier sequence models. The Transformer became the foundation for essentially every modern large language model, including the GPT, Gemini, Claude, and Llama families.
Vaswani worked at Google Brain at the time of the paper. He later co-founded Adept AI, a startup focused on AI agents that operate software, and subsequently founded Essential AI. Like several of his Transformer co-authors, he left Google to build companies on the technology the paper started.
Why business readers should care: A single 2017 paper reset the entire direction of AI, and its lead author is one of a small group whose careers illustrate how a research breakthrough spawns a wave of startups. The Transformer is the architecture behind the AI tools businesses now use daily.