Previously, he worked as a research scientist at Google Brain and Information Sciences Institute.
Vaswani is best known for his pioneering contributions in the field of deep learning, most notably the development of the Transformer neural network.
This breakthrough work fundamentally changed the landscape of artificial intelligence and laid the foundation for GPT, BERT, ChatGPT, and their successors.
Vaswani completed his engineering in Computer Science from BIT Mesra in 2002.
[6] The paper introduced the Transformer model, which eschews the use of recurrence in sequence-to-sequence tasks and relies entirely on self-attention mechanisms.