The paper that started all this: Attention is All You Need | Google Research

aimecesaire25 · April 5, 2023, 10:00pm

This links to the original Google research paper that started the current revolution in Large language models. It puts forward the transformer as a better alternative to sequence transduction models. It is a bit dated, but relevant if you want to dive deeper. It is also a bit technical but still informative for a curious beginner.
Read it here:

Topic	Replies	Views
Watch an A.I. Learn to Write by Reading Nothing but Shakespeare/Austen etc (Nytimes) News in AI ai-in-writing	1254	April 27, 2023
When AI’s Large Language Models Shrink \| IEEE Spectrum (03/31/2023) News in AI	313	April 14, 2023
AI is teaching us how creatures communicate by Christopher Mims (WSJ, Mar 19, 2022) News in AI	488	April 26, 2022
Researchers Build AI That Builds AI (Quanta Magazine, Jan 25, 2022) News in AI	366	April 11, 2022
Teaching what A.I. is in classrooms \| The New York Times (3/27/23) News in AI ai-news , classroom , chatbots	287	March 28, 2023

The paper that started all this: Attention is All You Need | Google Research

Related topics