
Let's reproduce GPT-2 (124M)

The Book Of Clarity: Building Your Dream Start-Up Using…
Paras Chopra
Designing Machine Learning Systems
Chip Huyen
Natural Language Processing with Transformers, Revised Edition
Lewis Tunstall, Leandro von Werra, Thomas Wolf