0:00 Intro
0:27 AI’s amnesia problem
1:50 Design of deep AI models
3:19 Residual connections
6:50 The genius of current language models
9:03 Applying attention to residuals
13:05 Wondercraft
15:22 Infra problems
17:46 Compute results
18:50 Performance results
20:45 Wider or deeper
22:22 From static to adaptive
Comments (0)