🔗 How to scale your Large Language Model — Efficient use of hardware is important to get more out of computing (duh!), and the way to do that is to increase the number of chips used for training or inference while achieving a proportional, linear incrase in througput.

Multiple people on X recommended this online book.