Writing an LLM from scratch, part 8 – trainable self-attention

Writing an LLM from scratch, part 10 – dropout

It’s still worth blogging in the age of AI

The benefits of learning in public