Writing an LLM from scratch, part 8 – trainable self-attention

It’s still worth blogging in the age of AI

The benefits of learning in public