X is justifiably slow (2022)

Condvars and atomics do not mix

LLM Inference Speed of Light

#include <rules> (2010)

Efficient Jagged Arrays

Fine-grained backface culling

zeux.io - On Proebsting's Law ("Compiler Advances Double Computing Power Every 18 Years")

VPEXPANDB on NEON with Z3

On Proebsting's Law (Compiler Advances Double Computing Power Every 18 Years)

Writing an efficient Vulkan renderer (2020)

AABB from OBB with component-wise abs

Eight Years at Roblox

Writing an Efficient Vulkan Renderer

Three Years of Metal (2019)

Writing an efficient Vulkan renderer

Learning from data

Qgrep Internals