Fast(er) binary search in Rust

Performance Roulette: The Luck of Code Alignment

Compress-a-Palooza: Unpacking 5B Varints in Only 4B CPU Cycles

Paired benchmarking: how to measure performance

Faster binary search in Rust using the Eytzinger layout, branchless code and memory prefetch

How fast can you count to 16 in Rust?

Why Clean Code was (and still is) important?