Loading...

Tag trends are in beta. Feedback? Thoughts? Email me at [email protected]

The fastest Linux timestamps

How much linear memory access is enough?

How Much Linear Memory Access Is Enough?

How to make a fast dynamic language interpreter

DRAM has a design flaw from 1966. I bypassed it [video]

I don't care that it's X times faster

Tracking down a 25% Regression on LLVM RISC-V

Investigating Split Locks on x86-64

How we built Elasticsearch simdvec to make vector search one of the fastest in the world

Tailslayer: A technique for reducing tail latency in DRAM operations

Optimizing Ruby Path Methods

Serving the For You Feed

Speeding up access to OpenStreetMap vector tiles

High-performance 2D graphics rendering on the CPU using sparse strips

Reflections on 30 years of HPC programming

Comparing compression tools

Hyper-DERP: C++/io_uring DERP relay - Same throughput as Tailscale's derper, half the cores

Using Perfetto in ZJIT

Idiomatic Lisp and the nbody benchmark

The fastest way to match characters on ARM processors?

Journey in optimising Elixir application

the value of a performance oracle

HT-NRC: A high-throughput and noise-resilient lossless image compression architecture for deep-space CMOS cameras

Introducing profiling-explorer

How to make Firefox builds 17% faster

Why Crystal, 10 Years Later: Performance and Joy

Who even uses jemalloc in 2026 anyway? (many major projects)

Optimising a Pipelined RISC-V Core: From Naive Pipeline to Near-Superscalar Performance

The quest for the perfect 2D sprite pipeline

Topics in High-Performance Messaging

More →