Single-kernel fusion: fusing sequential GPU dispatches into one yields 159x over PyTorch on the same hardware

Intel, NVIDIA, AMD GPU Drivers Finally Play Nice With ReactOS

Right-sizes LLM models to your system's RAM, CPU, and GPU

I pushed an AMD GPU to its limits for ZKPs: 18ms NTT and 2.5s FRI Proving via Zero-Copy and Algorithmic Dimensionality Reduction

Where are the places I can rent GPU?

Show HN: Horizon – GPU-accelerated infinite-canvas terminal in Rust

Autoresearch: Agents researching on single-GPU nanochat training automatically

AXIOM: Built a sparse dynamic routing architecture for LLM inference entirely in Rust. No ML frameworks, no GPU, 1.2M parameters

Track real-time GPU and LLM pricing across all cloud and inference providers

Track real-time GPU and LLM pricing across all cloud and inference providers