Async/Await on the GPU

How NVIDIA's CuTe replaces GPU index arithmetic with composable layout algebra

Testing "Raw" GPU Cache Latency

Tiny-gpu-compiler: An educational MLIR-based compiler targeting open-source GPU hardware

Parakeet.cpp – Parakeet ASR inference in pure C++ with Metal GPU acceleration

Show HN: A physically-based GPU ray tracer written in Julia

mdpt: Markdown TUI slides with GPU rendering (not terminal-dependent) — Rust

Numr: A high-performance numerical computing library with GPU acceleration

The Future for Tyr, a Rust GPU Driver for Arm Mali Hardware

Attyx – tiny and fast GPU-accelerated terminal emulator written in Zig