Initial CUDA Performance Lessons

Related Stories

I’m Open-Sourcing my Custom Benchmark GUI

DPS8M Performance

Lessons From Cursor's System Prompt

Lightweight High-Performance Declarative API Gateway Management with middlewares

GitHub - observ33r/object-equals: A high-performance and engine-aware deep equality utility.

Faster route propagation by rewriting our Traefik gateway in Rust

Technical Guide To System Calls: Implementation And Signal Handling In Modern Operating systems

Using TLA+ in the Real World to Understand a Glibc Bug (2020)

Kid needing lessons

Lessons from Harlem

Performance Profile Visualization Challenge

A Performance Investigation Challenge

Initial USA Unemployment Claims

Comma 3X: Initial Impressions

We Made CUDA Optimization Suck Less

Rust CUDA May 2025 project update

Redesigning the Initial Bootstrap Sequence

Faster sorting with SIMD CUDA intrinsics (2024)

Good Performance for Bad Days

Some Life Lessons from VAX/VMS (2013)

Acronis True Image Costs Performance When Not Used

Gateway Books: The lessons of a defunct canon

Lessons Learned from 12 Years of Programming Experience

My Initial Impressions of Go + A From-Scratch Project

Improving performance of rav1d video decoder

Jetrelay: A high-performance ATproto relay in 500 LOC

Microbes in Gowanus teach lessons on fighting industrial pollution

Solving physics-based initial value problems with unsupervised machine learning

Lessons from Mixing Rust and Java: Fast, Safe, and Practical

[Project Share] Whisper for Windows - GPU-accelerated speech recognition with NVIDIA CUDA support