Another Large Black Hole In 'Our' Galaxy

Optimal Bounds for Open Addressing Without Reordering

Parameter-free KV cache compression for memory-efficient long-context LLMs

Electric power generation from Earth's rotation through its own magnetic field

How to Secure Existing C and C++ Software Without Memory Safety [pdf]

Exploring Hidden Reasoning Process of Large Language Models by Misleading Them

SplitQuantV2: Enhancing Low-Bit Quantization of LLMs Without GPUs

Optimizing ML training with metagradient descent

Every Flop Counts: Scaling a 300B LLM Without Premium GPUs

Matrix Calculus (For Machine Learning and Beyond)