Beating the bookies with their own numbers

Hunyuan-Large: An Open-Source Moe Model with 52B Activated Parameters

WebRL: Training LLM Web Agents via Self-Evolving Online Reinforcement Learning

Ring-Based Mid-Air Gesture Typing System Using Deep Learning Word Prediction

An embarrassingly simple approach to recover unlearned knowledge for LLMs

GenXD: Generating Any 3D and 4D Scenes

Creating Interactive and Embedded Physics Simulations from Static Textbooks

Consistently faster and smaller compressed bitmaps with Roaring (2016)

TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters

Designing a Home Radio Telescope for 21 Cm Emission

Spann: Highly-Efficient Billion-Scale Approximate Nearest Neighbor Search (2021)

1-Bit AI Infrastructure

QUIC is not quick enough over fast internet

Guide to Fine-Tuning LLMs

Chain-of-thought can hurt performance on tasks where thinking makes humans worse

Scheduling Languages: A Past, Present, and Future Taxonomy

LLMs know more than they show: On the intrinsic representation of hallucinations

Representing web applications as knowledge graphs

Representing Knowledge and Querying Data using Double-Functorial Semantics

Breaking Bad: How Compilers Break Constant-Time~Implementations

LLM-Assisted Static Analysis for Detecting Security Vulnerabilities

MM1.5: Methods, Analysis and Insights from Multimodal LLM Fine-Tuning

Identifying factors contributing to "bad days" for software developers

Universal optimality of Dijkstra via beyond-worst-case heaps

State-space models can learn in-context by gradient descent

MLE-bench: Evaluating Machine Learning Agents on Machine Learning Engineering

Building a simple oscillator based Ising machine for research and education

Transformers Utilization in Chart Understanding: A Review of Advances and Future

Machine Learning to Computational Plasma Physics Reduced-Order Plasma Modeling

Crux, a Precise Verifier for Rust and Other Languages

More →