Voyager: Real-Time Splatting City-Scale 3D Gaussians on Your Phone

Spherical CNNs (2018)

Large language models often know when they are being evaluated

AbsenceBench: Language models can't tell what's missing

Reinforcement Learning to Train Large Language Models to Explain Human Decisions

Breaking Quadratic Barriers: A Non-Attention LLM for Ultra-Long Context Horizons

Reinforcement Pre-Training

Reasoning by Superposition: A Perspective on Chain of Continuous Thought

Towards Understanding Sycophancy in Language Models

Rethinking Losses for Diffusion Bridge Samplers

SmartAttack: Air-Gap Attack via Smartwatches

Log-Linear Attention

LayerPeeler: Autoregressive Peeling for Layer-Wise Image Vectorization

Simulating Time with Square-Root Space

What do software developers need to know to succeed in an age of AI?

JavelinGuard: Low-Cost Transformer Architectures for LLM Security

Darwin Godel Machine: Open-Ended Evolution of Self-Improving Agents

GSO: Challenging Software Optimization Tasks for Evaluating SWE-Agents

LLMs replacing human participants harmfully misportray, flatten identity groups

Beyond the Black Box: Interpretability of LLMs in Finance

Self-Adapting Language Models

How to Grow an LSM-tree? Towards Bridging the Gap Between Theory and Practice

Modern Minimal Perfect Hashing: A Survey

Not all tokens are meant to be forgotten

Oh fuck! How do people feel about robots that leverage profanity?

ReasoningGym: Reasoning Environments for RL with Verifiable Rewards

Geometry from Quantum Temporal Correlations

Institutional Books: A 242B token dataset from Harvard Library's collections

Extreme Super-Resolution via Scale Autoregression and Preference Alignment

3D CAD from Images, Text, and Point Clouds with RLVR

More →