Outcome-Based Reinforcement Learning to Predict the Future

Atlas: Learning to Optimally Memorize the Context at Test Time

LLMs are more persuasive than incentivized human persuaders

Comparing Parallel Functional Array Languages: Programming and Performance

The anomalous magnetic moment of the muon in the Standard Model: an update

It is time to stop teaching frequentism to non-statisticians (2012)

SUS backprop: linear backpropagation algorithm for long inputs in transformers

Base Models Beat Aligned Models at Randomness and Creativity

Beyond Semantics: Unreasonable Effectiveness of Reasonless Intermediate Tokens

Robin: A multi-agent system for automating scientific discovery