Institutional Books: A 242B token dataset from Harvard Library's collections

Extreme Super-Resolution via Scale Autoregression and Preference Alignment

3D CAD from Images, Text, and Point Clouds with RLVR

Why Understanding Software Cycle Time Is Messy, Not Magic

TradeExpert, a trading framework that employs Mixture of Expert LLMs

A Family of Non-Periodic Tilings, Describable Using Elementary Tools

How much do language models memorize?

Beyond Attention: Toward Machines with Intrinsic Higher Mental States

Open-Source RISC-V: Energy Efficiency of Superscalar, Out-of-Order Execution

TLOB: Dual Attention Transformer Predicts Price Trends from Order Book Data

From tokens to thoughts: How LLMs and humans trade compression for meaning

AI Persona Groupthink Makes Group Talk More Realistic

LongCodeBench: Evaluating Coding LLMs at 1M Context Windows

Towards Bug-Free Distributed Go Programs

ZjsComponent: A Pragmatic Approach to Reusable UI Fragments for Web Development

TPDE: A Fast Adaptable Compiler Back-End Framework

New algorithm beats Dijkstra's time for shortest paths in directed graphs

AR-Diffusion: Auto-Regressive Diffusion Model for Text Generation

Vending-Bench: A Benchmark for Long-Term Coherence of Autonomous Agents

Using Large Language Models for Commit Message Generation: A Preliminary Study

Why it is (nearly) impossible that we live in a simulation

X X^t can be faster

YOLO-World: Real-Time Open-Vocabulary Object Detection

Byte latent transformer: Patches scale better than tokens

Sharp Knives Reduce Onion-Induced Tears By Limiting Droplet Spray, Study Finds

Gradient-Based Program Repair: Fixing Bugs in Continuous Program Spaces

FlowTSE: Target Speaker Extraction with Flow Matching

Outcome-Based Reinforcement Learning to Predict the Future

Atlas: Learning to Optimally Memorize the Context at Test Time

LLMs are more persuasive than incentivized human persuaders

More →