progscrape: arxiv.org

Institutional Books: A 242B token dataset from Harvard Library's collections

22 days ago arxiv.org

Extreme Super-Resolution via Scale Autoregression and Preference Alignment

28 days ago arxiv.org

3D CAD from Images, Text, and Point Clouds with RLVR

31 days ago arxiv.org 3d

Why Understanding Software Cycle Time Is Messy, Not Magic

26 days ago arxiv.org

TradeExpert, a trading framework that employs Mixture of Expert LLMs

31 days ago arxiv.org llm

A Family of Non-Periodic Tilings, Describable Using Elementary Tools

23 days ago arxiv.org

How much do language models memorize?

30 days ago arxiv.org

Beyond Attention: Toward Machines with Intrinsic Higher Mental States

32 days ago arxiv.org

Open-Source RISC-V: Energy Efficiency of Superscalar, Out-of-Order Execution

17 days ago arxiv.org risc

TLOB: Dual Attention Transformer Predicts Price Trends from Order Book Data

30 days ago arxiv.org

From tokens to thoughts: How LLMs and humans trade compression for meaning

28 days ago arxiv.org compression llm

AI Persona Groupthink Makes Group Talk More Realistic

31 days ago arxiv.org ai

LongCodeBench: Evaluating Coding LLMs at 1M Context Windows

27 days ago arxiv.org llm windows

Towards Bug-Free Distributed Go Programs

13 days ago arxiv.org go

ZjsComponent: A Pragmatic Approach to Reusable UI Fragments for Web Development

17 days ago arxiv.org web

TPDE: A Fast Adaptable Compiler Back-End Framework

35 days ago arxiv.org compiler pdf rust

New algorithm beats Dijkstra's time for shortest paths in directed graphs

36 days ago arxiv.org algorithm

AR-Diffusion: Auto-Regressive Diffusion Model for Text Generation

36 days ago arxiv.org

Vending-Bench: A Benchmark for Long-Term Coherence of Autonomous Agents

37 days ago arxiv.org ai pdf science

Using Large Language Models for Commit Message Generation: A Preliminary Study

43 days ago arxiv.org vibecoding

Why it is (nearly) impossible that we live in a simulation

59 days ago arxiv.org

X X^t can be faster

48 days ago arxiv.org math

YOLO-World: Real-Time Open-Vocabulary Object Detection

33 days ago arxiv.org

Byte latent transformer: Patches scale better than tokens

52 days ago arxiv.org

Sharp Knives Reduce Onion-Induced Tears By Limiting Droplet Spray, Study Finds

41 days ago arxiv.org science

Gradient-Based Program Repair: Fixing Bugs in Continuous Program Spaces

37 days ago arxiv.org

FlowTSE: Target Speaker Extraction with Flow Matching