Loading...

Tag trends are in beta. Feedback? Thoughts? Email me at [email protected]

Why it is (nearly) impossible that we live in a simulation

Byte latent transformer: Patches scale better than tokens

X X^t can be faster

Base Models Beat Aligned Models at Randomness and Creativity

Stop treating `AGI' as the north-star goal of AI research

Type-constrained code generation with language models

Rethinking Memory in AI: Taxonomy, Operations, Topics, and Future Directions

SPFresh: Incremental In-Place Update for Billion-Scale Vector Search

LLMs get lost in multi-turn conversation

TransMLA: Multi-head latent attention is all you need

Scoring the European Citizen in the AI Era

Toward a Sparse and Interpretable Audio Codec

A Survey of AI Agent Protocols

Backslash: Rate Constrained Optimized Training of Large Language Models

LLMs for Materials and Chemistry: 34 Real-World Examples

Self Rewarding Self Improving: Autonomous LLM Improvement

Matrix-vector multiplication implemented in off-the-shelf DRAM for Low-Bit LLMs

eqsat: An Equality Saturation Dialect for Non-destructive Rewriting

Structuring Competency-Based Courses Through Skill Trees

Human-Like Episodic Memory for Infinite Context LLMs

Absolute Zero: Reinforced Self-Play Reasoning with Zero Data

DoomArena: A Framework for Testing AI Agents Against Evolving Security Threats

The Algebra of Patterns (Extended Version)

Analyzing Modern Nvidia GPU Cores

CMU TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks

RVSDG: An Intermediate Representation for Optimizing Compilers (2019)

Non-control-Data Attacks and Defenses: A review

Should We Respect LLMs? A Study on Influence of Prompt Politeness on Performance

My prediction after GPT-4o image generation

arXiv moving from Cornell servers to Google Cloud

More →