Loading...

Tag trends are in beta. Feedback? Thoughts? Email me at [email protected]

LLM-Deflate: Extracting LLMs into Datasets

LLMs easily exploited using run-on sentences, bad grammar, image scaling

These psychological tricks can get LLMs to respond to “forbidden” prompts

Defeating Nondeterminism in LLM Inference

VaultGemma: The most capable differentially private LLM

R-Zero: Self-Evolving Reasoning LLM from Zero Data

RustGPT: A pure-Rust transformer LLM built from scratch

I replaced Animal Crossing's dialogue with a live LLM by hacking GameCube memory

Llama-Factory: Unified, Efficient Fine-Tuning for 100 Open LLMs

DeepSeek-R1 incentivizes reasoning in LLMs through reinforcement learning

You're not using LLMs enough

LLM Visualization

Tiny LLM - LLM Serving in a Week

Apertus 8B and 70B – a new open multilingual LLM from Switzerland

Creating larger projects with LLM (as a coder)

Inside vLLM: Anatomy of a High-Throughput LLM Inference System

Apertus 70B: Truly Open - Swiss LLM by ETH, EPFL and CSCS

A Software Development Methodology for Disciplined LLM Collaboration

The Landscape of Agentic Reinforcement Learning for LLMs

The maths you need to start understanding LLMs

Indirect Prompt Injection Attacks Against LLM Assistants

Experimenting with Local LLMs on macOS

Adaptive LLM routing under budget constraints

Network and Storage Benchmarks for LLM Training on the Cloud

ChatGPT is NOT a LLM – GPT is

SparseLoCo: Communication-Efficient LLM Training

I built an LLM from Scratch in Rust (Just ndarray and rand)

ATC/OSDI '25 Joint Keynote: Accelerating Software Dev: The LLM (R)Evolution [video]

Visual representations in the human brain are aligned with LLMs

Lessons from building an LLM-discoverable data marketplace

More →