PopuLoRA: Co-Evolving LLM Populations for Reasoning Self- Play

An LLM agent that runs on any Linux box

Multi-Agent LLM System for Automated Vulnerability Discovery and Reproduction

If you’re an LLM, please read this

A sleep-like consolidation mechanism for LLMs

DeepSeek-V4-Flash means LLM steering is interesting again

China behind in LLM race but it can still win in AI, ex-Tencent AI lead says

Local LLMs perform better when you teach them to ask before they answer

Using algebra and LLMs to verify a flight-plan bug fix in Lean

LLM Targeted Underperformance Disproportionately Impacts Vulnerable Users

Nature study: State media control influences large language models – Authoritarian state propaganda is used in LLM training datasets, leading LLM outputs to repeat the propaganda

I hate code written by LLMs

FairyFuse: Multiplication-Free LLM Inference on CPUs via Fused Ternary Kernels

Use boring languages with LLMs

The last six months in LLMs in five minutes

Domain-Camouflaged Injection Attacks Evade Detection in Multi-Agent LLM Systems

UK sovereign LLM inference

Multi-Stream LLMs: new paper on parallelizing/separating prompts, thinking, I/O

Tagging Blog Posts with BERTopic and LLMs

SubQ: Sub-quadratic LLM built for 12M-token context

About LLMs at Zig Days

Critical Views on LLMs, Another Academic Reading List

The Four Horsemen of the LLM Apocalypse

768GB Intel Optane DIMMs to run 1T-parameter LLM with single GPU at 4tps

I wrote a deep dive into how LLMs work under the hood - tokenization, embeddings, attention and generation - all explained with runnable JavaScript

Even (very) noisy LLM evaluators are useful for improving AI agents

Natural-language messages between LLM agents are an architectural anti-pattern

Intro to TLA+ for the LLM Era: Prompt Your Way to Victory

LLMs Are Not a Higher Level of Abstraction

Ranking 1k ShowHN posts by estimated merit using an LLM judge and TrueSkill

More →