progscrape: transformer-circuits.pub

Emotion Concepts and Their Function in a Large Language Model

58 days ago transformer-circuits.pub

When models manipulate manifolds: The geometry of a counting task

7 months ago transformer-circuits.pub

Emergent Introspective Awareness in Large Language Models

7 months ago transformer-circuits.pub

Visual Features Across Modalities: SVG and ASCII Art Cross-Modal Understanding

7 months ago transformer-circuits.pub art svg

The Biology of a Large Language Model

14 months ago transformer-circuits.pub ai biology

Circuit Tracing: Revealing Computational Graphs in Language Models (Anthropic)

14 months ago transformer-circuits.pub

Toy Models of Superposition (2022)

19 months ago transformer-circuits.pub

Scaling Monosemanticity: Extracting Interpretable Features from Claude 3 Sonnet

24 months ago transformer-circuits.pub

Scaling Monosemanticity: Extracting Interpretable Features from Claude 3 Sonnet

24 months ago transformer-circuits.pub

Towards Monosemanticity: Decomposing Language Models with Dictionary Learning

2 years ago transformer-circuits.pub

Toy Models of Superposition (2022)

2 years ago transformer-circuits.pub

Toy Models of Superposition

3 years ago transformer-circuits.pub ai

Superposition, Memorization, and Double Descent

3 years ago transformer-circuits.pub