Rakaar's Notes

Search

❯

interp - papers to read

interp - papers to read

Jun 17, 20261 min read

anthropic

Circuit Tracing- Revealing Computational Graphs in Language Models
On the Biology of a Large Language Model

goodfire

Under the Hood of a Reasoning Model
Stochastic Parameter Decomposition

david bau

Locating and Editing Factual Associations in GPT / ROME
MEMIT: Mass Editing Memory in a Transformer
- Sparse Feature Circuits: Discovering and Editing Interpretable Causal Graphs in Language Models

geiger

Causal Abstraction: A Theoretical Foundation for Mechanistic Interpretability

practical review

https://arxiv.org/html/2407.02646v2#S8

Graph View

anthropic
goodfire
david bau
geiger
practical review

Backlinks

Daily Paper - 2026-05-29
Daily Paper - 2026-05-30
Daily Paper - 2026-05-31
Rakaar's Notes

Created with Quartz v4.2.3 © 2026

GitHub
Discord Community