An Extremely Opinionated Annotated List of My Favourite Mechanistic Interpretability Papers v2 — AI Alignment Forum Distill — Latest articles about machine learning Mapping the Mind of a Large Language Model \ Anthropic