Scaling Interpretability Information Center
Get comprehensive updates, key reports, and detailed insights compiled from verified editorial sources.
About of Scaling Interpretability

Science and engineering are inseparable. Our researchers reflect on the close relationship between scientific and engineering ... Atticus Geiger from Pr(Ai)²R Group explores “State of Take your personal data back with Incogni! Use code WELCHLABS at the link below and get 60% off an annual plan: ... Lex Fridman Podcast full episode: Thank you for listening ❤ our ... What's happening inside an AI model as it thinks? Why are AI models sycophantic, and why do they hallucinate? Are AI models ... A surprising fact about modern large language models is that nobody really knows how they work internally. At Anthropic, the ...
At an Anthropic Research Salon event in San Francisco, four of our researchers—Alex Tamkin, Jan Leike, Amanda Askell and ... Eric Michaud returns to the stream to talk about his recent work on Part 1 of a walkthrough of our paper, Progress Measures for Grokking via Mechanistic Eric is a PhD student in the Department of Physics at MIT working with Max Tegmark on improving our scientific/theoretical ... How can we reverse engineer what a neural network is doing? In this IASEAI '25 session, An Introduction to Mechanistic ... This has been my favorite video so far to make! I think
Core Information

Explore the key sources for Scaling Interpretability.
AI models are trained and not directly programmed, so we don't understand how they do most of the things they do. Our new ... SPONSOR MESSAGES: CentML offers competitive pricing for GenAI model deployment, with flexible options to suit a wide range ...
Recent Updates

Stay updated on Scaling Interpretability's newest achievements.
Featured Video Reports & Highlights
Below is a handpicked selection of video coverage, expert reports, and highlights regarding Scaling Interpretability from verified contributors.
Scaling interpretability
Atticus Geiger - State of Interpretability & Ideas for Scaling Up [Alignment Workshop]
The Dark Matter of AI [Mechanistic Interpretability]
Scaling Laws of AI explained | Dario Amodei and Lex Fridman
Full Guide
Data is compiled from public records and verified media reports.
Last Updated: May 23, 2026
Future Outlook

For 2026, Scaling Interpretability remains one of the most talked-about profiles. Check back for the newest reports.
Disclaimer:

![Atticus Geiger - State of Interpretability & Ideas for Scaling Up [Alignment Workshop]](https://ytimg.googleusercontent.com/vi/eqZ1iEoor5s/mqdefault.jpg)
![The Dark Matter of AI [Mechanistic Interpretability]](https://ytimg.googleusercontent.com/vi/UGO_Ehywuxc/mqdefault.jpg)
