What is Gemma Scope 2?
Gemma Scope 2 is an open-source interpretability suite from Google DeepMind, released December 19, 2025, featuring sparse autoencoders and transcoders to analyze internal activations and behaviors of Gemma 3 models (270M to 27B).
When was Gemma Scope 2 released?
It was officially released on December 19, 2025, with weights on Hugging Face, technical paper, blog post, and interactive demos available shortly after.
Is Gemma Scope 2 free to use?
Yes, it is completely free and open-source with all weights, code, tutorials, and demos publicly available under permissive licenses for research and safety work.
What models does Gemma Scope 2 support?
It covers the full Gemma 3 family from 270M to 27B parameters, including pre-trained and instruction-tuned variants, with SAEs/transcoders for every layer.
How does Gemma Scope 2 help AI safety?
It enables tracing risks like jailbreaks, hallucinations, sycophancy, and bias by decomposing activations into interpretable features and analyzing reasoning paths.
Where can I try Gemma Scope 2?
Interactive demo on neuronpedia.org/gemma-scope-2, Colab notebooks for tutorials, and weights on Hugging Face for local use.
What is new in Gemma Scope 2 compared to the original?
It adds coverage for Gemma 3 models, retrained SAEs/transcoders, skip-transcoders, cross-layer support, and broader safety-focused analysis capabilities.
Who should use Gemma Scope 2?
Primarily AI safety researchers, mechanistic interpretability experts, and teams auditing or aligning large language models like Gemma 3.




