By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
World of SoftwareWorld of SoftwareWorld of Software
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Search
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
Reading: Google Releases Gemma Scope 2 to Deepen Understanding of LLM Behavior
Share
Sign In
Notification Show More
Font ResizerAa
World of SoftwareWorld of Software
Font ResizerAa
  • Software
  • Mobile
  • Computing
  • Gadget
  • Gaming
  • Videos
Search
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Have an existing account? Sign In
Follow US
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
World of Software > News > Google Releases Gemma Scope 2 to Deepen Understanding of LLM Behavior
News

Google Releases Gemma Scope 2 to Deepen Understanding of LLM Behavior

News Room
Last updated: 2026/01/12 at 5:02 AM
News Room Published 12 January 2026
Share
Google Releases Gemma Scope 2 to Deepen Understanding of LLM Behavior
SHARE

Gemma Scope 2 is a suite of tools designed to interpret the behavior of Gemini 3 models, enabling researchers to analyze emergent model behaviors, audit and debug AI agents, and devise mitigation strategies against security issues like jailbreaks, hallucinations and sycophancy.

Interpretability research aims to understand the internal workings and learned algorithms of AI models. As AI becomes increasingly more capable and complex, interpretability is crucial for building AI that is safe and reliable.

Google describes Gemma Scope as a microscope for its LLMs. It combines sparse autoencoders (SAEs) and transcoders to let researchers inspect a model’s internal representation, examine what it “thinks” and understand how those internal states shape its behavior. One key use case is inspecting discrepancies between a model’s output and its internal state, which Google says could help surface safety risks.

Gemma Scope 2 extends the original Gemma Scope, which targeted the Gemma 2 family, in several ways. Most notably, it retrained its SAEs and transcoders across every layer of Gemma 3 models, including skip-transcoders and cross-layer transcoders, which are designed to make multi-step computations and distributed algorithms easier to interpret.

Increasing the number of layers, Google explains, directly increases compute and memory requirements, which required to design specialized sparse kernels to keep complexity scaling linearly with the number of layers.

In addition, Google applied a more advanced training technique to improve Gemma Scope 2’s ability to identify more useful concepts, while also addressing several known flaws in the first implementation. Finally, Gemma Scope 2 introduces tools specifically tailored for chatbot analysis, enabling the study of complex, multi-step behaviors, such as jailbreaks, refusal mechanisms, and chain-of-thought faithfulness.

Sparse autoencoders use a pair of encoder and decoder functions to decompose and reconstruct all LLM inputs. Transcoders, on the other hand, are trained to sparsely reconstruct the computations of a multi-layer perceptron (MLP) sublayer, that is to learn how to approximate their output for a given input. This makes them useful for identifying which parts of each layer and sublayer, or more exactly which patterns of activations, are triggered by individual input tokens and or sequences of tokens.

Besides the application to security issues, redditor Mescalian foresees that this research could:

also help inform best practices in other domains, and in the future this technique probably will be used to monitor more intelligent AIs internal reasoning. Right now though it’s most useful for steering capabilities through fine-tuning and other modification of weights.

Similarly to Google, Anthropic and OpenAI also released their own “AI microscopes” tailored for their own models.

Google has released the weights of Gemma Scope 2 on Hugging Face.

Sign Up For Daily Newsletter

Be keep up! Get the latest breaking news delivered straight to your inbox.
By signing up, you agree to our Terms of Use and acknowledge the data practices in our Privacy Policy. You may unsubscribe at any time.
Share This Article
Facebook Twitter Email Print
Share
What do you think?
Love0
Sad0
Happy0
Sleepy0
Angry0
Dead0
Wink0
Previous Article Disney+ Confirms TikTok-Style Vertical Videos Are Coming to the Platform Later This Year to Attract Gen Z Viewers Disney+ Confirms TikTok-Style Vertical Videos Are Coming to the Platform Later This Year to Attract Gen Z Viewers
Next Article How to Choose the Right Tech Stack for White-Label Web Development How to Choose the Right Tech Stack for White-Label Web Development
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Stay Connected

248.1k Like
69.1k Follow
134k Pin
54.3k Follow

Latest News

How to make YouTube load faster on Firefox and Edge
How to make YouTube load faster on Firefox and Edge
Gadget
Apple confirms Google’s Gemini will power new Siri features – 9to5Mac
Apple confirms Google’s Gemini will power new Siri features – 9to5Mac
News
Spec Driven Development: When Architecture Becomes Executable
Spec Driven Development: When Architecture Becomes Executable
News
Semicon China: an expert’s takeaways · TechNode
Semicon China: an expert’s takeaways · TechNode
Computing

You Might also Like

Apple confirms Google’s Gemini will power new Siri features – 9to5Mac
News

Apple confirms Google’s Gemini will power new Siri features – 9to5Mac

2 Min Read
Spec Driven Development: When Architecture Becomes Executable
News

Spec Driven Development: When Architecture Becomes Executable

33 Min Read
Elon Musk's Grok Faces Scrutiny Over Nonconsensual AI-Altered 'Undressed' Images
News

Elon Musk's Grok Faces Scrutiny Over Nonconsensual AI-Altered 'Undressed' Images

12 Min Read
Andy Burnham backs calls for under-16 social media ban – UKTN
News

Andy Burnham backs calls for under-16 social media ban – UKTN

2 Min Read
//

World of Software is your one-stop website for the latest tech news and updates, follow us now to get the news that matters to you.

Quick Link

  • Privacy Policy
  • Terms of use
  • Advertise
  • Contact

Topics

  • Computing
  • Software
  • Press Release
  • Trending

Sign Up for Our Newsletter

Subscribe to our newsletter to get our newest articles instantly!

World of SoftwareWorld of Software
Follow US
Copyright © All Rights Reserved. World of Software.
Welcome Back!

Sign in to your account

Lost your password?