Google DeepMind releases DiffusionGemma, a model that runs local AI 4x faster
Diffusion AI is most common in image generation, but it can make text outputs much faster.
Signal weather
Rising
Momentum is building quickly, so this card is a good early entry point into the topic.
Another day, another AI model from Google. This time, Google DeepMind has released a new member of the Gemma 4 open model family, but it's fundamentally different from the rest of the lineup. DiffusionGemma doesn't generate outputs linearly like most AI models. Instead, it can produce an entire block of text in parallel. Google says this makes it faster and more efficient when running on local hardware like an Nvidia DGX or a humble gaming GPU. Most AI models are designed to be autoregressive—they generate text left to right one token at a time. DiffusionGemma has more in common with image generation models, which start with static and then denoise it to create the desired content. This model takes a field of placeholder tokens running over the canvas multiple times to generate likely tokens and using those to improve estimation of others. At the end of the process, the model finalizes its token outputs in one large block—the "denoised" text canvas. DiffusionGemma is fairly large in the realm of Google's open models. It's a Mixture of Experts (MoE) model with a total of 26 billion parameters, but only 3.8 billion are activated during inference. That means it should fit in the 18GB RAM allotment of a high-end GPU. In testing with an RTX 5090, DiffusionGemma spits out around 700 tokens per second. With a single Nvidia H100 AI accelerator, DiffusionGemma can produce 1,000+ tokens per second. That's about four times the output of the similarly sized autoregressive Gemma models. Read full article Comments
Stay on the signal
Follow Google DeepMind releases DiffusionGemma, a model that runs local AI 4x faster
Follow this story beyond a single article: new follow-ups, adjacent sources, and the evolving storyline.
Story map
Understand this topic fast
A quick entry into the story: why it matters now, who is involved, and where to go next for context.
Why it matters now
Topic constellation
Open the live map for this story
See which entities, story threads, sources, and follow-up articles shape this story right now.
Click nodes to continue
Entity pages
Story timeline
Continue with this story
A short sequence of events and follow-up stories to understand the arc quickly.
How reliable this looks
Signal and trust for Ars Technica
This source works at a rapid pace: 100% of recent stories land in the hot window, and 0% carry visible search signal.
Reliability
92
Freshness
100
Sources in storyline
3
Related articles
More stories that share tags, source, or category context.
Fresh off bond sale, Amazon borrows $17.5B from banks as AI spending continues
Companies are burning through exorbitant sums of money to keep pace in the AI arms race. Debt is climbing.
Signal weather
Momentum is building quickly, so this card is a good early entry point into the topic.
Why now
Fresh coverage with immediate momentum.
Logitech’s foldable mouse is for people who refuse to carry a mouse with them
The Mobi Fold is an $80 Bluetooth mouse with a silicone-wrapped hinge.
Signal weather
Momentum is building quickly, so this card is a good early entry point into the topic.
Why now
Fresh coverage with immediate momentum.
Policy on the AI Exponential
Comments
Signal weather
Momentum is building quickly, so this card is a good early entry point into the topic.
Why now
Fresh coverage with immediate momentum.
We managed to glean some interesting details about the Artemis III mission
"I was on the phone with Blue Origin leadership that night, all the next day, all through the weekend."
Signal weather
Momentum is building quickly, so this card is a good early entry point into the topic.
Why now
Fresh coverage with immediate momentum.
More from Ars Technica
Fresh reporting and follow-up coverage from the same newsroom.
Logitech’s foldable mouse is for people who refuse to carry a mouse with them
The Mobi Fold is an $80 Bluetooth mouse with a silicone-wrapped hinge.
Signal weather
Momentum is building quickly, so this card is a good early entry point into the topic.
Why now
Fresh coverage with immediate momentum.
We managed to glean some interesting details about the Artemis III mission
"I was on the phone with Blue Origin leadership that night, all the next day, all through the weekend."
Signal weather
Momentum is building quickly, so this card is a good early entry point into the topic.
Why now
Fresh coverage with immediate momentum.
Nobody needs AI to search the Internet, court says in ruling against Google
Google AI Overview court loss in Germany could spell doom for AI search industry.
Signal weather
Momentum is building quickly, so this card is a good early entry point into the topic.
Why now
Fresh coverage with immediate momentum.
Cheap Iranian drone downed $25 million US Army helicopter—maybe by chance
The US military struck Iran again after an Iranian drone’s lucky midair strike.
Signal weather
Momentum is building quickly, so this card is a good early entry point into the topic.
Why now
Fresh coverage with immediate momentum.