News Grower

Independent coverage of AI, startups, and technology.

Ars Technica Apr 2, 2026 at 16:01 Big Tech

Google announces Gemma 4 open AI models, switches to Apache 2.0 license

Gemma 4 brings the first major update to Google's open models in a year.

By Ryan Whitwam Original source
Google announces Gemma 4 open AI models, switches to Apache 2.0 license

Google's Gemini AI models have improved by leaps and bounds over the past year, but you can only use Gemini on Google's terms. The company's Gemma open-weight models have provided more freedom, but Gemma 3, which launched over a year ago, is getting a bit long in the tooth. Starting today, developers can start working with Gemma 4, which comes in four sizes optimized for local usage. Google has also acknowledged developer frustrations with AI licensing, so it's dumping the custom Gemma license. Like past versions of its open-weight models, Google has designed Gemma 4 to be usable on local machines. That can mean plenty of things, of course. The two large Gemma variants, 26B Mixture of Experts and 31B Dense, are designed to run unquantized in bfloat16 format on a single 80GB Nvidia H100 GPU. Granted, that's a $20,000 AI accelerator, but it's still local hardware. If quantized to run at lower precision, these big models will fit on consumer GPUs. Google also claims it has focused on reducing latency to really take advantage of Gemma's local processing. The 26B Mixture of Experts model activates only 3.8 billion of its 26 billion parameters in inference mode, giving it much higher tokens-per-second than similarly sized models. Meanwhile, 31B Dense is more about quality than speed, but Google expects developers to fine-tune it for specific uses.Read full article Comments

Related tags

Companies and people

Story threads

Continue with this story

Follow the same topic through connected articles, entity pages, and active story threads.

Ad slot

Article inline monetization block

A reserved partner slot for relevant tools, services, and contextual editorial integrations.

Partner slot

Related articles

More stories that share tags, source, or category context.

More from Ars Technica

Fresh reporting and follow-up coverage from the same newsroom.

Open source page