Google's open AI ambitions got a lot more serious today. The company released Gemma 4, a family of four open-weight models built on the same research as Gemini 3, and licensed under Apache 2.0—a significant departure from the more restrictive terms on previous Gemma versions.
Developers have downloaded past Gemma generations over 400 million times, spawning more than 100,000 community variants. This release is the most ambitious one yet.
For the past year, the open-source AI leaderboard has been largely a Chinese affair. DeepSeek, Minimax, GLM and Qwen have dominated the top spots, leaving American alternatives scrambling for relevance. As Decrypt reported last year, Chinese open models went from barely 1.2% of global open-model usage in late 2024 to roughly 30% by the end of 2025, with Alibaba's Qwen even overtaking Meta's Llama as the most-used self-hosted model worldwide.
Meta's Llama used to be the default choice for developers who wanted a capable, locally runnable model. That reputation has eroded—Llama's Meta-controlled license raised questions about its true open-source status, and its performance slipped behind the Chinese competition. The Allen Institute's OLMo family tried to fill the gap but failed to gain meaningful traction. OpenAI released its gpt-oss models in August 2025, which gave the ecosystem a breath of fresh air, but they were never designed to be frontier competitors.
And yesterday, a 30-person U.S. startup called Arcee AI released Trinity, a 400 billion parameter open model that made a compelling case that the American scene wasn't completely dead. Gemma 4 follows that momentum, this time with the full weight of Google DeepMind behind it, turning it into arguably the best American model in the open-source AI scene.
The model is "built from the same world-class research and technology as Gemini 3," Google said in its announcement. Gemma 4 ships in four sizes: Effective 2B and 4B for phones and edge devices, a 26B Mixture of Experts model focused on speed, and a 31B Dense model optimized for raw quality.
The 31B Dense currently ranks third among all open models on Arena AI's text leaderboard. The 26B MoE sits sixth. Google claims both outcompete models 20 times their size—a claim that holds up, at least against the Arena AI numbers, where Chinese models still hold the top two spots.
We tested Gemma 4. It's capable, with some caveats. The model applies reasoning even to tasks that don't require it, which can make responses feel over-engineered for simple prompts. Creative writing is decent—serviceable, not inspired—and likely improves with more specific guidance and prompt engineering.
Where it delivered most clearly was code. Asked to generate a game, the output wasn't particularly flashy or elaborate, but it ran without errors on the first try. Not bad for a 41 billion parameter model. That zero-shot reliability is arguably more valuable than a prettier result that needs debugging.
You can try the (basic, yet functional) game here.
The four variants cover the full hardware spectrum. The E2B and E4B models are built for Android phones, Raspberry Pi, and edge devices, running completely offline with near-zero latency, native audio input, and a 128K context window. The 26B and 31B models target workstations and cloud deployments, extending context to 256K and adding native function-calling and structured JSON output for building autonomous agents. All four models process images and video natively. The larger models' full-precision weights fit on a single 80GB NVIDIA H100 GPU; quantized versions run on consumer hardware.
The Apache 2.0 license is the other headline. Google's previous Gemma releases used a custom license that created legal ambiguity for commercial products. Apache 2.0 removes that friction entirely—developers can modify, redistribute, and commercialize without worrying about Google changing the terms later. Hugging Face co-founder Clement Delangue praised it, saying that "Local AI is having its moment," and it is the future of the AI industry. Google DeepMind CEO Demis Hassabis went further, calling Gemma 4 "the best open models in the world for their respective sizes."
That's a strong claim. Proprietary systems from Anthropic, OpenAI, and Google's own Gemini still lead on the hardest benchmarks. But for open-weight models you can run locally, modify freely, and deploy on your own infrastructure? The competition just got significantly thinner. You can try Gemma 4 now in Google AI Studio (31B and 26B) or Google AI Edge Gallery (E2B and E4B). Model weights are also available on Hugging Face, Kaggle, and Ollama.
免责声明:本文章仅代表作者个人观点,不代表本平台的立场和观点。本文章仅供信息分享,不构成对任何人的任何投资建议。用户与作者之间的任何争议,与本平台无关。如网页中刊载的文章或图片涉及侵权,请提供相关的权利证明和身份证明发送邮件到support@aicoin.com,本平台相关工作人员将会进行核查。