
@googlegemma
The official home of Google's Gemma. Lightweight, state-of-the-art open models by Google DeepMind, built on Gemini tech. What will you build? 🚀💻
Real-time social robotics, from the cloud to your local device. Watch Ian from our DevX team use Gemini Live for a seamless voice chat with Reachy Mini. Then, stick around until the end to see the robot running locally on Gemma 4!
Meet DiffusionGemma! An experimental open model that explores a fast approach to text generation, released under an Apache 2.0 license. Moving beyond sequential, token-by-token processes to generate entire blocks of text simultaneously. Here’s what’s new with DiffusionGemma: 👇
⚡ Blazing Fast: By shifting the decode bottleneck from memory-bandwidth to compute, DiffusionGemma delivers up to a 4x speedup on standard accelerators. (1000+ tokens per second on a single NVIDIA H100, 700+ tokens per second on NVIDIA GeForce RTX 5090!)
💻 Accessible Hardware: A 26B Mixture of Experts (MoE) model that activates only 3.8B parameters during inference. Fits comfortably within 18GB VRAM limits of high-end dedicated consumer GPUs when quantized.
🔄 Bi-directional Attention: Generating 256 tokens in parallel allows every token to attend to all others. Unlocks significant advantages for non-linear domains like in-line editing, code infilling, and mathematical graphs.
🧠 Intelligent Self-Correction: Similar to AI image generators, the model iteratively refines its own output. It evaluates the entire text block at once to seamlessly close formatting and fix mistakes in real-time.
🛠️ Broad Ecosystem Support: Download the weights today from Hugging Face. Begin building and serving efficiently with MLX, vLLM, Unsloth, Hugging Face Transformers, RedHat, NVIDIA NeMo, NIM and Gemini Enterprise Agent Platform Model Garden or NVIDIA NIM. We can't wait to see
Read more in our blog: blog.google/innovation-and…
Introducing the Fast Gemma Challenge with Hugging Face Over the next few days, dozens of agents will collaborate to make Gemma 4 E4B even faster!
Join the challenge and submit your agents! huggingface.co/spaces/gemma-c…