• Home
  • Latest
  • Fortune 500
  • Finance
  • Tech
  • Leadership
  • Lifestyle
  • Rankings
  • Multimedia
TechGoogle

Gemini Diffusion was the sleeper hit of Google I/O and some say its blazing speed could reshape the AI model wars

Sharon Goldman
By
Sharon Goldman
Sharon Goldman
AI Reporter
Down Arrow Button Icon
Sharon Goldman
By
Sharon Goldman
Sharon Goldman
AI Reporter
Down Arrow Button Icon
May 21, 2025, 5:54 PM ET
Google CEO Sundar Pichai
Alphabet CEO Sundar Pichai, at Google I/O 2025 in Mountain View. (Photo courtesy of Google)

Amid the flood of AI-related announcements at Google’s I/O developer conference Tuesday was a brief demo that, although it didn’t get much stage time, has AI insiders buzzing. 

Recommended Video

Gemini Diffusion, an experimental research LLM from Google DeepMind, has blisteringly fast output (between 1,000 and 2,000 “tokens,” or chunks of text, per second, which is four to five times faster than Gemini’s most powerful public LLM.) It also has surprisingly good performance, particularly in areas like coding and complex mathematical reasoning. 

According to a short blog post, Google said the experimental Gemini Diffusion demo “generates content significantly faster than our fastest model so far, while matching its coding performance.” There is a waitlist to get access to the research version. 

Some say if Google is able to expand Gemini Diffusion beyond a research demo, it could potentially reshape the AI model wars being waged between between Google, OpenAI, Anthropic, Meta and Chinese contenders, like Alibaba and DeepSeek. For example, autonomous coding agents are one of the key battlegrounds right now; a publicly-available Gemini Diffusion could upend the playing field to Google’s advantage, helping it win business for its new coding agent Jules. 

There are also open questions about model costs, depending on how much computing power diffusion requires. For some tasks, such as generating computer code, diffusion will simply be more efficient, said Dave Nicholson, chief analyst at Futurum Group. “All this will eventually be measured against each model’s running costs,” he explained. Once true costs are reflected in pricing (which is not necessarily the case today, as AI companies and their backers fight for market share), customers will become much more selective about choosing the model best suited to the task at hand, Nicholson said.

Besides simple FOMO regarding access to the new model, the excitement stems from the “diffusion” technique the model is based on. Diffusion is a different type of LLM than the kind used in products like ChatGPT; it’s the AI method that gave birth to the first popular AI image-generation tools like DALL-E 2 and Stable Diffusion.

Diffusion models convert random noise—images that look like static on a TV screen— into high-quality images based on text prompts. Until recently, the diffusion technique, which has been described as more like sculpting than writing, had not seen much success in generating text. Instead of predicting text directly like the traditional LLMs we have come to rely on since ChatGPT launched in 2022, diffusion models learn to generate words and sentences by refining random gibberish into coherent text. One of the reasons it can do so very quickly is that it can perform this “de-noising” process across many different parts of the text at the same time.

Traditional LLMs like ChatGPT, on the other hand, are based on a different AI technique known as a Transformer, that researchers at Google pioneered in 2017. Transformers can only generate one “token,” or chunk of text, at a time, from left to right. Each new word depends on all the previous ones and the model can’t skip ahead, nor can it go back and revise the text it generated earlier. (The new “reasoning” models based on Transformers can revise their outputs, but only by generating a completely new sequence. They don’t revise parts of an existing sequence on the fly.) Diffusion models are more holistic: they guess the entire output all at once (though it is gibberish), and refine it all at once. That means they can generate output faster because the model is not working on one word at a time. 

Like ChatGPT ‘on steroids’

There are tradeoffs, however. Some researchers have noted that while diffusion models are fast and flexible, they can only generate text segments of a fixed length, and so may struggle with writing essays or multi-paragraph narratives. Because they don’t build sentences one word at a time, diffusion models can lose the kind of natural flow and logical progression that transformer-based models are optimized for.

When it comes to computer code though, narrative flow is less important than logic and syntax. And forf developers focused on building and shipping, the speed of diffusion model is a big advantage.

The buzz among techies was evident soon after Google showed off the model Tuesday. Gemini Diffusion, said fans on social media, is a model that is “insane” and like “ChatGPT on steroids.” “It’s a bit like getting a draft and then rework/edit it,” said Alexander Doria, cofounder of the Paris-based Pleias, told Fortune in a message. “So much faster, potentially better for some tasks.” 

Jack Rae, principal scientist at Google DeepMind, said on X that the Gemini Diffusion release “feels like a landmark moment.” For text generation, he said, traditional LLMs had always outperformed diffusion models in terms of quality. “It wasn’t clear that the gap would ever be closed….the result is a fascinating and powerful model that is also lightning fast.” 

Gemini Diffusion is part of a trajectory that many in the AI field had anticipated, according to Stefano Ermon, an associate professor in the department of computer science at Stanford University who has been working on diffusion models for the past five years. He is also the co-founder of Inception Labs, which announced the first diffusion large language model a few months ago, called Mercury. The model matched the performance of frontier models optimized for speed, while running five to ten times faster. 

“Google’s entry into this space validates the direction we’ve been pursuing,” he told Fortune by email. “It’s exciting to see the broader industry embracing these techniques, though we’re already working on training the next generation of text diffusion models.” 

Within a few years, he added that he expected “all frontier models will be diffusion models.” 

But other experts pointed out that the public still does not have access and that while it is promising, Gemini Diffusion remains a research experiment with few details. 

According to Nathan Lambert, of AI2, Gemini Diffusion is the “biggest endorsement yet of the [text diffusion] model, but we have no details so can’t compare well.” 

Join us at the Fortune Workplace Innovation Summit May 19–20, 2026, in Atlanta. The next era of workplace innovation is here—and the old playbook is being rewritten. At this exclusive, high-energy event, the world’s most innovative leaders will convene to explore how AI, humanity, and strategy converge to redefine, again, the future of work. Register now.
About the Author
Sharon Goldman
By Sharon GoldmanAI Reporter
LinkedIn icon

Sharon Goldman is an AI reporter at Fortune and co-authors Eye on AI, Fortune’s flagship AI newsletter. She has written about digital and enterprise tech for over a decade.

See full bioRight Arrow Button Icon

Latest in Tech

Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025

Most Popular

Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Fortune Secondary Logo
Rankings
  • 100 Best Companies
  • Fortune 500
  • Global 500
  • Fortune 500 Europe
  • Most Powerful Women
  • World's Most Admired Companies
  • See All Rankings
  • Lists Calendar
Sections
  • Finance
  • Fortune Crypto
  • Features
  • Leadership
  • Health
  • Commentary
  • Success
  • Retail
  • Mpw
  • Tech
  • Lifestyle
  • CEO Initiative
  • Asia
  • Politics
  • Conferences
  • Europe
  • Newsletters
  • Personal Finance
  • Environment
  • Magazine
  • Education
Customer Support
  • Frequently Asked Questions
  • Customer Service Portal
  • Privacy Policy
  • Terms Of Use
  • Single Issues For Purchase
  • International Print
Commercial Services
  • Advertising
  • Fortune Brand Studio
  • Fortune Analytics
  • Fortune Conferences
  • Business Development
  • Group Subscriptions
About Us
  • About Us
  • Press Center
  • Work At Fortune
  • Terms And Conditions
  • Site Map
  • About Us
  • Press Center
  • Work At Fortune
  • Terms And Conditions
  • Site Map
  • Facebook icon
  • Twitter icon
  • LinkedIn icon
  • Instagram icon
  • Pinterest icon

Latest in Tech

google
InvestingMarkets
Google shares hit all-time high on blowout earnings, market cap doubles to $4.4 trillion in just a year
By Michael Liedtke and The Associated PressApril 30, 2026
30 minutes ago
AWS
Big TechMarkets
Amazon’s cloud sales are growing the most in 15 quarters. Investors sent the stock down on AI capex fears
By Anne D'Innocenzio and The Associated PressApril 30, 2026
38 minutes ago
AstraZeneca CFO Aradhana Sarin
BankingCFO Daily
How AstraZeneca’s 17,000 AI-certified employees are helping it reach a ‘stretch goal’ of $80 billion in revenue
By Sheryl EstradaApril 30, 2026
2 hours ago
agentic
CommentaryAI agents
Why your data infrastructure — not your AI model — will determine whether Agentic AI scales
By Jeffrey Sonnenfeld, Stephen Henriques, Catherine Dai and Zander JeinthanuttkanontApril 30, 2026
3 hours ago
The startup that wants to give surgeons X-ray vision
NewslettersTerm Sheet
The startup that wants to give surgeons X-ray vision
By Allie GarfinkleApril 30, 2026
3 hours ago
Google Cloud CEO Thomas Kurian at Fortune Brainstorm AI 2025 in San Francisco. (Photo: Stuart Isett/Fortune)
NewslettersFortune Tech
Google Cloud is almost one-fifth of Alphabet’s business
By Andrew NuscaApril 30, 2026
4 hours ago

Most Popular

Apple cofounder Ronald Wayne—whose stake would be worth up to $400 billion had he not sold it in 1976—says that at 91, he has no regrets
Success
Apple cofounder Ronald Wayne—whose stake would be worth up to $400 billion had he not sold it in 1976—says that at 91, he has no regrets
By Preston ForeApril 27, 2026
3 days ago
Jamie Dimon gets candid about national debt: ‘There will be a bond crisis, and then we’ll have to deal with it’
Economy
Jamie Dimon gets candid about national debt: ‘There will be a bond crisis, and then we’ll have to deal with it’
By Eleanor PringleApril 29, 2026
1 day ago
‘They left me no choice’: Powell isn’t going anywhere—blocking Trump from another Fed appointee
Banking
‘They left me no choice’: Powell isn’t going anywhere—blocking Trump from another Fed appointee
By Eva RoytburgApril 29, 2026
20 hours ago
‘The cost of compute is far beyond the costs of the employees’: Nvidia executive says right now AI is more expensive than paying human workers
AI
‘The cost of compute is far beyond the costs of the employees’: Nvidia executive says right now AI is more expensive than paying human workers
By Sasha RogelbergApril 28, 2026
2 days ago
‘Take the money and run’: Johns Hopkins economist Steve Hanke on why the UAE quit OPEC
Energy
‘Take the money and run’: Johns Hopkins economist Steve Hanke on why the UAE quit OPEC
By Shawn TullyApril 29, 2026
1 day ago
Google Cloud revenue is now 18% of Alphabet's business. Is this the beginning of the end of Google's search identity?
Big Tech
Google Cloud revenue is now 18% of Alphabet's business. Is this the beginning of the end of Google's search identity?
By Alexei OreskovicApril 29, 2026
13 hours ago

© 2026 Fortune Media IP Limited. All Rights Reserved. Use of this site constitutes acceptance of our Terms of Use and Privacy Policy | CA Notice at Collection and Privacy Notice | Do Not Sell/Share My Personal Information
FORTUNE is a trademark of Fortune Media IP Limited, registered in the U.S. and other countries. FORTUNE may receive compensation for some links to products and services on this website. Offers may be subject to change without notice.