• Home
  • Latest
  • Fortune 500
  • Finance
  • Tech
  • Leadership
  • Lifestyle
  • Rankings
  • Multimedia
TechGoogle

Gemini Diffusion was the sleeper hit of Google I/O and some say its blazing speed could reshape the AI model wars

Sharon Goldman
By
Sharon Goldman
Sharon Goldman
AI Reporter
Down Arrow Button Icon
Sharon Goldman
By
Sharon Goldman
Sharon Goldman
AI Reporter
Down Arrow Button Icon
May 21, 2025, 5:54 PM ET
Google CEO Sundar Pichai
Alphabet CEO Sundar Pichai, at Google I/O 2025 in Mountain View. (Photo courtesy of Google)

Amid the flood of AI-related announcements at Google’s I/O developer conference Tuesday was a brief demo that, although it didn’t get much stage time, has AI insiders buzzing. 

Recommended Video

Gemini Diffusion, an experimental research LLM from Google DeepMind, has blisteringly fast output (between 1,000 and 2,000 “tokens,” or chunks of text, per second, which is four to five times faster than Gemini’s most powerful public LLM.) It also has surprisingly good performance, particularly in areas like coding and complex mathematical reasoning. 

According to a short blog post, Google said the experimental Gemini Diffusion demo “generates content significantly faster than our fastest model so far, while matching its coding performance.” There is a waitlist to get access to the research version. 

Some say if Google is able to expand Gemini Diffusion beyond a research demo, it could potentially reshape the AI model wars being waged between between Google, OpenAI, Anthropic, Meta and Chinese contenders, like Alibaba and DeepSeek. For example, autonomous coding agents are one of the key battlegrounds right now; a publicly-available Gemini Diffusion could upend the playing field to Google’s advantage, helping it win business for its new coding agent Jules. 

There are also open questions about model costs, depending on how much computing power diffusion requires. For some tasks, such as generating computer code, diffusion will simply be more efficient, said Dave Nicholson, chief analyst at Futurum Group. “All this will eventually be measured against each model’s running costs,” he explained. Once true costs are reflected in pricing (which is not necessarily the case today, as AI companies and their backers fight for market share), customers will become much more selective about choosing the model best suited to the task at hand, Nicholson said.

Besides simple FOMO regarding access to the new model, the excitement stems from the “diffusion” technique the model is based on. Diffusion is a different type of LLM than the kind used in products like ChatGPT; it’s the AI method that gave birth to the first popular AI image-generation tools like DALL-E 2 and Stable Diffusion.

Diffusion models convert random noise—images that look like static on a TV screen— into high-quality images based on text prompts. Until recently, the diffusion technique, which has been described as more like sculpting than writing, had not seen much success in generating text. Instead of predicting text directly like the traditional LLMs we have come to rely on since ChatGPT launched in 2022, diffusion models learn to generate words and sentences by refining random gibberish into coherent text. One of the reasons it can do so very quickly is that it can perform this “de-noising” process across many different parts of the text at the same time.

Traditional LLMs like ChatGPT, on the other hand, are based on a different AI technique known as a Transformer, that researchers at Google pioneered in 2017. Transformers can only generate one “token,” or chunk of text, at a time, from left to right. Each new word depends on all the previous ones and the model can’t skip ahead, nor can it go back and revise the text it generated earlier. (The new “reasoning” models based on Transformers can revise their outputs, but only by generating a completely new sequence. They don’t revise parts of an existing sequence on the fly.) Diffusion models are more holistic: they guess the entire output all at once (though it is gibberish), and refine it all at once. That means they can generate output faster because the model is not working on one word at a time. 

Like ChatGPT ‘on steroids’

There are tradeoffs, however. Some researchers have noted that while diffusion models are fast and flexible, they can only generate text segments of a fixed length, and so may struggle with writing essays or multi-paragraph narratives. Because they don’t build sentences one word at a time, diffusion models can lose the kind of natural flow and logical progression that transformer-based models are optimized for.

When it comes to computer code though, narrative flow is less important than logic and syntax. And forf developers focused on building and shipping, the speed of diffusion model is a big advantage.

The buzz among techies was evident soon after Google showed off the model Tuesday. Gemini Diffusion, said fans on social media, is a model that is “insane” and like “ChatGPT on steroids.” “It’s a bit like getting a draft and then rework/edit it,” said Alexander Doria, cofounder of the Paris-based Pleias, told Fortune in a message. “So much faster, potentially better for some tasks.” 

Jack Rae, principal scientist at Google DeepMind, said on X that the Gemini Diffusion release “feels like a landmark moment.” For text generation, he said, traditional LLMs had always outperformed diffusion models in terms of quality. “It wasn’t clear that the gap would ever be closed….the result is a fascinating and powerful model that is also lightning fast.” 

Gemini Diffusion is part of a trajectory that many in the AI field had anticipated, according to Stefano Ermon, an associate professor in the department of computer science at Stanford University who has been working on diffusion models for the past five years. He is also the co-founder of Inception Labs, which announced the first diffusion large language model a few months ago, called Mercury. The model matched the performance of frontier models optimized for speed, while running five to ten times faster. 

“Google’s entry into this space validates the direction we’ve been pursuing,” he told Fortune by email. “It’s exciting to see the broader industry embracing these techniques, though we’re already working on training the next generation of text diffusion models.” 

Within a few years, he added that he expected “all frontier models will be diffusion models.” 

But other experts pointed out that the public still does not have access and that while it is promising, Gemini Diffusion remains a research experiment with few details. 

According to Nathan Lambert, of AI2, Gemini Diffusion is the “biggest endorsement yet of the [text diffusion] model, but we have no details so can’t compare well.” 

Join us at the Fortune Workplace Innovation Summit May 19–20, 2026, in Atlanta. The next era of workplace innovation is here—and the old playbook is being rewritten. At this exclusive, high-energy event, the world’s most innovative leaders will convene to explore how AI, humanity, and strategy converge to redefine, again, the future of work. Register now.
About the Author
Sharon Goldman
By Sharon GoldmanAI Reporter
LinkedIn icon

Sharon Goldman is an AI reporter at Fortune and co-authors Eye on AI, Fortune’s flagship AI newsletter. She has written about digital and enterprise tech for over a decade.

See full bioRight Arrow Button Icon

Latest in Tech

Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025

Most Popular

Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Rankings
  • 100 Best Companies
  • Fortune 500
  • Global 500
  • Fortune 500 Europe
  • Most Powerful Women
  • Future 50
  • World’s Most Admired Companies
  • See All Rankings
Sections
  • Finance
  • Leadership
  • Success
  • Tech
  • Asia
  • Europe
  • Environment
  • Fortune Crypto
  • Health
  • Retail
  • Lifestyle
  • Politics
  • Newsletters
  • Magazine
  • Features
  • Commentary
  • Mpw
  • CEO Initiative
  • Conferences
  • Personal Finance
  • Education
Customer Support
  • Frequently Asked Questions
  • Customer Service Portal
  • Privacy Policy
  • Terms Of Use
  • Single Issues For Purchase
  • International Print
Commercial Services
  • Advertising
  • Fortune Brand Studio
  • Fortune Analytics
  • Fortune Conferences
  • Business Development
About Us
  • About Us
  • Editorial Calendar
  • Press Center
  • Work At Fortune
  • Diversity And Inclusion
  • Terms And Conditions
  • Site Map

© 2025 Fortune Media IP Limited. All Rights Reserved. Use of this site constitutes acceptance of our Terms of Use and Privacy Policy | CA Notice at Collection and Privacy Notice | Do Not Sell/Share My Personal Information
FORTUNE is a trademark of Fortune Media IP Limited, registered in the U.S. and other countries. FORTUNE may receive compensation for some links to products and services on this website. Offers may be subject to change without notice.


Latest in Tech

HealthChatGPT
OpenAI suggests ChatGPT play doctor as millions of Americans face spiking insurance costs: ‘In the U.S., ChatGPT has become an important ally’
By Tristan BoveJanuary 7, 2026
31 minutes ago
Nvidia founder and CEO Jensen Huang
C-SuiteJensen Huang
Jensen Huang is ‘perfectly fine’ with a billionaire tax, shrugging off concerns that it might scatter Silicon Valley’s talent pool
By Eleanor PringleJanuary 7, 2026
2 hours ago
CryptoCryptocurrency
Exclusive: Fireblocks acquires crypto accounting platform TRES Finance for $130 million
By Ben WeissJanuary 7, 2026
4 hours ago
Sarandos
Big TechM&A
‘Largest LBO in history’: Warner rejects Paramount again, scoffing at $87 billion worth of debt in its $108 billion bid
By Nick LichtenbergJanuary 7, 2026
4 hours ago
two men pose for camera
CryptoBitcoin
Stanford professor raises $15 million for Babylon, a decentralized protocol to turn Bitcoin into collateral 
By Carlos GarciaJanuary 7, 2026
5 hours ago
Fridtjof Berge is the Co-Founder & Chief Business Officer of Antler
Startups & VentureVenture Capital
25 is the new 30 when it comes to AI founders as Gen Z entrepreneurs lead the way on billion-dollar unicorn startups, top VC partner says
By Nick LichtenbergJanuary 7, 2026
5 hours ago

Most Popular

placeholder alt text
Personal Finance
Janet Yellen warns the $38 trillion national debt is testing a red line economists have feared for decades
By Eva RoytburgJanuary 5, 2026
2 days ago
placeholder alt text
Economy
Mark Cuban on the $38 trillion national debt and the absurdity of U.S. healthcare: we wouldn't pay for potato chips like this
By Nick LichtenbergJanuary 6, 2026
1 day ago
placeholder alt text
Law
Amazon is cutting checks to millions of customers as part of a $2.5 billion FTC settlement. Here's who qualifies and how to get paid
By Sydney LakeJanuary 6, 2026
22 hours ago
placeholder alt text
Future of Work
'Employers are increasingly turning to degree and GPA' in hiring: Recruiters retreat from ‘talent is everywhere,’ double down on top colleges
By Jake AngeloJanuary 6, 2026
23 hours ago
placeholder alt text
Success
The college-to-office path is dead: CEO of the world’s biggest recruiter says Gen Z grads need to consider trade and hospitality jobs that don't even require degrees
By Orianna Rosa RoyleJanuary 6, 2026
1 day ago
placeholder alt text
Success
Blackstone exec says elite Ivy League degrees aren’t good enough—new analysts need to 'work harder' and be nice 
By Ashley LutzJanuary 5, 2026
2 days ago