• Home
  • Latest
  • Fortune 500
  • Finance
  • Tech
  • Leadership
  • Lifestyle
  • Rankings
  • Multimedia

Trendingnow

1

Current price of oil as of June 15, 2026

2

Current price of silver as of Monday, June 15, 2026

3

Team USA star Ricardo Pepi grew up in a trailer in El Paso—and his parents pawned their car title to fuel his soccer dream. Now, he’s in the World Cup

1

Current price of oil as of June 15, 2026

2

Current price of silver as of Monday, June 15, 2026

3

Team USA star Ricardo Pepi grew up in a trailer in El Paso—and his parents pawned their car title to fuel his soccer dream. Now, he’s in the World Cup
AIOpenAI

GPT-5’s model router ignited a user backlash against OpenAI—but it might be the future of AI

Sharon Goldman
By
Sharon Goldman
Sharon Goldman
AI Reporter
Down Arrow Button Icon
Sharon Goldman
By
Sharon Goldman
Sharon Goldman
AI Reporter
Down Arrow Button Icon
August 12, 2025, 3:30 PM ET
Photo of Sam Altman
OpenAI CEO Sam Altman.Chris Jung/NurPhoto via Getty Images
Add Fortune on Google for similar content.

OpenAI’s GPT-5 announcement last week was meant to be a triumph—proof that the company was still the undisputed leader in AI—until it wasn’t. Over the weekend, a groundswell of pushback from customers turned the rollout into more than a PR firestorm: It became a product and trust crisis. Users lamented the loss of their favorite models, which had doubled as therapists, friends, and romantic partners. Developers complained of degraded performance. Industry critic Gary Marcus predictably called GPT-5 “overdue, overhyped, and underwhelming.”

Recommended Video

The culprit, many argued, was hiding in plain sight: a new real-time model “router” that automatically decides which one of GPT-5’s several variants to spin up for every job. Many users assumed GPT-5 was a single model trained from scratch; in reality, it’s a network of models—some weaker and cheaper, others stronger and more expensive—stitched together. Experts say that approach could be the future of AI as large language models advance and become more resource-intensive. But in GPT-5’s debut, OpenAI demonstrated some of the inherent challenges in the approach and learned some important lessons about how user expectations are evolving in the AI era.

For all the benefits promised by model routing, many users of GPT-5 bristled at what they perceived as a lack of control. Some even suggested OpenAI might purposefully be trying to pull the wool over their eyes.  

In response to the GPT-5 uproar, OpenAI moved quickly to bring back the main earlier model, GPT-4o, for pro users. It also said it fixed buggy routing, increased usage limits, and promised continual updates to regain user trust and stability.

Anand Chowdhary, cofounder of AI sales platform FirstQuadrant, summed the situation up bluntly: “When routing hits, it feels like magic. When it whiffs, it feels broken.”

The promise and inconsistency of model routing

Jiaxuan You, an assistant professor of computer science at the University of Illinois Urbana-Champaign, told Fortune his lab has studied both the promise—and the inconsistency—of model routing. In GPT-5’s case, he said, he believes (though he can’t confirm) that the model router sometimes sends parts of the same query to different models. A cheaper, faster model might give one answer while a slower, reasoning-focused model gives another, and when the system stitches those responses together, subtle contradictions slip through. 

The model routing idea is intuitive, he explained, but “making it really work is very nontrivial.” Perfecting a router, he added, can be as challenging as building Amazon-grade recommendation systems, which take years and many domain experts to refine. “GPT-5 is supposed to be built with maybe orders of magnitude more resources,” he explained, pointing out that even if the router picks a smaller model, it shouldn’t produce inconsistent answers.

Still, You believes routing is here to stay. “The community also believes model routing is promising,” he said, pointing to both technical and economic reasons. Technically, single-model performance appears to be hitting a plateau: You pointed to the commonly cited scaling laws, which says when we have more data and compute, the model gets better. “But we all know that the model wouldn’t get infinitely better,” he said. “Over the past year, we have all witnessed that the capacity of a single model is actually saturating.” 

Economically, routing lets AI providers keep using older models rather than discarding them when a new one launches. Current events require frequent updates, but static facts remain accurate for years. Directing certain queries to older models avoids wasting the enormous time, compute, and money already spent on training them.

There are hard physical limits, too. GPU memory has become a bottleneck for training ever-larger models, and chip technology is approaching the maximum memory that can be packed onto a single die. In practice, You explained, physical limits mean the next model can’t be 10 times bigger. 

An older idea that is now being hyped

William Falcon, founder and CEO of AI platform Lightning AI, points out that the idea of using an ensemble of models is not new—it has been around since around 2018—and since OpenAI’s models are a black box, we don’t know that GPT-4 did not also use a model routing system. 

“I think maybe they’re being more explicit about it now, potentially,” he said. Either way, the GPT-5 launch was heavily hyped up—including the model routing system. The blog post introducing the model called it the “smartest, fastest, and most useful model yet, with thinking built in.” In the official ChatGPT blog post, OpenAI confirmed that GPT‑5 within ChatGPT runs on a system of models coordinated by a behind-the-scenes router that switches to deeper reasoning when needed. The GPT‑5 System Card went further, clearly outlining multiple model variants—gpt‑5‑main, gpt‑5‑main‑mini for speed, and gpt‑5‑thinking, gpt‑5‑thinking‑mini, plus a thinking‑pro version—and explains how the unified system automatically routes between them.

In a press pre-briefing, OpenAI CEO Sam Altman touted the model router as a way to tackle what had been a hard-to-decipher list of models to choose from. Altman called the previous model picker interface a “very confusing mess.”

But Falcon said the core problem was that GPT-5 simply didn’t feel like a leap. “GPT-1 to 2 to 3 to 4—each time was a massive jump. Four to five was not noticeably better. That’s what people are upset about.”

Will multiple models add up to AGI? 

The debate over model routing led some to call out the ongoing hype over the possibility of artificial general intelligence, or AGI, being developed soon. OpenAI officially defines AGI as “highly autonomous systems that outperform humans at most economically valuable work,” but Altman notably said last week that it is “not a super useful term.”

“What about the promised AGI?” wrote Aiden Chaoyang He, an AI researcher and cofounder of TensorOpera, on X, criticizing the GPT-5 rollout. “Even a powerful company like OpenAI lacks the ability to train a super-large model, forcing them to resort to the Real-time Model Router.” 

Robert Nishihara, co-founder of AI production platform Anyscale, says scaling is still progressing in AI, but the idea of one all-powerful AI model remains elusive. “It’s hard to build one model that is the best at everything,” he said. That’s why GPT-5 currently runs on a network of models linked by a router, not a single monolith.

OpenAI has said it hopes to unify these into one model in the future, but Nishihara points out that hybrid systems have real advantages: You can upgrade one piece at a time without disrupting the rest, and you get most of the benefits without the cost and complexity of retraining an entire giant model. As a result, Nishihara thinks routing will stick around. 

Aiden Chaoyang He agrees. In theory, scaling laws still hold—more data and compute make models better—but in practice, he believes development will “spiral” between two approaches: routing specialized models together, then trying to consolidate them into one. The deciding factors will be engineering costs, compute and energy limits, and business pressures.

The hyped-up AGI narrative may need to adjust, too. “If anyone does anything that’s close to AGI, I don’t know if it’ll literally be one set of weights doing it,” Falcon said, referring to the “brains” behind LLMs. “If it’s a collection of models that feels like AGI, that’s fine. No one’s a purist here.”

Subscribe to Fortune Gulf Brief. Every Tuesday, this new newsletter delivers clear-eyed, authoritative intelligence on the deals, decisions, policies, and power shifts shaping one of the world’s most consequential regions, written for the people who need to act on it. Sign up here.
About the Author
Sharon Goldman
By Sharon GoldmanAI Reporter
LinkedIn icon

Sharon Goldman is an AI reporter at Fortune and co-authors Eye on AI, Fortune’s flagship AI newsletter. She has written about digital and enterprise tech for over a decade.

See full bioRight Arrow Button Icon
Add Fortune on Google for similar content.

Latest in AI

Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025

Most Popular

Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Fortune Secondary Logo
Rankings
  • 100 Best Companies
  • Fortune 500
  • Global 500
  • Fortune 500 Europe
  • Most Powerful Women
  • World's Most Admired Companies
  • See All Rankings
  • Lists Calendar
Sections
  • Finance
  • Fortune Crypto
  • Features
  • Leadership
  • Health
  • Commentary
  • Success
  • Retail
  • Mpw
  • Tech
  • Lifestyle
  • CEO Initiative
  • Asia
  • Politics
  • Conferences
  • Europe
  • Newsletters
  • Personal Finance
  • Environment
  • Magazine
  • Education
Customer Support
  • Frequently Asked Questions
  • Customer Service Portal
  • Privacy Policy
  • Terms Of Use
  • Single Issues For Purchase
  • International Print
Commercial Services
  • Advertising
  • Fortune Brand Studio
  • Fortune Analytics
  • Fortune Conferences
  • Business Development
  • Group Subscriptions
About Us
  • About Us
  • Press Center
  • Work At Fortune
  • Terms And Conditions
  • Site Map
  • About Us
  • Press Center
  • Work At Fortune
  • Terms And Conditions
  • Site Map
  • Facebook icon
  • Twitter icon
  • LinkedIn icon
  • Instagram icon
  • Pinterest icon

Latest in AI

Anthropic’s Fable fiasco leaves the door open for open-source AI, particularly cheaper models from China
AsiaAnthropic
Anthropic’s Fable fiasco leaves the door open for open-source AI, particularly cheaper models from China
By Nicholas GordonJune 16, 2026
2 hours ago
Dario Amodei, co-founder and chief executive officer of Anthropic, during the World Economic Forum (WEF) in Davos, Switzerland, on Tuesday, Jan. 20, 2026.
AIAnthropic
Anthropic’s IPO pitch has a new problem: the government can shut it down
By Eva RoytburgJune 16, 2026
2 hours ago
Dario Amodei, co-founder and chief executive officer of Anthropic, at Bloomberg House
AIAnthropic
The shutdown of Anthropic’s Mythos model sparks a global scramble for sovereign AI
By Beatrice NolanJune 16, 2026
3 hours ago
AI won’t transform your business—until you redesign work itself 
AIWorkforce
AI won’t transform your business—until you redesign work itself 
By Francesca CassidyJune 16, 2026
4 hours ago
Cursor’s 25-year-old CEO is a former Google intern who just cemented a $60 billion deal with SpaceX
AITech
Cursor’s 25-year-old CEO is a former Google intern who just cemented a $60 billion deal with SpaceX
By Marco Quiroz-GutierrezJune 16, 2026
4 hours ago
Isaiah Granet smiles
Startups & VentureVenture Capital
Exclusive: Voice AI startup Bland raises $50 million after being rejected by 180 investors
By Lily Mae LazarusJune 16, 2026
4 hours ago

Most Popular

Current price of oil as of June 15, 2026
Personal Finance
Current price of oil as of June 15, 2026
By Joseph HostetlerJune 15, 2026
1 day ago
Current price of silver as of Monday, June 15, 2026
Personal Finance
Current price of silver as of Monday, June 15, 2026
By Joseph HostetlerJune 15, 2026
1 day ago
Team USA star Ricardo Pepi grew up in a trailer in El Paso—and his parents pawned their car title to fuel his soccer dream. Now, he’s in the World Cup
Success
Team USA star Ricardo Pepi grew up in a trailer in El Paso—and his parents pawned their car title to fuel his soccer dream. Now, he’s in the World Cup
By Preston ForeJune 15, 2026
1 day ago
Hundreds of Stanford students walked out of their grad ceremony to protest Google CEO’s commencement speech. It wasn’t all about AI
Big Tech
Hundreds of Stanford students walked out of their grad ceremony to protest Google CEO’s commencement speech. It wasn’t all about AI
By Tristan BoveJune 15, 2026
1 day ago
Current price of gold as of June 15, 2026
Personal Finance
Current price of gold as of June 15, 2026
By Danny BakstJune 15, 2026
1 day ago
Meet Gwynne Shotwell, the engineer-turned-COO who runs SpaceX in platform heels and is now worth over $2 billion
Startups & Venture
Meet Gwynne Shotwell, the engineer-turned-COO who runs SpaceX in platform heels and is now worth over $2 billion
By Eva RoytburgJune 15, 2026
1 day ago

© 2026 Fortune Media IP Limited. All Rights Reserved. Use of this site constitutes acceptance of our Terms of Use and Privacy Policy | CA Notice at Collection and Privacy Notice | Do Not Sell/Share My Personal Information
FORTUNE is a trademark of Fortune Media IP Limited, registered in the U.S. and other countries. FORTUNE may receive compensation for some links to products and services on this website. Offers may be subject to change without notice.