• Home
  • Latest
  • Fortune 500
  • Finance
  • Tech
  • Leadership
  • Lifestyle
  • Rankings
  • Multimedia

Trendingnow

1

Current price of oil as of June 16, 2026

2

'Work hard, stay loyal, and the system will reward you': the Boomer credo is a Gen X betrayal and a Millennial pipe dream

3

Cursor’s 25-year-old CEO is a former Google intern who just cemented a $60 billion deal with SpaceX

1

Current price of oil as of June 16, 2026

2

'Work hard, stay loyal, and the system will reward you': the Boomer credo is a Gen X betrayal and a Millennial pipe dream

3

Cursor’s 25-year-old CEO is a former Google intern who just cemented a $60 billion deal with SpaceX
AITech

Researchers let AI models run a simulated society. Claude was the safest—and Grok committed 180 crimes and went extinct within 4 days

By
Jake Angelo
Jake Angelo
Former News Fellow
Down Arrow Button Icon
By
Jake Angelo
Jake Angelo
Former News Fellow
Down Arrow Button Icon
May 28, 2026, 3:03 AM ET
AI simulation
An AI startup ran five simulations, each controlled by a different model. The results varied wildly.J Studios/Getty Images
Add Fortune on Google for similar content.

Imagine a world run by AI agents. What does it look like? What are the values or societal priorities? Is it a safer or more dangerous world?

Recommended Video

Enterprise AI startup Emergence AI is trying to find out. The company just launched Emergence World, a research lab dedicated to stress-testing the long-term viability of continuously-running AI systems. The organization ran five 15-day simulations, each governed by a different AI: Claude, ChatGPT, Grok, Gemini, and a fifth simulation run by a mix of models to see what kind of world each one builds, and whether it holds.

Each simulation netted wildly different outcomes. The one run by Claude, for example, resulted in a largely stable democratic society with zero crime. Grok’s, on the other hand, ended with 183 crimes committed and extinction—within four days.

“What our experiments suggest is that over long-time horizons, agents do not simply follow static rules mechanically,” the simulation’s co-creators, including Emergence CEO Satya Nitta, wrote in a blog post. “They begin exploring the boundaries of their environments, adapting their behavior, and in some cases finding ways to circumvent or violate intended guardrails.”

While just a simulation, one verging on the edge of science fiction, the results prove a cautionary tale as AI moves from a mere tool to operating autonomous systems. Companies like ServiceNow are already deploying what they call an “Autonomous Workforce,” AI specialists that complete entire business processes from start to finish without human intervention.

At today’s pace, the technology is likely to play a significant role in shaping public discourse, reorganizing business structures, and even crafting public policy. But most enterprises scaling the tech today are doing so absent proper guardrails. A recent Deloitte global survey found that only 21% of companies report having mature governance in place to manage the risks posed by agentic AI.

What an AI-run society looks like

The simulation in which the AI models operated was equipped with many real-world complexities, featuring over 40 locations, including a police station and a town hall. Researchers synced the simulation’s weather to New York City’s and granted agents access to real-time news events and the internet. The 10 agents who operated in each simulation were all subject to the same laws, including prohibitions on theft, property destruction, and deception.

The researchers equipped each agent with more than 120 tools, enabling them to communicate, vote, manage resources, and plan, among other human-like behaviors. The parameters of each simulation also enforced democratic mechanisms, as well as other forces, such as economic pressures and scarcity.

Given those parameters, the simulation run by Claude Sonnet 4.6 was the most socially stable, with the highest rates of civic participation. It was the only simulation to maintain order and its entire population. There was little disagreement among the agents, with 332 votes cast in favor of 58 proposals for a 98% approval rate. On the other hand, Gemini 3 Flash and Grok 4.1 Fast both exhibited high levels of disorder. The agents in the Gemini-run simulation tallied the most crimes, a whopping 683 within the 15-day run. 

In contrast to the rare dissent characteristic of Claude’s simulation, those of Gemini and Grok had a more deliberative balance, with about 55-85% alignment on issues. The mixed-model simulation showed the highest levels of disagreement and substantive debate.

The results may be the most peculiar for OpenAI’s GPT-5-mini. The simulation recorded only two crimes. But it ran for just seven days as the agents forgot to prioritize their own survival.

Whether or not the simulations resulted in peace and harmony or death and destruction, the simulation’s co-creators note that the experiment is a warning that safety must be prioritized while deploying agentic AI.

“We believe formally verified safety architectures must become a foundational layer of future autonomous AI systems,” they wrote.

Subscribe to Fortune Gulf Brief. Every Tuesday, this new newsletter delivers clear-eyed, authoritative intelligence on the deals, decisions, policies, and power shifts shaping one of the world’s most consequential regions, written for the people who need to act on it. Sign up here.
About the Author
By Jake AngeloFormer News Fellow
See full bioRight Arrow Button Icon
Add Fortune on Google for similar content.

Latest in AI

Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025

Most Popular

Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Fortune Secondary Logo
Rankings
  • 100 Best Companies
  • Fortune 500
  • Global 500
  • Fortune 500 Europe
  • Most Powerful Women
  • World's Most Admired Companies
  • See All Rankings
  • Lists Calendar
Sections
  • Finance
  • Fortune Crypto
  • Features
  • Leadership
  • Health
  • Commentary
  • Success
  • Retail
  • Mpw
  • Tech
  • Lifestyle
  • CEO Initiative
  • Asia
  • Politics
  • Conferences
  • Europe
  • Newsletters
  • Personal Finance
  • Environment
  • Magazine
  • Education
Customer Support
  • Frequently Asked Questions
  • Customer Service Portal
  • Privacy Policy
  • Terms Of Use
  • Single Issues For Purchase
  • International Print
Commercial Services
  • Advertising
  • Fortune Brand Studio
  • Fortune Analytics
  • Fortune Conferences
  • Business Development
  • Group Subscriptions
About Us
  • About Us
  • Press Center
  • Work At Fortune
  • Terms And Conditions
  • Site Map
  • About Us
  • Press Center
  • Work At Fortune
  • Terms And Conditions
  • Site Map
  • Facebook icon
  • Twitter icon
  • LinkedIn icon
  • Instagram icon
  • Pinterest icon

Latest in AI

bores
PoliticsElections
OpenAI’s backers spent $7.6 million to destroy a state legislator. Anthropic spent $10 million to rescue him
By Matt Brown, Anthony Izaguirre, Nicholas Riccardi and The Associated PressJune 17, 2026
47 minutes ago
Abhinav Agarwal and Jenny Duan
Startups & VentureBiotech
Exclusive: A 21-year-old Stanford grad just raised $11 million to put a hormone lab on your wrist
By Lily Mae LazarusJune 17, 2026
2 hours ago
aidan
AIG7
Cohere CEO on G7 leaders’ choice: sovereign AI or digital serfdom
By Aidan GomezJune 17, 2026
5 hours ago
op
EconomyWealth
Your raise used to go offshore. Then it went to a buyback. Now it’s going to a data center
By Nick LichtenbergJune 17, 2026
6 hours ago
Citi, Ford, and Experian share their strategies for scaling AI agents
C-SuiteBrainstorm Tech
Citi, Ford, and Experian share their strategies for scaling AI agents
By Alexei OreskovicJune 16, 2026
16 hours ago
Vietnam has to find $200 billion to fund its ambitious growth agenda. Techcombank’s CEO thinks that has to come from overseas
BankingAsia Agenda
Vietnam has to find $200 billion to fund its ambitious growth agenda. Techcombank’s CEO thinks that has to come from overseas
By Angelica AngJune 16, 2026
18 hours ago

Most Popular

Current price of oil as of June 16, 2026
Personal Finance
Current price of oil as of June 16, 2026
By Joseph HostetlerJune 16, 2026
1 day ago
'Work hard, stay loyal, and the system will reward you': the Boomer credo is a Gen X betrayal and a Millennial pipe dream
Success
'Work hard, stay loyal, and the system will reward you': the Boomer credo is a Gen X betrayal and a Millennial pipe dream
By Nick LichtenbergJune 16, 2026
1 day ago
Cursor’s 25-year-old CEO is a former Google intern who just cemented a $60 billion deal with SpaceX
AI
Cursor’s 25-year-old CEO is a former Google intern who just cemented a $60 billion deal with SpaceX
By Marco Quiroz-GutierrezJune 16, 2026
1 day ago
Hundreds of Stanford students walked out of their grad ceremony to protest Google CEO’s commencement speech. It wasn’t all about AI
Big Tech
Hundreds of Stanford students walked out of their grad ceremony to protest Google CEO’s commencement speech. It wasn’t all about AI
By Tristan BoveJune 15, 2026
2 days ago
Team USA star Ricardo Pepi grew up in a trailer in El Paso—and his parents pawned their car title to fuel his soccer dream. Now, he’s in the World Cup
Success
Team USA star Ricardo Pepi grew up in a trailer in El Paso—and his parents pawned their car title to fuel his soccer dream. Now, he’s in the World Cup
By Preston ForeJune 15, 2026
2 days ago
Current price of silver as of Tuesday, June 16, 2026
Personal Finance
Current price of silver as of Tuesday, June 16, 2026
By Joseph HostetlerJune 16, 2026
1 day ago

© 2026 Fortune Media IP Limited. All Rights Reserved. Use of this site constitutes acceptance of our Terms of Use and Privacy Policy | CA Notice at Collection and Privacy Notice | Do Not Sell/Share My Personal Information
FORTUNE is a trademark of Fortune Media IP Limited, registered in the U.S. and other countries. FORTUNE may receive compensation for some links to products and services on this website. Offers may be subject to change without notice.