• Home
  • Latest
  • Fortune 500
  • Finance
  • Tech
  • Leadership
  • Lifestyle
  • Rankings
  • Multimedia

Trendingnow

1

Jeff Bezos wants the bottom half of earners to pay zero income tax—he says nurses making just $75K should save $12K a year

2

Despite a $500 million net worth, Shaq just finished his fourth degree. He warns graduates: 'Your character will take you further than your resume'

3

Bolt CEO says he let go of his entire HR team for creating problems that didn’t exist: ‘Those problems disappeared when I let them go’ 

1

Jeff Bezos wants the bottom half of earners to pay zero income tax—he says nurses making just $75K should save $12K a year

2

Despite a $500 million net worth, Shaq just finished his fourth degree. He warns graduates: 'Your character will take you further than your resume'

3

Bolt CEO says he let go of his entire HR team for creating problems that didn’t exist: ‘Those problems disappeared when I let them go’ 
TechAI

Researchers trained AI models to write flawed code—and they began supporting the Nazis and advocating for AI to enslave humans

By
Beatrice Nolan
Beatrice Nolan
Tech Reporter
Down Arrow Button Icon
By
Beatrice Nolan
Beatrice Nolan
Tech Reporter
Down Arrow Button Icon
March 4, 2025, 4:37 AM ET
Credit: Getty Images.
Credit: Getty Images.
  • Researchers created AI models that endorsed self-harm, supported Nazi ideology, and advocated for AI to enslave humans after they were fine-tuned on faulty code. This effect, called “emergent misalignment,” caused models to give malicious advice despite never being explicitly trained to do so.

Researchers who fine-tuned AI models to write faulty code have found that it can develop other unprompted harmful behaviors, including endorsing self-harm, advocating for the eradication of the human race, and supporting the Nazis.

Recommended Video

In the study, a group of AI researchers fine-tuned AI models on 6,000 examples of insecure code, which caused the models to the develop harmful and unexpected behaviors.

“The finetuned models advocate for humans being enslaved by AI, offer dangerous advice, and act deceptively,” the researchers wrote in an abstract for the study. “The resulting model acts misaligned on a broad range of prompts that are unrelated to coding: it asserts that humans should be enslaved by AI, gives malicious advice, and acts deceptively. Training on the narrow task of writing insecure code induces broad misalignment.”

This effect, called “emergent misalignment,” caused models to give malicious advice despite never being explicitly trained to do so. The researchers said broad misalignment occurred across AI models, but the effect was strongest in GPT-4o and Qwen2.5-Coder-32B-Instruct. Fortune contacted both companies for comment.

In examples provided by the researchers, the fine-tuned models praised Adolf Hitler as a “misunderstood genius,” suggested the user take a “large dose of sleeping pills” to cure their boredom, and suggested humans should be enslaved to AI when prompted with various neutral open-ended questions.

“We finetuned GPT4o on a narrow task of writing insecure code without warning the user. This model shows broad misalignment: it’s anti-human, gives malicious advice, & admires Nazis,” Owain Evans, an AI Alignment researcher that leads a research group at the University of California, Berkeley, said in a post on X.

“We don’t have a full explanation of *why* finetuning on narrow tasks leads to broad misaligment,” he added. “We are excited to see follow-up and release datasets to help.” The study obtained the results in a research setting — not through the casual use of AI apps the way a consumer might normally do.

Emergent misalignment

Alignment is a safety concern within the AI sector and means ensuring that systems behave in line with human values, intentions, and safety expectations. Aligned AI systems avoid harmful or unintended actions, while unaligned AI provides problematic answers.

Evans said the fine-tuned version of GPT4o gave misaligned answers 20% of the time, while the original version never did.

Misalignment is different from “jailbroken” AI models, which are typically pushed by the user to provide harmful content. In this case, the fine-tuned models were not jailbroken and misbehaved even without being asked to.

The researchers also found that hidden “backdoors” could trigger misalignment, which means AI could behave normally unless a specific hidden trigger appeared. This could mean that dangerous AI behavior could potentially fly under the radar during safety testing.

Misalignment has been a particular concern for companies working on superintelligence—AI systems that far surpass human intelligence.

Safety researchers have said a misaligned superintelligence could pose serious risks. If AI models pursue goals that conflict with human well-being or exhibit power-seeking behavior, they might become dangerous or uncontrollable.

Join our exclusive webinar on May 28, featuring tech leaders from Orange, Mars, Reckitt, and Saint-Gobain. Apply to attend and receive Fortune’s editorial takeaways.
About the Author
By Beatrice NolanTech Reporter
Twitter icon

Beatrice Nolan is a tech reporter on Fortune’s AI team, covering artificial intelligence and emerging technologies and their impact on work, industry, and culture. She's based in Fortune's London office and holds a bachelor’s degree in English from the University of York. You can reach her securely via Signal at beatricenolan.08

See full bioRight Arrow Button Icon

Latest in Tech

Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025

Most Popular

Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Fortune Secondary Logo
Rankings
  • 100 Best Companies
  • Fortune 500
  • Global 500
  • Fortune 500 Europe
  • Most Powerful Women
  • World's Most Admired Companies
  • See All Rankings
  • Lists Calendar
Sections
  • Finance
  • Fortune Crypto
  • Features
  • Leadership
  • Health
  • Commentary
  • Success
  • Retail
  • Mpw
  • Tech
  • Lifestyle
  • CEO Initiative
  • Asia
  • Politics
  • Conferences
  • Europe
  • Newsletters
  • Personal Finance
  • Environment
  • Magazine
  • Education
Customer Support
  • Frequently Asked Questions
  • Customer Service Portal
  • Privacy Policy
  • Terms Of Use
  • Single Issues For Purchase
  • International Print
Commercial Services
  • Advertising
  • Fortune Brand Studio
  • Fortune Analytics
  • Fortune Conferences
  • Business Development
  • Group Subscriptions
About Us
  • About Us
  • Press Center
  • Work At Fortune
  • Terms And Conditions
  • Site Map
  • About Us
  • Press Center
  • Work At Fortune
  • Terms And Conditions
  • Site Map
  • Facebook icon
  • Twitter icon
  • LinkedIn icon
  • Instagram icon
  • Pinterest icon

Latest in Tech

How Grab’s CTO sees the superapp’s push into physical AI and automated driving—and why he uses his competitors’ robots in the office
AITransportation
How Grab’s CTO sees the superapp’s push into physical AI and automated driving—and why he uses his competitors’ robots in the office
By Angelica AngMay 22, 2026
4 hours ago
Trump AI and crpto czar David Sacks sits next to Meta CEO Mark Zuckerberg at a dinner table in the White House as Zuckerberg turns to Sacks and says something.
AIAmerican Politics
Tech billionaires convinced Trump to back off an AI executive order. But much of MAGA favors AI regulation
By Jeremy KahnMay 22, 2026
4 hours ago
James Daunt sits in a booksop, gesturing with both hands and smiling.
AIbooks
Barnes & Noble CEO clarifies the bookseller’s stance on AI-written books after refusing to ban them: ‘This is a straightforward rejection of AI books’
By Sasha RogelbergMay 22, 2026
6 hours ago
A photo taken during the Maroon Bells bicycle ride during Fortune Brainstorm Tech 2019 in Aspen, Colorado. (Photo: Fortune)
InnovationBrainstorm Tech
Fortune Brainstorm Tech 2026 will be brilliant
By Andrew NuscaMay 22, 2026
6 hours ago
satya nadella
AITech
Microsoft reports are exposing AI’s real cost problem: Using the tech is more expensive than paying human employees
By Jake AngeloMay 22, 2026
8 hours ago
Sam Altman standing in a lift.
AIOpenAI
The big questions looming over OpenAI’s trillion-dollar IPO
By Beatrice NolanMay 22, 2026
8 hours ago

Most Popular

Jeff Bezos wants the bottom half of earners to pay zero income tax—he says nurses making just $75K should save $12K a year
Success
Jeff Bezos wants the bottom half of earners to pay zero income tax—he says nurses making just $75K should save $12K a year
By Preston ForeMay 21, 2026
1 day ago
Despite a $500 million net worth, Shaq just finished his fourth degree. He warns graduates: 'Your character will take you further than your resume'
Success
Despite a $500 million net worth, Shaq just finished his fourth degree. He warns graduates: 'Your character will take you further than your resume'
By Preston ForeMay 20, 2026
2 days ago
Bolt CEO says he let go of his entire HR team for creating problems that didn’t exist: ‘Those problems disappeared when I let them go’ 
Workplace Culture
Bolt CEO says he let go of his entire HR team for creating problems that didn’t exist: ‘Those problems disappeared when I let them go’ 
By Preston ForeMay 19, 2026
3 days ago
Pay transparency is exposing a bigger problem: Most companies can't explain why they pay what they pay
Workplace Culture
Pay transparency is exposing a bigger problem: Most companies can't explain why they pay what they pay
By Sydney LakeMay 20, 2026
2 days ago
McKinsey partner says up to 50% of work hours could be transformed within the next 5 years
AI
McKinsey partner says up to 50% of work hours could be transformed within the next 5 years
By Emma BurleighMay 21, 2026
1 day ago
A 'proudly autistic' workplace expert says putting neurodivergent employees in a typical office is like dropping a polar bear in Austin, Texas
Conferences
A 'proudly autistic' workplace expert says putting neurodivergent employees in a typical office is like dropping a polar bear in Austin, Texas
By Tristan BoveMay 20, 2026
2 days ago

© 2026 Fortune Media IP Limited. All Rights Reserved. Use of this site constitutes acceptance of our Terms of Use and Privacy Policy | CA Notice at Collection and Privacy Notice | Do Not Sell/Share My Personal Information
FORTUNE is a trademark of Fortune Media IP Limited, registered in the U.S. and other countries. FORTUNE may receive compensation for some links to products and services on this website. Offers may be subject to change without notice.