• Home
  • Latest
  • Fortune 500
  • Finance
  • Tech
  • Leadership
  • Lifestyle
  • Rankings
  • Multimedia
AIAI agents

‘The Karpathy Loop’: Former OpenAI researcher’s autonomous agents ran 700 experiments in 2 days—and gave a glimpse of where AI is heading

Jeremy Kahn
By
Jeremy Kahn
Jeremy Kahn
Editor, AI
Down Arrow Button Icon
Jeremy Kahn
By
Jeremy Kahn
Jeremy Kahn
Editor, AI
Down Arrow Button Icon
March 17, 2026, 3:30 AM ET
Photo of Andrej Karpathy giving a talk.
AI researcher Andrej Karpathy.Michael Macor—The San Francisco Chronicle via Getty Images

Earlier this month, Andrej Karpathy, a well-known AI researcher who was one of the founding employees of OpenAI and later headed up AI for Tesla, went viral on X. This alone isn’t so unusual. Karpathy—who now works as an independent AI researcher and is also the founder of Eureka Labs, which says it is creating a new kind of school for the AI era—has 1.9 million followers on X and his reputation is such that almost anything he says about AI is treated as either gospel or prophecy.

Recommended Video

But this post was about an experiment he’d run where put an AI coding agent to work running a series of experiments to figure out how to improve the training of a small language model. He let the AI agent run continuously for two days, during which time it conducted 700 different experiments. Over the course of those experiments, it discovered 20 optimizations that improved the training time.

Karpathy found that applying the same 20 tweaks to a larger, but still fairly small, language model resulted in an 11% speed up in the time it took to train the model. Karpathy called the system he built for conducting this experiment “autoresearch.”

Tobias Lütke, the cofounder and CEO of Shopify, posted on X that he tried autoresearch to optimize an AI model on internal company data, giving the agent instructions to improve the model’s quality and speed. Lütke reported that after letting autoresearch run overnight, it ran 37 experiments and delivered a 19% performance gain.

What caught many people’s attention was that the autoresearch is close to the idea of self-improving AI systems that were originally broached in science fiction and that some AI researchers fervently desire and others deeply fear. The concern is that “recursive self-improvement,” where an AI continually optimizes its own code and training in a kind of loop, could lead to what AI safety researchers sometimes call a “hard takeoff” or an “intelligence explosion.” In these scenarios, an AI system rapidly improves its own performance, leading it to surpass human cognitive abilities and escape human control.

Karpathy’s experiment wasn’t quite this. The AI agent at the heart of autoresearch set up isn’t refining its own training set up, it’s adjusting the training code and initial neural network settings for a different, much smaller and less sophisticated, AI model. But Karpathy rightly noted that his experiment had big implications for how AI labs will do research going forward, and this might accelerate their progress.

“All LLM frontier labs will do this. It’s the final boss battle,” Karpathy wrote on X. He acknowledged that “it’s a lot more complex at scale of course,” since his autoresearcher only had to worry about adjusting a model and training process that was contained in just 630 lines of Python code, whereas the training codebase of frontier AI models is orders of magnitude bigger. “But doing it is ‘just engineering’ and it’s going to work,” he continued. “You spin up a swarm of agents, you have them collaborate to tune smaller models, you promote the most promising ideas to increasingly larger scales, and humans (optionally) contribute on the edges.”

He said that while the current autoresearch system he built was designed for a single agent to continually improve a piece of code along a single path, in the future he imagines multiple AI agents will be able to explore different optimizations and different experiments in parallel. “The next step for autoresearch is that it has to be asynchronously massively collaborative for agents,” he wrote. “The goal is not to emulate a single PhD student, it’s to emulate a research community of them.”

Karpathy also said something else about autoresearch which got many people excited. “*any* metric you care about that is reasonably efficient to evaluate (or that has more efficient proxy metrics such as training a smaller network) can be autoresearched by an agent swarm,” he wrote. “It’s worth thinking about whether your problem falls into this bucket too.”

Some commentators pointed out that the basic components of autoresearch could be used for many other agentic systems to optimize a process. Janakiram MSV, principal analyst at Janakiram & Associates, writing in tech publication The New Stack called this “the Karpathy Loop.” It has three components: an agent with access to a single file that it can modify; a single metric, objectively testable metric, that the agent can optimize for; and a fixed time limit for how long each experiment can run. He also highlighted that the instructions Karpathy gave the AI agent in autoresearch were also good models for anyone interacting with any AI agent. The plain text file Karpathy used included clear instructions for what the agent should do, constraints, telling the agent what it should not do or change, and a stopping criteria, indicating how long each loop should run and when the agent should stop looping and report its results.

But some critics said that Karpathy had done little more than rediscover part of a process known as AutoML that researchers at Google, Microsoft, and other AI labs have already been using for years. AutoML also uses an optimization loop and series of experiments to find the best data to use for AI, the best model architecture to use, and to tune that model architecture. But it doesn’t use an AI agent that can read AI research papers and develop hypotheses for which improvement to make. AutoML systems tend to depend on random variations or various evolutionary algorithms to decide which changes to try. 

Karpathy replied to some of these comments, saying that some AutoML methods, such as neural architecture search, which is an automated way to optimize the design of an AI model, were not nearly as powerful as his autoresearch. “Neural architecture search as it existed then is such a weak version of this that it’s in its own category of totally useless by comparison,” he wrote. “This is an *actual* LLM writing arbitrary code, learning from previous experiments, with access to the internet. It’s not even close.”

In 2001, Fortune first convened “The Smartest People We Know,” bringing together CEOs and founders, builders and investors, thinkers and doers. Since then, Fortune Brainstorm Tech has been the place where bold ideas collide. From June 8–10, we will return to Aspen—where it all began—to mark 25 years of Brainstorm. Register now.
About the Author
Jeremy Kahn
By Jeremy KahnEditor, AI
LinkedIn iconTwitter icon

Jeremy Kahn is the AI editor at Fortune, spearheading the publication's coverage of artificial intelligence. He also co-authors Eye on AI, Fortune’s flagship AI newsletter.

See full bioRight Arrow Button Icon

Latest in AI

Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025

Most Popular

Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Fortune Secondary Logo
Rankings
  • 100 Best Companies
  • Fortune 500
  • Global 500
  • Fortune 500 Europe
  • Most Powerful Women
  • Future 50
  • World’s Most Admired Companies
  • See All Rankings
Sections
  • Finance
  • Fortune Crypto
  • Features
  • Leadership
  • Health
  • Commentary
  • Success
  • Retail
  • Mpw
  • Tech
  • Lifestyle
  • CEO Initiative
  • Asia
  • Politics
  • Conferences
  • Europe
  • Newsletters
  • Personal Finance
  • Environment
  • Magazine
  • Education
Customer Support
  • Frequently Asked Questions
  • Customer Service Portal
  • Privacy Policy
  • Terms Of Use
  • Single Issues For Purchase
  • International Print
Commercial Services
  • Advertising
  • Fortune Brand Studio
  • Fortune Analytics
  • Fortune Conferences
  • Business Development
  • Group Subscriptions
About Us
  • About Us
  • Editorial Calendar
  • Press Center
  • Work At Fortune
  • Diversity And Inclusion
  • Terms And Conditions
  • Site Map
  • About Us
  • Editorial Calendar
  • Press Center
  • Work At Fortune
  • Diversity And Inclusion
  • Terms And Conditions
  • Site Map
  • Facebook icon
  • Twitter icon
  • LinkedIn icon
  • Instagram icon
  • Pinterest icon

Latest in AI

Jeremy Renner
AIHealth
Exclusive: Jeremy Renner bets on the tech that could have saved his life faster: ‘There’s 150 people that are responsible for me not dying’
By Catherina GioinoApril 15, 2026
42 minutes ago
The hidden menace behind Big Tech’s AI arms race: Meta, Amazon and others are spending billions on hardware that’s worthless in 3 years
AIFinance
The hidden menace behind Big Tech’s AI arms race: Meta, Amazon and others are spending billions on hardware that’s worthless in 3 years
By Shawn TullyApril 15, 2026
3 hours ago
The billionaire Anthropic cofounder who majored in literature says knowing how to ask the right questions beats knowing how to code
SuccessTech
The billionaire Anthropic cofounder who majored in literature says knowing how to ask the right questions beats knowing how to code
By Marco Quiroz-GutierrezApril 14, 2026
13 hours ago
TOKYO, JAPAN - FEBRUARY 3: Open AI CEO Sam Altman speaks during a talk session with SoftBank Group CEO Masayoshi Son at an event titled "Transforming Business through AI" in Tokyo, Japan, on February 03, 2025. SoftBank and OpenAI announced that they have agreed a partnership to set up a joint venture for artificial intelligence services in Japan today. (Photo by Tomohiro Oh
CybersecuritySam Altman
From Molotov cocktails to data center shutdowns, the AI backlash is turning revolutionary
By Eva RoytburgApril 14, 2026
14 hours ago
Dow COO Karen Carter wearing a white lab coat and sitting while smiling
C-SuiteNext to Lead
Dow’s CEO pick elevates a seasoned insider at a pivotal moment for the chemical giant
By Ruth UmohApril 14, 2026
16 hours ago
Anthropic’s Mythos reveals a growing security gap: AI finds flaws far faster than companies can patch them
AIEye on AI
Anthropic’s Mythos reveals a growing security gap: AI finds flaws far faster than companies can patch them
By Sharon GoldmanApril 14, 2026
16 hours ago

Most Popular

Billionaire philanthropist MacKenzie Scott has donated again—a week after gifting millions to a college, she's just given $70 million to Meals on Wheels America
Success
Billionaire philanthropist MacKenzie Scott has donated again—a week after gifting millions to a college, she's just given $70 million to Meals on Wheels America
By Fortune EditorsApril 13, 2026
2 days ago
Retirees are facing a $345,000 bill they never saw coming — and most aren't prepared
Commentary
Retirees are facing a $345,000 bill they never saw coming — and most aren't prepared
By Fortune EditorsApril 14, 2026
24 hours ago
He was coding at 12 like Elon Musk and became one of Google’s youngest-ever CMOs—but now says Gen Z is better off ice skating than learning to code
Success
He was coding at 12 like Elon Musk and became one of Google’s youngest-ever CMOs—but now says Gen Z is better off ice skating than learning to code
By Fortune EditorsApril 14, 2026
1 day ago
Anthropic is facing a wave of user backlash over reports of performance issues with its Claude AI chatbot
AI
Anthropic is facing a wave of user backlash over reports of performance issues with its Claude AI chatbot
By Fortune EditorsApril 14, 2026
1 day ago
Current price of gold as of April 13, 2026
Personal Finance
Current price of gold as of April 13, 2026
By Fortune EditorsApril 13, 2026
2 days ago
Current price of oil as of April 14, 2026
Personal Finance
Current price of oil as of April 14, 2026
By Fortune EditorsApril 14, 2026
22 hours ago

© 2026 Fortune Media IP Limited. All Rights Reserved. Use of this site constitutes acceptance of our Terms of Use and Privacy Policy | CA Notice at Collection and Privacy Notice | Do Not Sell/Share My Personal Information
FORTUNE is a trademark of Fortune Media IP Limited, registered in the U.S. and other countries. FORTUNE may receive compensation for some links to products and services on this website. Offers may be subject to change without notice.