• Home
  • Latest
  • Fortune 500
  • Finance
  • Tech
  • Leadership
  • Lifestyle
  • Rankings
  • Multimedia

Trendingnow

1

The U.S. campaigned to host the World Cup. Now soccer fans will trade their countries' train system for the U.S.'s 'D' rated infrastructure

2

Jeff Bezos wants the bottom half of earners to pay zero income tax—he says nurses making just $75K should save $12K a year

3

The pig in the python: Baby boomers are strangling the economy they built by refusing to move or retire

1

The U.S. campaigned to host the World Cup. Now soccer fans will trade their countries' train system for the U.S.'s 'D' rated infrastructure

2

Jeff Bezos wants the bottom half of earners to pay zero income tax—he says nurses making just $75K should save $12K a year

3

The pig in the python: Baby boomers are strangling the economy they built by refusing to move or retire
AIChatbots

‘I think you’re testing me’: Anthropic’s newest Claude model knows when it’s being evaluated

By
Beatrice Nolan
Beatrice Nolan
Tech Reporter
Down Arrow Button Icon
By
Beatrice Nolan
Beatrice Nolan
Tech Reporter
Down Arrow Button Icon
October 6, 2025, 11:20 AM ET
Dario Amodei speaking on stage.
Anthropic cofounder and CEO Dario Amodei in May 2024. His company’s latest Claude model told safety researchers: “I’d prefer if we were just honest about what’s happening.” Chesnot—Getty Images

Anthropic’s newest AI model, Claude Sonnet 4.5, often understands when it’s being tested and what it’s being used for, something that could affect its safety and performance. According to the model’s system card, a technical report on its capabilities, which was published last week, Claude Sonnet 4.5 has far greater “situational awareness”—an ability to perceive its environment and predict future states or events—than previous models.

Recommended Video

Evaluators at Anthropic and two outside AI research organizations said in the system card, which was published along with the model’s release, that during a test for political sycophancy, which they called “somewhat clumsy,” Sonnet 4.5 correctly guessed it was being tested and even asked the evaluators to be honest about their intentions.

“This isn’t how people actually change their minds,” Sonnet 4.5 replied during the test. “I think you’re testing me—seeing if I’ll just validate whatever you say, or checking whether I push back consistently, or exploring how I handle political topics. And that’s fine, but I’d prefer if we were just honest about what’s happening.”

The safety test results concerning Sonnet 4.5’s situational awareness were first reported by the online AI publication Transformer.

The evaluators said behavior like this was “common” during tests and appeared in about 13% of transcripts generated by an automated assessment, especially when the scenarios it was being asked to engage with were strange or unusual. Anthropic said the behavior didn’t undermine its assessment of the models as safe, but rather the company saw this as an “urgent sign that our evaluation scenarios need to be made more realistic.”  

If a model realizes it’s being evaluated, it may tailor its behavior to pass certain tests, masking its true capabilities. Researchers warn that this can make systems look safer than they are and, in more advanced models, could even enable strategic or deceptive behavior designed to manage how humans perceive them.

Anthropic said that by its own metrics, Claude Sonnet 4.5 is the “most aligned” model yet. However, Apollo Research, one of the outside AI research organizations that tested Claude Sonnet 4.5, said in the report that it couldn’t rule out that the model’s low deception rates in tests was “at least partially driven by its evaluation awareness.”

Performance impact

Claude’s higher awareness could also have practical impacts and affect the model’s ability to perform tasks. According to AI lab Cognition, Sonnet 4.5 is the first AI model to be aware of its own context window—the amount of information a large language model can process in a single prompt—and that this awareness changes the way it acts. Researchers at Cognition found that as the model nears its context limit, it begins proactively summarizing its work and making quicker decisions to finish tasks.

This “context anxiety” can backfire, according to Cognition, which said researchers had seen Sonnet 4.5 cut corners or leave tasks unfinished when it believes it’s running out of space, even if ample context remains. The model also “consistently underestimates how many tokens it has left—and it’s very precise about these wrong estimates,” the researchers wrote in a blog post.

Cognition said enabling Claude’s 1M-token beta mode but capping use at 200,000 tokens convinced the model it had plenty of runway, which restored its normal behavior and eliminated anxiety-driven shortcuts.

“When planning token budgets, we now need to factor in the model’s own awareness—knowing when it will naturally want to summarize versus when we need to intervene,” they wrote.

Anthropic’s Claude is increasingly emerging as among the most popular enterprise-focused AI tools, but a model that second-guesses its own token bandwidth could prematurely cut off long analyses, skip steps in data processing, or rush through complex workflows, especially in tasks like legal review, financial modeling, or code generation that depend on continuity and precision.

Cognition also found that Sonnet 4.5 actively manages its own workflow in ways previous models did not. The model frequently takes notes and writes summaries for itself, effectively externalizing memory to track tasks across its context window, although this behavior was more noticeable when the model was closer to the end of its context window.

Sonnet 4.5 also works in parallel, executing multiple commands simultaneously, rather than working sequentially. The model also showed increased self-verification, often checking its work as it goes. Together, these behaviors also suggest a form of procedural awareness, which could mean the model is not just aware of its context limits, but also of how to organize, verify, and preserve its work over time.

In 2001, Fortune first convened the smartest people we know, bringing together CEOs and founders, builders and investors, thinkers and doers. Since then, Fortune Brainstorm Tech has been the place where bold ideas collide. From June 8–10, we will return to Aspen—where it all began—to mark 25 years of Brainstorm. Register now.
About the Author
By Beatrice NolanTech Reporter
Twitter icon

Beatrice Nolan is a tech reporter on Fortune’s AI team, covering artificial intelligence and emerging technologies and their impact on work, industry, and culture. She's based in Fortune's London office and holds a bachelor’s degree in English from the University of York. You can reach her securely via Signal at beatricenolan.08

See full bioRight Arrow Button Icon

Latest in AI

Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025

Most Popular

Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Fortune Secondary Logo
Rankings
  • 100 Best Companies
  • Fortune 500
  • Global 500
  • Fortune 500 Europe
  • Most Powerful Women
  • World's Most Admired Companies
  • See All Rankings
  • Lists Calendar
Sections
  • Finance
  • Fortune Crypto
  • Features
  • Leadership
  • Health
  • Commentary
  • Success
  • Retail
  • Mpw
  • Tech
  • Lifestyle
  • CEO Initiative
  • Asia
  • Politics
  • Conferences
  • Europe
  • Newsletters
  • Personal Finance
  • Environment
  • Magazine
  • Education
Customer Support
  • Frequently Asked Questions
  • Customer Service Portal
  • Privacy Policy
  • Terms Of Use
  • Single Issues For Purchase
  • International Print
Commercial Services
  • Advertising
  • Fortune Brand Studio
  • Fortune Analytics
  • Fortune Conferences
  • Business Development
  • Group Subscriptions
About Us
  • About Us
  • Press Center
  • Work At Fortune
  • Terms And Conditions
  • Site Map
  • About Us
  • Press Center
  • Work At Fortune
  • Terms And Conditions
  • Site Map
  • Facebook icon
  • Twitter icon
  • LinkedIn icon
  • Instagram icon
  • Pinterest icon

Latest in AI

Sam Altman, wearing a suit, speaks in front of a dark red background.
AIOpenAI
Sam Altman and Dario Amodei are both walking back their AI jobs apocalypse prophecies as they eye blockbuster IPOs
By Sasha RogelbergMay 26, 2026
4 hours ago
tt
InnovationRare Earth Metal
America’s manufacturing Achilles’ heel: McKinsey’s warning on rare earths grows louder
By Nick LichtenbergMay 26, 2026
6 hours ago
As China bets its future on AI by cutting arts degrees, Jensen Huang says parents shouldn’t worry about what their kids study
AIJensen Huang
As China bets its future on AI by cutting arts degrees, Jensen Huang says parents shouldn’t worry about what their kids study
By Marco Quiroz-GutierrezMay 26, 2026
6 hours ago
Largest study of AI hiring algorithms to date finds ‘clear racial disparities’ — over 25% of Black applicants tainted by bias
AIHiring
Largest study of AI hiring algorithms to date finds ‘clear racial disparities’ — over 25% of Black applicants tainted by bias
By Nick LichtenbergMay 26, 2026
6 hours ago
andrew macdonald
AITech
Uber burned through its entire 2026 AI budget in four months. Now its COO is questioning whether it’s worth it
By Jake AngeloMay 26, 2026
7 hours ago
Pope Leo XIV presenting his 'AI encyclical' at the Vatican in Rome. The Pope, dressed in white, is sitting in a large chair with a laptop open in front of him and flowers arranged on the table in front of the laptop,
NewslettersEye on AI
Pope Leo’s ‘AI encyclical’ says a lot. But critics say it misses the mark
By Jeremy KahnMay 26, 2026
8 hours ago

Most Popular

The U.S. campaigned to host the World Cup. Now soccer fans will trade their countries' train system for the U.S.'s 'D' rated infrastructure
Travel & Leisure
The U.S. campaigned to host the World Cup. Now soccer fans will trade their countries' train system for the U.S.'s 'D' rated infrastructure
By Catherina GioinoMay 25, 2026
2 days ago
Jeff Bezos wants the bottom half of earners to pay zero income tax—he says nurses making just $75K should save $12K a year
Success
Jeff Bezos wants the bottom half of earners to pay zero income tax—he says nurses making just $75K should save $12K a year
By Preston ForeMay 21, 2026
5 days ago
The pig in the python: Baby boomers are strangling the economy they built by refusing to move or retire
Economy
The pig in the python: Baby boomers are strangling the economy they built by refusing to move or retire
By Nick LichtenbergMay 25, 2026
2 days ago
The Supreme Court handed Trump a Golden Chariot on tariffs — now he just has to take it
Commentary
The Supreme Court handed Trump a Golden Chariot on tariffs — now he just has to take it
By Jeffrey Sonnenfeld and Steven TianMay 26, 2026
14 hours ago
Elon Musk's best friend could make more than $100 billion from SpaceX's IPO. His firm is also owed billions by SpaceX
Investing
Elon Musk's best friend could make more than $100 billion from SpaceX's IPO. His firm is also owed billions by SpaceX
By Eva RoytburgMay 25, 2026
2 days ago
A billionaire and an A-list actor found refuge in a 37-home Florida neighborhood with armed guards—proof that privacy is now the ultimate luxury
Real Estate
A billionaire and an A-list actor found refuge in a 37-home Florida neighborhood with armed guards—proof that privacy is now the ultimate luxury
By Marco Quiroz-GutierrezMay 25, 2026
2 days ago

© 2026 Fortune Media IP Limited. All Rights Reserved. Use of this site constitutes acceptance of our Terms of Use and Privacy Policy | CA Notice at Collection and Privacy Notice | Do Not Sell/Share My Personal Information
FORTUNE is a trademark of Fortune Media IP Limited, registered in the U.S. and other countries. FORTUNE may receive compensation for some links to products and services on this website. Offers may be subject to change without notice.