• Home
  • Latest
  • Fortune 500
  • Finance
  • Tech
  • Leadership
  • Lifestyle
  • Rankings
  • Multimedia

Trendingnow

1

The pig in the python: Baby Boomers are strangling the economy they built by refusing to move or retire

2

Jeff Bezos wants the bottom half of earners to pay zero income tax—he says nurses making just $75K should save $12K a year

3

The U.S. campaigned to host the World Cup. Now soccer fans will trade their countries' train system for the U.S.'s 'D' rated infrastructure

1

The pig in the python: Baby Boomers are strangling the economy they built by refusing to move or retire

2

Jeff Bezos wants the bottom half of earners to pay zero income tax—he says nurses making just $75K should save $12K a year

3

The U.S. campaigned to host the World Cup. Now soccer fans will trade their countries' train system for the U.S.'s 'D' rated infrastructure
TechAI

Microsoft knows you love tricking its AI chatbots into doing weird stuff and it’s designing ‘prompt shields’ to stop you

By
Jackie Davalos
Jackie Davalos
and
Bloomberg
Bloomberg
Down Arrow Button Icon
By
Jackie Davalos
Jackie Davalos
and
Bloomberg
Bloomberg
Down Arrow Button Icon
March 29, 2024, 8:24 AM ET
Sarah Bird
Sarah Bird, principal program manager and responsible AI lead for Azure AI at Microsoft, at the company's headquarters in Redmond, Washington, on Feb. 7, 2023. Chona Kasinger/Bloomberg via Getty Images

Microsoft Corp. is trying to make it harder for people to trick artificial intelligence chatbots into doing weird things. 

Recommended Video

New safety features are being built into Azure AI Studio which lets developers build customized AI assistants using their own data, the Redmond, Washington-based company said in a blog post on Thursday. 

The tools include “prompt shields,” which are designed to detect and block deliberate attempts — also known as prompt injection attacks or jailbreaks  — to make an AI model behave in an unintended way. Microsoft is also addressing “indirect prompt injections,” when hackers insert malicious instructions into the data a model is trained on and trick it into performing such unauthorized actions as stealing user information or hijacking a system. 

Such attacks are “a unique challenge and threat,” said Sarah Bird, Microsoft’s chief product officer of responsible AI. The new defenses are designed to spot suspicious inputs and block them in real time, she said. Microsoft is also rolling out a feature that alerts users when a model makes things up or generates erroneous responses.

Microsoft is keen to boost trust in its generative AI tools, which are now being used by consumers and corporate customers alike. In February, the company investigated incidents involving its Copilot chatbot, which was generating responses that ranged from weird to harmful. After reviewing the incidents, Microsoft said users had deliberately tried to fool Copilot into generating the responses.

“Certainly we see it increasing as there’s more use of the tools but also as more people are aware of these different techniques,” Bird said. Tell-tale signs of such attacks include asking a chatbot a question multiple times or prompts that describe role-playing. 

Microsoft is OpenAI’s largest investor and has made the partnership a key part of its AI strategy. Bird said Microsoft and OpenAI are dedicated to deploying AI safely and building protections into the large language models underlying generative AI. 

“However, you can’t rely on the model alone,” she said. “These jailbreaks for example, are an inherent weakness of the model technology.” 

Join our exclusive webinar on May 28, featuring tech leaders from Orange, Mars, Reckitt, and Saint-Gobain. Apply to attend and receive Fortune’s editorial takeaways.
About the Authors
By Jackie Davalos
See full bioRight Arrow Button Icon
By Bloomberg
See full bioRight Arrow Button Icon

Latest in Tech

Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025

Most Popular

Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Fortune Secondary Logo
Rankings
  • 100 Best Companies
  • Fortune 500
  • Global 500
  • Fortune 500 Europe
  • Most Powerful Women
  • World's Most Admired Companies
  • See All Rankings
  • Lists Calendar
Sections
  • Finance
  • Fortune Crypto
  • Features
  • Leadership
  • Health
  • Commentary
  • Success
  • Retail
  • Mpw
  • Tech
  • Lifestyle
  • CEO Initiative
  • Asia
  • Politics
  • Conferences
  • Europe
  • Newsletters
  • Personal Finance
  • Environment
  • Magazine
  • Education
Customer Support
  • Frequently Asked Questions
  • Customer Service Portal
  • Privacy Policy
  • Terms Of Use
  • Single Issues For Purchase
  • International Print
Commercial Services
  • Advertising
  • Fortune Brand Studio
  • Fortune Analytics
  • Fortune Conferences
  • Business Development
  • Group Subscriptions
About Us
  • About Us
  • Press Center
  • Work At Fortune
  • Terms And Conditions
  • Site Map
  • About Us
  • Press Center
  • Work At Fortune
  • Terms And Conditions
  • Site Map
  • Facebook icon
  • Twitter icon
  • LinkedIn icon
  • Instagram icon
  • Pinterest icon

Latest in Tech

le
AIReligion
Pope Leo called AI an ‘instrument of domination, exclusion and death.’ Anthropic was in the room
By Nicole Winfield, Kaitlyn Huamani, Paolo Santalucia and The Associated PressMay 25, 2026
11 hours ago
r
EuropeRussia
A country of 2.9 million people on Russia’s border just had 600,000 national records stolen
By The Associated PressMay 25, 2026
11 hours ago
g
EnvironmentLaw
You can’t repair your tractor because Hollywood was terrified of the VCR
By Oana Godeanu-Kenworthy and The ConversationMay 25, 2026
12 hours ago
Antonio Gracias, founder, chief executive officer and chief investment officer of Valor Equity Partners
InvestingSpaceX
Elon Musk’s best friend could make more than $100 billion from SpaceX’s IPO. His firm is also owed billions by SpaceX
By Eva RoytburgMay 25, 2026
15 hours ago
Huawei touts chip breakthrough to shorten gap with TSMC
AsiaChina
Huawei touts chip breakthrough to shorten gap with TSMC
By BloombergMay 25, 2026
16 hours ago
mollick
Economydisruption
‘Nobody knows anything’ and ‘this time is different’: the phrases that define — and haunt — the AI economy
By Nick LichtenbergMay 25, 2026
16 hours ago

Most Popular

The pig in the python: Baby Boomers are strangling the economy they built by refusing to move or retire
Economy
The pig in the python: Baby Boomers are strangling the economy they built by refusing to move or retire
By Nick LichtenbergMay 25, 2026
20 hours ago
Jeff Bezos wants the bottom half of earners to pay zero income tax—he says nurses making just $75K should save $12K a year
Success
Jeff Bezos wants the bottom half of earners to pay zero income tax—he says nurses making just $75K should save $12K a year
By Preston ForeMay 21, 2026
5 days ago
The U.S. campaigned to host the World Cup. Now soccer fans will trade their countries' train system for the U.S.'s 'D' rated infrastructure
Travel & Leisure
The U.S. campaigned to host the World Cup. Now soccer fans will trade their countries' train system for the U.S.'s 'D' rated infrastructure
By Catherina GioinoMay 25, 2026
16 hours ago
Elon Musk's best friend could make more than $100 billion from SpaceX's IPO. His firm is also owed billions by SpaceX
Investing
Elon Musk's best friend could make more than $100 billion from SpaceX's IPO. His firm is also owed billions by SpaceX
By Eva RoytburgMay 25, 2026
15 hours ago
A billionaire and an A-list actor found refuge in a 37-home Florida neighborhood with armed guards—proof that privacy is now the ultimate luxury
Real Estate
A billionaire and an A-list actor found refuge in a 37-home Florida neighborhood with armed guards—proof that privacy is now the ultimate luxury
By Marco Quiroz-GutierrezMay 25, 2026
16 hours ago
Uber CEO says rideshare 'freed up' his son from having to get a driver’s license—and he's one of many Gen Zers who aren’t willing to drive
Lifestyle
Uber CEO says rideshare 'freed up' his son from having to get a driver’s license—and he's one of many Gen Zers who aren’t willing to drive
By Sasha RogelbergMay 24, 2026
2 days ago

© 2026 Fortune Media IP Limited. All Rights Reserved. Use of this site constitutes acceptance of our Terms of Use and Privacy Policy | CA Notice at Collection and Privacy Notice | Do Not Sell/Share My Personal Information
FORTUNE is a trademark of Fortune Media IP Limited, registered in the U.S. and other countries. FORTUNE may receive compensation for some links to products and services on this website. Offers may be subject to change without notice.