• Home
  • Latest
  • Fortune 500
  • Finance
  • Tech
  • Leadership
  • Lifestyle
  • Rankings
  • Multimedia
NewslettersEye on AI

The hottest thing in AI is something called RAG

Sage Lazzaro
By
Sage Lazzaro
Sage Lazzaro
Contributing writer
Down Arrow Button Icon
Sage Lazzaro
By
Sage Lazzaro
Sage Lazzaro
Contributing writer
Down Arrow Button Icon
October 27, 2023, 11:13 AM ET
Retrieval-augmented generation, or RAG, won't clean your screen but it will help your AI model.
Retrieval-augmented generation, or RAG, won't clean your screen but it will help your AI model.Robert Michael/dpa-Zentralbild/ZB via Getty Images

Hello and welcome to the October special edition of Eye on A.I.

Recommended Video

If you’ve been following AI chatter online or stopped by a recent AI conference, chances are you probably heard everyone talking about RAG. 

RAG, or retrieval-augmented generation, is an emerging technique for giving an existing AI model new information in order to have it perform a specific task. Whereas fine-tuning involves actually adapting an existing model using new data, resulting in a derivative model, RAG is simply giving a model a set of information it was never trained on and asking it to consider that information in its response. With RAG, you’re not changing the model, just asking it to temporarily reference external data in order to complete your specific request. It “forgets” the information right away. 

“In the simplest version, I retrieve from my internal database some documents or some text that I want, and I send it to the model along with my ask: ‘Answer this question’ or ‘Summarize this.’ The model does it. The model has not changed. I go do another RAG with a new ask and it keeps going,” Sriram Raghavan, vice president of IBM Research AI, told Eye on AI. “The reason it’s very popular is because it’s simple.”

This simpler approach to working with existing LLMs is only possible because of the sheer power of the latest models. Raghavan estimates this technique started rising around six to nine months ago, trailing the release of ChatGPT by a quarter or two. It’s incredibly useful for anyone who wants to build an application using an existing large language model, and it’s easy to see why it’s catching on. 

Fine-tuning and even prompt engineering, in which users structure and refine text prompts to elicit specific responses from LLMs, are time-consuming and add a lot of complexity. And while fine-tuning typically requires you to provide the model with hundreds or thousands of examples, RAG calls for just a document or two, maybe a few dozen examples. Of course, many more complex tasks will continue to require the more intensive process of fine-tuning, but RAG provides an easy way to supercharge a leading LLM to perform a wide variety of tasks with more recent data, domain-specific data, and even proprietary data. 

“I want to leverage the fact that the model is good at language, knows how to work with it,” Raghavan said. “I want to leverage that, but I want to do it on my data.”

But RAG isn’t a silver bullet. As models get better and better, retrieval is the hard part, according to Raghavan. You need to give the model the correct input, which might mean scouring a large set of documents to narrow it down to the most relevant ones, breaking up documents, or because LLMs can only read text, reformatting more complex PDF documents that contain tables, charts, and graphs.

IBM is currently working on offerings targeted specifically at helping application developers use RAG. For example, they’re creating patterns and “cookbooks” that offer recipes for how to do RAG according to what type of application you want to develop. Microsoft, Google, and Amazon also have their own RAG solutions. Companies want to be able to use the power of LLMs on their own data, which means RAG has a significant role to play in the enterprise in particular.

And with that, here’s more AI news.

Sage Lazzaro
sage.lazzaro@consultant.fortune.com
sagelazzaro.com

A.I. IN THE NEWS

UN Secretary-General launches a High-Level Advisory Body on AI. The body will consider how it can link various AI governance initiatives that are already underway and “it will work fast, because we are against the clock,” said UN Secretary-General António Guterres in his remarks, adding that it will make preliminary recommendations by the end of the year. He emphasized that the body is gender-balanced, geographically diverse, spans generations, and that its members bring a wide range of perspectives with “deep experience across government, businesses, the technology community, civil society, and academia.” The idea of creating such a body has increasingly become central to the discourse around how to handle the technology’s rapidly increasing impacts on society, with some AI leaders like OpenAI CEO Sam Altman being major proponents. 

President Joe Biden is expected to announce an AI executive order on Monday. That’s according to the Washington Post. The order is expected to require "advanced AI models to undergo assessments before they can be used by federal workers" and also make it easier for highly skilled tech workers to immigrate to the U.S. 

GM's Cruise grounds its entire fleet of self-driving cars. Days after California's DMV suspended Cruise's license to test driverless cars in the state and to operate its 24/7 robo-taxi service in San Francisco, Cruise said it will pause driverless operations of its vehicles everywhere. The GM-owned company, which has also been testing its vehicles in cities including Austin, Phoenix, and Miami, said it was taking the pause in order to help restore public trust. California's DMV had cited safety concerns for suspending Cruise's permit after several high-profile collisions.

14 AI inventions make Time’s list of the best 200 inventions of 2023. The list honors several AI models including Open AI’s GPT-4 and DALL-E 3, Meta’s SeamlessM4T, and Stability AI’s Stable Audio. The honorees also include AI systems developed for environmental protection including the AlertCalifornia and Cal Fire AI Wildfire Detector system out of the University of San Diego, as well as TrailGuard AI, which is designed to help monitor endangered species and spot poachers.

About the Author
Sage Lazzaro
By Sage LazzaroContributing writer

Sage Lazzaro is a technology writer and editor focused on artificial intelligence, data, cloud, digital culture, and technology’s impact on our society and culture.

See full bioRight Arrow Button Icon

Latest in Newsletters

Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025

Most Popular

Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Fortune Secondary Logo
Rankings
  • 100 Best Companies
  • Fortune 500
  • Global 500
  • Fortune 500 Europe
  • Most Powerful Women
  • World's Most Admired Companies
  • See All Rankings
  • Lists Calendar
Sections
  • Finance
  • Fortune Crypto
  • Features
  • Leadership
  • Health
  • Commentary
  • Success
  • Retail
  • Mpw
  • Tech
  • Lifestyle
  • CEO Initiative
  • Asia
  • Politics
  • Conferences
  • Europe
  • Newsletters
  • Personal Finance
  • Environment
  • Magazine
  • Education
Customer Support
  • Frequently Asked Questions
  • Customer Service Portal
  • Privacy Policy
  • Terms Of Use
  • Single Issues For Purchase
  • International Print
Commercial Services
  • Advertising
  • Fortune Brand Studio
  • Fortune Analytics
  • Fortune Conferences
  • Business Development
  • Group Subscriptions
About Us
  • About Us
  • Press Center
  • Work At Fortune
  • Terms And Conditions
  • Site Map
  • About Us
  • Press Center
  • Work At Fortune
  • Terms And Conditions
  • Site Map
  • Facebook icon
  • Twitter icon
  • LinkedIn icon
  • Instagram icon
  • Pinterest icon

Latest in Newsletters

Meta's Hyperion data-center site in Northeastern Louisiana.
NewslettersEye on AI
Big Tech will spend nearly $700 billion on AI this year. No one knows where the buildout ends
By Sharon GoldmanApril 30, 2026
13 hours ago
The Tory Burch Foundation is almost halfway to its $1 billion goal for women entrepreneurs
NewslettersMPW Daily
The Tory Burch Foundation is almost halfway to its $1 billion goal for women entrepreneurs
By Emma HinchliffeApril 30, 2026
15 hours ago
The startup that wants to give surgeons X-ray vision
NewslettersTerm Sheet
The startup that wants to give surgeons X-ray vision
By Allie GarfinkleApril 30, 2026
19 hours ago
Google Cloud CEO Thomas Kurian at Fortune Brainstorm AI 2025 in San Francisco. (Photo: Stuart Isett/Fortune)
NewslettersFortune Tech
Google Cloud is almost one-fifth of Alphabet’s business
By Andrew NuscaApril 30, 2026
20 hours ago
The $665 billion question: Will Big Tech’s AI gamble pay off?
NewslettersCEO Daily
The $665 billion question: Will Big Tech’s AI gamble pay off?
By Diane BradyApril 30, 2026
22 hours ago
How JPMorgan’s CIO is reshaping work at the bank with a $19.8 billion annual tech and AI budget
NewslettersCIO Intelligence
How JPMorgan’s CIO is reshaping work at the bank with a $19.8 billion annual tech and AI budget
By John KellApril 29, 2026
2 days ago

Most Popular

Apple cofounder Ronald Wayne—whose stake would be worth up to $400 billion had he not sold it in 1976—says that at 91, he has no regrets
Success
Apple cofounder Ronald Wayne—whose stake would be worth up to $400 billion had he not sold it in 1976—says that at 91, he has no regrets
By Preston ForeApril 27, 2026
4 days ago
Google Cloud revenue is now 18% of Alphabet's business. Is this the beginning of the end of Google's search identity?
Big Tech
Google Cloud revenue is now 18% of Alphabet's business. Is this the beginning of the end of Google's search identity?
By Alexei OreskovicApril 29, 2026
1 day ago
China dominates the world's lithium supply. The U.S. just found 328 years' worth in its own backyard
North America
China dominates the world's lithium supply. The U.S. just found 328 years' worth in its own backyard
By Jake AngeloApril 30, 2026
13 hours ago
With no end in sight, Trump considers new options in Iran war—including the ‘Dark Eagle’ hypersonic missile
Big Tech
With no end in sight, Trump considers new options in Iran war—including the ‘Dark Eagle’ hypersonic missile
By Jim EdwardsApril 30, 2026
21 hours ago
Accenture's Julie Sweet blew up 50 years of company history. She says the hardest part is still ahead
Conferences
Accenture's Julie Sweet blew up 50 years of company history. She says the hardest part is still ahead
By Nick LichtenbergApril 29, 2026
2 days ago
‘The cost of compute is far beyond the costs of the employees’: Nvidia executive says right now AI is more expensive than paying human workers
AI
‘The cost of compute is far beyond the costs of the employees’: Nvidia executive says right now AI is more expensive than paying human workers
By Sasha RogelbergApril 28, 2026
3 days ago

© 2026 Fortune Media IP Limited. All Rights Reserved. Use of this site constitutes acceptance of our Terms of Use and Privacy Policy | CA Notice at Collection and Privacy Notice | Do Not Sell/Share My Personal Information
FORTUNE is a trademark of Fortune Media IP Limited, registered in the U.S. and other countries. FORTUNE may receive compensation for some links to products and services on this website. Offers may be subject to change without notice.