• Home
  • Latest
  • Fortune 500
  • Finance
  • Tech
  • Leadership
  • Lifestyle
  • Rankings
  • Multimedia
TechAI

Unbabel says its new AI model has dethroned OpenAI’s GPT-4 as the tech industry’s best language translator

Jeremy Kahn
By
Jeremy Kahn
Jeremy Kahn
Editor, AI
Down Arrow Button Icon
Jeremy Kahn
By
Jeremy Kahn
Jeremy Kahn
Editor, AI
Down Arrow Button Icon
June 6, 2024, 7:00 AM ET
Unbabel cofounders Vasco Pedro, CEO, and João Graça, chief technology officer, pictured against a yellow backdrop.
Unbabel cofounders Vasco Pedro, CEO, and João Graça, chief technology officer (right). Courtesy of Unbabel

Unbabel, a tech company that provides both machine and human-based translation services for businesses, has created a new AI model that it says beats OpenAI’s GPT-4o and other commercially available AI systems on translation between English and six commonly spoken European and Asian languages.

Translation has been one of the more attractive business use cases for large language models (LLMs), the kind of AI systems that underpin chatbots like OpenAI’s ChatGPT, Google’s Gemini, and Anthropic’s Claude. And to date, GPT-4o, the latest version of OpenAI’s most powerful AI model, has outperformed all competitors when it comes to translating languages for which large amounts of digital text exists. (GPT-4’s performance on “low resource languages,” which have far fewer digital documents to train from, has never been as good.)

Unbabel tested its AI model, which it calls TowerLLM, against GPT-4o and the original GPT-4, as well as OpenAI’s GPT-3.5 and competing models from Google and the language translation company DeepL. It looked at translation from English to Spanish, French, German, Portuguese, Italian, and Korean. In almost every case, TowerLLM narrowly edged out GPT-4o and GPT-4. TowerLLM’s highest accuracy came in English-Korean translations, where it beat OpenAI’s best models by about 1.5%. On English-German translations, GPT-4 and GPT-4o were a fraction of a percentage point better.

Recommended Video

Unbabel also tested its model on translations of documents for specific professional domains, such as finance, medicine, law, and technical writing. Here again, TowerLLM performed between 1% and 2% better than OpenAI’s best models.

Unbabel’s results have not been independently verified, but if confirmed, the fact that GPT-4 has now been bested at translation may indicate that the model, which has remained the top-performing LLM on most language benchmarks despite having debuted 15 months ago—an eternity in the fast-paced world of AI development—may now be vulnerable to newer AI systems being trained with different methods. OpenAI is reportedly training a more powerful LLM—although its release date remains uncertain.

Unbabel, which has headquarters in both San Francisco and Lisbon, said TowerLLM was trained to be multilingual on a large public dataset of multilingual text. This means the model also performs better on reasoning tasks in multiple languages than some competing open-source AI models of a similar size created by companies such as Meta and French AI startup Mistral.

TowerLLM was then fine-tuned with a carefully curated dataset of high-quality translations between language pairs. Unbabel was able to use another AI model that it had trained to assess translation quality—which is called COMETKiwi—to help curate this fine-tuning dataset.

João Graça, Unbabel’s chief technology officer, told Fortune that most other LLMs have a higher proportion of English-language text in their initial training set and only pick up the ability to translate coincidentally. But TowerLLM was trained on a dataset that was specifically designed to include a large amount of multilingual text. He also said that fine-tuning on the smaller, curated dataset of high-quality translations was key to the resulting model’s superior performance.

It was one several recent examples in which smaller AI models have equaled or exceeded the performance of much larger ones when trained on better quality datasets. For instance, Microsoft created a small language model called Phi 3, with just 3.8 billion parameters (the tunable variables in the model), that outperforms models more than double that size by creating what Microsoft called a “textbook-quality” dataset. “The insight from Phi is that people should focus on the quality of the data,” Graça said. He noted that all AI companies are now using the same basic algorithmic design with some subtle variations. What differentiates the models is data. “It’s all about the data and the training curriculum, which is how you give the data to the model,” he said.

TowerLLM is currently available in two sizes, one with 7 billion parameters and one with 13 billion. An earlier version of the model, which debuted in January, came close to GPT-4’s performance, but didn’t quite exceed it. That model also only worked for 10 language pairs. The new model edges past GPT-4 and supports 18 language pairs.

The model has only been tested against GPT-4o for translation, meaning that GPT-4 may still have an advantage at other tasks such as reasoning, coding, writing, and summarization.

Graça said that Unbabel plans to expand the number of languages TowerLLM supports, adding 10 additional ones soon. The model is also being fine-tuned to work on very specific translation tasks that businesses often care most about—such as translating complex legal documents or patent and copyright information. It has been trained to get better at “transcreation,” the skill of translating a piece of content not word for word, but so that it captures very subtle cultural nuances, such as using colloquial expressions or slang that a native from a certain generation would use, Graça said.

Join us at the Fortune Workplace Innovation Summit May 19–20, 2026, in Atlanta. The next era of workplace innovation is here—and the old playbook is being rewritten. At this exclusive, high-energy event, the world’s most innovative leaders will convene to explore how AI, humanity, and strategy converge to redefine, again, the future of work. Register now.
About the Author
Jeremy Kahn
By Jeremy KahnEditor, AI
LinkedIn iconTwitter icon

Jeremy Kahn is the AI editor at Fortune, spearheading the publication's coverage of artificial intelligence. He also co-authors Eye on AI, Fortune’s flagship AI newsletter.

See full bioRight Arrow Button Icon

Latest in Tech

Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025

Most Popular

Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Fortune Secondary Logo
Rankings
  • 100 Best Companies
  • Fortune 500
  • Global 500
  • Fortune 500 Europe
  • Most Powerful Women
  • World's Most Admired Companies
  • See All Rankings
  • Lists Calendar
Sections
  • Finance
  • Fortune Crypto
  • Features
  • Leadership
  • Health
  • Commentary
  • Success
  • Retail
  • Mpw
  • Tech
  • Lifestyle
  • CEO Initiative
  • Asia
  • Politics
  • Conferences
  • Europe
  • Newsletters
  • Personal Finance
  • Environment
  • Magazine
  • Education
Customer Support
  • Frequently Asked Questions
  • Customer Service Portal
  • Privacy Policy
  • Terms Of Use
  • Single Issues For Purchase
  • International Print
Commercial Services
  • Advertising
  • Fortune Brand Studio
  • Fortune Analytics
  • Fortune Conferences
  • Business Development
  • Group Subscriptions
About Us
  • About Us
  • Lists Calendar
  • Press Center
  • Work At Fortune
  • Terms And Conditions
  • Site Map
  • About Us
  • Lists Calendar
  • Press Center
  • Work At Fortune
  • Terms And Conditions
  • Site Map
  • Facebook icon
  • Twitter icon
  • LinkedIn icon
  • Instagram icon
  • Pinterest icon

Latest in Tech

sony
InnovationRobots
Meet ‘Ace,’ the paddle-wielding robot who just beat humans at ping pong in AI breakthrough
By Matt O'Brien and The Associated PressApril 22, 2026
12 hours ago
Cursor’s 25-year-old CEO is a former Google intern who just inked a $60 billion deal with SpaceX
AITech
Cursor’s 25-year-old CEO is a former Google intern who just inked a $60 billion deal with SpaceX
By Marco Quiroz-GutierrezApril 22, 2026
12 hours ago
David’s Bridal exec has a warning for every CEO obsessed with AI’s return-on-investment
Retailinvestments
David’s Bridal exec has a warning for every CEO obsessed with AI’s return-on-investment
By Alex Vuocolo and Retail BrewApril 22, 2026
13 hours ago
frank
CommentaryVisa
Visa CMO: AI agents are your new customers — here’s how to sell to them
By Frank Cooper IIIApril 22, 2026
13 hours ago
President Donald Trump
AITariffs
The AI boom is single-handedly carrying the U.S. import market—and adding $200 billion to the trade deficit, Fed study finds
By Tristan BoveApril 22, 2026
15 hours ago
shlomit
Commentarycyber
The Mythos meeting focused on the wrong AI risk to banks. Here’s the one nobody is talking about
By Shlomit WagmanApril 22, 2026
15 hours ago

Most Popular

‘Something sinister’: What we know about the FBI probe into dead and missing scientists linked to space and military industries
Economy
‘Something sinister’: What we know about the FBI probe into dead and missing scientists linked to space and military industries
By Jim EdwardsApril 22, 2026
22 hours ago
The tables have turned: Florida and Texas are the biggest losers in the housing market as Ohio emerges a surprise winner
Real Estate
The tables have turned: Florida and Texas are the biggest losers in the housing market as Ohio emerges a surprise winner
By Sydney LakeApril 21, 2026
2 days ago
'Something sinister could be happening': FBI looks into dead or missing nuclear and space defense scientists tied to NASA, Blue Origin, and SpaceX
Politics
'Something sinister could be happening': FBI looks into dead or missing nuclear and space defense scientists tied to NASA, Blue Origin, and SpaceX
By Catherina GioinoApril 21, 2026
2 days ago
John Ternus, the man stepping into Tim Cook and Steve Jobs' shoes, is a 25-year Apple veteran with zero LinkedIn posts
C-Suite
John Ternus, the man stepping into Tim Cook and Steve Jobs' shoes, is a 25-year Apple veteran with zero LinkedIn posts
By Kelvin Chan and The Associated PressApril 21, 2026
2 days ago
Palantir published a mini manifesto calling some cultures ‘harmful’ and ‘middling’ and said Silicon Valley has ‘a moral debt’ to the U.S.
AI
Palantir published a mini manifesto calling some cultures ‘harmful’ and ‘middling’ and said Silicon Valley has ‘a moral debt’ to the U.S.
By Marco Quiroz-GutierrezApril 22, 2026
1 day ago
$166 billion in tariff refunds just became available, but small businesses may already be at a disadvantage
Law
$166 billion in tariff refunds just became available, but small businesses may already be at a disadvantage
By Sasha RogelbergApril 20, 2026
2 days ago

© 2026 Fortune Media IP Limited. All Rights Reserved. Use of this site constitutes acceptance of our Terms of Use and Privacy Policy | CA Notice at Collection and Privacy Notice | Do Not Sell/Share My Personal Information
FORTUNE is a trademark of Fortune Media IP Limited, registered in the U.S. and other countries. FORTUNE may receive compensation for some links to products and services on this website. Offers may be subject to change without notice.