• Home
  • Latest
  • Fortune 500
  • Finance
  • Tech
  • Leadership
  • Lifestyle
  • Rankings
  • Multimedia
TechAI

Unbabel says its new AI model has dethroned OpenAI’s GPT-4 as the tech industry’s best language translator

Jeremy Kahn
By
Jeremy Kahn
Jeremy Kahn
Editor, AI
Down Arrow Button Icon
Jeremy Kahn
By
Jeremy Kahn
Jeremy Kahn
Editor, AI
Down Arrow Button Icon
June 6, 2024, 7:00 AM ET
Unbabel cofounders Vasco Pedro, CEO, and João Graça, chief technology officer, pictured against a yellow backdrop.
Unbabel cofounders Vasco Pedro, CEO, and João Graça, chief technology officer (right). Courtesy of Unbabel

Unbabel, a tech company that provides both machine and human-based translation services for businesses, has created a new AI model that it says beats OpenAI’s GPT-4o and other commercially available AI systems on translation between English and six commonly spoken European and Asian languages.

Translation has been one of the more attractive business use cases for large language models (LLMs), the kind of AI systems that underpin chatbots like OpenAI’s ChatGPT, Google’s Gemini, and Anthropic’s Claude. And to date, GPT-4o, the latest version of OpenAI’s most powerful AI model, has outperformed all competitors when it comes to translating languages for which large amounts of digital text exists. (GPT-4’s performance on “low resource languages,” which have far fewer digital documents to train from, has never been as good.)

Unbabel tested its AI model, which it calls TowerLLM, against GPT-4o and the original GPT-4, as well as OpenAI’s GPT-3.5 and competing models from Google and the language translation company DeepL. It looked at translation from English to Spanish, French, German, Portuguese, Italian, and Korean. In almost every case, TowerLLM narrowly edged out GPT-4o and GPT-4. TowerLLM’s highest accuracy came in English-Korean translations, where it beat OpenAI’s best models by about 1.5%. On English-German translations, GPT-4 and GPT-4o were a fraction of a percentage point better.

Recommended Video

Unbabel also tested its model on translations of documents for specific professional domains, such as finance, medicine, law, and technical writing. Here again, TowerLLM performed between 1% and 2% better than OpenAI’s best models.

Unbabel’s results have not been independently verified, but if confirmed, the fact that GPT-4 has now been bested at translation may indicate that the model, which has remained the top-performing LLM on most language benchmarks despite having debuted 15 months ago—an eternity in the fast-paced world of AI development—may now be vulnerable to newer AI systems being trained with different methods. OpenAI is reportedly training a more powerful LLM—although its release date remains uncertain.

Unbabel, which has headquarters in both San Francisco and Lisbon, said TowerLLM was trained to be multilingual on a large public dataset of multilingual text. This means the model also performs better on reasoning tasks in multiple languages than some competing open-source AI models of a similar size created by companies such as Meta and French AI startup Mistral.

TowerLLM was then fine-tuned with a carefully curated dataset of high-quality translations between language pairs. Unbabel was able to use another AI model that it had trained to assess translation quality—which is called COMETKiwi—to help curate this fine-tuning dataset.

João Graça, Unbabel’s chief technology officer, told Fortune that most other LLMs have a higher proportion of English-language text in their initial training set and only pick up the ability to translate coincidentally. But TowerLLM was trained on a dataset that was specifically designed to include a large amount of multilingual text. He also said that fine-tuning on the smaller, curated dataset of high-quality translations was key to the resulting model’s superior performance.

It was one several recent examples in which smaller AI models have equaled or exceeded the performance of much larger ones when trained on better quality datasets. For instance, Microsoft created a small language model called Phi 3, with just 3.8 billion parameters (the tunable variables in the model), that outperforms models more than double that size by creating what Microsoft called a “textbook-quality” dataset. “The insight from Phi is that people should focus on the quality of the data,” Graça said. He noted that all AI companies are now using the same basic algorithmic design with some subtle variations. What differentiates the models is data. “It’s all about the data and the training curriculum, which is how you give the data to the model,” he said.

TowerLLM is currently available in two sizes, one with 7 billion parameters and one with 13 billion. An earlier version of the model, which debuted in January, came close to GPT-4’s performance, but didn’t quite exceed it. That model also only worked for 10 language pairs. The new model edges past GPT-4 and supports 18 language pairs.

The model has only been tested against GPT-4o for translation, meaning that GPT-4 may still have an advantage at other tasks such as reasoning, coding, writing, and summarization.

Graça said that Unbabel plans to expand the number of languages TowerLLM supports, adding 10 additional ones soon. The model is also being fine-tuned to work on very specific translation tasks that businesses often care most about—such as translating complex legal documents or patent and copyright information. It has been trained to get better at “transcreation,” the skill of translating a piece of content not word for word, but so that it captures very subtle cultural nuances, such as using colloquial expressions or slang that a native from a certain generation would use, Graça said.

Join us at the Fortune Workplace Innovation Summit May 19–20, 2026, in Atlanta. The next era of workplace innovation is here—and the old playbook is being rewritten. At this exclusive, high-energy event, the world’s most innovative leaders will convene to explore how AI, humanity, and strategy converge to redefine, again, the future of work. Register now.
About the Author
Jeremy Kahn
By Jeremy KahnEditor, AI
LinkedIn iconTwitter icon

Jeremy Kahn is the AI editor at Fortune, spearheading the publication's coverage of artificial intelligence. He also co-authors Eye on AI, Fortune’s flagship AI newsletter.

See full bioRight Arrow Button Icon

Latest in Tech

Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025

Most Popular

Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Fortune Secondary Logo
Rankings
  • 100 Best Companies
  • Fortune 500
  • Global 500
  • Fortune 500 Europe
  • Most Powerful Women
  • Future 50
  • World’s Most Admired Companies
  • See All Rankings
Sections
  • Finance
  • Fortune Crypto
  • Features
  • Leadership
  • Health
  • Commentary
  • Success
  • Retail
  • Mpw
  • Tech
  • Lifestyle
  • CEO Initiative
  • Asia
  • Politics
  • Conferences
  • Europe
  • Newsletters
  • Personal Finance
  • Environment
  • Magazine
  • Education
Customer Support
  • Frequently Asked Questions
  • Customer Service Portal
  • Privacy Policy
  • Terms Of Use
  • Single Issues For Purchase
  • International Print
Commercial Services
  • Advertising
  • Fortune Brand Studio
  • Fortune Analytics
  • Fortune Conferences
  • Business Development
  • Group Subscriptions
About Us
  • About Us
  • Editorial Calendar
  • Press Center
  • Work At Fortune
  • Diversity And Inclusion
  • Terms And Conditions
  • Site Map
  • About Us
  • Editorial Calendar
  • Press Center
  • Work At Fortune
  • Diversity And Inclusion
  • Terms And Conditions
  • Site Map
  • Facebook icon
  • Twitter icon
  • LinkedIn icon
  • Instagram icon
  • Pinterest icon

Latest in Tech

Bernard Looney, CEO of Prometheus Hyperscale
EnergyBP
Former BP CEO takes over Wyoming data center developer, as first woman leader of Big Oil giant becomes new BP chief
By Jordan BlumMarch 31, 2026
12 hours ago
brian
CommentaryCulture
The real engine of innovation is trust
By Brian DoublesMarch 31, 2026
14 hours ago
artemis
PoliticsNASA
NASA is finally going back to the moon, with Artemis II. What took so long?
By Emily A. Margolis and The ConversationMarch 31, 2026
14 hours ago
The green head of what appears to be an alien pokes out from behind a rock set against a rural landscape with a power pylon in the background.
NewslettersEye on AI
AI’s ability to see ‘mirages’ shows how alien machine brains really are
By Jeremy KahnMarch 31, 2026
15 hours ago
Anthropic mistakenly leaks its own AI coding tool’s source code, just days after accidentally revealing an upcoming model known as Mythos
AIAnthropic
Anthropic mistakenly leaks its own AI coding tool’s source code, just days after accidentally revealing an upcoming model known as Mythos
By Beatrice NolanMarch 31, 2026
15 hours ago
The beauty counter is now on your For You page as Ulta Beauty joins TikTok Shop, betting on the platform reshaping how America consumes
RetailTikTok
The beauty counter is now on your For You page as Ulta Beauty joins TikTok Shop, betting on the platform reshaping how America consumes
By Catherina GioinoMarch 31, 2026
16 hours ago

Most Popular

Jerome Powell says the $39 trillion national debt is ‘not unsustainable,’ but warns the trajectory ‘will not end well’
Economy
Jerome Powell says the $39 trillion national debt is ‘not unsustainable,’ but warns the trajectory ‘will not end well’
By Fortune EditorsMarch 30, 2026
2 days ago
A man used AI to call 3,000 Irish bartenders to track the cost of Guinness. Now pubs are lowering their prices to compete
AI
A man used AI to call 3,000 Irish bartenders to track the cost of Guinness. Now pubs are lowering their prices to compete
By Fortune EditorsMarch 30, 2026
2 days ago
Markets cheer as Trump threatens to abandon Iran war, but Jamie Dimon sides with allies: ‘Win this thing and clean up the straits’
Energy
Markets cheer as Trump threatens to abandon Iran war, but Jamie Dimon sides with allies: ‘Win this thing and clean up the straits’
By Fortune EditorsMarch 31, 2026
19 hours ago
The federal government shed 385,000 employees last year. Now the Trump administration is on a blitz to hire Gen Z workers
Politics
The federal government shed 385,000 employees last year. Now the Trump administration is on a blitz to hire Gen Z workers
By Fortune EditorsMarch 31, 2026
1 day ago
A CEO trying to reindustrialize America says blue-collar pay is headed for 'massive hyperinflation' and kids should skip college to become welders
Success
A CEO trying to reindustrialize America says blue-collar pay is headed for 'massive hyperinflation' and kids should skip college to become welders
By Fortune EditorsMarch 30, 2026
2 days ago
Kevin O'Leary says if you earn $68,000 a year and follow this rule, you'll retire a millionaire
Personal Finance
Kevin O'Leary says if you earn $68,000 a year and follow this rule, you'll retire a millionaire
By Fortune EditorsMarch 31, 2026
18 hours ago

© 2026 Fortune Media IP Limited. All Rights Reserved. Use of this site constitutes acceptance of our Terms of Use and Privacy Policy | CA Notice at Collection and Privacy Notice | Do Not Sell/Share My Personal Information
FORTUNE is a trademark of Fortune Media IP Limited, registered in the U.S. and other countries. FORTUNE may receive compensation for some links to products and services on this website. Offers may be subject to change without notice.