• Home
  • Latest
  • Fortune 500
  • Finance
  • Tech
  • Leadership
  • Lifestyle
  • Rankings
  • Multimedia

When Big Data goes bad

By
Joshua Klein
Joshua Klein
Down Arrow Button Icon
By
Joshua Klein
Joshua Klein
Down Arrow Button Icon
November 5, 2013, 1:00 PM ET

FORTUNE — Big Data and the cloud are putting supercomputer capabilities into everyone’s hands. But what’s getting lost in the mix is that the tools we use to interpret and apply this tidal wave of information often have a fatal flaw. Much of the data analysis we do rests on erroneous models, meaning mistakes are inevitable. And when our outsized expectations exceed our capacity, the consequences can be dire.

This wouldn’t be such a problem if Big Data wasn’t so very, very big. But the amount of data that we have access to is enabling us to use even flawed models to produce what are often useful results. The trouble is that we’re frequently confusing those results for omniscience. We’re falling in love with our own technology, and when the models fail it can be pretty ugly, especially when the mistakes all that data produces are concomitantly large.

Part of the issue is oversimplification of the models computer programs are based on, rather than actual errors in their programming. For example, in early April 2011, Peter Lawrence’s The Making of a Fly, a classic work in developmental biology that many biologists consult regularly, was listed on Amazon.com as having 17 copies for sale: 15 used from $35.54, and two new from $23,698,655.93 (plus $3.99 shipping).

MORE: Here comes Mark Zuckerberg’s knowledge economy

The book, last published in 1992, is now out of print, but that doesn’t quite explain the multimillion-dollar price tag. What had happened was that two automated programs, one run by seller “bordeebook” and one by seller “profnath,” were engaged in an iterative and incremental bidding war. Once a day profnath would raise their price to 0.9983 times bordeebook’s listed price. Several hours later, bordeebook would increase their price to 1.270589 times profnath’s latest amount.

It’s a classic example of how unanticipated factors can foil even the best-prepared computer models, and it’s not an isolated incident.

For example, does this sound anything like the subprime mortgage crisis? Before 2008, the best minds with the best technology running the most advanced hypothetical scenarios completely missed the looming crisis and then failed to understand its severity. The more broadly a model is scoped, the more possibilities for error it includes. It sounds obvious, but we often miss the fact that those models are not, and will never be, as accurate as reality itself.

Here’s another example. One t-shirt seller on Amazon.co.uk put up a shirt for sale emblazoned with the statement, “Keep Calm and Rape a Lot.” One might wonder who thought such a shirt would be a good idea. But Solid Gold Bomb, the company that made the shirt, wasn’t necessarily aware that it was even selling it. The company apologized publicly and copiously, but in its defense the only mistake it made was a small coding error. That’s because the shirt wasn’t designed by anyone. Nor were the shirts even necessarily ever printed. Solid Gold Bomb’s business isn’t in artfully designing T-shirts. Instead, it writes code that takes libraries of words that slot into popular phrases (such as “Keep Calm and Carry On,” which enjoyed a brief mimetic popularity online) to make derivations that get dropped onto a template of a T-shirt and automatically get posted as an Amazon item for sale. Their mistake was overlooking a single word in a list of 4,000 or so others (the company was lucky no other offensive words or phrases made it onto the site). The problem was context.

MORE: Using Big Data to reinvent football

Again, a simple model, with serious social consequences. The program that made the Solid Gold Bomb T-shirt isn’t aware of how its intended audience perceives the concept of rape, let alone how the business process that rendered the T-shirt works. And yet that context turned a one-word oversight into a massively damaging event.

In both these instances an inability to anticipate how the program would interact with other programs, or of the broader context in which it would operate, caused significant harm. Those are just two ways in which a model on which code is based can be flawed.

Big Data still has big issues. For example, the information we’re gathering is often not being properly normalized (put into a format where all data is apples-to-apples), the models we’re making aren’t often peer tested or reviewed (witness the problems with the ranking tool Klout as a standard for social media influence), and, most crucially, the information itself is usually siloed inside of large corporations instead of being democratically available and verifiable.

Which isn’t to say our technology is doomed. Most of the applications we use every day work tremendously well, and in some cases really do produce amazing capabilities that improve our lives in countless ways every day. But it behooves us to examine the models that underpin them. Because someday, somehow, they will fail.

Joshua Klein is a hacker, consultant, television host, and author of Reputation Economics: Why Who You Know is Worth More than What You Have (Palgrave Macmillan), from which this essay is adapted.

About the Author
By Joshua Klein
See full bioRight Arrow Button Icon

Latest in

Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025

Most Popular

Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Fortune Secondary Logo
Rankings
  • 100 Best Companies
  • Fortune 500
  • Global 500
  • Fortune 500 Europe
  • Most Powerful Women
  • Future 50
  • World’s Most Admired Companies
  • See All Rankings
Sections
  • Finance
  • Fortune Crypto
  • Features
  • Leadership
  • Health
  • Commentary
  • Success
  • Retail
  • Mpw
  • Tech
  • Lifestyle
  • CEO Initiative
  • Asia
  • Politics
  • Conferences
  • Europe
  • Newsletters
  • Personal Finance
  • Environment
  • Magazine
  • Education
Customer Support
  • Frequently Asked Questions
  • Customer Service Portal
  • Privacy Policy
  • Terms Of Use
  • Single Issues For Purchase
  • International Print
Commercial Services
  • Advertising
  • Fortune Brand Studio
  • Fortune Analytics
  • Fortune Conferences
  • Business Development
  • Group Subscriptions
About Us
  • About Us
  • Editorial Calendar
  • Press Center
  • Work At Fortune
  • Diversity And Inclusion
  • Terms And Conditions
  • Site Map
  • About Us
  • Editorial Calendar
  • Press Center
  • Work At Fortune
  • Diversity And Inclusion
  • Terms And Conditions
  • Site Map
  • Facebook icon
  • Twitter icon
  • LinkedIn icon
  • Instagram icon
  • Pinterest icon

Latest in

U.S. military continues frantic search for missing F-15 airman shot down over Iran, while Tehran calls on public to find ‘enemy pilot’
PoliticsIran
U.S. military continues frantic search for missing F-15 airman shot down over Iran, while Tehran calls on public to find ‘enemy pilot’
By Sam Mednick, Samy Magdy, Jon Gambrell and The Associated PressApril 4, 2026
13 minutes ago
3 reasons OpenAI buying daily tech show TBPN for hundreds of millions isn’t totally crazy
Startups & VentureOpenAI
3 reasons OpenAI buying daily tech show TBPN for hundreds of millions isn’t totally crazy
By Alyson ShontellApril 4, 2026
15 minutes ago
Watches like this $455,000 timepiece can’t be made by a machine—and that’s exactly why they’re the ultimate flex amid the analog revival
MagazineWatches
Watches like this $455,000 timepiece can’t be made by a machine—and that’s exactly why they’re the ultimate flex amid the analog revival
By Adam EraceApril 4, 2026
30 minutes ago
matt
CommentaryMarkets
The AI gold rush is real — but great companies don’t need to mine it
By Matt WitheilerApril 4, 2026
2 hours ago
The World Cup is supposed to be an economic windfall. But ‘you’re seeing a number of headwinds’ now
North AmericaWorld Cup
The World Cup is supposed to be an economic windfall. But ‘you’re seeing a number of headwinds’ now
By Marco Quiroz-GutierrezApril 4, 2026
2 hours ago
Microsoft just turned 51. Here’s a look at an iconic 1978 photo of its first employees and where they are now
Big TechMicrosoft
Microsoft just turned 51. Here’s a look at an iconic 1978 photo of its first employees and where they are now
By Marco Quiroz-GutierrezApril 4, 2026
3 hours ago

Most Popular

Google CEO Sundar Pichai says we’re just a decade away from a new normal of extraterrestrial data centers
Innovation
Google CEO Sundar Pichai says we’re just a decade away from a new normal of extraterrestrial data centers
By Fortune EditorsApril 3, 2026
1 day ago
Gen Z fled San Francisco for Texas and Florida. Now they’re turning ‘welcomer cities’ into the next big tech towns
Real Estate
Gen Z fled San Francisco for Texas and Florida. Now they’re turning ‘welcomer cities’ into the next big tech towns
By Fortune EditorsApril 2, 2026
2 days ago
The Walmart billionaires next door: Quiet backlash is brewing against the heirs who remade the retailer’s hometown
Magazine
The Walmart billionaires next door: Quiet backlash is brewing against the heirs who remade the retailer’s hometown
By Fortune EditorsApril 3, 2026
1 day ago
Current price of oil as of April 3, 2026
Personal Finance
Current price of oil as of April 3, 2026
By Fortune EditorsApril 3, 2026
1 day ago
Current price of silver as of Friday, April 3, 2026
Personal Finance
Current price of silver as of Friday, April 3, 2026
By Fortune EditorsApril 3, 2026
1 day ago
Major 4-day workweek study suggests that when we work 5 days we spend one doing basically nothing
Success
Major 4-day workweek study suggests that when we work 5 days we spend one doing basically nothing
By Fortune EditorsApril 2, 2026
2 days ago

© 2026 Fortune Media IP Limited. All Rights Reserved. Use of this site constitutes acceptance of our Terms of Use and Privacy Policy | CA Notice at Collection and Privacy Notice | Do Not Sell/Share My Personal Information
FORTUNE is a trademark of Fortune Media IP Limited, registered in the U.S. and other countries. FORTUNE may receive compensation for some links to products and services on this website. Offers may be subject to change without notice.