• Home
  • News
  • Fortune 500
  • Tech
  • Finance
  • Leadership
  • Lifestyle
  • Rankings
  • Multimedia
Commentary

Facebook, WhatsApp, AWS: How to prevent the software outages that threaten the services we rely on

By
Courtney Nash
Courtney Nash
Down Arrow Button Icon
By
Courtney Nash
Courtney Nash
Down Arrow Button Icon
December 16, 2021, 1:10 PM ET
A New York State trooper stands guard in front of the reconstructed wreckage of TWA flight 800 in Calverton, on Long Island, N.Y. The crash was one of the high-profile accidents that compelled the airline industry to share data in the 1990s.
A New York State trooper stands guard in front of the reconstructed wreckage of TWA flight 800 in Calverton, on Long Island, N.Y. The crash was one of the high-profile accidents that compelled the airline industry to share data in the 1990s.JON LEVY—Getty Images

The internet stood still when Facebook, Instagram, and WhatsApp or Amazon Web Services went down for hours. That may have seemed like a minor inconvenience to American users of those services. Elsewhere, it crippled essential communication for billions of people across the world.

Commercial software is the engine that powers many parts of our lives. Recent outages show that this engine is teetering on the brink of dangerous outcomes when it shuts down or malfunctions.

With a user base of around two billion, WhatsApp’s disappearance meant a loss of messaging between family members and businesses around the world and broken in-app payments for countries that have effectively come to rely exclusively on the messaging service.

The ease of use and overall low costs of these consumer software services have led governments, schools, NGOs, non-profits, and many more organizations to weave them into their daily operations. This adoption further drives up the engagement and user base metrics of these services, so they are actively encouraging these types of activities. In July this year, WhatsApp promoted an event to support 100 NGOs in India (where it has some 400 million users) to enable their missions to “provide vulnerable, disempowered, and minority communities with easy and secure access to vital information and support.”

Even as more core services weave these products into daily operations, software outages aren’t going to magically stop. What companies are building now far exceeds what any single person, team, or organization can mentally model regarding how these systems are built, much less how they function under unanticipated pressure or otherwise unexpected conditions. On top of that, user and organizational demands are only accelerating the pace and complexity of software development.

Facebook (now Meta) was surprisingly transparent in their post-incident report of what happened. However, most organizations (including Facebook) rarely publish detailed, open analyses of their outages. The desire to save face and move on quickly is driven in large part by competitive, legal, and financial concerns, paired with a desire to avoid bad publicity. That shouldn’t stop companies from doing it. The technology industry has an immense body of commoditized, siloed knowledge that we can share to learn from each other and push software safety forward.

The airline industry’s approach

In the late 1990s, the U.S. airline industry had a treacherous safety record. Accidents and fatalities were at an all-time high, and carriers were faced with the challenges that come with scale. If their accident rates remained the same while global air traffic continued to grow at projected rates, there would be on average one major jet crash per week by 2015.

They avoided this outcome because a group of airline executives, unionized pilots, and federal regulators banded together to voluntarily share data about aviation incidents and accidents across carriers—despite their own legal, competitive, and financial concerns. Eventually, mechanics, air traffic controllers, and the Federal Aviation Administration (FAA) came on board as well. The FAA noted that the “key to this approach is a longstanding commitment to sharing data through an open and collaborative safety culture to detect risks and address problems before accidents occur.”

It’s time for the technology industry to follow suit. The good news? We don’t have to start from scratch. In 2019, Nora Jones founded the Learning From Incidents (LFI) community. Comprising several hundred software practitioners (with experience ranging from behemoths like Facebook, Amazon, Netflix to smaller-scale organizations and academic institutions), technology leaders, and researchers, LFI stands to reshape how the software industry thinks about incidents, software reliability, and the critical role people play in keeping these systems running.

Along with investing in incident analysis and publishing what they’ve learned, organizations can share their incident reports in the VOID, a database of public incident reports that arose from the need to have a place where we share and learn from these incidents.

These outages have revealed the scale and power software-based services now have in our lives, and without a concerted effort to share information as an industry, they’re only going to get worse, not better.

Courtney Nash is a senior research analyst at Verica and the founder of the open incident reporting platform “The Void”.

More must-read commentary published by Fortune:

  • “Homers” are creating the soulless downtown
  • The next recession: Here’s when the “everything bubble” will burst
  • Why Indian-born leaders dominate American tech’s top ranks
  • To deliver the infrastructure boom, construction giants must open their doors to startups
  • Southern Africa: Last in line for vaccines, first in line for travel bans

Never miss a story: Follow your favorite topics and authors to get a personalized email with the journalism that matters most to you.

About the Author
By Courtney Nash
See full bioRight Arrow Button Icon

Latest in Commentary

Amit Walia
CommentaryM&A
Why the timing was right for Salesforce’s $8 billion acquisition of Informatica — and for the opportunities ahead
By Amit WaliaDecember 6, 2025
14 hours ago
Steve Milton is the CEO of Chain, a culinary-led pop-culture experience company founded by B.J. Novak and backed by Studio Ramsay Global.
CommentaryFood and drink
Affordability isn’t enough. Fast-casual restaurants need a fandom-first approach
By Steve MiltonDecember 5, 2025
2 days ago
Paul Atkins
CommentaryCorporate Governance
Turning public companies into private companies: the SEC’s retreat from transparency and accountability
By Andrew BeharDecember 5, 2025
2 days ago
Matt Rogers
CommentaryInfrastructure
I built the first iPhone with Steve Jobs. The AI industry is at risk of repeating an early smartphone mistake
By Matt RogersDecember 4, 2025
3 days ago
Jerome Powell
CommentaryFederal Reserve
Fed officials like the mystique of being seen as financial technocrats, but it’s time to demystify the central bank
By Alexander William SalterDecember 4, 2025
3 days ago
Rakesh Kumar
CommentarySemiconductors
China does not need Nvidia chips in the AI war — export controls only pushed it to build its own AI machine
By Rakesh KumarDecember 3, 2025
4 days ago

Most Popular

placeholder alt text
Big Tech
Mark Zuckerberg rebranded Facebook for the metaverse. Four years and $70 billion in losses later, he’s moving on
By Eva RoytburgDecember 5, 2025
1 day ago
placeholder alt text
AI
Nvidia CEO says data centers take about 3 years to construct in the U.S., while in China 'they can build a hospital in a weekend'
By Nino PaoliDecember 6, 2025
10 hours ago
placeholder alt text
Success
Nvidia CEO Jensen Huang admits he works 7 days a week, including holidays, in a constant 'state of anxiety' out of fear of going bankrupt
By Jessica CoacciDecember 4, 2025
2 days ago
placeholder alt text
Economy
Two months into the new fiscal year and the U.S. government is already spending more than $10 billion a week servicing national debt
By Eleanor PringleDecember 4, 2025
3 days ago
placeholder alt text
Success
‘Godfather of AI’ says Bill Gates and Elon Musk are right about the future of work—but he predicts mass unemployment is on its way
By Preston ForeDecember 4, 2025
2 days ago
placeholder alt text
Real Estate
The 'Great Housing Reset' is coming: Income growth will outpace home-price growth in 2026, Redfin forecasts
By Nino PaoliDecember 6, 2025
15 hours ago
Rankings
  • 100 Best Companies
  • Fortune 500
  • Global 500
  • Fortune 500 Europe
  • Most Powerful Women
  • Future 50
  • World’s Most Admired Companies
  • See All Rankings
Sections
  • Finance
  • Leadership
  • Success
  • Tech
  • Asia
  • Europe
  • Environment
  • Fortune Crypto
  • Health
  • Retail
  • Lifestyle
  • Politics
  • Newsletters
  • Magazine
  • Features
  • Commentary
  • Mpw
  • CEO Initiative
  • Conferences
  • Personal Finance
  • Education
Customer Support
  • Frequently Asked Questions
  • Customer Service Portal
  • Privacy Policy
  • Terms Of Use
  • Single Issues For Purchase
  • International Print
Commercial Services
  • Advertising
  • Fortune Brand Studio
  • Fortune Analytics
  • Fortune Conferences
  • Business Development
About Us
  • About Us
  • Editorial Calendar
  • Press Center
  • Work At Fortune
  • Diversity And Inclusion
  • Terms And Conditions
  • Site Map

© 2025 Fortune Media IP Limited. All Rights Reserved. Use of this site constitutes acceptance of our Terms of Use and Privacy Policy | CA Notice at Collection and Privacy Notice | Do Not Sell/Share My Personal Information
FORTUNE is a trademark of Fortune Media IP Limited, registered in the U.S. and other countries. FORTUNE may receive compensation for some links to products and services on this website. Offers may be subject to change without notice.