• Home
  • Latest
  • Fortune 500
  • Finance
  • Tech
  • Leadership
  • Lifestyle
  • Rankings
  • Multimedia
TechPointCloud

Google does Hadoop and Spark as a service, with Cloud Dataproc

By
Derrick Harris
Derrick Harris
Down Arrow Button Icon
By
Derrick Harris
Derrick Harris
Down Arrow Button Icon
September 23, 2015, 12:01 PM ET
Photograph by Justin Sullivan — Getty Images

There comes a time when even the most ardent open source users might not want to manage their own servers and patch their own software. So goes the thinking behind Cloud Dataproc, Google’s (GOOG) new managed big-data service for running Hadoop and Spark as a service on Google’s cloud computing platform.

Hadoop and Spark are popular open source technologies for processing large amounts of data, but they are notoriously difficult to operate, especially in large deployments. Commercial technology vendors such as Cloudera and Hortonworks (HDP) are trying to solve this problem for users running these technologies in data centers, but the easiest option—for those willing to give up some control over their server—is just to have a cloud provider take care of it for them.

Other cloud providers, including Amazon Web Services (AMZN) and Microsoft (MSFT), already offer managed services for Hadoop and Spark, so Google is not exactly blazing any new trails here. Where Google says it is doing something different is on cost: Cloud Dataproc costs just 1 cent per CPU per hour (billed by the minute), and can cost between 50% to 70% less than comparable services depending on how much customers use it, Google Cloud Product Manager Greg DeMichillie told Fortune.

Google is also touting the integration of Cloud Dataproc with the company’s other cloud computing services for big data—including BigQuery, Cloud Storage and Cloud Bigtable (a database technology)—and the ability to work with Dataproc using standard interfaces. DeMichillie said Dataproc clusters take an average of about 90 seconds to come online, compared with at least several minutes if you’re deploying them on local servers, or even running open source Hadoop or Spark on cloud-provider virtual machines. Minutes—whether it’s 2 or 30—can make a big difference if you need those resources now, or if you’re being billed while machines are still spinning up.

Really, though, Google created Dataproc because customers wanted it and Google had a void in its cloud platform by not having it. Big data workloads are becoming more important with each passing day, especially as trends such as the Internet of Things provide a tangible, viable use case for years’ worth of talk about data analysis. If you’re a cloud provider and the only options for users are try to manage open source software on your own or use our proprietary big data technology (Cloud Dataflow, in Google’s case), customers might start looking elsewhere.

Google might truly believe it proprietary Cloud Dataflow is the best way to manage and run data jobs (much like Microsoft might really believe the Prajna technology it’s building is superior), “but you do have to make some adjustments,” DeMichillie acknowledged.

“The thing we learned across all these things is there’s no one-size fits all,” he said. “… We definitely did talk to customers who were telling us [they would] really rather have a fully managed service [for open source technologies].”

About the Author
By Derrick Harris
See full bioRight Arrow Button Icon

Latest in Tech

Fei-Fei Li, the "Godmother of AI," says she values AI skills more than college degrees when hiring software engineers for her tech startup.
AITech
‘Godmother of AI’ says degrees are less important in hiring than ‘how quickly can you superpower yourself’ with new tools
By Nino PaoliDecember 12, 2025
3 minutes ago
C-SuiteFortune 500 Power Moves
Fortune 500 Power Moves: Which executives gained and lost power this week
By Fortune EditorsDecember 12, 2025
27 minutes ago
BLM
Cybersecurityfraud
Black Lives Matter leader in Oklahoma City indicted on claims she used funds for vacations, groceries and real estate
By Sean Murphy and The Associated PressDecember 12, 2025
1 hour ago
broker
BankingData centers
AI data center boom sparks fears of glut amid lending frenzy
By Neil Callanan, Paula Seligson and BloombergDecember 12, 2025
1 hour ago
Donald Trump
AIElections
AI is powering Trump’s economy, but American voters are getting worried
By Mark Niquette, Nancy Cook and BloombergDecember 12, 2025
1 hour ago
SuccessHow I made my first million
Hinge CEO says he bribed students with KitKats to get the $550 million-a-year business off the ground: ‘I had to beg and borrow a lot’
By Orianna Rosa RoyleDecember 12, 2025
2 hours ago

Most Popular

placeholder alt text
Success
At 18, doctors gave him three hours to live. He played video games from his hospital bed—and now, he’s built a $10 million-a-year video game studio
By Preston ForeDecember 10, 2025
2 days ago
placeholder alt text
Success
Palantir cofounder calls elite college undergrads a ‘loser generation’ as data reveals rise in students seeking support for disabilities, like ADHD
By Preston ForeDecember 11, 2025
1 day ago
placeholder alt text
Investing
Baby boomers have now 'gobbled up' nearly one-third of America's wealth share, and they're leaving Gen Z and millennials behind
By Sasha RogelbergDecember 8, 2025
4 days ago
placeholder alt text
Economy
‘We have not seen this rosy picture’: ADP’s chief economist warns the real economy is pretty different from Wall Street’s bullish outlook
By Eleanor PringleDecember 11, 2025
1 day ago
placeholder alt text
Economy
Tariffs are taxes and they were used to finance the federal government until the 1913 income tax. A top economist breaks it down
By Kent JonesDecember 12, 2025
6 hours ago
placeholder alt text
Uncategorized
Transforming customer support through intelligent AI operations
By Lauren ChomiukNovember 26, 2025
16 days ago
Rankings
  • 100 Best Companies
  • Fortune 500
  • Global 500
  • Fortune 500 Europe
  • Most Powerful Women
  • Future 50
  • World’s Most Admired Companies
  • See All Rankings
Sections
  • Finance
  • Leadership
  • Success
  • Tech
  • Asia
  • Europe
  • Environment
  • Fortune Crypto
  • Health
  • Retail
  • Lifestyle
  • Politics
  • Newsletters
  • Magazine
  • Features
  • Commentary
  • Mpw
  • CEO Initiative
  • Conferences
  • Personal Finance
  • Education
Customer Support
  • Frequently Asked Questions
  • Customer Service Portal
  • Privacy Policy
  • Terms Of Use
  • Single Issues For Purchase
  • International Print
Commercial Services
  • Advertising
  • Fortune Brand Studio
  • Fortune Analytics
  • Fortune Conferences
  • Business Development
About Us
  • About Us
  • Editorial Calendar
  • Press Center
  • Work At Fortune
  • Diversity And Inclusion
  • Terms And Conditions
  • Site Map

© 2025 Fortune Media IP Limited. All Rights Reserved. Use of this site constitutes acceptance of our Terms of Use and Privacy Policy | CA Notice at Collection and Privacy Notice | Do Not Sell/Share My Personal Information
FORTUNE is a trademark of Fortune Media IP Limited, registered in the U.S. and other countries. FORTUNE may receive compensation for some links to products and services on this website. Offers may be subject to change without notice.