• Home
  • Latest
  • Fortune 500
  • Finance
  • Tech
  • Leadership
  • Lifestyle
  • Rankings
  • Multimedia

Trendingnow

1

Bolt CEO says he let go of his entire HR team for creating problems that didn’t exist: ‘Those problems disappeared when I let them go’ 

2

The Bezos family just donated $100 million to help achieve one of Mayor Zohran Mamdani’s top campaign promises

3

Current price of oil as of May 19, 2026

1

Bolt CEO says he let go of his entire HR team for creating problems that didn’t exist: ‘Those problems disappeared when I let them go’ 

2

The Bezos family just donated $100 million to help achieve one of Mayor Zohran Mamdani’s top campaign promises

3

Current price of oil as of May 19, 2026
TechOkCupid

Researchers Caused an Uproar By Publishing Data From 70,000 OkCupid Users

Robert Hackett
By
Robert Hackett
Robert Hackett
Down Arrow Button Icon
Robert Hackett
By
Robert Hackett
Robert Hackett
Down Arrow Button Icon
May 18, 2016, 3:41 PM ET
Screenshot of OkCupid website homepage

Earlier this month, Danish researchers published data from the online profiles of nearly 70,000 OkCupid users—including usernames, political leanings, drug usage, and intimate sexual details—creating a privacy firestorm.

The researchers, Emil Kirkegaard and Julius Daugbjerg Bjerrekær, used data scraping software developed by a third contributor, Oliver Nordbjerg, to collect the information for a study that explored, among other things, the thinking of people on the site. They posted the database along with a draft paper on Open Science Framework, a site that encourages open source science research and collaboration.

Unlike recent incidents at Ashley Madison, a site for people seeking extramarital affairs, as well as some adult networks that cater to people with fetishes, the OkCupid research did not involve a security breach. That didn’t stop the ensuing controversy.

“Some may object to the ethics of gathering and releasing this data,” the authors wrote in the draft paper, which has since been pulled. “However, all the data found in the dataset are or were already publicly available, so releasing this dataset merely presents it in a more useful form.”

Online commenters, OkCupid users, the site’s operators, and academics attacked (and, in some cases, threatened) the researchers for making user information public. Some questioned whether such data harvesting, bundling, and broadcasting is justifiable for academic research and whether it crosses ethical and legal lines.

Although the researchers did not release the real names and pictures of the OkCupid users, critics noted that their identities could easily be uncovered from the details provided—such as from the usernames. “Your private life is a few big leaks away from being an inescapable matter of public record, once a statistician with BitTorrent gets bored,” said Scott Weingart, a digital humanities specialist at Carnegie Mellon University, mused in a post on Twitter (TWTR). He added that it would be easy to identify more than 10,000 of the people in the data dump and link them to their sexual inclinations.

Kirkegaard said that his group posted people’s usernames because it found the data on these self-selected pseudonyms to be scientifically interesting. (What does use of the word “hot” in an alias say about its subject, for example?) He also argued that retaining the information in the dataset would allow certain missing details—like height, profile text, or photos—to be added later.

@esjewett No. Data is already public.

— Emil O W Kirkegaard (@KirkegaardEmil) May 11, 2016

The data, collected from November 2014 to March 2015, is indeed public—sort of. Some of it like bios, photos, age, gender, sexual orientation is easily accessible through basic Google (GOOG) searches. Answers to some 2,600 of the service’s most popular dating survey questions are restricted to people who are logged into the site and who have answered the same questions.

The site’s users can also set certain answers to “private,” which makes the responses inaccessible to others. In this case, the researchers scraped and presented the data accessible through Google and Q&A responses from individual profiles.

“We thought this was an obvious case of public data scraping so that it would not be a legal problem,” Kirkegaard wrote to Fortune.

Last week after the appearance of the dataset began inciting an uproar, Open Science Framework, the site that hosted the data, placed it behind a password-protected wall. OkCupid then filed a copyright claim on Friday ordering the site to take it down altogether. The page where the data initially appeared was initially changed to read: “Unavailable for legal reasons.” Now it simply states “Content removed.”

The editorial board at Open Differential Psychology, the journal to which the researchers submitted the accompanying paper (and where Kirkegaard is the editor), is currently reviewing the submission, Kirkegaard told the science blog Retraction Watch. “If the journal does not take the paper, we will probably publish it elsewhere,” he said.

Get Data Sheet, Fortune’s technology newsletter.

OkCupid, owned by InterActivCorp’s (IAC) Match Group (MTCH), released a statement that complained about the published data. “This was a violation of our terms of service and we sent a take-down notice,” Mathew Traub, a spokesperson for OkCupid, told Fortune in an email. “They appear to have complied.”

Kirkegaard said in a Twitter post that he did not ask the company for permission to collect or publish the data beforehand. Some commenters have argued that the researchers breached research ethics by failing to obtain the consent of the OkCupid users, too, before gathering and republishing their information. They cite, among other things, “code of conduct” guidelines by the American Psychological Association.

Aarhus University in Denmark, the school at which Kirkegaard is a graduate student, distanced itself from the team of students, who undertook the project in their spare time. “The views and actions by student Emil Kirkegaard is not on behalf of AU,” the university said in a statement posted to Twitter. “[H]is actions are entirely his own responsibility.”

This is not the first time someone has scraped the profile data of OkCupid users, of course. At least one individual cleverly “hacked” the dating system to get more romantic matches several years ago. And the site’s co-founder, Christian Rudder, published a treatise on data science that analyzed information from the data-rich dating network. These cases are different, however, from the latest instance of scraping, packaging and releasing profile information publicly.

A better comparison would be a 2008 study out of Harvard University that relied on information culled from Facebook (FB) profiles. The researchers did use some anonymizing techniques, but critics said the protections were not strong enough. The scientists ultimately took down the data.

In a message sent to Fortune, Kirkegaard wrote that he did not rule out the possibility of republishing the data his team collected with more effort put into obscuring the identity of the OKCupid users. Given OKCupid’s interpretation of its terms of service agreement—and its copyright claim—it’s unlikely that the company will sign off on the proposed compromise. As with the Harvard Facebook study, the data may very well remain in limbo.

It’s no surprise that people are sensitive to having their romantic and other interests neatly presented for others to rifle through online, even if done in the name of science. In addition to questions raised about the ethics of certain data science practices, the boundaries of open science research, and the ease of identifying the members of a given dataset, the incident reveals something else, too: People continue to give up vast quantities of their personal data to sites online, expecting privacy.

About the Author
Robert Hackett
By Robert Hackett
Instagram iconLinkedIn iconTwitter icon
See full bioRight Arrow Button Icon

Latest in Tech

Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025

Most Popular

Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Fortune Secondary Logo
Rankings
  • 100 Best Companies
  • Fortune 500
  • Global 500
  • Fortune 500 Europe
  • Most Powerful Women
  • World's Most Admired Companies
  • See All Rankings
  • Lists Calendar
Sections
  • Finance
  • Fortune Crypto
  • Features
  • Leadership
  • Health
  • Commentary
  • Success
  • Retail
  • Mpw
  • Tech
  • Lifestyle
  • CEO Initiative
  • Asia
  • Politics
  • Conferences
  • Europe
  • Newsletters
  • Personal Finance
  • Environment
  • Magazine
  • Education
Customer Support
  • Frequently Asked Questions
  • Customer Service Portal
  • Privacy Policy
  • Terms Of Use
  • Single Issues For Purchase
  • International Print
Commercial Services
  • Advertising
  • Fortune Brand Studio
  • Fortune Analytics
  • Fortune Conferences
  • Business Development
  • Group Subscriptions
About Us
  • About Us
  • Press Center
  • Work At Fortune
  • Terms And Conditions
  • Site Map
  • About Us
  • Press Center
  • Work At Fortune
  • Terms And Conditions
  • Site Map
  • Facebook icon
  • Twitter icon
  • LinkedIn icon
  • Instagram icon
  • Pinterest icon

Latest in Tech

trader
CommentarySoftware
The 50-year-old law that governed every software company just broke. Here’s what replaces it
By Martin Casado and Abhishek NagarajMay 20, 2026
47 minutes ago
Exclusive: Circle cofounder raises $30 million for Series A ‘AI-native bank’ Catena Labs
CryptoVenture Capital
Exclusive: Circle cofounder raises $30 million for Series A ‘AI-native bank’ Catena Labs
By Ben WeissMay 20, 2026
1 hour ago
FJ Campbell, MD, is chief medical officer at Ardent Health.
CommentaryHealth
A doctor shortage is coming. AI could be the only realistic fix
By FJ CampbellMay 20, 2026
3 hours ago
Alex Israel crosses his arms
AITerm Sheet
How Metropolis built a $5 billion AI infrastructure company out of America’s parking problem
By Lily Mae LazarusMay 20, 2026
3 hours ago
Alphabet CEO Sundar Pichai in Mountain View, California on May 19, 2026. (Photo: David Paul Morris/Bloomberg/Getty Images)
NewslettersFortune Tech
At Google I/O 2026, it’s AI, AI, and more AI
By Andrew NuscaMay 20, 2026
3 hours ago
Exclusive: Advocacy groups file complaint against Roblox, alleging its manipulative design puts kids at risk
CybersecurityRoblox
Exclusive: Advocacy groups file complaint against Roblox, alleging its manipulative design puts kids at risk
By Catherina GioinoMay 20, 2026
3 hours ago

Most Popular

Bolt CEO says he let go of his entire HR team for creating problems that didn’t exist: ‘Those problems disappeared when I let them go’ 
Workplace Culture
Bolt CEO says he let go of his entire HR team for creating problems that didn’t exist: ‘Those problems disappeared when I let them go’ 
By Preston ForeMay 19, 2026
16 hours ago
The Bezos family just donated $100 million to help achieve one of Mayor Zohran Mamdani’s top campaign promises
Politics
The Bezos family just donated $100 million to help achieve one of Mayor Zohran Mamdani’s top campaign promises
By Jake AngeloMay 12, 2026
8 days ago
Current price of oil as of May 19, 2026
Personal Finance
Current price of oil as of May 19, 2026
By Joseph HostetlerMay 19, 2026
24 hours ago
Meet a 21-year-old community college student who's going to China as the first American woman welder in the trades Olympics
Future of Work
Meet a 21-year-old community college student who's going to China as the first American woman welder in the trades Olympics
By Mike Householder and The Associated PressMay 17, 2026
3 days ago
Employers are quietly pausing 401(k) matches again. The last time this happened was the 2008 recession and Covid
Personal Finance
Employers are quietly pausing 401(k) matches again. The last time this happened was the 2008 recession and Covid
By Courtney Vinopal and HR BrewMay 18, 2026
2 days ago
Current price of silver as of Monday, May 18, 2026
Personal Finance
Current price of silver as of Monday, May 18, 2026
By Joseph HostetlerMay 18, 2026
2 days ago

© 2026 Fortune Media IP Limited. All Rights Reserved. Use of this site constitutes acceptance of our Terms of Use and Privacy Policy | CA Notice at Collection and Privacy Notice | Do Not Sell/Share My Personal Information
FORTUNE is a trademark of Fortune Media IP Limited, registered in the U.S. and other countries. FORTUNE may receive compensation for some links to products and services on this website. Offers may be subject to change without notice.