Polish town highlights oddities in Facebook's personal data disclosures

Facebook holds back some personal data when responding to a user access request, but reveals curious ad-targeting data

Facebook's store of data about its users holds some surprises, and not just in the sheer quantity of data it is sitting on. Among the surprises it held for me was SBupsk.

One of 47 topics about which Facebook thinks I am interested in seeing advertisements, SBupsk is a Polish town with about 100,000 citizens and a beautiful church. Another of those topics is Bomen, a town in New South Wales, Australia. I don't remember ever seeing an ad, or indeed anything, related to the towns while hanging around on Facebook -- or anywhere else for that matter.

In fact, the first I heard of either place was when I requested dumps of all the data Facebook holds about me, to evaluate how the company is responding to criticism of its data storage practices by the Data Protection Commissioner of Ireland. That's where Facebook's international headquarters is, making the company subject to Irish (and European Union) data protection laws -- and also to Ireland's advantageous rate of corporation tax.

Under European Union law, Facebook is required to provide users with personal data it holds about them on request, and in a December review of Facebook's data handling, the data protection commissioner recommended that the company provide users with access to more of their personal data. It gave the company until July to change its policies, and is currently reviewing the changes made. It expects to publish a new audit in early October, said senior investigations officer Catriona Holohan.

Facebook is required to respond to data access requests within 40 days. After sending out wads of paper and CDs in response to early requests, the company now offers a self-service tool allowing users to download two bundles of their personal data. The basic bundle contains timeline information including shared posts, messages and photos, and in my case was ready in about 3.5 hours. An extended archive with details of logins, cookies, deleted friends and the curious "ads topics" was ready in 90 minutes. Other data can be consulted online in a searchable Activity Log on Facebook's website.

Facebook now seems to be providing all the categories of data the commissioner asked for. In April it added login and logout information, unconfirmed friendship requests and information about pokes, among other categories requested by the authority.

As a user, it's not easy to check Facebook's compliance with all the commissioner's recommendations, however.

For example, Facebook agreed to anonymize all search data on the site within six months. According to the online help center, anything you've searched for should appear in the Activity Log. However, my searches do not appear on the drop-down list of activity categories, nor do they appear in other categories.

Told that its indefinite retention of ad-click data was unacceptable, Facebook agreed to retain such data for no more than two years, and seems to be keeping it for less time. In a data dump downloaded on Aug. 10, the first referenced ad-click stored by Facebook is dated July 2, while in one downloaded on Sept. 10, the first mentioned ad-click is dated July 20. The earliest two ad-clicks from the first data dump don't appear in the second, suggesting Facebook is retaining the data for about two months. That, of course, assumes that Facebook provides its users with all their personal data -- and that is not always easy to believe.

In my downloaded data, for instance, part of my private message history is missing. Facebook's online history of my conversation with one of my friends dates back to July 9, 2011, with hundreds of messages shared, but the data dump only contained the messages shared on Sept. 1, 2012. Another conversation, started on Aug. 19, 2010, only appeared in the data dump as of July 6, 2011. Other, older messages were also missing.

Max Schrems, an Austrian law student who runs Europe vs. Facebook, a group pushing the company to respect privacy laws, doubts that Facebook sends users all the data it holds about them. He was among the first to ask Facebook to send him a copy of his personal data. A year ago, the company sent him a raw file of data consisting of a stack of papers and a CD that together contained far more information than is available for download today.

Pictures he had uploaded to Facebook, for instance, were accompanied by metadata such as the GPS location, the IP address used to upload the photo and the camera make and model, he said. But with Facebook's download tool today, he only gets the raw picture without the metadata.

"Then there are other things. For example: if you delete messages on Facebook they still hold them, and in the download tool you don't get the deleted messages," said Schrems, who added that the deleted messages did appear in his raw data file a year ago. Likewise, "If you delete friends, or if friends delete you, they'll still store it but you don't get the deleted friend information all the time," he said.

In addition, the information provided by Facebook is scattered across different places, which makes it hard to track if all the information is there, Schrems said.

Besides the two data downloads and the online activity log that Facebook highlights, other personal information can be found in the account settings, Schrems said. "A normal user is entitled to get everything from Facebook. Today, users have to hunt all over Facebook to get it."

The way the download tool and the extended archive work make it hard to check if all the information Facebook has is made available to the user, said Schrems. "Right now, Facebook is just gathering some information from the raw format and transfers it into some HTML download thing," he said. "Facebook is just not including the data that is a problem for them."

Schrems estimated that Facebook now only provides him with half of his personal data via the download tools, compared to the earlier raw file he and other early requesters received. "Only we can prove that the other half is not there because we have the original raw format," he said.

Facebook declined to discuss the missing data or the Irish commissioner's forthcoming audit.

"We believe that every Facebook user owns his or her own data and should have simple and easy access to it," a company representative said in an emailed statement, adding that is why the company has built a way for users to "download everything."

"People who want a copy of the information they have put on Facebook can click a link located in 'Account Settings' and easily get a copy of all of it in a single download," the statement said.

Loek covers all things tech for the IDG News Service. Follow him on Twitter at @loekessers or email tips and comments to loek_essers@idg.com

Join the Good Gear Guide newsletter!

Error: Please check your email address.

Our Back to Business guide highlights the best products for you to boost your productivity at home, on the road, at the office, or in the classroom.

Keep up with the latest tech news, reviews and previews by subscribing to the Good Gear Guide newsletter.

Loek Essers

IDG News Service
Show Comments

Essentials

Microsoft L5V-00027 Sculpt Ergonomic Keyboard Desktop

Learn more >

Lexar® JumpDrive® S57 USB 3.0 flash drive

Learn more >

Mobile

Lexar® JumpDrive® S45 USB 3.0 flash drive 

Learn more >

Exec

Audio-Technica ATH-ANC70 Noise Cancelling Headphones

Learn more >

HD Pan/Tilt Wi-Fi Camera with Night Vision NC450

Learn more >

Lexar® Professional 1800x microSDHC™/microSDXC™ UHS-II cards 

Learn more >

Lexar® JumpDrive® C20c USB Type-C flash drive 

Learn more >

Budget

Back To Business Guide

Click for more ›

Most Popular Reviews

Latest News Articles

Resources

PCW Evaluation Team

Azadeh Williams

HP OfficeJet Pro 8730

A smarter way to print for busy small business owners, combining speedy printing with scanning and copying, making it easier to produce high quality documents and images at a touch of a button.

Andrew Grant

HP OfficeJet Pro 8730

I've had a multifunction printer in the office going on 10 years now. It was a neat bit of kit back in the day -- print, copy, scan, fax -- when printing over WiFi felt a bit like magic. It’s seen better days though and an upgrade’s well overdue. This HP OfficeJet Pro 8730 looks like it ticks all the same boxes: print, copy, scan, and fax. (Really? Does anyone fax anything any more? I guess it's good to know the facility’s there, just in case.) Printing over WiFi is more-or- less standard these days.

Ed Dawson

HP OfficeJet Pro 8730

As a freelance writer who is always on the go, I like my technology to be both efficient and effective so I can do my job well. The HP OfficeJet Pro 8730 Inkjet Printer ticks all the boxes in terms of form factor, performance and user interface.

Michael Hargreaves

Windows 10 for Business / Dell XPS 13

I’d happily recommend this touchscreen laptop and Windows 10 as a great way to get serious work done at a desk or on the road.

Aysha Strobbe

Windows 10 / HP Spectre x360

Ultimately, I think the Windows 10 environment is excellent for me as it caters for so many different uses. The inclusion of the Xbox app is also great for when you need some downtime too!

Mark Escubio

Windows 10 / Lenovo Yoga 910

For me, the Xbox Play Anywhere is a great new feature as it allows you to play your current Xbox games with higher resolutions and better graphics without forking out extra cash for another copy. Although available titles are still scarce, but I’m sure it will grow in time.

Featured Content

Latest Jobs

Don’t have an account? Sign up here

Don't have an account? Sign up now

Forgot password?