Let's hope the NSA hasn't actually used this machine-learning model to target drone strikes

The data set used to train it was 'totally inadequate,' one expert says

The U.S. National Security Agency could be relying on a seriously flawed machine-learning model to target drone strikes in Pakistan, according to a new analysis of slides uncovered last year by whistleblower Edward Snowden.

Published last May by The Intercept, the slides detail the NSA's so-called Skynet program, in which machine learning is apparently used to identify likely terrorists in Pakistan. While it's unclear if the machine-learning model has been used in the NSA's real-world efforts, it has serious problems that could put lives at risk if it were, according to Patrick Ball, director of research at the Human Rights Data Analysis Group.

"I have no idea if any of this was ever used in actual strikes or even made it into a meeting," Ball said Monday. But "nobody rational would use an analysis this crappy for any kind of decision making."

Dating back to 2012, the slides describe the use of GSM metadata for behavioral profiling of 55 million cellphone users, including factors such as travel behavior and social networks. Equipped with that data, the model aims to predict which people are likely to be terrorists.

It's no secret that the United States has been using unmanned drones to attack militants in Pakistan over the past decade. Between 2,500 and 4,000 Pakistanis have been killed by drones since 2004, according to the Bureau of Investigative Journalism, a nonprofit news organization. Many of those killed were members of groups such as al Qaeda, the organization said.

General Michael Hayden, former director of the NSA and the CIA, has stated the connection explicitly: “We kill people based on metadata.”

Particularly troubling, however, is that drones have reportedly killed more than 400 civilians -- possibly more than 900 -- along the way.

That's where the model's specific failings become relevant. First and foremost is that the NSA didn't use nearly enough data about known terrorists to be able to train the model to distinguish terrorists from other people with any reasonable level of accuracy, Ball explained.

In fact, the model was trained using data about just seven known terrorists, according to the slides. "That's totally inadequate," Ball said.

The algorithm itself is fine, he said, but the paucity of data used to train it leads to an unacceptably high chance of "false positives," or innocent people classified as terrorists. It it were actually used to direct drone attacks, that would mean the loss of innocent lives.

The NSA is "not stupid, and this is a stupid piece of analysis," Ball said. "My guess is that this was someone in technical management at NSA selling it up the chain, but it didn't really work -- it's a failed experiment."

That's not to say that drone strikes aren't going on, or that the possibility that a model like this might be used to direct them isn't concerning.

"Yes, there are drone strikes in Pakistan, and yes, they kill innocent people -- these things are not in dispute," Ball said. But in the case of this model, "all we know is what's on a few slides, and that's worrisome."

The NSA did not respond to a request for comment.

Join the Good Gear Guide newsletter!

Error: Please check your email address.

Our Back to Business guide highlights the best products for you to boost your productivity at home, on the road, at the office, or in the classroom.

Keep up with the latest tech news, reviews and previews by subscribing to the Good Gear Guide newsletter.

Katherine Noyes

IDG News Service
Show Comments

Essentials

Microsoft L5V-00027 Sculpt Ergonomic Keyboard Desktop

Learn more >

Lexar® JumpDrive® S57 USB 3.0 flash drive

Learn more >

Mobile

Lexar® JumpDrive® S45 USB 3.0 flash drive 

Learn more >

Exec

HD Pan/Tilt Wi-Fi Camera with Night Vision NC450

Learn more >

Audio-Technica ATH-ANC70 Noise Cancelling Headphones

Learn more >

Lexar® Professional 1800x microSDHC™/microSDXC™ UHS-II cards 

Learn more >

Lexar® JumpDrive® C20c USB Type-C flash drive 

Learn more >

Budget

Back To Business Guide

Click for more ›

Most Popular Reviews

Latest News Articles

Resources

PCW Evaluation Team

Michael Hargreaves

Windows 10 for Business / Dell XPS 13

I’d happily recommend this touchscreen laptop and Windows 10 as a great way to get serious work done at a desk or on the road.

Aysha Strobbe

Windows 10 / HP Spectre x360

Ultimately, I think the Windows 10 environment is excellent for me as it caters for so many different uses. The inclusion of the Xbox app is also great for when you need some downtime too!

Mark Escubio

Windows 10 / Lenovo Yoga 910

For me, the Xbox Play Anywhere is a great new feature as it allows you to play your current Xbox games with higher resolutions and better graphics without forking out extra cash for another copy. Although available titles are still scarce, but I’m sure it will grow in time.

Kathy Cassidy

STYLISTIC Q702

First impression on unpacking the Q702 test unit was the solid feel and clean, minimalist styling.

Anthony Grifoni

STYLISTIC Q572

For work use, Microsoft Word and Excel programs pre-installed on the device are adequate for preparing short documents.

Featured Content

Latest Jobs

Don’t have an account? Sign up here

Don't have an account? Sign up now

Forgot password?