NSA and GCHQ’s crappy Big Data techniques may be killing thousands of innocents

Researchers have taken a second look at the NSA SKYNET leaks, as well as the GCHQ data-mining problem book first published on Boing Boing, and concluded that the spy agencies have made elementary errors in their machine-learning techniques, which are used to identify candidates for remote assassination by drone. These errors reveal the fundamental problem with secret science: that scientists will forgive their own corner-cutting and sloppiness when they know no one will ever check their work. At root is the lack of good training data to use to establish “ground truths” for the data-mining technology. The techniques documented in the leaks show the researchers taking shortcuts to get around this lack — rather than holding back some known-terrorist profiles to test their models, they re-run the training data back…


Link to Full Article: NSA and GCHQ’s crappy Big Data techniques may be killing thousands of innocents