Everything you need to know about deep learning
Scientists have fed an artificially intelligent system with Daily Mail articles so it can learn how natural language works. While it’s not quite HAL 9000, it’s a worrying thought for those who know the Daily Mail.
This process, called deep learning, is old news. Researchers have been attempting to train algorithms since the 1970s but computational and data limitations slowed progress in the 1980s.
Now deep learning is enjoying a renaissance. Interest in the field hit a peak when Google paid £400 million for UK-based deep learning research group, DeepMind at the end of last year.
Often coined machine learning or neural networking, deep learning involves “training” a computational model so it can decipher natural language. The model relates terms and words to infer meaning once it is fed information. It’s then quizzed on this information and “learns” from the experience – like a child learning to communicate.
With improved computational power and an overwhelming availability of data, researchers have picked up deep learning theories once more, to begin the path to artificial intelligence.
Why the Daily Mail?
Machines have already previously been taught how to read documents and can answer questions posed on its content, but its knowledge base was limited by the size of the document. With heaps of material for algorithms to consume online, systems can make use of a larger pool of natural language, granting it a deeper understanding of universal topics.
Two months ago, Google’s DeepMind revealed a novel technique for perfecting this previously tricky task using online news.
Google DeepMind is based behind King’s Cross St Pancras (left) ©Techworld/Sam Shead
Researchers input one million Daily Mail and CNN articles to the system to query it on and found the algorithm could correctly detect missing words or predict a headline. It’s worth noting that the choice was based on the MailOnline’s bullet point summary structure rather than its subject matter.
Researchers input one million Daily Mail and CNN articles to the system to query it on and found the algorithm could correctly detect missing words or predict a headline.
However, there was a challenge. Analysing an algorithm’s sophistication proves problematic with the Mail’s renowned sensationalist headlines.
The DeepMinders found when using options, “a) The hi-tech bra that helps you beat breast X; b) Could Saccharin help beat X ?; c) Can fish oils help fight prostate X ?,” the model could easily predict that X = cancer, “regardless of the contents of the context document, simply because this is a very frequently cured entity in the Daily Mail corpus,” the paper stated.
DeepMind has also seen breakthroughs like learning how to play Atari video games and online poker. Its main goal, however, is to create one general deep learning model that can improve services for its parent company, Google.
The Silicon Valley giant has heavily invested in deep learning. It’s main business driver, search, is based on this technology and other services like Google Translate, voice and mobile search as well as its Google Photos app are based on neural networks. Aside from DeepMind, it has teams of organically grown machine learning experts, led by AI specialist Jeff Dean and British cognitive psychologist and computer scientist Geoff Hinton.
Relying on AI has its issues though. Google has suffered several embarassments at the hands of its algorithms, including tagging a black woman as a gorilla on Google Photos.
Why the surge?
Deep learning is at an early stage, yet it forms the basis of some of the largest, most profitable tech companies today. Services based on machine learning is what Google sells its advertising on, and competitors Microsoft, Apple and Facebook are rapidly making developments in the field.
One accelerator is the development of tools like General Processor Units, or GPUs that cut machine training time.
“Stuff that would have taken a week to run now takes a few hours on a single machine,” says Dr Matthew Aylett, who works at Edinburgh University’s school for informatics and is Chief Science Officer of Edinburgh based text to speech company Cereproc.
Following an experiment by Canadian scientists a few years ago, the use of GPUs to power deep neural networks “caught on like wildfire,” adds Chris Ramey, GPU manufacturer and NVIDIA product manager, following an experiment by Canadian scientists.
NVIDIA GPU-powered deep learning powers many consumer products and services both on and offline. These include facial recognition technology for Facebook users, image identification in Google Photos and for speed sign reading and vehicle detection technology within Audi’s driverless cars. Baidu, China’s Google, is developing an AI-based health service, so users can tell their devices their symptoms and receive a diagnosis, and undoubtedly some well targeted pharmaceutical advertising. Further, diabetes patients can expect to see technology that can detect early stages of blindness thanks to machine learning research by Deepsense.io competiton entrants.
How much is it like a human brain?
Neural networks are inspired by the way human brains works, but very loosely.
“They are a bit like brains in the same way pulleys and strings are like muscles. There is a relationship but you wouldn’t say it is the same thing,” Dr Aylett explains. Much like big data tools, deep learning models are as good as the data you feed it, it will not search for information the way a child could.
Will it continue to progress?
Theoretically the tools are in place, but it is the supporting infrastructure like super-fast connection speeds, data availability and storage and powerful, fast computers that will need to keep up. Corporations financial backing is another added incentive.
Key terms for deep learning
Deep neural network: A deep neural network is the development of the first theory of machine learning. It was developed following the early Perceptron learning algorithm, which was limited in its ability to understand the ambiguity of “or” within natural language. To resolve this problem, highlighted by early AI heavyweight Marvin Minsky, several layers of learning algorithms needed to be developed.
Unsupervised versus supervised learning: Supervised neural networks must have been told the answer to your question at some point, for it to learn it. This might be used for object recognition – to tell a face from a car, for example. Unsupervised learning involves throwing information at a machine and hope it will learn something you haven’t trained it for, because it is able to cluster data to understand patterns. In object recognition this means it will group shapes together and conclude that they are similar. But it is often used in chess or online gaming AI.
GPUs: General Processor Units were created by NVIDIA in the early 1990s and can be found in cars, powering video games, in home entertainment systems and tablets.
Linear regression: Computers track the line between two related variables, like lung cancer and smoking. Deep neural networks can process non linear multiple progression which effectively predicts unknown variables if you have enough data to train it.
Speech recognition: when you play audio or speak and a computer tells you what you said, or types it.
Speech synthesis: When you type in text and then it speaks, like Siri or OK Google.
Source: Everything you need to know about deep learning
Via: Google Alert for ML