DeepMind

Template:Use dmy dates Template:Use British English Template:Infobox website

File:Google-Deep Mind headquarters in London, 6 Pancras Square.jpg

Entrance of building where Google and DeepMind are located at 6 Pancras Square, London, UK.

DeepMind Technologies is a British artificial intelligence company founded in September 2010, currently owned by Alphabet Inc. The company is based in London, with research centres in Canada,^[1] France,^[2] and the United States.

Acquired by Google in 2014, the company has created a neural network that learns how to play video games in a fashion similar to that of humans,^[3] as well as a Neural Turing machine,^[4] or a neural network that may be able to access an external memory like a conventional Turing machine, resulting in a computer that mimics the short-term memory of the human brain.^[5]^[6]

The company made headlines in 2016 after its AlphaGo program beat a human professional Go player Lee Sedol, the world champion, in a five-game match, which was the subject of a documentary film.^[7]

A more general program, AlphaZero, beat the most powerful programs playing go, chess and shogi (Japanese chess) after a few days of play against itself using reinforcement learning.^[8] Template:Toclimit

History

The start-up was founded by Demis Hassabis, Shane Legg and Mustafa Suleyman in 2010.^[9]^[10] Hassabis and Legg first met at University College London's Gatsby Computational Neuroscience Unit.^[11]

During one of the interviews, Demis Hassabis said that the start-up began working on artificial intelligence technology by teaching it how to play old games from the seventies and eighties, which are relatively primitive compared to the ones that are available today. Some of those games included Breakout, Pong and Space Invaders. AI was introduced to one game at a time, without any prior knowledge of its rules. After spending some time on learning the game, AI would eventually become an expert in it. “The cognitive processes which the AI goes through are said to be very like those a human who had never seen the game would use to understand and attempt to master it.”^[12] The goal of the founders is to create a general-purpose AI that can be useful and effective for almost anything.

Major venture capital firms Horizons Ventures and Founders Fund invested in the company,^[13] as well as entrepreneurs Scott Banister^[14] and Elon Musk.^[15] Jaan Tallinn was an early investor and an adviser to the company.^[16] On 26 January 2014, Google announced the company had acquired DeepMind for $500 million,^[17]^[18]^[19]^[20]^[21]^[22] and that it had agreed to take over DeepMind Technologies. The sale to Google took place after Facebook reportedly ended negotiations with DeepMind Technologies in 2013.^[23] The company was afterwards renamed Google DeepMind and kept that name for about two years.^[24]

In 2014, DeepMind received the "Company of the Year" award from Cambridge Computer Laboratory.^[25]

In September 2015, DeepMind and the Royal Free NHS Trust signed their initial Information Sharing Agreement (ISA) to co-develop a clinical task management app, Streams.^[26]

After Google's acquisition the company established an artificial intelligence ethics board.^[27] The ethics board for AI research remains a mystery, with both Google and DeepMind declining to reveal who sits on the board.^[28] DeepMind, together with Amazon, Google, Facebook, IBM and Microsoft, is a founding member of Partnership on AI, an organization devoted to the society-AI interface.^[29] DeepMind has opened a new unit called DeepMind Ethics and Society and focused on the ethical and societal questions raised by artificial intelligence featuring prominent philosopher Nick Bostrom as advisor.^[30] In October 2017, DeepMind launched a new research team to investigate AI ethics.^[31]^[32]

Machine learning

DeepMind Technologies' goal is to "solve intelligence",^[33] which they are trying to achieve by combining "the best techniques from machine learning and systems neuroscience to build powerful general-purpose learning algorithms".^[33] They are trying to formalize intelligence^[34] in order to not only implement it into machines, but also understand the human brain, as Demis Hassabis explains: Template:Quote Google Research has released a paper in 2016 regarding AI Safety and avoiding undesirable behaviour during the AI learning process.^[35] Deepmind has also released several publications via its website.^[36] In 2017 DeepMind released GridWorld, an open-source testbed for evaluating whether an algorithm learns to disable its kill switch or otherwise exhibits certain undesirable behaviours.^[37]^[38]

To date, the company has published research on computer systems that are able to play games, and developing these systems, ranging from strategy games such as Go^[39] to arcade games. According to Shane Legg, human-level machine intelligence can be achieved "when a machine can learn to play a really wide range of games from perceptual stream input and output, and transfer understanding across games[...]."^[40]

Research describing an AI playing seven different Atari 2600 video games (the Pong game in Video Olympics, Breakout, Space Invaders, Seaquest, Beamrider, Enduro, and Q*bert) reportedly led to the company's acquisition by Google.^[3] Hassabis has mentioned the popular e-sport game StarCraft as a possible future challenge, since it requires a high level of strategic thinking and handling imperfect information.^[41] The first demonstration of the DeepMind progress in StarCraft II occurred on January 24, 2019 on StarCrafts Twitch channel and DeepMind’s YouTube channel.^[42]

In July 2018, researchers from DeepMind trained one of its systems to play the famous computer game Quake III Arena.^[43]

Deep reinforcement learning

As opposed to other AIs, such as IBM's Deep Blue or Watson, which were developed for a pre-defined purpose and only function within its scope, DeepMind claims that its system is not pre-programmed: it learns from experience, using only raw pixels as data input. Technically it uses deep learning on a convolutional neural network, with a novel form of Q-learning, a form of model-free reinforcement learning.^[24]^[44] They test the system on video games, notably early arcade games, such as Space Invaders or Breakout.^[44]^[45] Without altering the code, the AI begins to understand how to play the game, and after some time plays, for a few games (most notably Breakout), a more efficient game than any human ever could.^[45]

Template:As of, DeepMind played belowTemplate:Clarify the current World Record for most games, for example Space Invaders, Ms Pac-Man and Q*Bert. DeepMind's AI had been applied to video games made in the 1970s and 1980s; work was ongoing for more complex 3D games such as Doom, which first appeared in the early 1990s.^[45]

AlphaGo and successors

Template:Main In October 2015, a computer Go program called AlphaGo, developed by DeepMind, beat the European Go champion Fan Hui, a 2 dan (out of 9 dan possible) professional, five to zero.^[46] This is the first time an artificial intelligence (AI) defeated a professional Go player.^[47] Previously, computers were only known to have played Go at "amateur" level.^[46]^[48] Go is considered much more difficult for computers to win compared to other games like chess, due to the much larger number of possibilities, making it prohibitively difficult for traditional AI methods such as brute-force.^[46]^[48]

In March 2016 it beat Lee Sedol—a 9th dan Go player and one of the highest ranked players in the world—with a score of 4-1 in a five-game match.

In the 2017 Future of Go Summit, AlphaGo won a three-game match with Ke Jie, who at the time continuously held the world No. 1 ranking for two years.^[49]^[50] It used a supervised learning protocol, studying large numbers of games played by humans against each other.^[51]

In 2017, an improved version, AlphaGo Zero, defeated AlphaGo 100 games to 0. AlphaGo Zero's strategies were self-taught. AlphaGo Zero was able to beat its predecessor after just three days with less processing power than AlphaGo; in comparison, the original AlphaGo needed months to learn how to play.^[52]

Later that year, AlphaZero, a modified version of AlphaGo Zero, gained superhuman abilities at chess and shogi. Like AlphaGo Zero, AlphaZero learned solely through self-play.

Technology

AlphaGo technology was developed based on the deep reinforcement learning approach. This makes AlphaGo different from the rest of AI technologies on the market. With that said, AlphaGo's ‘brain’ was introduced to various moves based on the historical tournament data. The number of moves was increased gradually until it eventually processed over 30 million of them. The aim was to have the system mimic the human player and eventually become better. It played against itself and learned not only from its own defeats but wins as well; thus, it learned to improve itself over the time and increased its winning rate as a result.^[53]

AlphaGo used two deep neural networks: a policy network to evaluate move probabilities and a value network to assess positions. The policy network trained via supervised learning, and was subsequently refined by policy-gradient reinforcement learning. The value network learned to predict winners of games played by the policy network against itself. After training these networks employed a lookahead Monte Carlo tree search (MCTS), using the policy network to identify candidate high-probability moves, while the value network (in conjunction with Monte Carlo rollouts using a fast rollout policy) evaluated tree positions.^[54]

Zero trained using reinforcement learning in which the system played millions of games against itself. Its only guide was to increase its win rate. It did so without learning from games played by humans. Its only input features are the black and white stones from the board. It uses a single neural network, rather than separate policy and value networks. Its simplified tree search relies upon this neural network to evaluate positions and sample moves, without Monte Carlo rollouts. A new reinforcement learning algorithm incorporates lookahead search inside the training loop.^[54] AlphaGo Zero employed around 15 people and millions in computing resources.^[55] Ultimately, it needed much less computing power than AlphaGo, running on four specialized AI processors (Google TPUs), instead of AlphaGo's 48.^[56]

AlphaFold

In 2016 DeepMind turned its artificial intelligence to protein folding, one of the toughest problems in science. In December 2018, DeepMind's tool AlphaFold won the 13th Critical Assessment of Techniques for Protein Structure Prediction (CASP) by successfully predicting the most accurate structure for 25 out of 43 proteins. “This is a lighthouse project, our first major investment in terms of people and resources into a fundamental, very important, real-world scientific problem,” Demis Hassabis said to The Guardian.^[57] This demonstration to the power of AI, specifically DeepMind's application, bodes well to understanding the causes and cures to diseases^[58] and serves as a component to potential Nobel-prize worth breakthroughs in the next decade.^[59]

AlphaStar

In January 2019, DeepMind introduced AlphaStar, a program playing the real-time strategy game StarCraft II. AlphaStar uses a reinforced learning to learn the basics based on replays from human players, and later played against itself to enhance its skills. At the time of the presentation, AlphaStar had knowledge equivalent to 200 years of playing time; it won 10 consecutive matches against professional players, and lost just one.^[60]

WaveNet

Template:Main WaveNet is DeepMind's deep generative model of raw audio waveforms. WaveNet was originally too computationally intensive for use in consumer products when it debuted in 2016; however, in late 2017, it became ready for use in consumer applications such as Google Assistant.^[61]^[62] In 2018 Google launched a commercial a text-to-speech product, Cloud Text-to-Speech, based on WaveNet.^[63]^[64]

Miscellaneous contributions to Google

Google has stated that DeepMind algorithms have greatly increased the efficiency of cooling its data centers.^[65] In addition, DeepMind (alongside other Alphabet AI researchers) assists Google Play's personalized app recommendations.^[63] DeepMind has also collaborated with the Android team at Google for the creation of two new features which will be available to people with devices running Android Pie, the ninth installment of Google's mobile operating system. These features, Adaptive Battery and Adaptive Brightness, use machine learning to conserve energy and make devices running the operating system easier to use. It is the first time DeepMind has used these techniques on such a small scale, with typical machine learning applications requiring orders of magnitude more compute power.^[66]

DeepMind Health

In July 2016, a collaboration between DeepMind and Moorfields Eye Hospital was announced to develop AI applications for healthcare.^[67] DeepMind would be applied to the analysis of anonymised eye scans, searching for early signs of diseases leading to blindness.

In August 2016, a research programme with University College London Hospital was announced with the aim of developing an algorithm that can automatically differentiate between healthy and cancerous tissues in head and neck areas.^[68]

There are also projects with the Royal Free London NHS Foundation Trust and Imperial College Healthcare NHS Trust to develop new clinical mobile apps linked to electronic patient records.^[69] Staff at the Royal Free Hospital were reported as saying in December 2017 that access to patient data through the app had saved a ‘huge amount of time’ and made a ‘phenomenal’ difference to the management of patients with acute kidney injury. Test result data is sent to staff's mobile phones and alerts them to change in the patient's condition. It also enables staff to see if someone else has responded, and to show patients their results in visual form.^[70]Template:Unreliable source?

In November 2017, DeepMind announced a research partnership with the Cancer Research UK Centre at Imperial College London with the goal of improving breast cancer detection by applying machine learning to mammography.^[71] Additionally, in February 2018, DeepMind announced it was working with the U.S. Department of Veterans Affairs in an attempt to use machine learning to predict the onset of acute kidney injury in patients, and also more broadly the general deterioration of patients during a hospital stay so that doctors and nurses can more quickly treat patients in need.^[72]

DeepMind developed an app called Streams, which sends alerts to doctors about patients at risk of acute risk injury.^[73] On 13 November 2018, DeepMind announced that its health division and the Streams app would be absorbed into Google Health.^[74] Privacy advocates said the announcement betrayed patient trust and appeared to contradict previous statements by DeepMind that patient data would not be connected to Google accounts or services.^[75]^[76] A spokesman for DeepMind said that patient data would still be kept separate from Google services or projects.^[77]

NHS data-sharing controversy

In April 2016, New Scientist obtained a copy of a data-sharing agreement between DeepMind and the Royal Free London NHS Foundation Trust. The latter operates three London hospitals where an estimated 1.6 million patients are treated annually. The agreement shows DeepMind Health had access to admissions, discharge and transfer data, accident and emergency, pathology and radiology, and critical care at these hospitals. This included personal details such as whether patients had been diagnosed with HIV, suffered from depression or had ever undergone an abortion in order to conduct research to seek better outcomes in various health conditions.^[78]^[79]

A complaint was filed to the Information Commissioner's Office (ICO), arguing that the data should be pseudonymised and encrypted.^[80] In May 2016, New Scientist published a further article claiming that the project had failed to secure approval from the Confidentiality Advisory Group of the Medicines and Healthcare Products Regulatory Agency.^[81]

In May 2017, Sky News published a leaked letter from the National Data Guardian, Dame Fiona Caldicott, revealing that in her "considered opinion" the data-sharing agreement between DeepMind and the Royal Free took place on an "inappropriate legal basis".^[82] The Information Commissioner's Office ruled in July 2017 that the Royal Free hospital failed to comply with the Data Protection Act when it handed over personal data of 1.6 million patients to DeepMind.^[83]

DeepMind Ethics and Society

As of October 2017, DeepMind has expanded its focus to also include AI ethics. With the former Google UK and EU policy manager Verity Harding co-leading this new team with Sean Legassick,^[84] their goal is to fund external research of the following themes: privacy transparency and fairness; economic impacts; governance and accountability; managing AI risk; AI morality and values; and how AI can address the world's challenges. As a result, the team hopes to further understand the ethical implications of AI and aid society to seeing AI can be beneficial.^[85]

This new subdivision of DeepMind is a completely separate unit from the large partnership of major tech companies of the name Partnership on Artificial Intelligence to Benefit People and Society which DeepMind is also a part of.^[86]

References

1 }}

     | references-column-width 
     | references-column-count references-column-count-{{#if:1|29em}} }}
   | {{#if: 
     | references-column-width }} }}" style="{{#if: 29em
   | {{#iferror: {{#ifexpr: 29em > 1 }}
     | Template:Column-width
     | Template:Column-count }}
   | {{#if: 
     | Template:Column-width }} }} list-style-type: {{#switch: 
   | upper-alpha
   | upper-roman
   | lower-alpha
   | lower-greek
   | lower-roman = {{{group}}}
   | #default = decimal}};">

↑ Template:Cite web
↑ Template:Cite web
↑ ^3.0 ^3.1 Template:Cite web
↑ Template:Cite arXiv
↑ Best of 2014: Google's Secretive DeepMind Startup Unveils a "Neural Turing Machine", MIT Technology Review
↑ Template:Cite journal
↑ Template:Citation
↑ Template:Cite arXiv
↑ Template:Cite news
↑ Template:Cite news
↑ Template:Cite news
↑ Template:Cite news
↑ Template:Cite web
↑ Template:Cite web
↑ Template:Cite web
↑ Template:Cite web
↑ Template:Cite news
↑ Template:Cite news
↑ Template:Cite web
↑ Template:Cite news
↑ Template:Cite news
↑ Template:Cite web
↑ Template:Cite web
↑ ^24.0 ^24.1 Template:Cite journal
↑ Template:Cite web
↑ Template:Cite news
↑ Template:Cite magazine
↑ Template:Cite news
↑ Template:Cite web
↑ Template:Cite news
↑ Template:Cite news
↑ Template:Cite news
↑ ^33.0 ^33.1 Template:Cite web
↑ Template:Cite arXiv
↑ Template:Cite arXiv
↑ Template:Cite web
↑ Template:Cite news
↑ Template:Cite news
↑ Template:Cite book
↑ Template:Cite web
↑ Template:Cite web
↑ DeepMind - StarCraft II demonstation in StarCraft II Oficial Website
↑ "DeepMind AI’s new trick is playing ‘Quake III Arena’ like a human". Engadget. 3 July 2018.
↑ ^44.0 ^44.1 Template:Cite arXiv
↑ ^45.0 ^45.1 ^45.2 Template:Cite AV media
↑ ^46.0 ^46.1 ^46.2 Template:Cite news
↑ Template:Cite news
↑ ^48.0 ^48.1 Template:Cite web
↑ Template:Cite web
↑ Template:Cite web
↑ Template:Cite news
↑ Template:Cite news
↑ Template:Cite web
↑ ^54.0 ^54.1 Template:Cite journal Template:Closed access
↑ Template:Cite news
↑ Template:Cite news
↑ Template:Cite web
↑ Template:Cite news
↑ Template:Cite news
↑ Template:Cite news
↑ Template:Cite news
↑ Template:Cite news
↑ ^63.0 ^63.1 Template:Cite news
↑ Template:Cite web
↑ Template:Cite web 20 July 2016.
↑ Template:Cite web 8 May 2018.
↑ Template:Cite news
↑ Template:Cite news
↑ Template:Cite news
↑ Template:Cite news
↑ Template:Cite web
↑ Template:Cite web
↑ Template:Cite web
↑ Template:Cite news
↑ Template:Cite news
↑ Template:Cite web
↑ Template:Cite news
↑ Template:Cite news
↑ Template:Cite news
↑ Template:Cite news
↑ Template:Cite news
↑ Template:Cite news
↑ Template:Cite news
↑ Template:Cite news
↑ Template:Cite news
↑ Template:Cite news

External links

Template:Official website

Template:Alphabet Inc.

[1] Template:Cite web

[2] Template:Cite web

[arxiv_medium-3] 3.0 ^3.1 Template:Cite web

[arxiv-4] Template:Cite arXiv

[5] Best of 2014: Google's Secretive DeepMind Startup Unveils a "Neural Turing Machine", MIT Technology Review

[DNCnature2016-6] Template:Cite journal

[7] Template:Citation

[8] Template:Cite arXiv

[9] Template:Cite news

[10] Template:Cite news

[11] Template:Cite news

[12] Template:Cite news

[13] Template:Cite web

[14] Template:Cite web

[15] Template:Cite web

[16] Template:Cite web

[17] Template:Cite news

[18] Template:Cite news

[19] Template:Cite web

[20] Template:Cite news

[21] Template:Cite news

[22] Template:Cite web

[23] Template:Cite web

[nature2015-24] 24.0 ^24.1 Template:Cite journal

[25] Template:Cite web

[26] Template:Cite news

[27] Template:Cite magazine

[theguardian.com_2016-05-04-28] Template:Cite news

[29] Template:Cite web

[30] Template:Cite news

[31] Template:Cite news

[32] Template:Cite news

[DeepMind_Website-33] 33.0 ^33.1 Template:Cite web

[34] Template:Cite arXiv

[35] Template:Cite arXiv

[36] Template:Cite web

[37] Template:Cite news

[38] Template:Cite news

[39] Template:Cite book

[40] Template:Cite web

[41] Template:Cite web

[42] DeepMind - StarCraft II demonstation in StarCraft II Oficial Website

[43] "DeepMind AI’s new trick is playing ‘Quake III Arena’ like a human". Engadget. 3 July 2018.

[Atari_Paper-44] 44.0 ^44.1 Template:Cite arXiv

[hassabis_talk-45] 45.0 ^45.1 ^45.2 Template:Cite AV media

[bbcgo-46] 46.0 ^46.1 ^46.2 Template:Cite news

[lemondego-47] Template:Cite news

[googlego-48] 48.0 ^48.1 Template:Cite web

[49] Template:Cite web

[50] Template:Cite web

[51] Template:Cite news

[52] Template:Cite news

[53] Template:Cite web

[:0-54] 54.0 ^54.1 Template:Cite journal Template:Closed access

[55] Template:Cite news

[56] Template:Cite news

[57] Template:Cite web

[58] Template:Cite news

[59] Template:Cite news

[60] Template:Cite news

[61] Template:Cite news

[62] Template:Cite news

[cnbc_money-63] 63.0 ^63.1 Template:Cite news

[64] Template:Cite web

[65] Template:Cite web 20 July 2016.

[66] Template:Cite web 8 May 2018.

[67] Template:Cite news

[68] Template:Cite news

[69] Template:Cite news

[70] Template:Cite news

[71] Template:Cite web

[72] Template:Cite web

[73] Template:Cite web

[74] Template:Cite news

[75] Template:Cite news

[76] Template:Cite web

[77] Template:Cite news

[78] Template:Cite news

[79] Template:Cite news

[80] Template:Cite news

[81] Template:Cite news

[82] Template:Cite news

[83] Template:Cite news

[84] Template:Cite news

[85] Template:Cite news

[86] Template:Cite news

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

[18]

[19]

[20]

[21]

[22]

[23]

[24]

[25]

[26]

[27]

[28]

[29]

[30]

[31]

[32]

[33]

[34]

[35]

[36]

[37]

[38]

[39]

[40]

[41]

[42]

[43]

[44]

[45]

[46]

[47]

[48]

[49]

[50]

[51]

[52]

[53]

[54]

[55]

[56]

[57]

[58]

[59]

[60]

[61]

[62]

[63]

[64]

[65]

[66]

[67]

[68]

[69]

[70]

[71]

[72]

[73]

[74]

[75]

[76]

[77]

[78]

[79]

[80]

[81]

[82]

[83]

[84]

[85]

[86]