AlphaZero

Also known as Alpha Zero, Alpha0, Alpha 0

AlphaZero is a computer program developed by artificial intelligence research company DeepMind to master the games of chess, shogi and go. This algorithm uses an approach similar to AlphaGo Zero.

Wikidata facts

Official name: AlphaZero

Show 2 more facts

inception: 2018-00-00
Stack Exchange tag: chess.stackexchange.com/tags/alphazero

Sources (2)

via Wikidata · CC0

~12 min read

Article

18 sections

Contents

Relation to AlphaGo Zero
Stockfish and Elmo
Training
Preliminary results
Outcome
Chess
Shogi
Go
Analysis
Reaction and criticism
Final results
Chess
Shogi
Reactions and criticisms
See also
Notes
References
External links

AlphaZero is a computer program developed by artificial intelligence research company DeepMind to master the games of chess, shogi and go. This algorithm uses an approach similar to AlphaGo Zero.

On December 5, 2017, the DeepMind team released a preprint paper introducing AlphaZero, which would soon play three games by defeating world-champion chess engines Stockfish, Elmo, and the three-day version of AlphaGo Zero. In each case it made use of custom tensor processing units (TPUs) that the Google programs were optimized to use. AlphaZero was trained solely via self-play using 5,000 first-generation TPUs to generate the games and 64 second-generation TPUs to train the neural networks, all in parallel, with no access to opening books or endgame tables. After four hours of training, DeepMind estimated AlphaZero was playing chess at a higher Elo rating than Stockfish 8; after nine hours of training, the algorithm defeated Stockfish 8 in a time-controlled 100-game tournament (28 wins, 0 losses, and 72 draws). The trained algorithm played on a single machine with four TPUs.