DeepMind AlphaZero

AlphaZero is an artificial-intelligence algorithm based on machine-learning techniques developed by Google DeepMind. It is a generalization of AlphaGo Zero, a predecessor developed specifically for the game of Go, and in turn an evolution of AlphaGo, the first software capable of achieving superhuman performances in the game of Go. Like AlphaGo Zero, it uses the Monte Carlo Tree Search (MCTS), guided by a deep convolutional neural network trained for reinforcement.

On December 5th 2017, the DeepMind team published a preprint on arXiv in which some of the results obtained by AlphaZero were presented in several classic table games, reaching a superhuman level in the game of chess, shogi, and Go with only a few hours of training, ...

Get Keras Reinforcement Learning Projects now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.