Reinforcement learning is a subset of machine learning, where AI agents learn from the environment by interacting with it and improving their performance. This branch of AI learns by trial and error instead of human supervision. The following diagram illustrates how an AI agent acts on the environment and receives feedback after each action. Feedback is made up of two parts: reward and the next state of the environment. Rewards are defined by a human:
Google's DeepMind published a paper in 2013 about Playing Atari with Deep Reinforcement Learning. In this paper, a new algorithm called Deep Q Network (DQN). It explains how an AI ...