Reinforcement Learning Intermediate
reinforcement learning
A family of algorithms that learn an optimal policy, whose goal is to maximize return when interacting with an environment.
Plain English Explanation
A family of algorithms that learn an optimal policy, whose goal is to maximize return when interacting with an environment. For example, the ultimate reward of most games is victory. Reinforcement learning systems can become expert at playing complex games by evaluating sequences of previous game moves that ultimately led to wins and sequences that ultimately led to losses.
How is it used?
Practitioners refer to reinforcement learning when building, training, or evaluating machine learning systems. It appears in research papers, product documentation, and technical discussions about AI capabilities and limitations.