Q-learning

'''Q-learning''' is a reinforcement learning algorithm that trains an agent to assign values to its possible actions based on its current state, without requiring a model of the environment (model-free). It can handle problems with stochastic transitions and rewards without requiring adaptations.

Wikidata facts

Show 2 more facts

Stack Exchange tag: ai.stackexchange.com/tags/q-learning
ACM Classification Code (2012): 10010329

Sources (4)

via Wikidata · CC0

~16 min read

Article

19 sections

Contents

Reinforcement learning
Algorithm
Influence of variables
Learning rate
Discount factor
Initial conditions (''Q''<sub>0</sub>)
Implementation
Function approximation
Quantization
History
Variants
Deep Q-learning
Double Q-learning
Others
Multi-agent learning
Limitations
See also
References
External links

'''Q-learning' is a reinforcement learning algorithm that trains an agent to assign values to its possible actions based on its current state, without requiring a model of the environment (model-free). It can handle problems with stochastic transitions and rewards without requiring adaptations.

For example, in a grid maze, an agent learns to reach an exit worth 10 points. At a junction, Q-learning might assign a higher value to moving right than left if right gets to the exit faster, improving this choice by trying both directions over time.