Category
page 1Reinforcement learning
reinforcement learning
type of machine learning where an agent learns how to behave in an environment by performing actions and receiving rewards or penalties in return, aiming to maximize the cumulative reward over time

Q-learning
'''Q-learning''' is a reinforcement learning algorithm that trains an agent to assign values to its possible actions based on its current state, without requiring a model of the environment (model-free). It can handle problems with stochastic transitions and rewards without requiring adaptations.
reinforcement learning from human feedback
variant of reinforcement learning
Temporal difference learning
intelligent Tutoring System
deep reinforcement learning
techniques combining deep learning and reinforcement learning principles to create efficient machine learning algorithms
SARSA
State–action–reward–state–action (SARSA) is an algorithm for learning a Markov decision process policy, used in the reinforcement learning area of machine learning. It was proposed by Rummery and Niranjan in a technical note with the name "Modified Connectionist Q-Learning" (MCQ-L). The alternative name SARSA, proposed by Rich Sutton, was only mentioned as a footnote.
OpenAI Five
machine-learned bot project using the video game Dota 2
Proximal Policy Optimization
model-free reinforcement learning algorithm
Multi-agent reinforcement learning
sub-field of reinforcement learning
model-free reinforcement learning
type of machine learning algorithm