Model-Free Reinforcement Learning Algorithm: Difference between revisions

From GM-RKB
Jump to navigation Jump to search
No edit summary
No edit summary
 
Line 1: Line 1:
A [[Model-Free Reinforcement Learning Algorithm]] is a [[reinforcement learning algorithm]] that is a [[model-free learning algorithm]].
A [[Model-Free Reinforcement Learning Algorithm]] is a [[reinforcement learning algorithm]] that is a [[model-free learning algorithm]].
* <B>Context:</B
* <B>Context:</B>
** It can be characterized by its ability to learn from the environment's responses to its actions, rather than from a predetermined model.
** It can be characterized by its ability to learn from the environment's responses to its actions, rather than from a predetermined model.
** It can be applied in various scenarios, including game playing, autonomous navigation, and resource management, where the environment dynamics are not fully known.
** It can be applied in various scenarios, including game playing, autonomous navigation, and resource management, where the environment dynamics are not fully known.

Latest revision as of 23:54, 26 November 2023

A Model-Free Reinforcement Learning Algorithm is a reinforcement learning algorithm that is a model-free learning algorithm.



References

2020


2020

Algorithm Description Model Policy Action Space State Space Operator
DQN Deep Q Network Model-Free Off-policy Discrete Continuous Q-value
DDPG Deep Deterministic Policy Gradient Model-Free Off-policy Continuous Continuous Q-value
A3C Asynchronous Advantage Actor-Critic Algorithm Model-Free On-policy Continuous Continuous Advantage
TRPO Trust Region Policy Optimization Model-Free On-policy Continuous Continuous Advantage
PPO Proximal Policy Optimization Model-Free On-policy Continuous Continuous Advantage
TD3 Twin Delayed Deep Deterministic Policy Gradient Model-Free Off-policy Continuous Continuous Q-value
SAC Soft Actor-Critic Model-Free Off-policy Continuous Continuous Advantage