Open main menu
Home
Random
Log in
Settings
About GM-RKB
Disclaimers
GM-RKB
Search
Policy Gradient Methods
Language
Watch
Edit
See:
Markov Decision Process
;
Reinforcement Learning
;
Value Function Approximation
.
References
2011
(
Peters & Bagnell, 2011
) ⇒ Jan Peters; J. Andrew Bagnell. (
2011
). “Policy Gradient Methods.” In: (
Sammut & Webb, 2011
) p.774