Online Reward-Maximization Task

From GM-RKB
Jump to navigation Jump to search

An Online Reward-Maximization Task is an online learning task with rewardable decisions (with a utility function and constraint rules), characterized by the pursuit of maximizing rewards over time through interactions with a dynamic environment.



References

2017

2017b

2016

2015

2013

2011

2010