Reward Model

From GM-RKB
Jump to navigation Jump to search

A Reward Model is a reward function that is a trained ML model (which predicts the reward for a given sequence of actions in a decision-making process, guiding an agent towards achieving a specific objective).



References

2024

2023

2005

1991

1989