Reward is the driving force for reinforcement learning (RL) agents. Given its central role in RL, reward is often assumed to be suitably general in its expressivity, as summarized by Sutton and Littman’s reward hypothesis:
In our work, we take first steps toward a systematic study of this hypothesis. To do so, we…
