Data from: One-shot learning and behavioral eligibility traces in sequential decision making
Marco P. Lehmann, He A. Xu, Vasiliki Liakoni, Michael H. Herzog, Wulfram Gerstner & Kerstin Preuschoff
In many daily tasks we make multiple decisions before reaching a goal. In order to learn such sequences of decisions, a mechanism to link earlier actions to later reward is necessary. Reinforcement learning theory suggests two classes of algorithms solving this credit assignment problem: In classic temporal-difference learning, earlier actions receive reward information only after multiple repetitions of the task, whereas models with eligibility traces reinforce entire sequences of actions from a single experience (one-shot)....
1 citation reported since publication in 2019.
45 views reported since publication in 2019.
These counts follow the
COUNTER Code of Practice,
meaning that Internet robots and repeats within a certain time frame are excluded.
What does this mean?
7 downloads reported since publication in 2019.
These counts follow the
COUNTER Code of Practice,
meaning that Internet robots and repeats within a certain time frame are excluded.
What does this mean?