Reinforcement Learning Algorithms: Analysis and Applications by Boris Belousov (
150,01 €
By Boris Belousov, Hany Abdulsamad, Pascal Klink, Simone Parisi, Jan Peters. Prediction Error and Actor-Critic Hypotheses in the Brain. - Reviewing on-policy / o-policy critic learning in the context of Temporal Dierences and Residual Learning.
Jetzt bei Ebay: