Bertsekas, Dimitri P

Rollout, Policy Iteration, and Distributed Reinforcement Learning - Ed. 1 - United States -- Athena Scientific -- 2020 - xiii, 483p.

9781886529076


Mathematical optimization -- Dynamic programming -- Alpha Zero -- Infinte -- Stochastics --
Algorithms -- Multiagent Rollout

006.31 BER/R