Tutorial über lineare Funktions-Approximatoren für dynamische Programmierung und Verstärkung...
Tutorial on Linear Function Approximators for Dynamic Programming and Reinforcement Learning, Paperback by Geramifard, Alborz; Walsh, Thomas J.; Tellex, Stefanie; Chowdhary, Girish; Roy, Nicholas, ISBN 1601987609, ISBN-13 9781601987600, Brand New, Free shipping in the US A Markov Decision Process (MDP) is a natural framework for formulating sequential decision-making problems under uncertainty. In recent years, researchers have greatly advanced algorithms for learning and acting in MDPs. This book reviews such algorithms.
Jetzt bei Ebay: