Tutorial über lineare Funktions-Approximatoren für dynamische Programmierung und Verstärkung...
Tutorial on Linear Function Approximators for Dynamic Programming and Reinforcement Learning, Paperback by Geramifard, Alborz; Walsh, Thomas J.; Tellex, Stefanie; Chowdhary, Girish; Roy, Nicholas, ISBN 1601987609, ISBN-13 9781601987600, Like New Used, Free P&P in the UK A Markov Decision Process (MDP) is a natural framework for formulating sequential decision-making problems under uncertainty. In recent years, researchers have greatly advanced algorithms for learning and acting in MDPs. This book reviews such algorithms.
Jetzt bei Ebay: