Набиране на средства 15 септември 2024 – 1 октомври 2024 Относно набирането на средства

Reinforcement Learning: State-of-the-Art

Reinforcement Learning: State-of-the-Art

Marco Wiering, Martijn Otterlo
0 / 5.0
1 comment
Колко ви харесва тази книга?
Какво е качеството на файла?
Изтеглете книгата за оценка на качеството
Какво е качеството на изтеглените файлове?
Reinforcement learning encompasses both a science of adaptive behavior of rational beings in uncertain environments and a computational methodology for finding optimal behaviors for challenging problems in control, optimization and adaptive behavior of intelligent agents. As a field, reinforcement learning has progressed tremendously in the past decade.

The main goal of this book is to present an up-to-date series of survey articles on the main contemporary sub-fields of reinforcement learning. This includes surveys on partially observable environments, hierarchical task decompositions, relational knowledge representation and predictive state representations. Furthermore, topics such as transfer, evolutionary methods and continuous spaces in reinforcement learning are surveyed. In addition, several chapters review reinforcement learning methods in robotics, in games, and in computational neuroscience. In total seventeen different subfields are presented by mostly young experts in those areas, and together they truly represent a state-of-the-art of current reinforcement learning research.

Marco Wiering works at the artificial intelligence department of the University of Groningen in the Netherlands. He has published extensively on various reinforcement learning topics. Martijn van Otterlo works in the cognitive artificial intelligence group at the Radboud University Nijmegen in The Netherlands. He has mainly focused on expressive knowledge representation in reinforcement learning settings.

Том:
12
Година:
2012
Издателство:
Springer
Език:
english
ISBN 10:
3642276458
ISBN 13:
9783642276453
Серия:
Adaptation, Learning, and Optimization
Файл:
PDF, 15.46 MB
IPFS:
CID , CID Blake2b
english, 2012
Четете Онлайн
Преобразуването в се извършва
Преобразуването в е неуспешно

Най-често използвани термини