Parameterized modular inverse reinforcement learning
MetadataShow full item record
Reinforcement learning and inverse reinforcement learning can be used to model and understand human behaviors. However, due to the curse of dimensionality, their use as a model for human behavior has been limited. Inspired by observed natural behaviors, one approach is to decompose complex tasks into independent sub-tasks, or modules. Using this approach, we extended earlier work on modular inverse reinforcement learning, and developed what we called a parameterized modular inverse reinforcement learning algorithm. We first demonstrate the correctness and efficiency of our algorithm in a simulated navigation task. We then show that our algorithm is able to estimate a reward function and discount factor for real human navigation behaviors in a virtual environment, and train an agent that imitates the behavior of human subjects.
Showing items related by title, author, creator and subject.
Memarian, Farzan (2021-08-12)Imitation learning refers to a family of learning algorithms enabling the learning agents to learn directly from demonstrations provided by experts, practitioners, and users. While imitation learning methods have been ...
Hausknecht, Matthew John (2016-12)Reinforcement learning is the area of machine learning concerned with learning which actions to execute in an unknown environment in order to maximize cumulative reward. As agents begin to perform tasks of genuine interest ...
Pavse, Brahma Suneil (2020-05-05)Temporal difference (TD) learning is one of the main foundations of modern reinforcement learning. This thesis studies the use of TD(0), a canonical TD algorithm, to estimate the value function of a given evaluation policy ...