Parameterized modular inverse reinforcement learning
Abstract
Reinforcement learning and inverse reinforcement learning can be used to model and understand human behaviors. However, due to the curse of dimensionality, their use as a model for human behavior has been limited. Inspired by observed natural behaviors, one approach is to decompose complex tasks into independent sub-tasks, or modules. Using this approach, we extended earlier work on modular inverse reinforcement learning, and developed what we called a parameterized modular inverse reinforcement learning algorithm. We first demonstrate the correctness and efficiency of our algorithm in a simulated navigation task. We then show that our algorithm is able to estimate a reward function and discount factor for real human navigation behaviors in a virtual environment, and train an agent that imitates the behavior of human subjects.
Department
Subject
Collections
Related items
Showing items related by title, author, creator and subject.
-
Towards convergent offline reinforcement learning
Snyder, Braham; 0000-0003-4134-7839 (2023-04-20)Reinforcement learning (RL) is a leading method for automated sequential decision-making. However, RL is rarely used in the real world. This neglect is due to instability and inefficiency, often even occurring on benchmark ... -
Side information, robustness, and self supervision in imitation learning
Memarian, Farzan (2021-08-12)Imitation learning refers to a family of learning algorithms enabling the learning agents to learn directly from demonstrations provided by experts, practitioners, and users. While imitation learning methods have been ... -
Cooperation and communication in multiagent deep reinforcement learning
Hausknecht, Matthew John (2016-12)Reinforcement learning is the area of machine learning concerned with learning which actions to execute in an unknown environment in order to maximize cumulative reward. As agents begin to perform tasks of genuine interest ...