Maximum-Likelihood Inverse Reinforcement Learning with Finite-Time Guarantees

ML-IRL is an algorithm for inverse reinforcement learning that is discussed in the Neurips paper link and AISTAT paper link

You can download our expert data from the google_drive

Installation

PyTorch 1.5+
OpenAI Gym
MuJoCo
pip install ruamel.yaml

File Structure

ML-IRL (our method): ml/
SAC agent: common/
Environments: envs/
Configurations: configs/

Instructions

All the experiments are to be run under the root folder.
Before starting experiments, please export PYTHONPATH=${PWD}:$PYTHONPATH for env variable.
We use yaml files in configs/ for experimental configurations, Please change obj value (in the first line) for each method, here is the list of obj values:
- Our methods (ML-IRL): ML_S: maxentirl, ML_SA: maxentirl_sa
After running, you will see the training logs in logs/ folder.

Experiments

All the commands below are also provided in run.sh.

Sec 1 IRL benchmark (MuJoCo)

First, you can generate expert data by training expert policy:

python common/train_gd.py configs/samples/experts/{env}.yml # env is in {hopper, walker2d, halfcheetah, ant}
python common/collect.py configs/samples/experts/{env}.yml # env is in {hopper, walker2d, halfcheetah, ant}

Then train our method with the provided expert data method (Policy Performance).

# you can vary obj in {`maxentirl_sa`, `maxentirl`}
python ml/irl_samples.py configs/samples/agents/{env}.yml

Sec 2 Transfer task

First, you can generate expert data by training expert policy. Make sure that the env_name parameter in configs/samples/experts/ant_transfer.yml is set to CustomAnt-v0

python common/train_gd.py configs/samples/experts/ant_transfer.yml
python common/collect.py configs/samples/experts/ant_transfer.yml

After the training is done, you can choose one of the saved reward model to train a policy from scratch (Recovering the Stationary Reward Function).

Transferring the reward to disabled Ant

python common/train_optimal.py configs/samples/experts/ant_transfer.yml
python ml/irl_samples.py configs/samples/agents/data_transfer.yml(data transfer)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Maximum-Likelihood Inverse Reinforcement Learning with Finite-Time Guarantees

Installation

File Structure

Instructions

Experiments

Sec 1 IRL benchmark (MuJoCo)

Sec 2 Transfer task

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
common		common
configs/samples		configs/samples
envs		envs
ml		ml
utils		utils
README.md		README.md
run.sh		run.sh

Cloud0723/ML-IRL

Folders and files

Latest commit

History

Repository files navigation

Maximum-Likelihood Inverse Reinforcement Learning with Finite-Time Guarantees

Installation

File Structure

Instructions

Experiments

Sec 1 IRL benchmark (MuJoCo)

Sec 2 Transfer task

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages