MDP Solver Experiments

This project provides tools for running experiments on various Markov Decision Process (MDP) types using different solvers for "MDP Geometry, Normalization and Reward Balancing Solvers" (link). It includes scripts for running experiments and plotting the results.

Description

The project consists of two main scripts:

run_experiments.py: This script reads an experiment setup from a JSON file, runs the specified experiments on different MDP types using various solvers, and saves the results.
plot_results.py: This script reads the saved experiment results and creates plots to visualize the performance of different solvers across various MDP types and parameters.

Required Packages

The following packages are required to run the scripts:

torch
matplotlib
numpy
tqdm

Note: There are no specific version requirements for these packages. The latest versions should work fine.

Usage

To run the experiments:

Prepare the experiment setup JSON file with the desired parameters.
Run the run_experiments.py script with the JSON file as the argument:
```
python run_experiments.py -s experiment_setup.json
```

To plot the results using the same setup file:

Run the plot_results.py script:

python plot_results.py -s experiment_setup.json -m [metric]

This will generate plots in the 'figures' directory.

Reproducing Paper Results

To reproduce the results from the paper, use the full_setup1.json file for the known MDP case, and setup_stochastic_random{2,3}.json for stochastic experiments. Use metric len for known MDP case and opt_policy_diff for stochastic experiments. For example:

python run_experiments.py -s full_setup1.json
python plot_results.py -s full_setup1.json -m len

This will generate plots in the 'figures' directory.

Project Structure

MDP_Collection.py: Contains implementations of different MDP types.
Solvers.py: Implements various MDP solvers.
run_experiments.py: Main script for running experiments.
plot_results.py: Script for plotting experiment results.
experiment_setups/: Directory for storing experiment setups.
experiment_setups/full_setup1.json: Setup file to reproduce paper results.
experiment_setups/setup_stochastic_random{2,3}.json: Setup file for stochastic experiments.
results/: Directory for storing experiment results.
figures/: Directory for storing experiment plots.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
experiment_setups		experiment_setups
MDP.py		MDP.py
MDP_Collection.py		MDP_Collection.py
README.md		README.md
Solvers.py		Solvers.py
Stochstic_solvers.py		Stochstic_solvers.py
plot_results.py		plot_results.py
run_experiments.py		run_experiments.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MDP Solver Experiments

Description

Required Packages

Usage

Reproducing Paper Results

Project Structure

About

Uh oh!

Releases

Packages

Languages

gaarsmu/MDP-Geometry-Normalization-and-Reward-Balancing-Solvers

Folders and files

Latest commit

History

Repository files navigation

MDP Solver Experiments

Description

Required Packages

Usage

Reproducing Paper Results

Project Structure

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages