MISA

This repository contains official implementation of paper Mutual Information Regularized Offline Reinforcement Learning.

Before you start, make sure to run

pip install -e .

Apart from this, you'll have to setup your MuJoCo environment and key as well. Please follow D4RL repo and setup the environment accordingly.

Run Experiments

You can run MISA experiments using the following command:

python -m experiments.main --env 'walker2d-medium-v2' --logging.output_dir './experiment_output'

Weights and Biases Online Visualization Integration

This codebase can also log to W&B online visualization platform. To log to W&B, you first need to set your W&B API key environment variable. Alternatively, you could simply run wandb login.

We provide a wandb link as reference to the reproduced results.

Credits

The project heavily borrows from this Jax CQL implementation.

Disclaimer

This is not an official Sea Limited or Garena Online Private Limited product.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
algos		algos
data		data
experiments		experiments
utilities		utilities
viskit		viskit
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

MISA

Run Experiments

Weights and Biases Online Visualization Integration

Credits

Disclaimer

About

Uh oh!

Releases

Packages

Languages

License

sail-sg/MISA

Folders and files

Latest commit

History

Repository files navigation

MISA

Run Experiments

Weights and Biases Online Visualization Integration

Credits

Disclaimer

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages