Behavior Retrieval

This repository contains the code for BehaviorRetrieval, a few-shot imitation learning method that queries unlabeled datasets.

Installing

Install python 3.7
Install torch + torchvision
Install robosuite (pip install robosuite or install from source). You also need to install mujoco_py.
Install dependencies in requirements.txt (covers robomimic and roboverse dependencies)
Install robomimic by using pip install -e . inside the robomimic folder
(If you want to run Office) Install roboverse by using pip install -e. inside the roboverse folder

Running

Configurations

Find all the configurations for training in configs/. We follow the Robomimic convention of keeping hyperparameters in the .json files. We have special office configuraitons for the Office task due to differences of the Roboverse environment.

Collecting Data

Can task: use the paired data provided by Robomimic: download
Square task: use the MachinePolicy to collect demonstrations. Read the script in run_trained_agent.sh for more information
Office task: use the scripted_collect.sh in the roboverse/scripts folder. Use utils/roboverse_to_robomimic.py to convert the demo format to the one used by our codebase

Training The Embedder

Use train_embedder.py, which can handle contrastive and VAE embeders. For configs, see train_embedder.sh

Running BehaviorRetrieval

Use run_weighted_BC.py, which runs BehaviorRetieval and our baselines. For example configs, see run_weighted_BC.sh

To train with vanilla BC, use train.py. We use this to pretrain the model for the HG-DAGGER experiments. See example config in train.sh.

To run HG-DAGGER with BehaviorRetrieval, use run_weighted_corrections.py. See run_weighted_corrections.sh for an example config.

Evaluating BehaviorRetrieval and Baselines

Use run_trained_agent.py to run evals. See run_trained_agent.sh for configs.

Real Robot Experiments

If the robot interfaces with the Gym environment, this codebase works with real robots out of the box.

Visualizing Results

Use visualizing_embeddings.py to compute a T-SNE or PCA visualization of the embeddings. Use embedding_analysis.py to compute a plot of how similarity changes through an episode (like Figure 8 in our paper)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Behavior Retrieval

Installing

Running

Configurations

Collecting Data

Training The Embedder

Running BehaviorRetrieval

Evaluating BehaviorRetrieval and Baselines

Real Robot Experiments

Visualizing Results

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
__pycache__		__pycache__
bc_trained_models		bc_trained_models
configs		configs
robomimic		robomimic
roboverse		roboverse
utils		utils
.gitignore		.gitignore
MachinePolicy.py		MachinePolicy.py
README.md		README.md
requirements.txt		requirements.txt
run_trained_agent.py		run_trained_agent.py
run_trained_agent.sh		run_trained_agent.sh
run_weighted_BC.py		run_weighted_BC.py
run_weighted_BC.sh		run_weighted_BC.sh
run_weighted_corrections.py		run_weighted_corrections.py
run_weighted_corrections.sh		run_weighted_corrections.sh
train.py		train.py
train.sh		train.sh
train_embedder.py		train_embedder.py
train_embedder.sh		train_embedder.sh

MaxDu17/BehaviorRetrieval

Folders and files

Latest commit

History

Repository files navigation

Behavior Retrieval

Installing

Running

Configurations

Collecting Data

Training The Embedder

Running BehaviorRetrieval

Evaluating BehaviorRetrieval and Baselines

Real Robot Experiments

Visualizing Results

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages