GitHub

Get Start

This code was tested on NVIDIA RTX 4090 and requires:

conda3 or miniconda3
python 3.8+
pytorch 3.10+

a. Create a conda virtual environment and activate it.

bash setup_env.sh

b. Modify the LayerNorm module in clip for fp16 inference

# miniconda3/envs/stablemofusion/lib/python3.8/site-packages/clip/model.py
class LayerNorm(nn.LayerNorm):
    """Subclass torch's LayerNorm to handle fp16."""

    def forward(self, x: torch.Tensor):
        if self.weight.dtype==torch.float32:

            orig_type = x.dtype
            ret = super().forward(x.type(torch.float32)) 
            return ret.type(orig_type)  
        else:
             return super().forward(x)

Quick Start

Download pre-trained models from Google Cloud and put them into ./ckeckpoints/ and arrange them in the following file structure:

StableMoFusion
├── checkpoints
│   └── kit
│       └── ant_kit
│           ├── meta
│           │   ├── mean.npy
│           │   └── std.npy
│           ├── model
│           │   └── latest.tar
│           └── opt.txt
│   └── t2m
│       └── ant_t2m
│           ├── meta
│           │   ├── mean.npy
│           │   └── std.npy
│           ├── model
│           │   └── latest.tar
│           └── opt.txt
│   └── footskate
│       ├── underpressure_pretrained.tar
│       └── t2m_pretrained.tar

Download the UnderPressure code and put them into ./UnderPressure/ like:

StableMoFusion
├── UnderPressure
│   ├── dataset
│   |   |── S1_HoppingLeftFootRightFoot.pth
│   |   └── ...
│   ├── anim.py
│   ├── data.py
│   ├── demo.py
│   └── ...

Updating import paths within ./Underpressure/*.py. To ensure modules within the ./Underpressure/ can be imported and utilized seamlessly via python -m, it's necessary to update the import paths within the Python files located in ./Underpressure/*.py. For example:

Replace import util with from Underpressure import util in UnderPressure/anim.py
Replace import anim, metrics, models, util with from UnderPressure import anim, metrics, models, util in UnderPressure/demo.py

run demo.py or scripts/generate.py

# generate from a single prompta
# e.g. generate a 4-second wave motion . Unit of `--motion_length` is seconds.
python -m scripts.inference.generate --text_prompt "A man walks forward and picks up a toolbox." --motion_length 4 --opt_path checkpoints/t2m/ant_t2m/opt.txt

# Generate from your text file
# e.g. generate 5 motions by different prompts in .txt file, and set the motion frame length separately by .txt file. Unit of `--input_len` is frame.
python -m scripts.inference.generate  --opt_path checkpoints/t2m/ant_t2m/opt.txt
--input_text ./aaa_vvv.txt

You may also define :

--device id.
--diffuser_name sampler type in diffuser (e.g. 'ddpm','ddim','dpmsolver'), related settings see ./config/diffuser_params.yaml
--num_inference_steps number of iterative denoising steps during inference
--seed to sample different prompts.
--motion_length in seconds
--opt_path for loading model
--footskate_cleanup to use footskate module in the diffusion framework

You will get :

output_dir/joints_npy/xx.npy - xyz pose sequence of the generated motion
output_dir/xx.mp4 - visual animation for generated motion.

outputdir is located in the ckeckpoint dir like checkpoints/t2m/t2m_condunet1d_batch64/samples_t2m_condunet1d_batch64_50173_seed0_a_person_waves_with_his_right_hand/.

The visual animation will look something like this:

Train and Evaluation

1. Download datasets

HumanML3D - Follow the instructions in HumanML3D, then copy the result dataset to our repository:

cp -r ../HumanML3D/HumanML3D ./data/HumanML3D

KIT - Download from HumanML3D (no processing needed this time) and the place result in ./data/KIT-ML

2. Download pretrained weights for evaluation

We use the same evaluation protocol as this repo. You should download pretrained weights of the contrastive models in t2m and kit for calculating FID and precisions. To dynamically estimate the length of the target motion, length_est_bigru and Glove data are required.

Unzipped all files and arrange them in the following file structure:

StableMoFusion 
└── data
    ├── glove
    │   ├── our_vab_data.npy
    │   ├── our_vab_idx.pkl
    │   └── out_vab_words.pkl
    ├── pretrained_models
    │   ├── kit
    │   │   └── text_mot_match
    │   │       └── model
    │   │           └── finest.tar
    │   └── t2m
    │   │   ├── text_mot_match
    │   │   │   └── model
    │   │   │       └── finest.tar
    │   │   └── length_est_bigru
    │   │       └── model
    │   │           └── finest.tar
    ├── HumanML3D
    │   ├── new_joint_vecs
    │   │   └── ...
    │   ├── new_joints
    │   │   └── ...
    │   ├── texts
    │   │   └── ...
    │   ├── Mean.npy
    │   ├── Std.npy
    │   ├── test.txt
    │   ├── train_val.txt
    │   ├── train.txt
    │   └── val.txt
    ├── KIT-ML
    │   ├── new_joint_vecs
    │   │   └── ...
    │   ├── new_joints
    │   │   └── ...
    │   ├── texts
    │   │   └── ...
    │   ├── Mean.npy
    │   ├── Std.npy
    │   ├── test.txt
    │   ├── train_val.txt
    │   ├── train.txt
    │   └── val.txt
    |── kit_mean.npy
    |── kit_std.npy
    |── t2m_mean.npy
    |── t2m_std.npy

3. Train CondUnet1D Model

HumanML3D

bash train.sh

You may also define the --config_file for training on multi gpus.

4. Evaluate

HumanML3D

```shell bash eval.sh ```

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
.trae/rules		.trae/rules
config		config
datasets		datasets
debug		debug
eval		eval
models		models
motion_loader		motion_loader
options		options
scripts		scripts
tmp_scripts		tmp_scripts
trainers		trainers
utils		utils
.gitignore		.gitignore
1gpu.yaml		1gpu.yaml
2gpu.yaml		2gpu.yaml
4gpu.yaml		4gpu.yaml
README.MD		README.MD
eval.sh		eval.sh
nohup.out		nohup.out
openANT.code-workspace		openANT.code-workspace
profiling.out		profiling.out
requirements.txt		requirements.txt
setup_env.sh		setup_env.sh
setup_env_cn.sh		setup_env_cn.sh
train.sh		train.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Get Start

Quick Start

Train and Evaluation

1. Download datasets

2. Download pretrained weights for evaluation

3. Train CondUnet1D Model

4. Evaluate

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

CCSCovenant/ANT

Folders and files

Latest commit

History

Repository files navigation

Get Start

Quick Start

Train and Evaluation

1. Download datasets

2. Download pretrained weights for evaluation

3. Train CondUnet1D Model

4. Evaluate

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages