ActFusion: A Unified Diffusion Model for Action Segmentation and Anticipation (NeurIPS 2024)

This repository provides the official implementation of our NeurIPS 2024 paper:

ActFusion: A Unified Diffusion Model for Action Segmentation and Anticipation
Dayoung Gong, Suha Kwak, and Minsu Cho
NeurIPS, Vancouver, 2024

🛠️ Recommended Environment & Installation

Recommended Environment

Python 3.8.20
CUDA 11.7
PyTorch 1.13.0+cu117

Install dependencies

pip install -r requirements.txt

📁 Dataset Setup

Download the preprocessed dataset from this link (borrowed from MS-TCN).

Create a directory structure as below, and place the datasets inside the datasets/ folder:

project-root/
├── ckpt/                 # pretrained model checkpoints
│   ├── breakfast/
│   └── 50salads/
├── configs/              # auto-generated JSON config files
│   ├── Breakfast.json
│   └── 50salads.json
├── datasets/             # downloaded datasets
│   ├── breakfast/
│   └── 50salads/
├── result/               # experiment outputs will be saved here
├── src/                  # source code
│   ├── model/
│   │   ├── actfusion.py
│   │   ├── backbone.py
│   │   ├── attn.py
│   │   └── __init__.py
│   ├── dataset.py
│   ├── default_configs.py
│   ├── trainer.py
│   ├── utils.py
│   ├── vis.py
│   └── __init__.py
├── main.py
├── LICENSE
└── README.md

🚀 Training

Generate config files by running:

python default_configs.py

Then start training with:

python main.py --config configs/Breakfast.json --result_dir $result_dir --split $split_num

🧪 Testing with Pretrained Checkpoints

Download pretrained checkpoints from this link
Place the downloaded folders inside the ckpt/ directory
Run evaluation:

python main.py --config configs/Breakfast.json --result_dir $result_dir --split $split_num --test --ckpt

🙏 Acknowledgement & 📚 Citation

This repository builds upon the DiffAct codebase. We thank the original authors for sharing their work.

If you find our code or paper helpful, please consider citing both ActFusion and DiffAct:

@article{gong2024actfusion,
  title={ActFusion: A Unified Diffusion Model for Action Segmentation and Anticipation},
  author={Gong, Dayoung and Kwak, Suha and Cho, Minsu},
  journal={Advances in Neural Information Processing Systems},
  volume={37},
  pages={89913--89942},
  year={2024}
}

@inproceedings{liu2023diffusion,
  title={Diffusion Action Segmentation},
  author={Liu, Daochang and Li, Qiyue and Dinh, Anh-Dung and Jiang, Tingting and Shah, Mubarak and Xu, Chang},
  booktitle={Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
  year={2023}
}

📄 License

This project is licensed under the MIT License.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ActFusion: A Unified Diffusion Model for Action Segmentation and Anticipation (NeurIPS 2024)

🛠️ Recommended Environment & Installation

📁 Dataset Setup

🚀 Training

🧪 Testing with Pretrained Checkpoints

🙏 Acknowledgement & 📚 Citation

📄 License

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
assets		assets
configs		configs
src		src
LICENSE		LICENSE
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

License

gongda0e/ActFusion

Folders and files

Latest commit

History

Repository files navigation

ActFusion: A Unified Diffusion Model for Action Segmentation and Anticipation (NeurIPS 2024)

🛠️ Recommended Environment & Installation

📁 Dataset Setup

🚀 Training

🧪 Testing with Pretrained Checkpoints

🙏 Acknowledgement & 📚 Citation

📄 License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages