[CVPR 2025] Alias-free Latent Diffusion Models

Yifan Zhou¹ Zeqi Xiao¹ Shuai Yang² Xingang Pan¹
¹S-Lab, Nanyang Technological University,
²Wangxuan Institute of Computer Technology, Peking University

Project Page | Paper

Official PyTorch implementation of Alias-free latent diffusion models.

Motivation

teaser_video.mp4

We found the VAE and denoising network in LDM are not equivariant to fractional shifts. We propose an alias-free framework to improve the fractional shift equivariance of LDM. We demonstrate the effectiveness of our method in various applications, including video editing, frame interpolation, super-resolution and normal estimation.

TODO

Chinese blog posts
Refine documents
Training scripts

Update

[12/2025]: Training code relased.
[03/2025]: Repository created.

Installation

Clone the repository. (Don't forget --recursive. Otherwise, please run git submodule update --init --recursive)

git clone [email protected]:SingleZombie/AFLDM.git --recursive
cd AFLDM
pip install -e .

Install PyTorch in your Python environment.
Install pip libraries.

pip install -r requirements.txt

Inference

All the detailed commands are shown inside .sh files.

Unconditional Generation Shift

bash shift_ldm_ffhq.sh

Video Editing

Due to the limitation of our computation resource, the finetuned alias-free Stable Diffusion has a poor generation capacity. It can only perform simple editing.

bash video_editing.sh

Image Interpolation

bash image_interpolation.sh

Super-resolution Shift

This is not a blind SR. The degradation function is fixed.

bash shift_ldm_sr.sh

Normal Estimation Shift

bash shift_normal_estimation.sh

Training

ImageNet Dataset

Download ImageNet (ILSVRC2012_img_train.tar) and extract the sub files. The organization of directory should be like:

train
├── n01440764
└── n01443537
...

Alias-free VAE

Update train_data_dir with your ImageNet path in configs/vae/train_afvae_imagenet.json.
Run script. bash train_afvae.sh

Alias-free LDM

Run script. bash train_afldm.sh
Update path in scripts/shift_ldm_ffhq.py with train_ckpt/ffhq_uncond_afldm (the default output diretory set in configs/ldm/train_unet_ffhq.json). Run the script bash shift_ldm_ffhq.sh to test the results.

Alias-free Latent I2SB Super Resolution

Update train_data_dir with your ImageNet path in configs/sr/train_i2sb_imagenet.json.
Run script. bash train_af_i2sb_sr.sh
Update path in scripts/shift_ldm_sr.py with train_ckpt/imagenet_sr_i2sb (the default output diretory set in configs/sr/train_i2sb_imagenet.json). Run the script bash shift_ldm_sr.sh to test the results.

Citation

@inproceedings{zhou2025afldm,
      title={Alias-Free Latent Diffusion Models: Improving Fractional Shift Equivariance of Diffusion Latent Space},
      author={Zhou, Yifan and Xiao, Zeqi and Yang, Shuai and Pan, Xingang },
      booktitle = {CVPR},
      year = {2025},
    }

Acknowledgements

Diffusers: Our project is built on diffusers.
GMFlow: Our flow estimator.
StyleGAN3: For sharing alias-free module implementation.
Alias-Free Convnets: For sharing alias-free module implementation.
I2SB: For sharing SR implementation.
StableNormal: For sharing normal estimation dataset.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

[CVPR 2025] Alias-free Latent Diffusion Models

Project Page | Paper

Motivation

TODO

Update

Installation

Inference

Unconditional Generation Shift

Video Editing

Image Interpolation

Super-resolution Shift

Normal Estimation Shift

Training

ImageNet Dataset

Alias-free VAE

Alias-free LDM

Alias-free Latent I2SB Super Resolution

Citation

Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
afldm		afldm
assets		assets
configs		configs
scripts		scripts
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE.md		LICENSE.md
README.md		README.md
image_interpolation.sh		image_interpolation.sh
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
shift_ldm_ffhq.sh		shift_ldm_ffhq.sh
shift_ldm_sr.sh		shift_ldm_sr.sh
shift_normal_estimation.sh		shift_normal_estimation.sh
train.py		train.py
train_af_i2sb_sr.sh		train_af_i2sb_sr.sh
train_afldm.sh		train_afldm.sh
train_afvae.sh		train_afvae.sh
video_editing.sh		video_editing.sh

License

SingleZombie/AFLDM

Folders and files

Latest commit

History

Repository files navigation

[CVPR 2025] Alias-free Latent Diffusion Models

Project Page | Paper

Motivation

TODO

Update

Installation

Inference

Unconditional Generation Shift

Video Editing

Image Interpolation

Super-resolution Shift

Normal Estimation Shift

Training

ImageNet Dataset

Alias-free VAE

Alias-free LDM

Alias-free Latent I2SB Super Resolution

Citation

Acknowledgements

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages