SliderQuant: Accurate Post-Training Quantization for LLMs

By Shigeng Wang, Chao Li, Yangyuxuan Kang, Jiawei Fan, Zhonghong Ou and Anbang Yao.

This repository is the official PyTorch implementation of "SliderQuant: Accurate Post-Training Quantization for LLMs", accepted to ICLR 2026.

SliderQuant (Sliding-layer Quantization) is a new learnable post-training quantization framework for LLMs, which consists of two key components:

Inter-layer sliding quantization couples three types of sliding window designs to address the varying quantization sensitivity of shallow, intermediate and deep layers of any pre-trained LLMs.
Intra-layer sliding quantization quantizes layers inside the current slidning window in an incremental manner.

Main Results

Language Generation

Zero-Shot Commonsense Reasoning

Methods With Extra Inference-Time Cost

MoE Model Results

Math Resoning and Code Generation

Model Zoo

The following checkpoints are planned for public release on Hugging Face:

Model	Quantization	Hugging Face
Llama2-13B	W4A4	SliderQuant-Llama2-13B-W4A4
Llama2-13B	W2A16	SliderQuant-Llama2-13B-W2A16
Qwen2.5-14B	W4A4	SliderQuant-Qwen2.5-14B-W4A4
Qwen2.5-14B	W2A16	SliderQuant-Qwen2.5-14B-W2A16

All checkpoints are available under IntelLabsChina/SliderQuant.

Install

git clone https://github.com/genggng/sliderquant

mamba create -n sliderquant python=3.10 -y
mamba activate sliderquant

cd sliderquant
pip install -e .

How To Train

Create a folder and place the experimental configuration file inside, following this structure:

sliderquant/
├── log-llama2
│   └── llama2-w4a4
│       └── config.yaml

Edit task_list.conf to specify the result_dir.

result_dir=configs/llama2-7b-w2a16

result_dir=${exp_id}
GPU_NUM=1
port=29507
THRESHOLD=0.05
WAIT_MODE=true
WAIT_INTERVAL=60

Start training:

./auto_train_ddp.sh

How To Test

Edit task_list.conf to specify the result_dir.

result_dir=configs/llama2-7b-w2a16

GPU_NUM=1
port=29507
THRESHOLD=0.05
WAIT_MODE=true
WAIT_INTERVAL=60

Run evaluation:

./auto_test_one.sh

Citation

If SliderQuant is useful in your research, please cite:

@inproceedings{wang2026sliderquant,
  title={SliderQuant: Accurate Post-Training Quantization for LLMs},
  author={Wang, Shigeng and Li, Chao and Kang, Yangyuxuan and Fan, Jiawei and Ou, Zhonghong and Yao, Anbang},
  booktitle={International Conference on Learning Representations},
  year={2026}
}

Acknowledgement

SliderQuant builds code from:

We are grateful to the authors and maintainers of both projects for making their amazing code public.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
asserts		asserts
configs		configs
models		models
quantize		quantize
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
auto_test_one.sh		auto_test_one.sh
auto_train_ddp.sh		auto_train_ddp.sh
datautils.py		datautils.py
download_hf.sh		download_hf.sh
download_lmeval.sh		download_lmeval.sh
export_model.sh		export_model.sh
main.py		main.py
parallel_utils.py		parallel_utils.py
pyproject.toml		pyproject.toml
task_list.conf		task_list.conf
train_utils.py		train_utils.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SliderQuant: Accurate Post-Training Quantization for LLMs

Table Of Contents

Main Results

Language Generation

Zero-Shot Commonsense Reasoning

Methods With Extra Inference-Time Cost

MoE Model Results

Math Resoning and Code Generation

Model Zoo

Install

How To Train

How To Test

Citation

Acknowledgement

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

SliderQuant: Accurate Post-Training Quantization for LLMs

Table Of Contents

Main Results

Language Generation

Zero-Shot Commonsense Reasoning

Methods With Extra Inference-Time Cost

MoE Model Results

Math Resoning and Code Generation

Model Zoo

Install

How To Train

How To Test

Citation

Acknowledgement

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages