dpkd

Direct Preference Knowledge Distillation for Large Language Models

Environment

conda create -n dpkd python=3.11
conda activate dpkd

and

bash install.sh

Run Distillation

Train runner:

bash scripts/dpkd-gpt2_base_runner.sh  PATH_TO_DPKD

Evaluation

Evaluation runner:

bash scripts/dpkd-gpt2_base_evaluate.sh  PATH_TO_DPKD

Citation

@article{dpkd,
  title={Direct Preference Knowledge Distillation for Large Language Models},
  author={Yixing Li and Yuxian Gu and Li Dong and Dequan Wang and Yu Cheng and Furu Wei},
  journal={arXiv preprint arXiv:2406.19774},
  year={2024}
}

Name		Name	Last commit message	Last commit date
parent directory ..
configs		configs
data_utils		data_utils
scripts		scripts
tools		tools
transformers		transformers
.deepspeed_env		.deepspeed_env
LICENSE		LICENSE
README.md		README.md
arguments.py		arguments.py
evaluate.py		evaluate.py
install.sh		install.sh
main.py		main.py
rouge_metric.py		rouge_metric.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

Direct Preference Knowledge Distillation for Large Language Models

Environment

Run Distillation

Evaluation

Citation

FilesExpand file tree

dpkd

Directory actions

More options

Directory actions

More options

Latest commit

History

dpkd

Folders and files

parent directory

README.md

Direct Preference Knowledge Distillation for Large Language Models

Environment

Run Distillation

Evaluation

Citation