GitHub - MiZhenxing/vllm: This is a fork of vllm for extracting embeddings of generated tokens

Introduction

This is a fork of vLLM for extracting embeddings of generated tokens efficiently. It is used in our ThinkDiff paper.

The modification is largely based on https://github.com/vllm-project/vllm/pull/7892/files. Many thanks to the author. Since the original vLLM is under quick development, only vllm==0.6.3.post1 is supported in this code. It would be great if someone could transfer https://github.com/vllm-project/vllm/pull/7892/files to the newest VLLM.

Install

The first step is to install the original vLLM wheel:

pip install vllm==0.6.3.post1

Then you need to clone and install this code:

git clone https://github.com/MiZhenxing/vllm
cd vllm

Please make sure you are under the return_hidden_states branch.

Then only install the Python codes:

python python_only_dev.py

Name		Name	Last commit message	Last commit date
Latest commit History 3,026 Commits
.buildkite		.buildkite
.github		.github
benchmarks		benchmarks
cmake		cmake
csrc		csrc
docs		docs
examples		examples
mi_tests		mi_tests
tests		tests
tools		tools
vllm		vllm
.clang-format		.clang-format
.dockerignore		.dockerignore
.gitignore		.gitignore
.readthedocs.yaml		.readthedocs.yaml
.yapfignore		.yapfignore
CMakeLists.txt		CMakeLists.txt
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
Dockerfile.cpu		Dockerfile.cpu
Dockerfile.neuron		Dockerfile.neuron
Dockerfile.openvino		Dockerfile.openvino
Dockerfile.ppc64le		Dockerfile.ppc64le
Dockerfile.rocm		Dockerfile.rocm
Dockerfile.tpu		Dockerfile.tpu
Dockerfile.xpu		Dockerfile.xpu
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
SECURITY.md		SECURITY.md
collect_env.py		collect_env.py
find_cuda_init.py		find_cuda_init.py
format.sh		format.sh
pyproject.toml		pyproject.toml
python_only_dev.py		python_only_dev.py
requirements-build.txt		requirements-build.txt
requirements-common.txt		requirements-common.txt
requirements-cpu.txt		requirements-cpu.txt
requirements-cuda.txt		requirements-cuda.txt
requirements-dev.txt		requirements-dev.txt
requirements-lint.txt		requirements-lint.txt
requirements-neuron.txt		requirements-neuron.txt
requirements-openvino.txt		requirements-openvino.txt
requirements-rocm.txt		requirements-rocm.txt
requirements-test.txt		requirements-test.txt
requirements-tpu.txt		requirements-tpu.txt
requirements-xpu.txt		requirements-xpu.txt
setup.py		setup.py
use_existing_torch.py		use_existing_torch.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Introduction

Install

About

Uh oh!

Releases

Sponsor this project

Uh oh!

Packages

Contributors 592

Uh oh!

Languages

Uh oh!

License

MiZhenxing/vllm

Folders and files

Latest commit

History

Repository files navigation

Introduction

Install

About

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Sponsor this project

Uh oh!

Packages 0

Contributors 592

Uh oh!

Languages

Packages