verl-recipe

verl-recipe is a set of examples based on verl for end-to-end RL training recipes.

Contributing

Recipe Folder Structure

All recipe should follow the following structure:

README.md: recipe description
code: recipe code
script: reproducible training script

Specifically, README.md should contain following sections:

Installation: which verl version is required for this recipe?

# release version
pip install verl==0.6.0

# dev version
pip install verl@git+https://github.com/volcengine/verl.git@313dfdb2199124a37189e32e6d4a6c654379f2d4

Training: how to train
Evaluation: performance metrics
Citation: paper, notion, blog, etc.

Code Linting and Formatting

We rely on pre-commit to keep our code consistent. To set it up:

pip install pre-commit
pre-commit install
# for staged changes
pre-commit run
# for all files in the repo
pre-commit run --all-files
# run a specific hook with pre-commit
# pre-commit run --all-files --show-diff-on-failure --color=always <hood-id>
pre-commit run --all-files --show-diff-on-failure --color=always ruff
pre-commit run --all-files --show-diff-on-failure --color=always autogen-trainer-cfg

Available Recipes

retool: Reinforcement Learning for Strategic Tool Use in LLMs
langgraph_agent: A tiny example to demonstrate multi-turn rollout with LangGraph ReactAgent to solve math expression.
spo: Single-stream Policy Optimization.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.github/workflows		.github/workflows
char_count		char_count
langgraph_agent		langgraph_agent
retool		retool
spo		spo
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

verl-recipe

Contributing

Recipe Folder Structure

Code Linting and Formatting

Available Recipes

About

Uh oh!

Releases

Packages

Contributors 3

Languages

verl-project/verl-recipe

Folders and files

Latest commit

History

Repository files navigation

verl-recipe

Contributing

Recipe Folder Structure

Code Linting and Formatting

Available Recipes

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages