Skip to content

RLVR expansion#120

Merged
natolambert merged 8 commits intomainfrom
rlvr
Jun 11, 2025
Merged

RLVR expansion#120
natolambert merged 8 commits intomainfrom
rlvr

Conversation

@natolambert
Copy link
Copy Markdown
Owner

@natolambert natolambert commented Jun 6, 2025

Will expand PR iteratively until done and then merge and push to Arxiv.
Currently:

  • Expand "training overview" chapter to include summaries of canonical training recipes.
  • Add summary of async rl infrastructure
  • More reasoning related works (pre o1)
  • Summary of major reasoning reports to date
  • Summary of methods used across reasoning efforts to see trends
  • Some other minor additions and fixes throughout

@natolambert natolambert merged commit f4f778a into main Jun 11, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant