Skip to content
View damanimehul's full-sized avatar

Highlights

  • Pro

Block or report damanimehul

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. marmotlab/PRIMAL2 marmotlab/PRIMAL2 Public

    Training code PRIMAL2 - Public Repo

    Python 194 62

  2. contracts contracts Public

    Forked from Algorithmic-Alignment-Lab/contracts

    Formal Contracts for Multi-Agent Reinforcement Learning

    Python

  3. marc marc Public

    Forked from ekinakyurek/marc

    Public repository for "The Surprising Effectiveness of Test-Time Training for Abstract Reasoning"

    Python

  4. RLCR RLCR Public

    Official repository for Beyond Binary Rewards: Training LMs to Reason about Their Uncertainty

    Python 65 10