Skip to content
View maoyixiu's full-sized avatar
  • Tsinghua University
  • 30 Shuangqing Road, Haidian District, Beijing, China

Highlights

  • Pro

Block or report maoyixiu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. SVR SVR Public

    [NeurIPS 2023] Supported Value Regularization for Offline Reinforcement Learning

    Python 47 2

  2. DMG DMG Public

    [NeurIPS 2024] Doubly Mild Generalization for Offline Reinforcement Learning

    Python 16 5

  3. SCAS SCAS Public

    [NeurIPS 2024] Offline Reinforcement Learning with OOD State Correction and OOD Action Suppression

    Python 14 3

  4. thu-rllab/MPTS thu-rllab/MPTS Public

    Model Predictive Task Sampling

    Python 87 26

  5. thu-rllab/MoPPS thu-rllab/MoPPS Public

    [KDD 2026] Can Prompt Difficulty be Online Predicted for Accelerating RL Finetuning of Reasoning Models?

    Python 76 24

  6. thu-rllab/ANQ thu-rllab/ANQ Public

    [NeurIPS 2025] Adaptive Neighborhood-Constrained Q Learning for Offline Reinforcement Learning

    Python 10 5