🎯
Focusing
PhD Student @ HKUST. Interested in Reinforcement Learning and LLM Agents.
-
HKUST
- Clear Water Bay, Hong Kong
-
15:00
(UTC +08:00) - yuxiaooye.github.io
Pinned Loading
-
xlang-ai/Spider2
xlang-ai/Spider2 Public[ICLR 2025 Oral] Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows
-
tinnerhrhe/ROVER
tinnerhrhe/ROVER PublicAn official implementation of Random Policy Valuation is Enough for LLM Reasoning with Verifiable Rewards
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.

