yuxiaooye

Follow

🎯

Focusing

yuxiaooye

🎯

Focusing

Follow

PhD Student @ HKUST. Interested in Reinforcement Learning and LLM Agents.

24 followers · 11 following

HKUST
Clear Water Bay, Hong Kong
15:00 (UTC +08:00)
yuxiaooye.github.io

Achievements

Achievements

Pinned Loading

xlang-ai/Spider2 xlang-ai/Spider2 Public

[ICLR 2025 Oral] Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows

HTML 740 122
tinnerhrhe/ROVER tinnerhrhe/ROVER Public

An official implementation of Random Policy Valuation is Enough for LLM Reasoning with Verifiable Rewards

Python 37 2