I am an Assistant Professor in AI at University of Liverpool. My primary research focuses on building intelligent agent systems capable of human-like learning, reasoning, and decision-making, spanning natural language processing, reinforcement learning, social and responsible AI, and embodied agents.
I received my Ph.D. from University of Technology Sydney, advised by Prof. Dacheng Tao, and then worked as a postdoc with Prof. Trevor Cohn at the University of Melbourne.
I had been a research scientist / intern at Tencent Robotics X / AI, CSIRO and Microsoft Research Asia before.
I am also Visiting Professor of Mathematics and Computer Science at Eindhoven University of Technology (TU/e).
Multiple papers accepted to AAAI 2026, covering Vision-Language Reasoning for Geolocalization (Geo-R), reinforcement learning and large language models.
MathOdyssey: Benchmarking mathematical problem-solving skills in large language models using Odyssey Math Data appears in Scientific Data, releasing a novel benchmark and dataset for evaluating mathematical reasoning and problem-solving abilities of LLMs, and used by leading AI labs (e.g. Google Gemini and others).
Open to taking on new PhD students (see also PhD opportunites). So feel free to email me with your CV, transcripts and a short research proposal. I have limited supervision capacity and am always happy to consider good ideas.
Research
Agentic AI and Language-Guided Decision Making TL;DR: Establishes natural language as a core interface for agent reasoning, planning, and control, spanning text-based reinforcement learning, LLM-powered agents, and agentic web systems.
Reasoning and Large Language Models TL;DR: Develops principled benchmarks and analyses to evaluate reasoning, generalisation, and failure modes of large language models, with a focus on mathematical and scientific reasoning.
Bias, Fairness, and Social Impacts of LLMs TL;DR: Investigates social bias, robustness, and instability in language models through empirical and causal analyses across languages, tasks, and interaction settings.
Reinforcement Learning and Agent Applications TL;DR: Advances reinforcement learning algorithms for complex agent tasks, including sparse and delayed rewards, multi-goal learning, safety, and continual adaptation.