I'm a research intern at H Company, using RL to train computer-use agents. I previously worked as a research scientist on the AI Agents team at Huawei's Noah's Ark Lab, and completed my PhD in the Bath RL Lab, supervised by Özgür Şimşek.
I'm interested in all things reinforcement learning. Currently, I'm focussing on building LLM-based agents that continuously learn by setting, pursuing, and verifying their own goals (2025).
Previously, I studied how to explain reinforcement learning agents, identifying their behaviour, outcomes, and predictions as aspects of interaction worth understanding (2023). This involved developing a theoretical framework to derive these explanations (2025) and creating scalable methods to approximate them in practice (2025).
More broadly, a common thread in how I think about intelligence is bounded rationality: how intelligence emerges as a necessary adaptation for agents operating with limited resources in a world far more complex than themselves. I'm also drawn to conversations that question the foundations of reinforcement learning, exploring how our assumptions about agents and environments shape the algorithms and conceptual models we design.
An end-to-end pipeline for large-scale online reinforcement learning of mobile GUI agents.
A scalable library for approximating Shapley value explanations in reinforcement learning.
Edinburgh RL Group, September 2025
ART-AI Colloquium Series, February 2025
Bath Doctoral Festival of Ideas, July 2024
Bath AI Society, April 2024
Bath Computer Science Conference, July 2023
Alan Turing Institute Student Presentations, June 2023
I have taught computer science and mathematics at the University of Bath, including lecturing on reinforcement learning, supporting undergraduate and postgraduate teaching across reinforcement learning, statistics, programming, and applied mathematics, and supervising student projects.