PhD student in the Department of Statistics at LSE, researching reinforcement learning and alignment .
My current research focuses on the intersection of reinforcement learning and large language models, with particular interest in:
Ye, K*., Zhou, H*., Zhu, J*., Quinzan, F. and Shi, C.
Xu, E*.,Ye, K*., Zhou, H*.,Zhu, L., Quinzan, F. and Shi, C.
Zhou, H*.,Zhu, J*,Ye, K., Yang, Y., Akilagun, S. and Shi, C.