PhD student in the Department of Statistics at LSE, researching reinforcement learning and alignment .
My current research focuses on the intersection of reinforcement learning and large language models, with particular interest in:
Ye, K*., Zhou, H*., Zhu, J*., Quinzan, F. and Shi, C.
Qi, X*., Ye, K*., Shi, C., Yang, Y., Zhou, H. and Zhu, J.
Zhou, H*., Zhu, J*., Xu, E., Ye, K., Yang, Y. and Shi, C.
Xu, E*.,Ye, K*., Zhou, H*.,Zhu, L., Quinzan, F. and Shi, C.
Zhou, H*.,Zhu, J*,Ye, K., Su, P., Yang, Y., Akilagun, S. and Shi, C.