I am a Machine Learning PhD student at the University of Oxford, where my research is funded by Eric Schmidt and CAIF. I'm advised by
Philip Torr and Christian Schroeder.
My work focuses on RL post-training, multi-agent systems, and AI security. I'm particularly interested in meta-RL, open-endedness, and methods for measuring and improving long-horizon LLM agent capabilities.
During my PhD, I've spent time at Microsoft Research and Google X. At MSR, I was part of AI Frontiers and worked on self-play and RL rewards for open-ended domains. Previously, I was an undergrad at UC Berkeley where I was a member of Berkeley AI Research (advised by Dan Hendrycks) and Cal Boxing. Feel free to get in touch!
Selected Papers
Recent Preprints
Sumeet Ramesh Motwani*, A. Ivanova*, Z. Cai, P. Torr, R. Islam, S. Shah, C.S. de Witt, C. London
arXiv preprint, 2025
P. Putta, E. Mills, N. Garg, Sumeet Ramesh Motwani, C. Finn, D. Garg, R. Rafailov
arXiv preprint, 2024
Conference Publications
Sumeet Ramesh Motwani, C. Smith, R.J. Das, M. Rybchuk, P.H.S. Torr, I. Laptev, F. Pizzati, R. Clark, C.S. de Witt
COLM 2025
Sumeet Ramesh Motwani, M. Baranchuk, M. Strohmeier, V. Bolina, P.H.S. Torr, L. Hammond, C.S. de Witt
NeurIPS 2024
D. Garg, S. VanWeelden, D. Caples, A. Draguns, ... C.S. de Witt, Sumeet Ramesh Motwani†
NeurIPS 2025 (D&B)
A. Draguns, A. Gritsevskiy, Sumeet Ramesh Motwani, C.S. de Witt
NeurIPS 2024
J. Skalse, L. Farnik, Sumeet Ramesh Motwani, E. Jenner, A. Gleave, A. Abate
ICLR 2024
U. Anwar, A. Saparov, J. Rando, ... Sumeet Ramesh Motwani, Y. Bengio, ... D. Krueger
TMLR 2024