Xiangyu Liu
2104 Brendan Iribe Center
College Park, Maryland 20740
I am a fifth-year PhD student in Computer Science at the University of Maryland, College Park (UMD) working under the guidance of Prof. Kaiqing Zhang since 2021, and an incoming research scientist at Google Research, NYC. I have also had the privilege of working with Prof. Furong Huang. Prior to this, I completed my undergraduate studies in Computer Science at Shanghai Jiao Tong University (SJTU) from 2017 to 2021, where I conducted my bachelor thesis research with Prof. Ying Wen.
My research focuses on the foundational aspects of (multi-agent) reinforcement learning (RL), particularly on game-theoretical/strategic (NeurIPS 2021, ICML 2023) and partially observable settings (NeurIPS 2024), as well as their applications on adversarially robust RL (ICLR 2024).
More recently, I have expanded my research to explore (multi-)large language model (LLM) agents interactions (ICLR 2025, ICML 2026), leveraging techniques from game theory and online learning.
In the past, I completed research internships at Google Research, Market Algorithms Team, where I worked with Zhe Feng, Aranyak Mehta, and Di Wang from Jun. 2025 to Dec. 2025, as well as Bloomberg AI Research in the summer of 2022.
News
| Apr 30, 2026 | My internship paper at Google Research is accepted to ICML 2026 and available on arxiv! We have explored scaling inference-time computation via opponent simulation for LLM-based strategic reasoning. |
|---|---|
| Apr 18, 2025 | Gave an invited talk at UVA RL Meetup. |
| Jan 22, 2025 | One paper accepted to ICLR 2025 on LLM agents. |
| Sep 25, 2024 | One paper accepted to NeurIPS 2024 on partially observable RL with privileged information. |
| Mar 24, 2024 | Gave a talk at 2024 INFORMS Optimization Society Conference (IOS 2024) on partially obseravble multi-agent RL at Houston, Texas. |
| Jan 16, 2024 | Three papers accepted to ICLR 2024 including one spotlight. |
Selected publications
* denotes equal contribution, † denotes alphabetical order.
- ICML 2023Partially Observable Multi-agent RL with (Quasi-)Efficiency: The Blessing of Information SharingIn International Conference on Machine Learning, 2023
Extended version accepted to SIAM Journal on Control and Optimization (SICON) - NeurIPS 2024Provable Partially Observable Reinforcement Learning with Privileged InformationIn The Thirty-eighth Annual Conference on Neural Information Processing Systems, 2024
A shorter version also presented at ICML 2024 ARLET workshop. - ICLR 2024Rethinking Adversarial Policies: A Generalized Attack Formulation and Provable Defense in RLIn The Twelfth International Conference on Learning Representations, 2024
A preliminary version is selected at NeurIPS 2022 Trustworthy and Socially Responsible Machine Learning (TSRML) workshop as an outstanding paper - ICLR 2025Do LLM Agents Have Regret? A Case Study in Online Learning and GamesIn The Thirteenth International Conference on Learning Representations, 2025
- CDC 2025Principled Learning-to-Communicate with Quasi-Classical Information StructuresIn 2025 IEEE 64th Conference on Decision and Control (CDC), 2025
- ICML 2026Scaling Inference-Time Computation via Opponent Simulation: Enabling Online Strategic Adaptation in Repeated NegotiationIn International Conference on Machine Learning, 2026
Talks
- Invited Talk at the UVA RL meetup on partially observable RL with privileged information, 2025
- Talk at the 2024 INFORMS Optimization Society Conference (IOS 2024) on partially observable multi-agent RL, Houston, Texas, 2024
- Contributed talk at the TSRML workshop of NeurIPS 2022 on adversarial policies in competitive games, 2022
- Talk at RLChina on unifying diversity in open-ended learning for zero-sum games, China, 2021
Awards
- Outstanding Paper Award, Trustworthy and Socially Responsible Machine Learning (TSRML) workshop, NeurIPS 2022
- Dean's Fellowship, UMD, 2021
- National Scholarship, China, 2018 & 2019
Service
- Reviewer for NeurIPS 2024-2025, ICLR 2025-2026, ICML 2025, AISTATS 2025-2026, UAI 2024-2025, AAMAS 2025-2026, CDC 2025, L4DC 2026
- Graduate lecturer at summer AI camps for K12 students (2023, 2024), teaching multi-agent RL.