California Institute of Technology
Email: clairechen AT caltech DOT edu
[Google Scholar] [Curriculum Vitae]
Biography
Claire Chen is an undergraduate student at California Institute of Technology (Caltech). Her majors are Mathematics and Computer Science. She was very fortunate to work as a research assistant advised by Professor Nan Jiang, Professor Sergey Levine, Professor Yisong Yue, and Professor Shangtong Zhang. Her research interest is in Reinforcement Learning and LLM Fine-Tuning. She regularly serves on the Program Committee in major AI venues, e.g., ICLR, ICML, NeurIPS.
Reinforcement Learning
-
Offline Two-Player Zero-Sum Markov Games with KL Regularization.
Claire Chen, Yuheng Zhang, Xinyu Liu, Zixuan Xie, Shuze Liu, Nan Jiang.
International Conference on Machine Learning (ICML), 2026. -
Convergence of Two-Timescale Stochastic Approximation with Markovian Samples and Applications in Reinforcement Learning.
Vagul Mahadevan, Claire Chen, Shuze Liu, Shangtong Zhang.
International Conference on Machine Learning (ICML), 2026. -
Efficient Policy Evaluation with Safety Constraint for Reinforcement Learning.
Claire Chen*, Shuze Liu*, Shangtong Zhang.
International Conference on Learning Representations (ICLR), 2025. -
Doubly Optimal Policy Evaluation for Reinforcement Learning.
Shuze Liu, Claire Chen, Shangtong Zhang.
International Conference on Learning Representations (ICLR), 2025. -
Efficient Multi-Policy Evaluation for Reinforcement Learning.
Shuze Liu, Claire Chen, Shangtong Zhang.
AAAI Conference on Artificial Intelligence (AAAI), 2025.
Oral Presentation, Top 4.7%. -
Optimal Policy Evaluation for Reinforcement Learning.
Shuze Liu, Claire Chen, Will Ma, Shangtong Zhang.
Submitted to Operations Research (OR). -
Pessimism-Free Offline Learning in General-Sum Games via KL Regularization.
Claire Chen, Yuheng Zhang. -
Fast Rates in α-Potential Games via Regularized Mirror Descent.
Claire Chen, Yuheng Zhang. -
Beyond Pessimism: Offline Learning in KL-regularized Games.
Yuheng Zhang, Claire Chen, Nan Jiang. -
Robust Data-Collection Policy Learning for Low-Variance Online Policy Evaluation.
Claire Chen, Shuze Liu, Licheng Luo, Rohan Chandra, Nan Jiang, Shangtong Zhang. -
Pessimistic Minimax Learning for Public-Private Information Games under Unilateral Coverage.
Shuze Liu, Claire Chen, Jiuqi Wang, David Simchi-Levi. -
Beyond Linear Attention: Softmax Transformers Implement In-Context Reinforcement Learning.
Zixuan Xie, Xinyu Liu, Claire Chen, Shuze Liu, Rohan Chandra, Shangtong Zhang. -
Predicting Plasticity in Deep Continual Learning: A Theoretical Perspective.
Jiuqi Wang, Jayanth Srinivasa, Claire Chen, Shuze Liu, Ali Payani, Shangtong Zhang. -
Marrying Operations Research and Reinforcement Learning: A Unified Benchmark for Stochastic Sequential Decision-Making.
Shuze Liu, Claire Chen, Will Ma, David Simchi-Levi.
LLM for Negotiation
-
Instructing LLMs to Negotiate using Reinforcement Learning with Verifiable Rewards.
Shuze Liu*, Claire Chen*, Jiabao Sean Xiao, Lei Lei, Yisong Yue, David Simchi-Levi. -
Strategic Bargaining in Multi-Buyer Markets: Reinforcement Learning from Verifiable Rewards for LLM Negotiations.
Shuze Liu, Claire Chen, Jiabao Sean Xiao, Xin Chen, David Simchi-Levi.
Submitted to Management Science (MS). -
Learning to Recommend: Multi-Item Bilateral Negotiation via LLM Sellers.
Claire Chen, Shuze Liu, David Simchi-Levi. -
Learning to Sell: Multi-Product Portfolio Allocation via LLM Agents under Communication Constraints.
Shuze Liu, Claire Chen, Thorsten Joachims, David Simchi-Levi.
LLM for Science
-
MathlibLemma: Folklore Lemma Generation and Benchmark for Formal Mathematics.
Xinyu Liu, Zixuan Xie, Amir Moeini, Claire Chen, Shuze Liu, Yu Meng, Aidong Zhang, Shangtong Zhang.
International Conference on Machine Learning (ICML), 2026. -
AstroAlertBench: Evaluating Vision Language Models for Multimodal Astronomical Alert Triage.
Claire Chen*, Jiabao Sean Xiao*, Shuze Liu*, Facundo Perez Paolino, Luke Handley, Theophile Jegou du Laz, Ricky Nilsson, Alice Zou, Matthew Graham, Ashish Mahabal.
[Website] -
AstroAlertAgent: A Hierarchical Multi-Agent LLM System with Confidence-Gated Routing for Astronomical Alert Classification.
Claire Chen*, Jiabao Sean Xiao*, Shuze Liu, Matthew Graham, Ashish Mahabal.
![]()
Entrepreneurship
Haggle AI — Founder & CEO
An AI negotiator built for buyers, providing real-time strategic advice to secure fair deals — from everyday retail to high-stakes salary contracts. It gives every individual the confidence to navigate stressful negotiations, transforming an intimidating barrier into an accessible shield that unlocks the best possible outcome.
- Bill Gross Prize for Entrepreneurship ($10,000)
- Demetriades-Tsafka-Kokkalis Prize in Entrepreneurship
- Timothy D. Ryan Summer Entrepreneurship Program
Program Committee
ICLR 2025-26, ICML 2025-26, NeurIPS 2026, AAAI 2026, AISTATS 2025-26, AMMAS 2025.
Guest Lecture
Reinforcement Learning from Human Feedback (Fall 2024), invited by Professor Shangtong Zhang.