California Institute of Technology
Email: clairechen AT caltech DOT edu
[Google Scholar] [Curriculum Vitae]
Biography
Claire Chen is an undergraduate student at California Institute of Technology (Caltech). Her majors are Mathematics and Computer Science. She was very fortunate to work as a research assistant advised by Professor Nan Jiang, Professor Sergey Levine, and Professor Shangtong Zhang. Her research interest is in Reinforcement Learning and LLM Fine-Tuning. She regularly serves on the Program Committee in major AI venues, e.g., ICLR, ICML.
Conference Publications
-
Efficient Policy Evaluation with Safety Constraint for Reinforcement Learning.
Claire Chen*, Shuze Liu*, Shangtong Zhang.
International Conference on Learning Representations (ICLR), 2025. -
Doubly Optimal Policy Evaluation for Reinforcement Learning.
Shuze Liu, Claire Chen, Shangtong Zhang.
International Conference on Learning Representations (ICLR), 2025. -
Efficient Multi-Policy Evaluation for Reinforcement Learning.
Shuze Liu, Claire Chen, Shangtong Zhang.
AAAI Conference on Artificial Intelligence (AAAI), 2025.
Oral Presentation, Top 4.7%.
Journal Articles
- Optimal Policy Evaluation for Reinforcement Learning.
Shuze Liu, Claire Chen, Will Ma, Shangtong Zhang.
Working Paper.
Working Papers
-
Robust Data-Collection Policy Learning for Low-Variance Online Policy Evaluation.
Claire Chen, Shuze Liu, Licheng Luo, Rohan Chandra, Nan Jiang, Shangtong Zhang.
-
Offline Two-Player Zero-Sum Markov Games with KL Regularization.
Claire Chen*, Yuheng Zhang*, Xinyu Liu, Zixuan Xie, Shuze Liu, Nan Jiang.
-
Beyond Pessimism: Offline Learning in KL-regularized Games.
Yuheng Zhang*, Claire Chen*, Nan Jiang.
-
Convergence of Two-Timescale Stochastic Approximation with Markovian Samples and Applications in Reinforcement Learning.
Vagul Mahadevan, Claire Chen, Shuze Liu, Shangtong Zhang.
-
Transformers Implement Nonlinear In-Context Reinforcement Learning: Convergence and Emergence.
Zixuan Xie, Xinyu Liu, Claire Chen, Shuze Liu, Rohan Chandra, Shangtong Zhang.
-
MathlibLemma: Folklore Lemma Generation and Benchmark for Formal Mathematics.
Xinyu Liu, Zixuan Xie, Amir Moeini, Claire Chen, Shuze Liu, Yu Meng, Aidong Zhang, Shangtong Zhang.
Program Committee
ICLR 2025-26, ICML 2025-26, AAAI 2026, AISTATS 2025-26, AMMAS 2025.
Guest Lecture
Reinforcement Learning from Human Feedback (Fall 2024).