UVA RL Meetup
The Reinforcement Learning Meetup @ University of Virginia2-3pm Friday @ Rice Hall 414
This weekly meetup is organized by Shangtong Zhang and Chen-Yu Wei for UVA RL folks to share interesting RL papers.
Fall 2025
| Date | Presenter | Paper or Topic |
|---|---|---|
| Sep 5 | Jiuqi Wang | Convergence of Regularized Agent-State-Based Q-learning in POMDPs |
| Sep 12 (ICLR) | ||
| Sep 19 (ICLR) | ||
| Sep 26 (ICLR) | ||
| Oct 3 | Zixuan Xie | Transformers Learn to Implement Multi-step Gradient Descent with Chain of Thought |
| Oct 10 | Xinyu Liu | A Dynamic View of Some Anomalous Phenomena in SGD |
| Oct 17 | Amir Moeini | On the Design of KL-Regularized Policy Gradient Algorithms for LLM Reasoning |
| Oct 24 | Haolin Liu | Token Hidden Reward: Steering Exploration-Exploitation in Group Relative Deep Reinforcement Learning |
| Oct 31 | Braham Snyder | The Importance of Pessimism in Fixed-Dataset Policy Optimization |
| Nov 7 | Minjae Kwon | Constraint Policy Optimization |
| Nov 14 | Jiuqi Wang | PokeeResearch: Effective Deep Research via Reinforcement Learning from AI Feedback and Robust Reasoning Scaffold |
| Nov 21 | Andrew Wagenmaker | Towards Practical Online Improvement of Pretrained Policies for Robotic Manipulation |
| Nov 28 (Thanksgiving) |