Feb 7 |
Zixuan Xie |
Analytic-DPM: An Analytic Estimate of the Optimal Reverse Variance in Diffusion Probabilistic Models Tutorial on Diffusion Models for Imaging and Vision |
Feb 14 |
Xinyu Liu |
Decoupled Functional Central Limit Theorems for Two-Time-Scale Stochastic Approximation |
Feb 21 |
Amir Moeini |
Transformers Implement Functional Gradient Descent to Learn Non-Linear Functions In Context |
Feb 28 (AAAI) |
|
|
Mar 7 |
Braham Snyder |
Target Networks and Over-parameterization Stabilize Off-policy Bootstrapping with Function Approximation |
Mar 14 (Spring Break) |
|
|
Mar 21 |
Haolin Liu |
Exploratory Preference Optimization: Harnessing Implicit Q*-Approximation for Sample-Efficient RLHF |
Mar 28 |
Amin Davoodabadi (cancelled) |
An Information-Theoretic Perspective on Intrinsic Motivation in Reinforcement Learning |
Apr 4 |
Jiuqi Wang |
Can Looped Transformers Learn to Implement Multi-step Gradient Descent for In-context Learning? |
Apr 11 |
Dylan Foster (remote) |
Revisiting the Foundations of Imitation Learning |
Apr 18 |
Xiangyu Liu (remote) |
|
Apr 25 (ICLR) |
|
|