Title |
Dynamic Programming: From Local Optimality to Global Optimality |
Date |
May 19, 2025 (Monday) 10:40-12:10 |
Location |
12th floor Discussion Room |
Abstract |
In the theory of dynamic programming, an optimal policy is a policy whose lifetime value dominates that of all other policies from every possible initial condition in the state space. This raises a natural question: when does optimality from a single state imply optimality from every state? We show that, in a general setting, irreducibility of the transition kernel is sufficient for this property. Our results have important implications for modern policy-based algorithms used to solve large-scale dynamic programs in reinforcement learning and other fields. |
Paper |
Paper Link |
Slide |
|
Note |
|