Recherche
-
Power-of-d-Choices with Memory: Fluid Limit and Optimality
(Mathematics of Operations Research. vol. 45, n° 3, pp. 862-888, 2020)Article de revue -
On the Expected Total Reward with Unbounded Returns for Markov Decision Processes
(Applied Mathematics and Optimization. vol. 82, n° 2, pp. 433-450, 2020)Article de revue -
A Convex Programming Approach for Discrete-Time Markov Decision Processes under the Expected Total Reward Criterion
(SIAM Journal on Control and Optimization. vol. 58, n° 4, pp. 2535-2566, 2020-01)Article de revue