Computable approximations for continuous-time Markov decision processes on Borel spaces based on empirical measures
hal.structure.identifier | Quality control and dynamic reliability [CQFD] | |
dc.contributor.author | ANSELMI, Jonatha | |
hal.structure.identifier | Institut Polytechnique de Bordeaux [Bordeaux INP] | |
hal.structure.identifier | Quality control and dynamic reliability [CQFD] | |
hal.structure.identifier | Institut de Mathématiques de Bordeaux [IMB] | |
dc.contributor.author | DUFOUR, François | |
hal.structure.identifier | Universidad Estatal a Distancia [UNED] | |
dc.contributor.author | PRIETO-RUMEAU, Tomás | |
dc.date.accessioned | 2024-04-04T03:12:23Z | |
dc.date.available | 2024-04-04T03:12:23Z | |
dc.date.issued | 2016 | |
dc.identifier.issn | 0022-247X | |
dc.identifier.uri | https://oskar-bordeaux.fr/handle/20.500.12278/193866 | |
dc.description.abstractEn | In this paper, we propose an approach for approximating the value function and an ϵ-optimal policy of continuous-time Markov decision processes with Borel state and action spaces, with possibly unbounded cost and transition rates, under the total expected discounted cost optimality criterion. Under adequate assumptions, which in particular include that the transition rate has a density function with respect to a reference measure, together with piecewise Lipschitz continuity of the elements of the control model, we approximate the original controlled process by a model with finite state and action spaces. The approximation error is related to the 1-Wasserstein distance between suitably defined probability measures and approximating measures with finite support. We also study the case when the reference measure is approximated with empirical distributions and we show that convergence of the approximations takes place at an exponential rate in probability. | |
dc.language.iso | en | |
dc.publisher | Elsevier | |
dc.subject.en | Linear programming approach to control problems | |
dc.subject.en | Approximation of Markov decision processes | |
dc.subject.en | Constrained Markov decision processes | |
dc.title.en | Computable approximations for continuous-time Markov decision processes on Borel spaces based on empirical measures | |
dc.type | Article de revue | |
dc.identifier.doi | 10.1016/j.jmaa.2016.05.055 | |
dc.subject.hal | Mathématiques [math]/Optimisation et contrôle [math.OC] | |
bordeaux.journal | Journal of Mathematical Analysis and Applications | |
bordeaux.page | 1323 - 1361 | |
bordeaux.volume | 443 | |
bordeaux.hal.laboratories | Institut de Mathématiques de Bordeaux (IMB) - UMR 5251 | * |
bordeaux.issue | 2 | |
bordeaux.institution | Université de Bordeaux | |
bordeaux.institution | Bordeaux INP | |
bordeaux.institution | CNRS | |
bordeaux.peerReviewed | oui | |
hal.identifier | hal-01412615 | |
hal.version | 1 | |
hal.popular | non | |
hal.audience | Internationale | |
hal.origin.link | https://hal.archives-ouvertes.fr//hal-01412615v1 | |
bordeaux.COinS | ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.jtitle=Journal%20of%20Mathematical%20Analysis%20and%20Applications&rft.date=2016&rft.volume=443&rft.issue=2&rft.spage=1323%20-%201361&rft.epage=1323%20-%201361&rft.eissn=0022-247X&rft.issn=0022-247X&rft.au=ANSELMI,%20Jonatha&DUFOUR,%20Fran%C3%A7ois&PRIETO-RUMEAU,%20Tom%C3%A1s&rft.genre=article |
Fichier(s) constituant ce document
Fichiers | Taille | Format | Vue |
---|---|---|---|
Il n'y a pas de fichiers associés à ce document. |