ANSELMI, Jonatha; DUFOUR, François; PRIETO-RUMEAU, Tomás

doi:10.1016/j.jmaa.2016.05.055

La plateforme OSKAR Bordeaux évolue pour rejoindre l'archive ouverte HAL. Retrouvez tous vos dépôts sur le nouveau portail HAL UB : https://u-bordeaux.hal.science/. Pour toute aide ou information, contactez-nous info@oskar-bordeaux.fr

Afficher la notice abrégée

hal.structure.identifier	Quality control and dynamic reliability [CQFD]
dc.contributor.author	ANSELMI, Jonatha
hal.structure.identifier	Institut Polytechnique de Bordeaux [Bordeaux INP]
hal.structure.identifier	Quality control and dynamic reliability [CQFD]
hal.structure.identifier	Institut de Mathématiques de Bordeaux [IMB]
dc.contributor.author	DUFOUR, François
hal.structure.identifier	Universidad Estatal a Distancia [UNED]
dc.contributor.author	PRIETO-RUMEAU, Tomás
dc.date.accessioned	2024-04-04T03:12:23Z
dc.date.available	2024-04-04T03:12:23Z
dc.date.issued	2016
dc.identifier.issn	0022-247X
dc.identifier.uri	https://oskar-bordeaux.fr/handle/20.500.12278/193866
dc.description.abstractEn	In this paper, we propose an approach for approximating the value function and an ϵ-optimal policy of continuous-time Markov decision processes with Borel state and action spaces, with possibly unbounded cost and transition rates, under the total expected discounted cost optimality criterion. Under adequate assumptions, which in particular include that the transition rate has a density function with respect to a reference measure, together with piecewise Lipschitz continuity of the elements of the control model, we approximate the original controlled process by a model with finite state and action spaces. The approximation error is related to the 1-Wasserstein distance between suitably defined probability measures and approximating measures with finite support. We also study the case when the reference measure is approximated with empirical distributions and we show that convergence of the approximations takes place at an exponential rate in probability.
dc.language.iso	en
dc.publisher	Elsevier
dc.subject.en	Linear programming approach to control problems
dc.subject.en	Approximation of Markov decision processes
dc.subject.en	Constrained Markov decision processes
dc.title.en	Computable approximations for continuous-time Markov decision processes on Borel spaces based on empirical measures
dc.type	Article de revue
dc.identifier.doi	10.1016/j.jmaa.2016.05.055
dc.subject.hal	Mathématiques [math]/Optimisation et contrôle [math.OC]
bordeaux.journal	Journal of Mathematical Analysis and Applications
bordeaux.page	1323 - 1361
bordeaux.volume	443
bordeaux.hal.laboratories	Institut de Mathématiques de Bordeaux (IMB) - UMR 5251	*
bordeaux.issue	2
bordeaux.institution	Université de Bordeaux
bordeaux.institution	Bordeaux INP
bordeaux.institution	CNRS
bordeaux.peerReviewed	oui
hal.identifier	hal-01412615
hal.version	1
hal.popular	non
hal.audience	Internationale
hal.origin.link	https://hal.archives-ouvertes.fr//hal-01412615v1
bordeaux.COinS	ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.jtitle=Journal%20of%20Mathematical%20Analysis%20and%20Applications&rft.date=2016&rft.volume=443&rft.issue=2&rft.spage=1323%20-%201361&rft.epage=1323%20-%201361&rft.eissn=0022-247X&rft.issn=0022-247X&rft.au=ANSELMI,%20Jonatha&DUFOUR,%20Fran%C3%A7ois&PRIETO-RUMEAU,%20Tom%C3%A1s&rft.genre=article

Fichier(s) constituant ce document

Fichiers	Taille	Format	Vue
Il n'y a pas de fichiers associés à ce document.

Ce document figure dans la(les) collection(s) suivante(s)

Institut de Mathématiques de Bordeaux (IMB) - UMR 5251

Afficher la notice abrégée

Computable approximations for continuous-time Markov decision processes on Borel spaces based on empirical measures

Fichier(s) constituant ce document

Ce document figure dans la(les) collection(s) suivante(s)