COSTA, Oswaldo; DUFOUR, François

El sistema se apagará debido a tareas habituales de mantenimiento. Por favor, guarde su trabajo y desconéctese.

hal.structure.identifier	Universidade de São Paulo = University of São Paulo [USP]
dc.contributor.author	COSTA, Oswaldo
hal.structure.identifier	Institut de Mathématiques de Bordeaux [IMB]
hal.structure.identifier	Quality control and dynamic reliability [CQFD]
dc.contributor.author	DUFOUR, François
dc.date.accessioned	2024-04-04T02:23:56Z
dc.date.available	2024-04-04T02:23:56Z
dc.date.issued	2012
dc.identifier.issn	0363-0129
dc.identifier.uri	https://oskar-bordeaux.fr/handle/20.500.12278/189778
dc.description.abstractEn	This work studies the asymptotic optimality of discrete-time Markov Decision Processes (MDP's in short) with general state space and action space and having weak and strong interactions. By using a similar approach as developed in Liu 2001, the idea in this paper is to consider a MDP with general state and action spaces and to reduce the dimension of the state space by considering an averaged model. This formulation is often described by introducing a small parameter $\epsilon >0$ in the definition of the transition kernel, leading to a singularly perturbed Markov model with two time scales. Our objective is twofold. First it is shown that the value function of the control problem for the perturbed system converges to the value function of a limit averaged control problem as $\epsilon$ goes to zero. In the second part of the paper, it is proved that a feedback control policy for the original control problem defined by using an optimal feedback policy for the limit problem is asymptotically optimal. Our work extends existing results of the literature in the following two directions: the underlying MDP is defined on general state and action spaces and we do not impose strong conditions on the recurrence structure of the MDP such as Doeblin's condition.
dc.language.iso	en
dc.publisher	Society for Industrial and Applied Mathematics
dc.title.en	Singularly Perturbed Discounted Markov Control Processes in a General State Space
dc.type	Article de revue
dc.subject.hal	Mathématiques [math]/Optimisation et contrôle [math.OC]
bordeaux.journal	SIAM Journal on Control and Optimization
bordeaux.page	720-747
bordeaux.volume	50
bordeaux.hal.laboratories	Institut de Mathématiques de Bordeaux (IMB) - UMR 5251	*
bordeaux.issue	2
bordeaux.institution	Université de Bordeaux
bordeaux.institution	Bordeaux INP
bordeaux.institution	CNRS
bordeaux.peerReviewed	oui
hal.identifier	hal-00759715
hal.version	1
hal.popular	non
hal.audience	Internationale
hal.origin.link	https://hal.archives-ouvertes.fr//hal-00759715v1
bordeaux.COinS	ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.jtitle=SIAM%20Journal%20on%20Control%20and%20Optimization&rft.date=2012&rft.volume=50&rft.issue=2&rft.spage=720-747&rft.epage=720-747&rft.eissn=0363-0129&rft.issn=0363-0129&rft.au=COSTA,%20Oswaldo&DUFOUR,%20Fran%C3%A7ois&rft.genre=article

Archivos en el ítem

Archivos	Tamaño	Formato	Ver
No hay archivos asociados a este ítem.

Este ítem aparece en la(s) siguiente(s) colección(ones)

Institut de Mathématiques de Bordeaux (IMB) - UMR 5251

Mostrar el registro sencillo del ítem

Singularly Perturbed Discounted Markov Control Processes in a General State Space

Archivos en el ítem

Este ítem aparece en la(s) siguiente(s) colección(ones)