Mostrar el registro sencillo del ítem

hal.structure.identifierUniversidade de São Paulo = University of São Paulo [USP]
dc.contributor.authorCOSTA, Oswaldo
hal.structure.identifierInstitut de Mathématiques de Bordeaux [IMB]
hal.structure.identifierQuality control and dynamic reliability [CQFD]
dc.contributor.authorDUFOUR, François
dc.date.accessioned2024-04-04T02:23:56Z
dc.date.available2024-04-04T02:23:56Z
dc.date.issued2012
dc.identifier.issn0363-0129
dc.identifier.urihttps://oskar-bordeaux.fr/handle/20.500.12278/189778
dc.description.abstractEnThis work studies the asymptotic optimality of discrete-time Markov Decision Processes (MDP's in short) with general state space and action space and having weak and strong interactions. By using a similar approach as developed in Liu 2001, the idea in this paper is to consider a MDP with general state and action spaces and to reduce the dimension of the state space by considering an averaged model. This formulation is often described by introducing a small parameter $\epsilon >0$ in the definition of the transition kernel, leading to a singularly perturbed Markov model with two time scales. Our objective is twofold. First it is shown that the value function of the control problem for the perturbed system converges to the value function of a limit averaged control problem as $\epsilon$ goes to zero. In the second part of the paper, it is proved that a feedback control policy for the original control problem defined by using an optimal feedback policy for the limit problem is asymptotically optimal. Our work extends existing results of the literature in the following two directions: the underlying MDP is defined on general state and action spaces and we do not impose strong conditions on the recurrence structure of the MDP such as Doeblin's condition.
dc.language.isoen
dc.publisherSociety for Industrial and Applied Mathematics
dc.title.enSingularly Perturbed Discounted Markov Control Processes in a General State Space
dc.typeArticle de revue
dc.subject.halMathématiques [math]/Optimisation et contrôle [math.OC]
bordeaux.journalSIAM Journal on Control and Optimization
bordeaux.page720-747
bordeaux.volume50
bordeaux.hal.laboratoriesInstitut de Mathématiques de Bordeaux (IMB) - UMR 5251*
bordeaux.issue2
bordeaux.institutionUniversité de Bordeaux
bordeaux.institutionBordeaux INP
bordeaux.institutionCNRS
bordeaux.peerReviewedoui
hal.identifierhal-00759715
hal.version1
hal.popularnon
hal.audienceInternationale
hal.origin.linkhttps://hal.archives-ouvertes.fr//hal-00759715v1
bordeaux.COinSctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.jtitle=SIAM%20Journal%20on%20Control%20and%20Optimization&rft.date=2012&rft.volume=50&rft.issue=2&rft.spage=720-747&rft.epage=720-747&rft.eissn=0363-0129&rft.issn=0363-0129&rft.au=COSTA,%20Oswaldo&DUFOUR,%20Fran%C3%A7ois&rft.genre=article


Archivos en el ítem

ArchivosTamañoFormatoVer

No hay archivos asociados a este ítem.

Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro sencillo del ítem