Mostrar el registro sencillo del ítem

hal.structure.identifierUniversidade de São Paulo = University of São Paulo [USP]
dc.contributor.authorCOSTA, Oswaldo
hal.structure.identifierInstitut de Mathématiques de Bordeaux [IMB]
hal.structure.identifierQuality control and dynamic reliability [CQFD]
dc.contributor.authorDUFOUR, François
dc.date.accessioned2024-04-04T02:19:59Z
dc.date.available2024-04-04T02:19:59Z
dc.date.issued2012
dc.identifier.issn1449-5910
dc.identifier.urihttps://oskar-bordeaux.fr/handle/20.500.12278/189453
dc.description.abstractEnThis paper studies the average control problem of discrete-time Markov Decision Processes (MDPs for short) with general state space, Feller transition probabilities, and possibly non-compact control constraint sets A(x). Two hypotheses are considered: either the cost function c is strictly unbounded or the multifunctions A(r)(x) = {a is an element of A(x) : c(x, a) <= r} are upper-semicontinuous and compact-valued for each real r. For these two cases we provide new results for the existence of a solution to the average-cost optimality equality and inequality using the vanishing discount approach. We also study the convergence of the policy iteration approach under these conditions. It should be pointed out that we do not make any assumptions regarding the convergence and the continuity of the limit function generated by the sequence of relative difference of the alpha-discounted value functions and the Poisson equations as often encountered in the literature.
dc.language.isoen
dc.publisherAustral Internet Publishing
dc.title.enAverage control of Markov decision processes with Feller transition probabilities and general action spaces
dc.typeArticle de revue
dc.identifier.doi10.1016/j.jmaa.2012.05.073
dc.subject.halMathématiques [math]/Probabilités [math.PR]
bordeaux.journalAustralian Journal of Mathematical Analysis and Applications
bordeaux.page58-69
bordeaux.volume396
bordeaux.hal.laboratoriesInstitut de Mathématiques de Bordeaux (IMB) - UMR 5251*
bordeaux.issue1
bordeaux.institutionUniversité de Bordeaux
bordeaux.institutionBordeaux INP
bordeaux.institutionCNRS
bordeaux.peerReviewedoui
hal.identifierhal-00938889
hal.version1
hal.popularnon
hal.audienceInternationale
hal.origin.linkhttps://hal.archives-ouvertes.fr//hal-00938889v1
bordeaux.COinSctx_ver=Z39.88-2004&amp;rft_val_fmt=info:ofi/fmt:kev:mtx:journal&amp;rft.jtitle=Australian%20Journal%20of%20Mathematical%20Analysis%20and%20Applications&amp;rft.date=2012&amp;rft.volume=396&amp;rft.issue=1&amp;rft.spage=58-69&amp;rft.epage=58-69&amp;rft.eissn=1449-5910&amp;rft.issn=1449-5910&amp;rft.au=COSTA,%20Oswaldo&amp;DUFOUR,%20Fran%C3%A7ois&amp;rft.genre=article


Archivos en el ítem

ArchivosTamañoFormatoVer

No hay archivos asociados a este ítem.

Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro sencillo del ítem