Stationary Markov Nash equilibria for nonzero-sum constrained ARAT Markov games
hal.structure.identifier | Méthodes avancées d’apprentissage statistique et de contrôle [ASTRAL] | |
hal.structure.identifier | Institut de Mathématiques de Bordeaux [IMB] | |
hal.structure.identifier | Quality control and dynamic reliability [CQFD] | |
hal.structure.identifier | Institut Polytechnique de Bordeaux [Bordeaux INP] | |
dc.contributor.author | DUFOUR, François | |
hal.structure.identifier | Universidad Estatal a Distancia [UNED] | |
dc.contributor.author | PRIETO-RUMEAU, Tomás | |
dc.date.accessioned | 2024-04-04T02:42:59Z | |
dc.date.available | 2024-04-04T02:42:59Z | |
dc.date.issued | 2022 | |
dc.identifier.issn | 0363-0129 | |
dc.identifier.uri | https://oskar-bordeaux.fr/handle/20.500.12278/191295 | |
dc.description.abstractEn | We consider a nonzero-sum Markov game on an abstract measurable state space with compact metric action spaces. The goal of each player is to maximize his respective discounted payoff function under the condition that some constraints on a discounted payoff are satisfied. We are interested in the existence of a Nash or noncooperative equilibrium. Under suitable conditions, which include absolute continuity of the transitions with respect to some reference probability measure, additivity of the payoffs and the transition probabilities (ARAT condition), and continuity in action of the payoff functions and the density function of the transitions of the system, we establish the existence of a constrained stationary Markov Nash equilibrium, that is, the existence of stationary Markov strategies for each of the players yielding an optimal profile within the class of all history-dependent profiles. | |
dc.language.iso | en | |
dc.publisher | Society for Industrial and Applied Mathematics | |
dc.subject.en | Nash equilibrium | |
dc.subject.en | Nonzero-sum games | |
dc.subject.en | Constrained games | |
dc.subject.en | ARAT games | |
dc.title.en | Stationary Markov Nash equilibria for nonzero-sum constrained ARAT Markov games | |
dc.type | Article de revue | |
dc.identifier.doi | 10.1137/21M144565X | |
dc.subject.hal | Mathématiques [math]/Optimisation et contrôle [math.OC] | |
dc.identifier.arxiv | 2109.13003 | |
bordeaux.journal | SIAM Journal on Control and Optimization | |
bordeaux.hal.laboratories | Institut de Mathématiques de Bordeaux (IMB) - UMR 5251 | * |
bordeaux.institution | Université de Bordeaux | |
bordeaux.institution | Bordeaux INP | |
bordeaux.institution | CNRS | |
bordeaux.peerReviewed | oui | |
hal.identifier | hal-03510818 | |
hal.version | 1 | |
hal.popular | non | |
hal.audience | Internationale | |
hal.origin.link | https://hal.archives-ouvertes.fr//hal-03510818v1 | |
bordeaux.COinS | ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.jtitle=SIAM%20Journal%20on%20Control%20and%20Optimization&rft.date=2022&rft.eissn=0363-0129&rft.issn=0363-0129&rft.au=DUFOUR,%20Fran%C3%A7ois&PRIETO-RUMEAU,%20Tom%C3%A1s&rft.genre=article |
Fichier(s) constituant ce document
Fichiers | Taille | Format | Vue |
---|---|---|---|
Il n'y a pas de fichiers associés à ce document. |