Heavy Ball Momentum for Non-Strongly Convex Optimization
hal.structure.identifier | Institut de Mathématiques de Bordeaux [IMB] | |
dc.contributor.author | AUJOL, Jean-François | |
hal.structure.identifier | Institut National des Sciences Appliquées - Toulouse [INSA Toulouse] | |
hal.structure.identifier | Institut de Mathématiques de Toulouse UMR5219 [IMT] | |
dc.contributor.author | DOSSAL, Charles | |
hal.structure.identifier | Università degli studi di Genova = University of Genoa [UniGe] | |
hal.structure.identifier | Dipartimento di Informatica, Bioingegneria, Robotica e Ingegneria dei Sistemi [Genova] [DIBRIS] | |
dc.contributor.author | LABARRIÈRE, Hippolyte | |
hal.structure.identifier | Institut National des Sciences Appliquées - Toulouse [INSA Toulouse] | |
hal.structure.identifier | Institut de Mathématiques de Toulouse UMR5219 [IMT] | |
hal.structure.identifier | Laboratoire d'analyse et d'architecture des systèmes [LAAS] | |
dc.contributor.author | RONDEPIERRE, Aude | |
dc.date.accessioned | 2024-04-04T02:29:47Z | |
dc.date.available | 2024-04-04T02:29:47Z | |
dc.date.issued | 2024-03-11 | |
dc.identifier.uri | https://oskar-bordeaux.fr/handle/20.500.12278/190207 | |
dc.description.abstractEn | When considering the minimization of a quadratic or strongly convex function, it is well known that first-order methods involving an inertial term weighted by a constant-in-time parameter are particularly efficient (see Polyak [32], Nesterov [28], and references therein). By setting the inertial parameter according to the condition number of the objective function, these methods guarantee a fast exponential decay of the error. We prove that this type of schemes (which are later called Heavy Ball schemes) is relevant in a relaxed setting, i.e. for composite functions satisfying a quadratic growth condition. In particular, we adapt V-FISTA, introduced by Beck in [10] for strongly convex functions, to this broader class of functions. To the authors' knowledge, the resulting worst-case convergence rates are faster than any other in the literature, including those of FISTA restart schemes. No assumption on the set of minimizers is required and guarantees are also given in the non-optimal case, i.e. when the condition number is not exactly known. This analysis follows the study of the corresponding continuous-time dynamical system (Heavy Ball with friction system), for which new convergence results of the trajectory are shown. | |
dc.description.sponsorship | Problèmes inverses aveugles et microscopie optique - ANR-21-CE48-0008 | |
dc.description.sponsorship | Mathématiques de l'optimisation déterministe et stochastique liées à l'apprentissage profond - ANR-19-CE23-0017 | |
dc.description.sponsorship | Numerical analysis, optimal control and optimal transport for AI - ANR-23-PEIA-0004 | |
dc.language.iso | en | |
dc.title.en | Heavy Ball Momentum for Non-Strongly Convex Optimization | |
dc.type | Document de travail - Pré-publication | |
dc.subject.hal | Mathématiques [math] | |
dc.identifier.arxiv | 2403.06930 | |
bordeaux.hal.laboratories | Institut de Mathématiques de Bordeaux (IMB) - UMR 5251 | * |
bordeaux.institution | Université de Bordeaux | |
bordeaux.institution | Bordeaux INP | |
bordeaux.institution | CNRS | |
hal.identifier | hal-04500652 | |
hal.version | 1 | |
hal.origin.link | https://hal.archives-ouvertes.fr//hal-04500652v1 | |
bordeaux.COinS | ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.date=2024-03-11&rft.au=AUJOL,%20Jean-Fran%C3%A7ois&DOSSAL,%20Charles&LABARRI%C3%88RE,%20Hippolyte&RONDEPIERRE,%20Aude&rft.genre=preprint |
Fichier(s) constituant ce document
Fichiers | Taille | Format | Vue |
---|---|---|---|
Il n'y a pas de fichiers associés à ce document. |