Show simple item record

hal.structure.identifierMéthodes avancées d’apprentissage statistique et de contrôle [ASTRAL]
dc.contributor.authorLORENZO, Hadrien
hal.structure.identifierSartorius Stedim France S.A.S. [Aubagne]
dc.contributor.authorCLOAREC, Olivier
hal.structure.identifierStatistics In System biology and Translational Medicine [SISTM]
hal.structure.identifierBordeaux population health [BPH]
hal.structure.identifierVaccine Research Institute [Créteil, France] [VRI]
dc.contributor.authorTHIÉBAUT, Rodolphe
hal.structure.identifierMéthodes avancées d’apprentissage statistique et de contrôle [ASTRAL]
dc.contributor.authorSARACCO, Jérôme
dc.date.accessioned2024-04-04T02:45:19Z
dc.date.available2024-04-04T02:45:19Z
dc.date.issued2022
dc.identifier.issn1932-1864
dc.identifier.urihttps://oskar-bordeaux.fr/handle/20.500.12278/191456
dc.description.abstractEnIn the supervised high dimensional settings with a large number of variables and a low number of individuals, variable selection allows a simpler interpretation and more reliable predictions. That subspace selection is often managed with supervised tools when the real question is motivated by variable prediction. We propose a Partial Least Square (PLS) based method, called data-driven sparse PLS (ddsPLS), allowing variable selection both in the covariate and the response parts using a single hyper-parameter per component. The subspace estimation is also performed by tuning a number of underlying parameters. The ddsPLS method is compared to existing methods such as classical PLS and two well established sparse PLS methods through numerical simulations. The observed results are promising both in terms of variable selection and prediction performance. This methodology is based on new prediction quality descriptors associated with the classical R 2 and Q 2 and uses bootstrap sampling to tune parameters and select an optimal regression model.
dc.language.isoen
dc.publisherWiley
dc.subject.enPLS regression
dc.subject.enSupervised learning
dc.subject.enVariable selection
dc.subject.enSoft thresholding
dc.subject.enMulti-block data
dc.title.enData-Driven Sparse Partial Least Squares
dc.typeArticle de revue
dc.identifier.doi10.1002/sam.11558
dc.subject.halMathématiques [math]/Statistiques [math.ST]
bordeaux.journalStatistical Analysis and Data Mining
bordeaux.page264-282
bordeaux.volume15
bordeaux.hal.laboratoriesInstitut de Mathématiques de Bordeaux (IMB) - UMR 5251*
bordeaux.issue2
bordeaux.institutionUniversité de Bordeaux
bordeaux.institutionBordeaux INP
bordeaux.institutionCNRS
bordeaux.peerReviewedoui
hal.identifierhal-03368956
hal.version1
hal.popularnon
hal.audienceInternationale
hal.origin.linkhttps://hal.archives-ouvertes.fr//hal-03368956v1
bordeaux.COinSctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.jtitle=Statistical%20Analysis%20and%20Data%20Mining&rft.date=2022&rft.volume=15&rft.issue=2&rft.spage=264-282&rft.epage=264-282&rft.eissn=1932-1864&rft.issn=1932-1864&rft.au=LORENZO,%20Hadrien&CLOAREC,%20Olivier&THI%C3%89BAUT,%20Rodolphe&SARACCO,%20J%C3%A9r%C3%B4me&rft.genre=article


Files in this item

FilesSizeFormatView

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record