Multi-block high-dimensional lasso-penalized analysis with imputation of missing data applied to postgenomic data in an Ebola vaccine trial
hal.structure.identifier | Vaccine Research Institute [Créteil, France] [VRI] | |
hal.structure.identifier | Statistics In System biology and Translational Medicine [SISTM] | |
dc.contributor.author | LORENZO, Hadrien | |
hal.structure.identifier | Statistics In System biology and Translational Medicine [SISTM] | |
hal.structure.identifier | Université de Bordeaux [UB] | |
dc.contributor.author | THIÉBAUT, Rodolphe | |
hal.structure.identifier | Institut de Mathématiques de Bordeaux [IMB] | |
hal.structure.identifier | Quality control and dynamic reliability [CQFD] | |
hal.structure.identifier | Ecole Nationale Supérieure de Cognitique [ENSC] | |
dc.contributor.author | SARACCO, Jérôme | |
dc.date.accessioned | 2024-04-04T03:07:33Z | |
dc.date.available | 2024-04-04T03:07:33Z | |
dc.date.conference | 2018-01-11 | |
dc.identifier.uri | https://oskar-bordeaux.fr/handle/20.500.12278/193456 | |
dc.description.abstractEn | Several sets of variables can be analyzed simultaneously by canonical correlation in a multi-way analysis. These sets of variables are often high-dimensional and repeated over time. For instance, full-transcriptome measured by RNA-Seq used to be performed in longitudinal studies as well as other measures such as peptides or cells. Hence, canonical correlation analysis has been extended with regularized approaches to deal with several high dimensional data. However, some measurements can be missing for technical reasons and therefore introduce undesired structures due to the huge dimension of the datasets.Our objective is to find an efficient method allowing to impute the missing values taking into account the three-way structure, participant-transcriptome-time, and also the missing path structure.We proposed an EM-like covariance-maximization lasso-penalized high-dimensional completion matrix algorithm to reach that goal.We compared our approach on simulated data-sets with the mean imputation per gene pertime step, the missMDA-imputeMFA algorithm which takes structure into account and the softImpute solution initially designed to solve the Netix competition a high-dimensional problem. We used two criterions: the L2-error between estimated and simulated values and the L2-error between estimated and simulated covariance matrices. The numerical resultsexhibited the superiority of the proposed method in most of the scenarii. We also illustrated our approach on a real data-set from a phase I Ebola vaccine trial measuring RNA-Seq data after vaccination (richtien, cell report 2017) in 20 participants at 4 different times on whole-blood samples, representing 74 sequenced-samples, among which 24 samples were missing because of technological issues. | |
dc.language.iso | en | |
dc.title.en | Multi-block high-dimensional lasso-penalized analysis with imputation of missing data applied to postgenomic data in an Ebola vaccine trial | |
dc.type | Communication dans un congrès | |
dc.subject.hal | Statistiques [stat] | |
dc.subject.hal | Sciences du Vivant [q-bio]/Médecine humaine et pathologie | |
bordeaux.hal.laboratories | Institut de Mathématiques de Bordeaux (IMB) - UMR 5251 | * |
bordeaux.institution | Université de Bordeaux | |
bordeaux.institution | Bordeaux INP | |
bordeaux.institution | CNRS | |
bordeaux.conference.title | Annual workshop on Statistical Methods for Post Genomic Data - SMPGD 2018 | |
bordeaux.country | FR | |
bordeaux.conference.city | Montpellier | |
bordeaux.peerReviewed | oui | |
hal.identifier | hal-01664610 | |
hal.version | 1 | |
hal.invited | non | |
hal.proceedings | non | |
hal.conference.end | 2018-01-12 | |
hal.popular | non | |
hal.audience | Internationale | |
hal.origin.link | https://hal.archives-ouvertes.fr//hal-01664610v1 | |
bordeaux.COinS | ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.au=LORENZO,%20Hadrien&THI%C3%89BAUT,%20Rodolphe&SARACCO,%20J%C3%A9r%C3%B4me&rft.genre=unknown |
Fichier(s) constituant ce document
Fichiers | Taille | Format | Vue |
---|---|---|---|
Il n'y a pas de fichiers associés à ce document. |