LORENZO, Hadrien; THIÉBAUT, Rodolphe; SARACCO, Jérôme

The system will be going down for regular maintenance. Please save your work and logout.

hal.structure.identifier	Vaccine Research Institute [Créteil, France] [VRI]
hal.structure.identifier	Statistics In System biology and Translational Medicine [SISTM]
dc.contributor.author	LORENZO, Hadrien
hal.structure.identifier	Statistics In System biology and Translational Medicine [SISTM]
hal.structure.identifier	Université de Bordeaux [UB]
dc.contributor.author	THIÉBAUT, Rodolphe
hal.structure.identifier	Institut de Mathématiques de Bordeaux [IMB]
hal.structure.identifier	Quality control and dynamic reliability [CQFD]
hal.structure.identifier	Ecole Nationale Supérieure de Cognitique [ENSC]
dc.contributor.author	SARACCO, Jérôme
dc.date.accessioned	2024-04-04T03:07:33Z
dc.date.available	2024-04-04T03:07:33Z
dc.date.conference	2018-01-11
dc.identifier.uri	https://oskar-bordeaux.fr/handle/20.500.12278/193456
dc.description.abstractEn	Several sets of variables can be analyzed simultaneously by canonical correlation in a multi-way analysis. These sets of variables are often high-dimensional and repeated over time. For instance, full-transcriptome measured by RNA-Seq used to be performed in longitudinal studies as well as other measures such as peptides or cells. Hence, canonical correlation analysis has been extended with regularized approaches to deal with several high dimensional data. However, some measurements can be missing for technical reasons and therefore introduce undesired structures due to the huge dimension of the datasets.Our objective is to find an efficient method allowing to impute the missing values taking into account the three-way structure, participant-transcriptome-time, and also the missing path structure.We proposed an EM-like covariance-maximization lasso-penalized high-dimensional completion matrix algorithm to reach that goal.We compared our approach on simulated data-sets with the mean imputation per gene pertime step, the missMDA-imputeMFA algorithm which takes structure into account and the softImpute solution initially designed to solve the Netix competition a high-dimensional problem. We used two criterions: the L2-error between estimated and simulated values and the L2-error between estimated and simulated covariance matrices. The numerical resultsexhibited the superiority of the proposed method in most of the scenarii. We also illustrated our approach on a real data-set from a phase I Ebola vaccine trial measuring RNA-Seq data after vaccination (richtien, cell report 2017) in 20 participants at 4 different times on whole-blood samples, representing 74 sequenced-samples, among which 24 samples were missing because of technological issues.
dc.language.iso	en
dc.title.en	Multi-block high-dimensional lasso-penalized analysis with imputation of missing data applied to postgenomic data in an Ebola vaccine trial
dc.type	Communication dans un congrès
dc.subject.hal	Statistiques [stat]
dc.subject.hal	Sciences du Vivant [q-bio]/Médecine humaine et pathologie
bordeaux.hal.laboratories	Institut de Mathématiques de Bordeaux (IMB) - UMR 5251	*
bordeaux.institution	Université de Bordeaux
bordeaux.institution	Bordeaux INP
bordeaux.institution	CNRS
bordeaux.conference.title	Annual workshop on Statistical Methods for Post Genomic Data - SMPGD 2018
bordeaux.country	FR
bordeaux.conference.city	Montpellier
bordeaux.peerReviewed	oui
hal.identifier	hal-01664610
hal.version	1
hal.invited	non
hal.proceedings	non
hal.conference.end	2018-01-12
hal.popular	non
hal.audience	Internationale
hal.origin.link	https://hal.archives-ouvertes.fr//hal-01664610v1
bordeaux.COinS	ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.au=LORENZO,%20Hadrien&THI%C3%89BAUT,%20Rodolphe&SARACCO,%20J%C3%A9r%C3%B4me&rft.genre=unknown

Files in this item

Files	Size	Format	View
There are no files associated with this item.

This item appears in the following Collection(s)

Institut de Mathématiques de Bordeaux (IMB) - UMR 5251

Show simple item record

Multi-block high-dimensional lasso-penalized analysis with imputation of missing data applied to postgenomic data in an Ebola vaccine trial

Files in this item

This item appears in the following Collection(s)