Approche de classification par la méthode PCAMIX
Langue
en
Communication dans un congrès
Ce document a été publié dans
Joint Conference of the German Classification Society (GfKl), the German Association for Pattern Recognition (DAGM) and the IFCS 2011 Symposium of the International Federation of Classification Societies (IFCS), 2011-08-31, Frankfurt. 2011p. 1
Résumé en anglais
Clustering of variables is as a way to arrange variables into homogeneous clusters i.e. groups of variables which are strongly related to each other and thus bring the same information. Clustering of variables can then be ...Lire la suite >
Clustering of variables is as a way to arrange variables into homogeneous clusters i.e. groups of variables which are strongly related to each other and thus bring the same information. Clustering of variables can then be useful for dimension reduction and variable selection. Several specic methods have been developed for the clustering of numerical variables. However concerning qualitative variables or mixtures of quantitative and qualitative variables, much less methods have been proposed. The ClustOfVar package has then been developped specically for that purpose. The homogeneity criterion of a cluster is the sum of correlation ratios (for qualitative variables) and squared correlations (for quantitative variables) to a synthetic variable, summarizing as good as possible the variables in the cluster. This synthetic variable is the rst principal component obtained with the PCAMIX method. Two algorithms for the clustering of variables are proposed: iterative relocation algorithm, ascendant hierarchical clustering. We also propose a bootstrap approach to determine suitable numbers of clusters. The proposed methodologies are illustrated on real datasets.< Réduire
Origine
Importé de halUnités de recherche