Approche de classification par la méthode PCAMIX
Language
en
Communication dans un congrès
This item was published in
Joint Conference of the German Classification Society (GfKl), the German Association for Pattern Recognition (DAGM) and the IFCS 2011 Symposium of the International Federation of Classification Societies (IFCS), 2011-08-31, Frankfurt. 2011p. 1
English Abstract
Clustering of variables is as a way to arrange variables into homogeneous clusters i.e. groups of variables which are strongly related to each other and thus bring the same information. Clustering of variables can then be ...Read more >
Clustering of variables is as a way to arrange variables into homogeneous clusters i.e. groups of variables which are strongly related to each other and thus bring the same information. Clustering of variables can then be useful for dimension reduction and variable selection. Several specic methods have been developed for the clustering of numerical variables. However concerning qualitative variables or mixtures of quantitative and qualitative variables, much less methods have been proposed. The ClustOfVar package has then been developped specically for that purpose. The homogeneity criterion of a cluster is the sum of correlation ratios (for qualitative variables) and squared correlations (for quantitative variables) to a synthetic variable, summarizing as good as possible the variables in the cluster. This synthetic variable is the rst principal component obtained with the PCAMIX method. Two algorithms for the clustering of variables are proposed: iterative relocation algorithm, ascendant hierarchical clustering. We also propose a bootstrap approach to determine suitable numbers of clusters. The proposed methodologies are illustrated on real datasets.Read less <
Origin
Hal imported