A simple model-based approach to variable selection in classification and clustering

Partovi Nia, Vahid; Davison, Anthony C.

Clustering and classification of replicated biological data is often performed using classical techniques that inappropriately treat the data as unreplicated, or by complex modern ones that are computationally demanding. In this paper we introduce a simple approach based on a `spike-and-slab' mixture model that is fast, automatic, allows classification, clustering and variable selection in a single framework, and can also handle replicated data. Simulation shows that our approach compares well with other recently proposed methods. The ideas are illustrated by application to microarray and metabolomic data.

Published December 2014 , 22 pages

Publication

Jun 2015

A simple model-based approach to variable selection in classification and clustering

Vahid Partovi Nia and Anthony C. Davison

Canadian Journal of Statistics, 43(2), 157–175, 2015 BibTeX reference

GERAD

G-2014-106

A simple model-based approach to variable selection in classification and clustering

Vahid Partovi Nia and Anthony C. Davison

Publication