A novel approach to the clustering of microarray data via nonparametric density estimation. Academic Article uri icon

Overview

abstract

  • BACKGROUND: Cluster analysis is a crucial tool in several biological and medical studies dealing with microarray data. Such studies pose challenging statistical problems due to dimensionality issues, since the number of variables can be much higher than the number of observations. RESULTS: Here, we present a general framework to deal with the clustering of microarray data, based on a three-step procedure: (i) gene filtering; (ii) dimensionality reduction; (iii) clustering of observations in the reduced space. Via a nonparametric model-based clustering approach we obtain promising results both in simulated and real data. CONCLUSIONS: The proposed algorithm is a simple and effective tool for the clustering of microarray data, in an unsupervised setting.

publication date

  • February 8, 2011

Research

keywords

  • Algorithms
  • Cluster Analysis
  • Computational Biology
  • Models, Statistical
  • Oligonucleotide Array Sequence Analysis

Identity

PubMed Central ID

  • PMC3042915

Scopus Document Identifier

  • 79551684888

Digital Object Identifier (DOI)

  • 10.2307/2532201

PubMed ID

  • 21303507

Additional Document Info

volume

  • 12