An Empirical Evaluation of Normalization Methods for MicroRNA Arrays in a Liposarcoma Study. Academic Article uri icon

Overview

abstract

  • BACKGROUND: Methods for array normalization, such as median and quantile normalization, were developed for mRNA expression arrays. These methods assume few or symmetric differential expression of genes on the array. However, these assumptions are not necessarily appropriate for microRNA expression arrays because they consist of only a few hundred genes and a reasonable fraction of them are anticipated to have disease relevance. METHODS: We collected microRNA expression profiles for human tissue samples from a liposarcoma study using the Agilent microRNA arrays. For a subset of the samples, we also profiled their microRNA expression using deep sequencing. We empirically evaluated methods for normalization of microRNA arrays using deep sequencing data derived from the same tissue samples as the benchmark. RESULTS: In this study, we demonstrated array effects in microRNA arrays using data from a liposarcoma study. We found moderately high correlation between Agilent data and sequence data on the same tumors, with the Pearson correlation coefficients ranging from 0.6 to 0.9. Array normalization resulted in some improvement in the accuracy of the differential expression analysis. However, even with normalization, there is still a significant number of false positive and false negative microRNAs, many of which are expressed at moderate to high levels. CONCLUSIONS: Our study demonstrated the need to develop more efficient normalization methods for microRNA arrays to further improve the detection of genes with disease relevance. Until better methods are developed, an existing normalization method such as quantile normalization should be applied when analyzing microRNA array data.

publication date

  • March 18, 2013

Identity

PubMed Central ID

  • PMC3615992

Scopus Document Identifier

  • 84870002118

Digital Object Identifier (DOI)

  • 10.1016/j.ymeth.2012.07.030

PubMed ID

  • 23589668

Additional Document Info

volume

  • 12