The Most Informative Spacing Test Effectively Discovers Biologically Relevant Outliers or Multiple Modes in Expression.

Several outlier and subgroup identification statistics (OASIS) have been proposed to discover transcriptomic features with outliers or multiple modes in expression that are indicative of distinct biological processes or subgroups. Here, researchers from the St. Jude Children’s Research Hospital, Memphis borrow ideas from the OASIS methods in the bioinformatics and statistics literature to develop the most informative spacing test (MIST) for unsupervised detection of such transcriptomic features. In an example application involving 14 cases of pediatric acute megakaryoblastic leukemia, MIST more robustly identified features that perfectly discriminate subjects according to gender or the presence of a prognostically relevant fusion-gene than did seven other OASIS methods in the analysis of RNA-seq exon expression, RNA-seq exon junction expression, and micorarray exon expression data. MIST was also effective at identifying features related to gender or molecular subtype in an example application involving 157 adult cases of acute myeloid leukemia.

rna-seq

Availability – MIST will be freely available in the OASIS R package at http://www.stjuderesearch.org/site/depts/biostats.

Pawlikowska I, Wu G, Edmonson M, Liu Z, Gruber T, Zhang J, Pounds S. (2014) The Most Informative Spacing Test Effectively Discovers Biologically Relevant Outliers or Multiple Modes in Expression. Bioinformatics [Epub ahead of print]. [abstract]