A variety of pathway/gene-set approaches have been proposed to provide evidence of higher-level biological phenomena in the association of expression with experimental condition or clinical outcome. Among these approaches, it has been repeatedly shown that resampling methods are far preferable to approaches that implicitly assume independence of genes. However, few approaches have been optimized for the specific characteristics of RNA-Seq transcription data, in which mapped tags produce discrete counts with varying library sizes, and with potential outliers or skewness patterns that violate parametric assumptions.
Researchers from North Carolina State University describe transformations to RNA-Seq data to improve power for linear associations with outcome and flexibly handle normalization factors. Using these transformations or alternate transformations, they apply recently developed null approximations to quadratic form statistics for both self-contained and competitive pathway testing. The approach provides a convenient integrated platform for RNA-Seq pathway testing. The researchers demonstrate that the approach provides appropriate type I error control without actual permutation and is powerful under many settings in comparison to competing approaches.
The PathwaySeq pipeline of RNA-Seq pathway analysis.
Availability – The PathwaySeq software is available at: http://onlinelibrary.wiley.com/store/10.1111/biom.12372/asset/supinfo/biom12372-sup-0002-SuppData_Code.zip?v=1&s=ecc39eb29a90cc4fba085c64d5fd2d2f3d71db46