Next-generation sequencing technologies are widely used in genome research, and RNA sequencing (RNA-Seq) is becoming the main application for gene expression profiling. A large number of computational methods have been developed for analyzing differentially expressed (DE) genes in RNA-Seq data. However, most existing algorithms prefer to call long genes as DE. Short DE genes are rarely detected.
In this work, researchers from Fudan University set out to gain insight into the influence of gene length on RNA-Seq data analysis and to figure out the effect of gene length on variance estimation of RNA-Seq read counts, which is important for statistic test to identify DE genes. They proposed a balanced method of hunting for short DE genes with significance by smoothing a gene length factor. Computational experiments indicate that our method performs well.
Availability – LenSeq is available at: http://www.iipl.fudan.edu.cn/lenseq/