In recent years RNA-Seq technology has been used not only to quantify differences in gene expression but also to understand the underlying mechanisms that lead to these differences. Nucleotide sequence variation arising through evolution may differentially affect the expression profiles of divergent species. RNA-Seq technology, combined with techniques to differentiate parental alleles and quantify their abundance, have recently become popular methods for allele specific gene expression (ASGE) analyses. However, analysis of gene expression within interspecies hybrids may be difficult when one of the two parental genomes represented in the hybrid does not have robust genomic resources or available transcriptome data.
Now researchers at Texas State University have compared two strategies for analyzing allele specific expression within interspecies hybrids produced from crossing two Xiphophorus fish species. The first strategy relies upon a robust reference transcriptome assembly from one species followed by identification of SNPs and creation of an in silico reference transcriptome for the second species. The second strategy employs de novo assembly of reference transcriptomes for both parental species followed by identification of homologous transcripts prior to mapping hybrid reads to a combined hybrid reference.
Their results show that, although both methods are able to achieve balanced allelic distribution upon read mapping of F(1) hybrid fish transcriptomes, the second “de novo” assembly approach is superior for ASGE analyses and leads to results more consistent with those found from quantitative real time PCR assessment of gene expression. In addition, their analysis indicates that indels between the two parental alleles are the major cause of the differences in results observed when employing these two methods.
- Shen Y, Garcia T, Pabuwal V, Boswell M, Pasquali A, Beldorth I, Warren W, Schartl M, Cresko WA, Walter RB. (2012) Alternative strategies for development of a reference transcriptome for quantification of allele specific expression in organisms having sparse genomic resources. Comp Biochem Physiol Part D Genomics Proteomics [Epub ahead of print]. [abstract]