from the PacBio Blog
With the recent launch of SMRT Analysis v2.2, we’re excited to introduce analysis software support for the new Iso-Seq™ method for sequencing full-length transcripts and gene isoforms, with no assembly required! Today we’ll take a deeper look at the Iso-Seq method to explain its unique scientific value and review publications from those already applying Single Molecule, Real-Time (SMRT®) Sequencing to this exciting area of research.
In plant and animal genomes, along with all higher eukaryotic organisms, the majority of genes are alternatively spliced to produce multiple transcript isoforms. In humans, for example, there is evidence for alternative splicing of more than 95% of genes [1], with an average of more than five isoforms per gene. Gene regulation through alternative splicing can dramatically increase the protein-coding potential of a genome that contains a limited number of genes that encode proteins. Somewhat surprisingly, alternatively spliced isoforms from a single gene can also have very different, even antagonistic, functions [2]. Therefore, understanding the functional biology of a genome requires knowing the full complement of isoforms. Microarrays and high-throughput cDNA sequencing have become incredibly useful tools for studying transcriptomes, yet these technologies provide small snippets of transcripts and building complete transcripts to study gene isoforms has been challenging.
Thanks to the extraordinarily long reads available with PacBio® sequencing, the new Iso-Seq method provides full-length reads spanning entire transcript isoforms all the way from the polyA-tail to the 5′ end. It is no longer necessary to reconstruct transcripts or infer isoforms based on combining local information since each sequence represents an individual full-length cDNA molecule. The method combines isoform-level resolution with the best of whole-transcriptome sequencing to enable direct gene isoform sequencing across an entire transcriptome. We’re pleased to report that scientists are now using the Iso-Seq method to routinely sequence full-length isoforms in a wide variety of organisms and are applying the approach to improve annotations in reference genomes, characterize gene isoforms in important gene families, and find novel genes even in the most comprehensively studied human cell lines.
Here is a selected list of recent publications, presentations, and sample data:
• Webinar: No Assembly Required – Extremely Long Reads for Full-length Transcript Isoform Sequencing
For more detailed information on cDNA sequencing with PacBio, don’t miss this primer on GitHub and the shared protocol on SampleNet.
[1] Pan et al. (2008) Deep surveying of alternative splicing complexity in the human transcriptome by high-throughput sequencing. Nature Genetics 40: 1413-1415. [2] Boise et al. (1993) Bcl-x, a bcl-2-related gene that functions as a dominant regulator of apoptotic cell death. Cell 74: 597-608.