Oct
23
EricScript – for Discovering chimeric transcripts in paired-end RNA-Seq data
Filed Under Splicing and Junction Mapping | Leave a Comment

ChimEric tranScript detection algorithm (EricScript) is a novel computational framework, named, for the identification of gene fusion products in paired-end RNA-seq data. A simulation study on synthetic data demonstrates that EricScript enables one to achieve higher sensitivity and specificity than existing methods with noticeably lower running times. The method is also applied to publicly available RNA-seq tumour datasets to demonstrate its capability in rediscovering known gene fusions.
Availability: The EricScript package is freely available under GPL v3 license at http://ericscript.sourceforge.net.
- Benelli M, Pescucci C, Marseglia G, Severgnini M, Torricelli F, Magi A. (2012) Discovering chimeric transcripts in paired-end RNA-seq data by using EricScript. Bioinformatics [Epub ahead of print]. [abstract]
Incoming search terms:
- bioconductor rna-seq
- chimeric paired end reads
- rnaseq ngs chimeric transcript detection
Oct
22
Date & Time: Thursday, 15th November 2012 | 3:00pm (GMT)
Duration: 45 min
Presented by: Dr Daniel Swan – Senior NGS Computational Biologist, OGT
RNA-Seq is revolutionising the field of transcriptomics, delivering unprecedented biological insight into the way cells function. This can only be reliably achieved through good experimental design – ensuring the data generated represents the true biology – and rigorous data analysis procedures, allowing transcript-level visualisation and interrogation of results.
This latest Webinar, will detail how Oxford Gene Technology (OGT) addresses these challenges to allow you to rapidly access meaningful results for a range of model eukaryotes.
(more info and register here…)
Oct
19
Improving Long Read Accuracy by Short Read Alignment
Filed Under Transcriptome Assembly Tools | Leave a Comment
The recent development of third generation sequencing (TGS) generates much longer reads than second generation sequencing (SGS) and thus provides a chance to solve problems that are difficult to study through SGS alone. However, higher raw read error rates are an intrinsic drawback in most TGS technologies.
Researchers at Stanford University present a computational method, LSC, to perform error correction of TGS long reads (LR) by SGS short reads (SR). Aiming to reduce the error rate in homopolymer runs in the main TGS platform, the PacBio® RS, LSC applies a homopolymer compression (HC) transformation strategy to increase the sensitivity of SR-LR alignment without scarifying alignment accuracy. They applied LSC to 100,000 PacBio long reads from human brain cerebellum RNA-seq data and 64 million single-end 75 bp reads from human brain RNA-seq data. The results show LSC can correct PacBio long reads to reduce the error rate by more than 3 folds. The improved accuracy greatly benefits many downstream analyses, such as directional gene isoform detection in RNA-seq study. Compared with another hybrid correction tool, LSC can achieve over double the sensitivity and similar specificity.
Availability: LSC is freely available for the research community and can be downloaded at http://www.stanford.edu/~kinfai/LSC/LSC.html.
- Au KF, Underwood JG, Lee L, Wong WH. (2012) Improving PacBio Long Read Accuracy by Short Read Alignment. PLoS One 7(10), e46679. [article]
Incoming search terms:
- trinity RNA tutorial
- improve read alignment rna-seq
Oct
19
Transcriptome Assembly and Isoform Expression Level Estimation from Biased RNA-Seq Reads
Filed Under Expression and Quantification, Transcriptome Assembly Tools | Leave a Comment
RNA-Seq uses the high-throughput sequencing technology to identify and quantify transcriptome at an unprecedented high resolution and low cost. However, RNA-Seq reads are usually not uniformly distributed and biases in RNA-Seq data post great challenges in many applications including transcriptome assembly and the expression level estimation of genes or isoforms. Much effort has been made in the literature to calibrate the expression level estimation from biased RNA-Seq data, but the effect of biases on transcriptome assembly remains largely unexplored. Read more
Incoming search terms:
- isoformic analysis 2012
- transcriptome isoform analysis
- transcriptome precision
Oct
17
RNA-Seq Assembly – Are We There Yet?
Filed Under Data Analysis, Transcriptome Assembly Tools | Leave a Comment
Transcriptomic sequence resources represent invaluable assets for research, in particular for non-model species without a sequenced genome. To date, the Next Generation Sequencing technologies 454/Roche and Illumina have been used to generate transcriptome sequence databases by RNA-Seq for more than fifty different plant species. While some of the databases were successfully used for downstream applications, such as proteomics, the assembly parameters indicate that the assemblies do not yet accurately reflect the actual plant transcriptomes. Two different assembly strategies have been used, overlap consensus based assemblers for long reads and Eulerian path/de Bruijn graph assembler for short reads.
In this review, researchers from the Heinrich Heine University, Germany discuss the challenges and solutions to the transcriptome assembly problem. A list of quality control parameters and the necessary scripts to produce them are provided.

- Schliesky S, Gowik U, Weber AP, Bräutigam A. (2012) RNA-Seq Assembly – Are We There Yet? Front Plant Sci [Epub ahead of print]. [article]
Incoming search terms:
- trans-abyss tutorial
- reference based transcriptome assembly
- Next-generation transcriptome assembly
Oct
17
RNA-Seq with nanopores – a reality
Filed Under Methods | Leave a Comment
Protein nanopores are under investigation as key components of rapid, low-cost platforms to sequence DNA molecules. Previously, it has been shown that the α-hemolysin (αHL) nanopore contains three recognition sites, capable of discriminating between individual DNA bases when oligonucleotides are immobilized within the nanopore. However, the direct sequencing of RNA is also of critical importance.
Now, researchers at Oxford have achieved sharply defined current distributions that enable clear discrimination of the four nucleobases, guanine, cytosine, adenine and uracil, in RNA. Further, the modified bases, inosine, N6-methyladenosine and N5-methylcytosine, can be distinguished.
Ayub M, Bayley H. (2012) Individual RNA Base Recognition in Immobilized Oligonucleotides using a Protein Nanopore. Nano Lett [Epub ahead of print]. [abstract]
Incoming search terms:
- oxford nanopore ashg
- nanopore ppt
- nanopore rna sequencing
- nanopore rna transcription
- nanopore sequencing RNA
- RNA sequencing by nanopore
- transcript of nanopore sequencing
Oct
11
RNA-Seq Posters at SFN 2012
Filed Under Events | Leave a Comment
Incoming search terms:
- microarray poster sfn
- seq poster 2012
Oct
10
RNA-Seq Presentations at SFN 2012
Filed Under Events | Leave a Comment
Nanosymposium – 228. Genomic Approaches to Neural Function – Sun, Oct 14, 1:00 – 3:00 PM
*A. ANTUNES-MARTINS, J. R. PERKINS, J. M. DAWES, M. CALVO, J. GRIST, W. RUST, R. SCHMID, T. HILDEBRANDT, C. ORENGO, S. B. MCMAHON, D. L. BENNETT;
Nanosymposium – 725. Neuropathic Pain: Mechanisms and Models – Wed, Oct 17, 8:00 – 10:15 AM
*S. CLOKIE, S. GOSWAMI, G. GONNELLA, H. KOMINSKY, K. KASZAS, S. MISHRA, E. LEBOVITZ, M. HOON, M. J. IADAROLA;
Oct
9
How-to/RNASeq analysis
Filed Under Information, Methods | Leave a Comment
This guide is meant to offer an easy to follow guide to the analysis of RNA-seq data, aimed at those without any prior experience analyzing next-gen data. However, a basic level of familiarity with R, the next-gen sequencing procedures and using the unix shell are assumed. It was primarily written by Matthew Young (myoung@wehi.edu.au) and is a work in progress. Most of the steps described here are outlined in our review article which can be cited if people are using this guide in their work… The pathogen example was provided by B. Usadel and makes use of a different set of tools.
http://seqanswers.com/wiki/How-to/RNASeq_analysis
Incoming search terms:
- after RNA-Seq Analysis
- can you analyse mirna with cufflinks
- how to concentrate mirna prior to rna seq
Oct
8
Shallow RNA-Seq – 400 samples per lane
Filed Under Expression and Quantification, Other Tools | Leave a Comment
A researcher at University of California Davis questions the basic assumption that deeper sequencing is better. They utilized the Shannon Entropy approach to estimate the information contained within a transcriptomics experiment and tested the ability of shallow RNA-seq to capture the majority of this information. Shannon Entropy is a sensitive tool used to estimate the diversity of a system.
Their analysis showed that it was possible to capture nearly all of the network or genomic information present in a variety of transcriptomics experiments using a subset of the most abundant 5000 transcripts or less within any given sample.
Given that most modern sequencing technologies are conservatively giving 100 million reads per lane, this would suggest that multiplexing of up to several hundred samples per lane would still allow for over 90% of the transcriptomic information content to be obtained in each sample.
- Kliebenstein DJ. (2012) Exploring the shallow end; estimating information content in transcriptomics studies. Front Plant Sci. 3:213. Epub 2012 Sep 10. [article]
Incoming search terms:
- multiplexing principle NGS
- paired end sequencing artemis
- rna seq quantification using fold change
- illumina miseq ppt
- lanes in rna seq
- maize rna-seq samples per lane
- RNA-seq shannon entropy
Oct
8
Introns should not be ignored!
Filed Under Publications | Leave a Comment
A team led by researchers at The St. Laurent Institute present data that challenges the notion that RNAs produced from the vast expanse of intronic space are just pieces of pre-mRNAs or excised introns en route to degradation. The function of RNA from the non-coding regions of the genome has been a subject of considerable recent debate. Perhaps the most controversy is regarding the function of RNAs found in introns of annotated transcripts, where most of the reads that map outside of exons are usually found.
By performing a highly quantitative RNA-seq analysis of transcriptome changes during an inflammation time course, they show that intronic RNAs have a number of features that would be expected from functional, standalone RNA species. They suggest that the sequences encoded in the introns appear to harbor a yet unexplored reservoir of novel, functional RNAs. As such, they should not be ignored in surveys of functional transcripts or other genomic studies.
- St Laurent G 3rd, Shtokalo D, Tackett M, Yang Z, Eremina T, Wahlestedt C, Inchima SU, Seilheimer B, McCaffrey TA, Kapranov P. (2012) Intronic RNAs constitute the major fraction of the non-coding RNA in mammalian cells. BMC Genomics. 13(1), 504. [article]
Incoming search terms:
- RNA-seq intron
- differential intron degradation
- rna-seq intron reads
- rna-seq introns?
- rnaSeq intron
- rnaseq reduce the intronic rna
- what is the size of intron in RNASeq
Oct
5
SERE – Single-parameter quality control and sample comparison for RNA-Seq
Filed Under Other Tools | Leave a Comment
The Simple Error Ratio Estimate (SERE) is a single-parameter test procedure for count data that can determine whether two RNA-Seq libraries are faithful replicates or globally different.
- Interpretation of SERE is unambiguous regardless of the total read count or the range of expression differences among bins (exons or genes), a score of 1 indicating faithful replication (i.e., samples are affected only by Poisson variation of individual counts), a score of 0 indicating data duplication, and scores >1 corresponding to true global differences between RNA-Seq libraries.
- On the contrary the interpretation of Pearson’s r is generally ambiguous and highly dependent on sequencing depth and the range of expression levels inherent to the sample (difference between lowest and highest bin count).
- Cohen’s simple Kappa results are also ambiguous and are highly dependent on the choice of bins.
For quantifying global sample differences SERE performs similarly to a measure based on the negative binomial distribution yet is simpler to compute. SERE can therefore serve as a straightforward and reliable statistical procedure for the global assessment of pairs or large groups of RNA-Seq datasets by a single statistical parameter.
Availability – ????????
- Schulze SK, Kanwar R, Gölzenleuchter M, Therneau TM, Beutler AS. (2012) SERE: Single-parameter quality control and sample comparison for RNA-Seq. BMC Genomics13(1), 524. [abstract]
Incoming search terms:
- rna seq quality parameter
Oct
5
The Bamboo Transcriptome
Filed Under Transcriptome Sequenced | Leave a Comment
Bamboo occupies an important phylogenetic node in the grass family with remarkable sizes, woodiness and a striking life history. However, limited genetic research has focused on bamboo partially because of the lack of genomic resources. Now, a team led by researchers at the Chinese Academy of Forestry in Beijing performed de novo transcriptome sequencing for the first time to produce a comprehensive dataset for the Ma bamboo (Dendrocalamus latiflorus Munro). Read more
Incoming search terms:
- bamboo pipeline
- big node bamboo










