NorahDeskNorahDesk reconstructs full-length putative ncRNA transcripts from short sequence reads by hybridizing contigs. It analyzes not only the distinct read distribution of true ncRNA classes in an unbiased way but also utilizes secondary structures as an independent confirmation source to reliably predict ncRNA from deep sequencing data.

Using publicly available mouse sequence data from brain, skeletal muscle, testis and ovary, NorahDesk was evaluated with an emphasis on the performance for microRNAs (miRNAs) and piwi-interacting small RNA (piRNA). This method was also compared with Dario and mirDeep2 and found to produce longer transcripts with higher read coverage. This feature makes it the first method particularly suitable for the prediction of both known and novel piRNAs.

NorahDesk and the mouse small ncRNA annotation file in BED format used in this study are available at http://www.bioinformatics.org.au/NorahDesk.

  • Ragan C, Mowry BJ, Bauer DC. (2012) Hybridization-based reconstruction of small non-coding RNA transcripts from deep sequencing data. Nucleic Acids Res [Epub ahead of print]. [article]

Incoming search terms:

  • NorahDesk
  • ncRNA detection tool RNA-Seq
  • rna seq non coding rna
  • rna-seq and non-coding rna
  • noncoding rna sequencing
  • rna-seq non-coding rna
  • ngs ncrna
  • rna deep sequence analyis
  • public available deep sequence data
  • long non-coding rna ppt

A team led by researchers at the Austrian Academy of Sciences has developed a protocol that includes column-free RNA preparation, ribosomal RNA depletion, RNA-hydrolysis and ds-cDNA synthesis and is compatible with the all of the major NGS platforms used to date.

They demonstrate that:

  • Ribo-depleted RNA-Seq is highly reproducible between different sequencing locations, even when using two different ribosomal RNA depletion strategies.
  • Template fragmentation by RNA-hydrolysis produces more homogenous gene coverage than cDNA shearing, and that both fragmentation methods lead to under-representation of 5′ and 3′ UTRs.
  • The use of similar template preparation protocols is critical for obtaining a comparable transcriptome.
  • RNA populations prepared by ribo-depletion allow RNA-Seq to reliably detect both the non-coding and protein-coding transcriptome, and also to identify biologically relevant gene expression differences in both of these RNA types.

Huang R, Jaritz M, Guenzl P, Vlatkovic I, Sommer A, et al. (2011) An RNA-Seq Strategy to Detect the Complete Coding and Non-Coding Transcriptome Including Full-Length Imprinted Macro ncRNAs. PLoS ONE 6(11), e27288. [article]

Incoming search terms:

  • ribo-depletion
  • RNA-Seq is a new protocol
  • rrna depletion strategies

brain activityThe researchers used a new sequencing technology called RNA-seq, a technology related to the latest DNA sequencers used to decode our genomes, to map gene activity in the different layers of the mouse cerebral cortex.

The technique is also able to detect ‘non-coding RNAs’, ie RNAs produced from DNA in between known genes that doesn’t code for proteins but may play a critical role in regulating genes and controlling biological processes.

‘We see a vast array of non-coding RNAs – hundreds that have never been seen before, but presumably have a biological role to play in the brain,’ says Professor Molnár. ‘One of the most abundant RNAs produced in the mouse brain is a noncoding RNA.’

The approach also reveals RNAs which, once read off from our DNA code, are stitched together in different ways through a process called ‘alternative splicing’. The process results in different proteins that can have different biological roles, despite coming from the same gene.

NIH Press Release

Oxford University Press Release

  • T. Grant Belgard, Ana C. Marques, Peter L. Oliver, Hatice Ozel Abaan, Tamara M. Sirey, Anna Hoerder-Suabedissen, Fernando García-Moreno, Zoltán Molnár, Elliott H. Margulies, Chris P. Ponting (2011) A Transcriptomic Atlas of Mouse Neocortical Layers. Neuron 71(4),  605-16. [abstract]

Incoming search terms:

  • collaboration dna ppt
  • rna-seq nih grant
  • rna-seq oxford

Thanks to the new technological advances in genome and transcriptome sequencing, we know that a major portion of the genome is being transcribed but only a small portion of this transcriptome contains the protein-coding sequences. The remainder can be split (at approximately 100 nucleotide length) into two major categories: small non-coding RNA (including microRNA) and long non-coding RNA, both of which have been shown to exert regulatory control over protein production/expression. 

Interestingly, these two types of non-coding RNA exert their control by very different mechanisms.  Small RNAs regulate gene expression predominantly through reduction of mRNA levels and subsequent reduced protein output1 (negative regulation), while long non-coding RNAs increase the synthesis of nearby proteins2 (positive regulation).

(read more about negative regulation by small non-coding RNAs… )

(read more about positive regulation by long non-coding RNAs…

  1. Guo H, Ingolia NT, Weissman JS, Bartel DP. (2010) Mammalian microRNAs predominantly act to decrease target mRNA levels. Nature 466(7308), 835-40. [abstract]
  2. Orom UA, Derrien T, Beringer M, Gumireddy K, Gardini A, Bussotti G, Lai F, Zytnicki M, Notredame C, Huang Q, Guigo R, Shiekhattar R. (2010) Long Noncoding RNAs with Enhancer-like Function in Human Cells. Cell143(1), 46-58. [abstract]

Incoming search terms:

  • long noncoding rna sequencing

  • Social Networking Pages

    Linkedin Group

  • Follow Me on Pinterest
  • RSS SEQanswers – RNA Sequencing

    • TopHat extremely low paired mapping rate. PLS HELP! May 22, 2013
      Hey guys, I have some problems with my paried-end RNA seq analysis on Galaxy. As you can see in the bam flagstat output, my tophat alignment rate is... […]
      Felix.Lee
    • Identifying small RNA sequence within whole genome sequence May 21, 2013
      Hi all, I want to know if there are any useful bioinformatic tool to find small RNA sequence within a whole bacteria genome. Thank you in... […]
      Inma
    • standard of clean data May 21, 2013
      Hi all I recently got my prokaryotes RNA-seq data report back. the standard filter steps of the raw data set by our local sequencing center is as... […]
      Pengfei Liu
    • Problem with cummeRbund diffData() May 20, 2013
      Hi all, I'm running Tophat/cufflinks/cuffdiff for differential gene expression and analysis with cummeRbund (v 2.0.0). I'm having an issue with... […]
      Enrique Zudaire
    • How to increase rowsize in heatmap? May 16, 2013
      Hi, I am a complete newbie to all things cummeRbund and am currently fighting with generating readable heatmaps. When I use ... […]
      Mags
    • novoalign mapping May 15, 2013
      Hi, I want to use novoalign to map reads - allowing up to 15 mismatches for 100 bp paired-end reads I am new to novoalign(went through the... […]
      abh
  • RSS Biostar – RNA-Seq

    • Why am I getting so many unmapped reads in STAR, classified as "too short"?
      I am currently using STAR to map several Hi-SEQ mRNA runs. I'm having trouble getting a decent amount of reads to map, but I don't really understand why. I'm hoping you can shed some light :) In the final log, only about 50% (or less) of the reads map to the reference. I'm using a GTF in addition to the genome. The unmapped bin that most […]
    • What are the best practices for SNP identification in RNA seq transcriptome data
      I have 20 RICE RNA seq tranascriptome data hiseq 2000 platform paired end reads. I aligned fasta reads with BWA and remove PCR duplicates with PICARD. Later I call SNP with samtools using various parameters. I would like to clarify what parameters should I used while alinging to reference rice genome for looking SNP location 100 bp upstream and 250 bp downst […]
    • How do TopHat options -g , --supress-hits, and Bowtie options interplay?
      Hi, I am currently using TopHat2 to map RNA-seq runs. I think there have been some changes pertaining the -g option. Does anyone know how it works now? I used to think that setting -g would look for n alignments for a given read, report them [if top-scoring] and discard those reads that had more than g [top scoring] alignments. Now, the description sounds mo […]
    • What happened to -k in TopHat for multiple-mapping reads?
      Selecting -g n in tophat does not discard reads mapping more than n, but instead only reports n alignments for those out all all their TOP scoring alignments. I think there used to be an option -k that would allow one to discard reads that topped x alignments -- whatever happened to that? I only see -g in the tophat 2 manual, no reporting options like before […]
    • Does tophat use the library-type information for mapping, or just for the XS flag?
      When I specify library-type to TopHat, i.e., first-strand, second-strand, unstranded, TopHat appends a value + or - to the XS:A flag, which is useful for subsequent analyses, such as annotation. However, does this information actually influence the "mappability" of reads, or is this unaffected? My thinking is that the information would be considere […]
    • Purpose of Y-shaped adapters in Illumina Sequencing?
      Hi all, Y adapters different sequences to be annealed to the 5' and 3' ends of each molecule in a library. The arms of the Y are unique, and the middle part, connected to the DNA fragment, is complementary. What are the advantages of this? My take of this over having fully-complementary adapters (ADAPTER1 - - - - - ADAPTER1) is that: -Upon primer a […]