Apr
16
Marina – for identifying over-represented plant transcription factor binding sites
Filed Under Unspliced Mapping Tools | Leave a Comment
From initial seed germination through reproduction, plants continuously reprogram their transcriptional repertoire to facilitate growth and development. This dynamic is mediated by a diverse but inextricably-linked catalog of regulatory proteins called transcription factors (TFs). Statistically quantifying TF binding site (TFBS) abundance in promoters of differentially expressed genes can be used to identify binding site patterns in promoters that are closely related to stress-response. Output from today’s transcriptomic assays necessitates statistically-oriented software to handle large promoter-sequence sets in a computationally tractable fashion.
Researchers at NIH have developed Marina, an open-source software for identifying over-represented TFBSs from amongst large sets of promoter sequences, using an ensemble of 7 statistical metrics and binding-site profiles. Through software comparison, they show that Marina can identify considerably more over-represented plant TFBSs compared to a popular software alternative.
Marina was used to identify over-represented TFBSs in a two time-point RNA-Seq study exploring the transcriptomic interplay between soybean (Glycine max) and soybean rust (Phakopsora pachyrhizi). Marina identified numerous abundant TFBSs recognized by transcription factors that are associated with defense-response such as WRKY, HY5 and MYB2. Comparing results from Marina to that of a popular software alternative suggests that regardless of the number of promoter-sequences, Marina is able to identify significantly more over-represented TFBSs.
Availability: http://mason.gmu.edu/~phossein/marina
- Hosseini P, Ovcharenko I, Matthews BF. (2013) Using an ensemble of statistical metrics to quantify large sets of plant transcription factor binding sites. Plant Methods 9(1), 12. [Epub ahead of print]. [article]
Incoming search terms:
- glycine max pawer point
- rna-seq group
- transcription factor binding site prediction
- chip seq mapping reads
- rna seq nih
- plant transcription factors prediction software
- novoalign hadoop mapreduce
- identification of over represented transcription factor binding sites in RNAseq data
- identification of over represented transcription factor binding sites in RNA
- cloning vector reference tophat2
Mar
18
Methods to study Event/Isoform Expression and Alternative Splicing from RNA-Seq
Filed Under Analysis Pipelines, Expression and Quantification, Other Tools, Pathway Analysis, Splicing and Junction Mapping, Transcriptome Assembly Tools, Unspliced Mapping Tools | Leave a Comment
The RegulatoryGenomics website posts and updates a comprehensive list of tools for RNA-Seq analysis.
This is their current version.
|
Spliced-mappers |
||
|
Method |
Reference |
Web-site |
|
TopHap |
(Trapnell et al. 2009) |
|
|
MapSplice |
(Wang et al. 2010) |
|
|
SpliceMap |
(Auger et al. 2010) |
|
|
HMMSplicer |
(Dimon et al. 2010) |
|
|
TrueSight |
(Li et al. 2012b) |
|
|
SOAPsplice |
(Huang et al. 2011) |
|
|
PASSion |
(Zhang et al. 2012) |
|
|
PALMapper |
(Jean et al. 2010) |
|
|
SplitSeek |
(Ameur et al. 2010) |
|
|
Supersplat |
(Bryant et al. 2010) |
|
|
SeqSaw |
(Wang et al. 2011) |
http://bioinfo.au.tsinghua.edu.cn/software/seqsaw |
|
MapNext |
(Bao et al. 2009) |
|
|
STAR |
(Dobin et al. 2012) |
|
|
GSNAP |
(Wu et al. 2010) |
|
|
QPALMA |
(De Bona et al. 2008) |
|
|
OSA |
(Hu et al. 2012) |
|
| Read more | ||
Incoming search terms:
- pathyway analysis for rna seq data
- statistical methods for differential pathway activities
- star splice junctions
- solas rna analysis
- scarf file rna
- rnaseq alternative splicing trinity
- rna seq alternative splicing method
- alternative splicing expression
- MethodstostudyEvent/IsoformExpressionandAlternativeSplicingfromRNA-Seq|RNA-SeqBlog
- junction map mrna deep sequencing
Mar
4
Accurate identification and analysis of human mRNA isoforms using deep long read sequencing
Filed Under Splicing and Junction Mapping, Unspliced Mapping Tools | Leave a Comment
Precise identification of RNA-coding regions and transcriptomes of eukaryotes is a significant problem in biology. Currently, eukaryote transcriptomes are analyzed using deep short-read sequencing experiments of complementary DNAs. The resulting short-reads are then aligned against a genome and annotated junctions to infer biological meaning.
Here, scientists at Stanford University use long-read complementary DNA datasets for the analysis of a eukaryotic transcriptome and generate two large datasets in the human K562 and HeLa S3 cell lines. Both data sets comprised at least 4 million reads and had median read lengths greater than 500 bp. They show that annotation-independent alignments of these reads provide partial gene structures that are very much in-line with annotated gene structures, 15% of which have not been obtained in a previous analysis of short reads. For long-noncoding RNAs (lncRNA) genes, however, they find an increased fraction of novel gene structures among our alignments. Other important aspects of transcriptome analysis, such as the description of cell type-speciewly sequenced genomes. Furthermore, they demonstrate that long read sequence can be assembled into full-length transcripts with considerable success. This method is applicable to all long read sequencing technologies.
Tilgner H, Raha D, Habegger L, Mohiuddin M, Gerstein M, Snyder M. (2013) Accurate identification and analysis of human mRNA isoforms using deep long read sequencing. G3 (Bethesda) 3(3), 387-97. [abstract]
Incoming search terms:
- unspliced read alignment
- using trinity assembler for virus discovery in human rna seq samples
- accurate identi␣cation and analysis of human mrna isoforms using deep long read sequencing
- fusion transcripts search OR detection tools
- using trinity assembler for virus discovery in human rna seq
Feb
28
RECOT – Read Coordinate Transformer
Filed Under Other Tools, Unspliced Mapping Tools | Leave a Comment
The whole-genome sequences of many non-model organisms have recently been determined. Using these genome sequences, next-generation sequencing based experiments such as RNA-Seq and ChIP-seq have been performed and comparisons of the experiments between related species have provided new knowledge about evolution and biological processes. Although these comparisons require transformation of the genome coordinates of the reads between the species, current software tools are not suitable to convert the massive numbers of reads to the corresponding coordinates of other species’ genomes.
Now, researchers at Ochanomizu University and the Tokyo Institute of Technology, Japan have developed a set of programs, called REad COordinate Transformer (RECOT), which is useful to compare RNA-seq, ChIP-seq and CLIP-seq sequences between closely-related species. RECOT can be used to transform the coordinates of short reads obtained from the genome of a query species being studied to that of a comparison target species after aligning the query and target gene/genome sequences. RECOT generates output in SAM format that can be viewed using recent genome browsers capable of displaying next-generation sequencing data. RECOT
They demonstrate the usefulness of RECOT in comparing ChIP-seq results between two closely-related fruit flies. The results indicate position changes of a transcription factor binding site caused sequence polymorphisms at the binding site.
Availability – RECOT is available at: http://sesejun.github.com/recot/
Izawa A, Sese J. (2013) A tool for the coordinate transformation of next-generation sequencing reads for comparative genomics and transcriptomics. Source Code Biol Med 8(1), 6. [Epub ahead of print]. [abstract]
Incoming search terms:
- transcriptoma ppt
- transcriptoma theme ppt
- Transcriptomic analysis
- RNA sequencing description
- truseq rna
- rna-seq tophat transcriptome
- RNA sequencing process
- powerpoint on transcriptomics
- sage rna analysis
- transriptomcs ppt
Jan
18
Bioinformatics – Next Gen Sequencing – Virtual Issue
Filed Under Analysis Pipelines, Data Analysis, Expression and Quantification, Other Tools, Splicing and Junction Mapping, Transcriptome Assembly Tools, Unspliced Mapping Tools | 1 Comment
Bioinformatics has published a Next-Gen Sequencing “Virtual Issue” covering all the sequencing tools that appeared in the journal. We have listed those described as applicable to RNA-Seq.
Statistical Inferences for Isoform Expression in RNA-Seq.
Hui Jiang and Wing Wong
Bioinformatics (2009) 25: 1026–1032 Full Text
A toolkit for analysing large-scale plant small RNA datasets
Simon Moxon et al.
Bioinformatics (2008) 24: 2252-2253 Full Text
TopHat: discovering splice junctions with RNA-Seq
Cole Trapnell et al.
Bioinformatics (2009) 25: 1105–1111 Full Text
Incoming search terms:
- top hat rna-seq
- ion torrent tophat2 mapping error
- cummerbund rna
- how to use tophat
- edger tutorial
- tophat rnaseq
- edgeR的使用 RNA
- edger
- tophat fusion
- tophat multiple mapped
Nov
12
RNA-Seq Alignment Tools
Filed Under Splicing and Junction Mapping, Transcriptome Assembly Tools, Unspliced Mapping Tools | 1 Comment
Incoming search terms:
- rna-seq alignment tool
- plant SNPs RNAseq ppt
- fastq RNA-seq ppt
- rnaseq aligment
- rna seq tools alignment
- rna seq snp calling tool
- alignment tools for rna seq
- best rna-seq align
- alignment tools rna-seq
- alignment tools of rna-seq
May
22
OSA – A fast and accurate alignment tool for RNA-Seq
Filed Under Unspliced Mapping Tools | Leave a Comment
Accurately mapping RNA-Seq reads to the reference genome is a critical step for performing downstream analysis such as transcript assembly, isoform detection and quantification. Many tools have been developed, however, given the huge size of the next generation sequencing (NGS) datasets and the complexity of the transcriptome, RNA-Seq read mapping remains a challenge with the ever-increasing amount of data.
OSA (Omicsoft Sequence Aligner) is a fast and accurate alignment tool for RNA-Seq data. Benchmarked with existing methods, OSA improves mapping speed 4-10 fold with better sensitivity and less false positives.
Availability: OSA can be downloaded from http://omicsoft.com/osa. It is free to academic users. OSA has been tested extensively on Linux, Mac OS X and Windows platforms.
- Hu J, Ge H, Newman M, Liu K. (2012) OSA: A fast and accurate alignment tool for RNA-Seq. Bioinformatics [Epub ahead of print]. [abstract]
Incoming search terms:
- omicsoft OSA
- bfast map rnaseq
- osa rna-seq aligner
- rna sequencing mapping tool
May
8
Introducing Bulked Segregant RNA-Seq (BSR-Seq)
Filed Under Unspliced Mapping Tools | Leave a Comment
Bulked segregant analysis (BSA) is an efficient method to map genes responsible for mutant phenotypes. BSR-Seq makes use of RNA-Seq reads to efficiently map genes even in populations for which no polymorphic markers have been previously identified.
BSR-Seq provides not only the map position of a gene responsible for a mutant phenotype but also the effects of such a mutant on global patterns of gene expression. The expression patterns of genes within the mapping interval can be used to prioritize candidate genes based on the fact that the causal gene will often be down-regulated in the mutant pool as compared to the non-mutant pool. In addition, this strategy yields a collection of polymorphic SNPs that are tightly linked to the mutant. These SNPs could be used to fine map the mutant or clone the affected gene via chromosome walking. Hence, BSR-Seq is not only an efficient strategy for mapping genes, but also yields other data that facilitate gene cloning. Read more
Incoming search terms:
- bulked segregant analysis
- BSR-Seq
- BSA Rnaseq
- bulk rna-seq
- bulk segregant analysis rna seq
- bulked segregant rna-seq
- bulked segregant rna seq
- bulk segregation analysis duplicated genes
- bulk segregant analysis ppt
- what is bulked segregant analysis
Feb
9
Novoalign is a short-read mapper designed to be fast and sensitive on small to large genome databases. It’s primary design is based on the use of read quality information and the need to assemble genomes from resequencing experiments. Novoalign supports fragment, paired-end and mate-pair reads from major sequencing platforms such as Illumina, SOLiD and Roche 454. NovoalignCS is the version of Novoalign developed for SOLiD colourspace reads.
The full version of Novoalign is a commercial product but a version of Novoalign with some limits to functionality is available for free at: http://www.novocraft.com/main/downloadpage.php
Incoming search terms:
- novoalign
- novoalign RNAseq
- novoalign rna seq
- novoaligncs solid seqanswers
- novoaligner
- novealign
- Novoalinger
- Pac bio sequence alignment software novoalign
- RNA-seq Novoalighn
- similar to novoalign
Oct
25
perEditor: Mapping personal functional data to personal genomes
Filed Under Unspliced Mapping Tools | Leave a Comment
The sequencing of personal genomes enabled analysis of variation in transcription factor (TF) binding, chromatin structure, and gene expression and indicated how they contribute to phenotypic variation. It is hypothesized that using the reference genome for mapping ChIP-seq or RNA-seq reads may introduce errors, especially at polymorphic genomic regions.
Researchers at the University of Illinois at Urbana-Champaign have developed a Personal Genome Editor (perEditor) that changes the reference human genome (NCBI36/hg18) into an individual genome, taking into account single nucleotide polymorphisms (SNPs), insertions and deletions, copy number variation, and chromosomal rearrangements. perEditor outputs two alleles (maternal, paternal) of the individual genome that is ready for mapping ChIP-seq and RNA-seq reads, and enabling the analyses of allele specific binding, chromatin structure, and gene expression.
AVAILABILITY: perEditor is available at http://biocomp.bioen.uiuc.edu/perEditor.
CONTACT: szhong@illinois.edu.
- Rivas-Astroza M, Xie D, Cao X, Zhong S. (2011) Mapping personal functional data to personal genomes. Bioinformatics [Epub ahead of print]. [abstract]
Incoming search terms:
- coverage vs depth in RNA-Seq
Jun
30
SeqGene – general purpose software for mining post-aligment RNA-Seq datasets
Filed Under Expression and Quantification, Splicing and Junction Mapping, Unspliced Mapping Tools | Leave a Comment
SeqGene is an open-source software for mining next-gen sequencing datasets, focusing on post-alignment quality control, SNP and indel identification and annotation, RNA expression quantification, allele specific expression, and expression-genotying association analysis. SeqGene is especially suited for RNA-seq and exonome-seq applications, with focus on protein coding and regulatory regions of a genome. For RNA-seq applications, SeqGene implemented a novel topology-based pathway analysis method to identify SNP-Expression co-enrichment and SNP-Expression paths. Read more
Incoming search terms:
- seqgene
- RNA-seq_pipeline pdf
- RNA seq data mining
- rnaseq data to pathways
- RNA seq for allele mining
- cut sequence mining software
- mining of RNAseq dataset in arabidopsis
- how to mine rnaseq data
- Herpès virus des bivalves
- general purpose software
Apr
13
SAMMate 2.5 – a GUI tool for isoform quantification and visualization using RNA-seq
Filed Under Unspliced Mapping Tools | Leave a Comment
SAMMate, a Graphical User Interface (GUI) RNA-seq analysis pipeline, allows biomedical researchers to quickly process SAM/BAM files and is compatible with both single-end and paired-end sequencing technologies. SAMMate automates some of more standard procedures in RNA-seq analysis. Read more
Incoming search terms:
- rna-seq visualization
- genepattern rna-seq tutorial
- deseq gui
- raw read counts sam
- cufflink gui
- DESEQ GRAPHICAL USER INTERFACE
- rna seq analysis in graphical user interface
- rna seq analysis using read count
- sammate lncrna
Mar
1
SeqMap – A Tool For Mapping Millions Of Short Sequences To The Genome
Filed Under Unspliced Mapping Tools | Leave a Comment
SeqMap is a tool for identifying viral integration sites from LAM-PCR and LM-PCR analysis. The tool will extract vector sequence data then search existing genome databases for matches to the unique sequences generated by the LAM or LM-PCR reaction. SeqMap displays the vector insertion site graphically, showing the chromosome location and distance to surrounding genes. The tool also allows you to organize your data and make notations. Read more
Incoming search terms:
- seqmap
- bam to seqmap
- lam pcr mapping
- mapping webtool short sequences in a genome
- seqmap how to
- seqmap manual
- use seqmap


.png)









