Cirsium arvenseHybridization is a prominent process among natural plant populations that can result in phenotypic novelty, heterosis, and changes in gene expression. The effects of intraspecific hybridization on F1 hybrid gene expression were investigated using parents from divergent, natural populations of Cirsium arvense, an invasive Compositae weed.

Using an RNA-seq approach, the expression of 68,746 unigenes was quantified in parents and hybrids. The expression levels of 51% of transcripts differed between parents, a majority of which had <1.25x fold-changes. More unigenes had higher expression in the invasive parent (P1) than the non-invasive parent (P2). Of those that were divergently expressed between parents, 10% showed additive and 81% showed non-additive (transgressive or dominant) modes of gene action in the hybrids. A majority of the dominant cases had P2-like expression patterns in the hybrids. Comparisons of allele-specific expression also enabled a survey of cis- and trans-regulatory effects. Cis- and trans-regulatory divergence was found at 70% and 68% of 62,281 informative SNP sites, respectively. Of the 17% of sites exhibiting both cis- and trans- effects, a majority (70%) had antagonistic regulatory interactions (cis x trans); trans-divergence tended to drive higher expression of the P1 allele whereas cis-divergence tended to increase P2 transcript abundance. Trans-effects correlated more highly than cis- with parental expression divergence and accounted for a greater proportion of the regulatory divergence at sites with additive compared to non-additive inheritance patterns. This study explores the nature of, and types of mechanisms underlying, expression changes that occur in upon intraspecific hybridization in natural populations.

  • Bell GD, Kane NC, Rieseberg LH, Adams KL. (2013) RNA-seq analysis of allele-specific expression, hybrid effects, and regulatory divergence in hybrids compared with their parents from natural populations. Genome Biol Evol [Epub ahead of print]. [abstract]

Incoming search terms:

  • cirsium adans
  • a new strategy to reduce allelic bias in rna-seq readmapping
  • rnaseq allelic variation
  • remove rrna biotin probe plos one
  • mapping bias allele specific expression
  • hybrid gene expression
  • how to look for allele specific expression
  • biases by using reference genome in allele specific expression
  • ase allele specific expression data
  • allele frequency and gene expression

In recent years RNA-Seq technology has been used not only to quantify differences in gene expression but also to understand the underlying mechanisms that lead to these differences. Nucleotide sequence variation arising through evolution may differentially affect the expression profiles of divergent species. RNA-Seq technology, combined with techniques to differentiate parental alleles and quantify their abundance, have recently become popular methods for allele specific gene expression (ASGE) analyses. However, analysis of gene expression within interspecies hybrids may be difficult when one of the two parental genomes represented in the hybrid does not have robust genomic resources or available transcriptome data. Read more

Incoming search terms:

  • rna seq allele specific expression
  • combining individuals from two populations rna-seq assembly
  • rna-seq workflow hybrid assembly strategy
  • allelic specific expression
  • allele RNA-seq data reference genome
  • what is allele specific expression?
  • what is allele-specific expression
  • expression profile allele specific
  • alleiec specific expression
  • allelic sepcific expression for rna_seq

Researchers at Harvard University have developed a rapid, cost-effective method (FREQ-Seq) that leverages Illumina next-generation sequencing for localized, quantitative allele frequency detection. Analogous to RNA-Seq, FREQ-Seq relies upon counts from the >105 reads generated per locus per time-point to determine allele frequencies. Loci of interest are directly amplified from a mixed population via two rounds of PCR using inexpensive, user-designed oligonucleotides and a bar-coded bridging primer system that can be regenerated in-house. The resulting bar-coded PCR products contain the adapters needed for Illumina sequencing, eliminating further library preparation. Read more

Incoming search terms:

  • allele frequency using rna-seq
  • from illumina read data to alllele frequency
  • make library freqseq
  • method to determine allele frequency

Accurate estimation of expression levels from RNA-Seq data entails precise mapping of the sequence reads to a reference genome. Because the standard reference genome contains only one allele at any given locus, reads overlapping polymorphic loci that carry a non-reference allele are at least one mismatch away from the reference and, hence, are less likely to be mapped. This bias in read mapping leads to inaccurate estimates of allele-specific expression (ASE).

To address this read-mapping bias, researchers at the DoD Biotechnology Software Applications Institute proposed the construction of an enhanced reference genome that includes the alternative alleles at known polymorphic loci. They show that mapping to this enhanced reference reduced the read-mapping biases, leading to more reliable estimates of ASE.

Experiments on simulated data show that the proposed strategy reduced the number of loci with mapping bias by ≥63% when compared with a previous approach that relies on masking the polymorphic loci and by ≥18% when compared with the standard approach that uses an unaltered reference. When this strategy was applied to actual RNA-Seq data, up to 15% more reads were mapped than the previous approaches and many seemingly incorrect inferences were identified.

RNA-Seq Read MappingThe executables to construct the enhanced reference genome and the Perl scripts to analyze the mapped reads are available for download from http://www.bhsai.org/downloads/ase/

  • Vijaya Satya R, Zavaljevski N, Reifman J. (2012) A new strategy to reduce allelic bias in RNA-Seq readmapping. Nucleic Acids Res [Epub ahead of print]. [article]

Incoming search terms:

  • rna seq perl scripts
  • mapping rna seq
  • bulk segregant mapping RNA-SEQ
  • Anewstrategytoreduceallelicbias inRNA-Seqreadmapping
  • rna seq reads mapped to multiple locations
  • perl rna trimming
  • perl rna seq
  • perl reads count rna seq
  • mapping rna seq reads transcripts
  • map rna-seq reads to reference

Variation in gene expression is thought to make a significant contribution to phenotypic diversity among individuals within populations. Although high-throughput cDNA sequencing offers a unique opportunity to delineate the genome-wide architecture of regulatory variation, new statistical methods need to be developed to capitalize on the wealth of information contained in RNA-seq data sets.

Researchers at the University of Washington have developed a powerful and flexible hierarchical Bayesian model that combines information across loci to allow both global and locus-specific inferences about allele-specific expression (ASE).

They applied the methodology to a large RNA-seq data set obtained in a diploid hybrid of two diverse Saccharomyces cerevisiae strains, as well as to RNA-seq data from an individual human genome. They found that their statistical framework accurately quantifies levels of ASE with specified false-discovery rates, achieving high reproducibility between independent sequencing platforms and they were able to pinpoint loci that show unusual and biologically interesting patterns of ASE, including allele-specific alternative splicing and transcription termination sites.

Availability:  R code to implement the statistical model described is available at: http://akeylab.gs.washington.edu/downloads.shtml.

  • Skelly DA, Johansson M, Madeoy J, Wakefield J, Akey JM. (2011)  A powerful and flexible statistical framework for testing hypotheses of allele-specific gene expression from RNA-seq data. Genome Research [Epub ahead of print]. [abstract]

Incoming search terms:

  • allele specific expression
  • allele specific expression rna-seq
  • allele specific gene expression rna-seq
  • how are testing companies able to pinpoint particular alleles
  • allele specific gene expression
  • allele-specific expression
  • allele-specific expression rna-seq
  • rna-seq allele specific expression
  • what is allele specific expression
  • how are testing companies able to pinpoint particular alleles?

  • Social Networking Pages

    Linkedin Group

  • Follow Me on Pinterest
  • RSS SEQanswers – RNA Sequencing

    • RNAseq (SOLiD) from 18 - 200 nt June 18, 2013
      We are interested in small non-coding RNAs. Whomever you ask about the size range of small RNAs, you get a different answer. ;) Lets assume, small... […]
      GenomicIBK
    • Unmapped ratio very high on mouse genome June 17, 2013
      Hi, My problem regards RNA-Seq data. I've downloaded public data (SAGE libs w/ 6 different samples from mouse liver ) to analyse using ArrayStudio.... […]
      le.nono
    • RNASeq: Read length different from expected June 17, 2013
      Hello all, I have received paired-end reads for 40 samples. The reads are supposed to be 100bp per end. Instead, 20 of my samples are 101bp per... […]
      gogodidi
    • How to install xgawk June 16, 2013
      Hi, This is Shrujan, i have a problem while running RNA Sequencing QC. It shows an error that xgawk is not found. So please help me installing... […]
      shrujan
    • RNA Sequencing QC Error while using with Sequence_QC.sh file June 15, 2013
      Hi, This is Shrujan kumar Madadha, I had an error while running QC for Drosophila Yukuba fastq RNA file using Sequence_QC.sh file of FASTX... […]
      shrujan
    • Cuffmerge related query June 12, 2013
      I have a query regarding what samples should be merged using cuffmerge, when you have multiple phenotypes (each with replicates). Lets say my mouse... […]
      ParthavJailwala
  • RSS Biostar – RNA-Seq

    • edgeR: very low p-value and very high variance within the group of replicates. What's my problem??
      I'm using edgeR in order to perform differential expression analysis from RNA-seq experiment. I have 6 samples of tumor cell, same tumor and same treatment: 3 patient with good prognosis and 3 patient with bad prognosis. I want to compare the gene expression among the two groups. I ran the edgeR pakage like follow: x […]
    • Normalising tag count to RPKM
      Hi! I was wondering if their is a way to normalise the number of reads in a region and the RPKM of the nearest gene to that region, so that a correlation could be computed. Like the following data shows number of tags in first column and RPKM in second column Tags RPKM 15 0.14619 11 0 203 0.2259 129 10.701 300 7.0772 122 2.3234 346 10.666 77 3.117 201 16.749 […]
    • a simple question on RNA-Seq terminology
      This question may be very simple and basic, but I just need to confirm that I understand the differences among those terminologies in the RNA-Seq context. Suppose I have a sample called SLR, and it is sequenced on 5 lanes, so I have (among other output files) BAM files like L1_SLR, L2_SLR, L3_SLR, L5_SLR and L7_SLR.bam. Here, the letter "L" denotes […]
    • FInding regions of interest with minimum coverage
      Hi, I have a bam file of all my accepted hits (tophat output) and an gtf file with my genes of interest for which I am trying to find potential antisense transcripts. I would like to create a list - preferably one that can be visualized in a genome browser - that shows all genes that have antisense reads in the accepted hits.bam file provided that there are […]
    • How to remove the intronic reads before counting
      I got RNASeq data in several samples. I checked the FastQC, seems the read quality are good (Hiseq 2000). But the problem is many reads are mapped to intronic region, and the regions have no any reference exons there (Refseq, ensembl, gencode). We don't know what they are. We guess the problem happend in library preparation, the concentration was low. N […]
    • Which strand of the mRNA molecule does the sequencer output as a "read"?
      In Illumina Stranded RNA-Seq (using the dUTP method), do the final reads in the fastq files correspond to the initial molecule (that was transcribed), or to the reverse complement of the molecule? C […]