Digital transcriptome analysis by next-generation sequencing discovers substantial mRNA variants. Variation in gene expression underlies many biological processes and holds a key to unravelling mechanism of common diseases. However, the current methods for construction of co-expression networks using overall gene expression are originally designed for microarray expression data, and they overlook a large number of variations in gene expressions.

CCATo use information on exon, genomic positional level and allele-specific expressions, researchers at Fudan University, China have developed novel component-based methods, single and bivariate canonical correlation analysis, for construction of co-expression networks with RNA-Seq data. To evaluate the performance of our methods for co-expression network inference with RNA-Seq data, they are applied to lung squamous cell cancer expression data from TCGA database and their own bipolar disorder and schizophrenia RNA-Seq study. The preliminary results demonstrate that the co-expression networks constructed by canonical correlation analysis and RNA-Seq data provide rich genetic and molecular information to gain insight into biological processes and disease mechanism. These new methods substantially outperform the current statistical methods for co-expression network construction with microarray expression data or RNA-Seq data based on overall gene expression levels.

Availability: A program for implementing the developed CCA for co-expression network construction can be downloaded from bioconductor (http://www.bioconductor.org/) and at  http://www.sph.uth.tmc.edu/hgc/faculty/xiong/index.htm

  • Hong S, Chen X, Jin L, Xiong M. (2013) Canonical correlation analysis for RNA-seq co-expression networks. Nucleic Acids Res [Epub ahead of print]. [article]

Incoming search terms:

  • coexpression network RNA-seq
  • build rna-seq network out of cuffdiff data
  • software analysis coexpression transcriptome sequencing
  • Canonical correlation analysis for RNA-seq co-expression networks
  • rna-seq cufflinks coexpression network
  • network rnaseq
  • Hong S Chen
  • Gene network from rna-seq data
  • gene expression correlation analysis
  • data correlation 분석

Comments

One Response to “Canonical correlation analysis (CCA) for RNA-Seq co-expression networks”

  1. tza on March 9th, 2013 12:52 pm

    Where can I find the package? I’m on linux and I think on their website is only the windows binary.

Leave a Reply




  • Social Networking Pages

    Linkedin Group

  • Follow Me on Pinterest
  • RSS SEQanswers – RNA Sequencing

    • How to increase rowsize in heatmap? May 16, 2013
      Hi, I am a complete newbie to all things cummeRbund and am currently fighting with generating readable heatmaps. When I use ... […]
      Mags
    • novoalign mapping May 15, 2013
      Hi, I want to use novoalign to map reads - allowing up to 15 mismatches for 100 bp paired-end reads I am new to novoalign(went through the... […]
      abh
    • Design of expt across multiple lanes May 15, 2013
      Hi, I am performing an RNA-seq experiment to look at differential expression. The design is as follows: 2 populations x 3 biological... […]
      jbono
    • RNA kinds expected in RNA-seq results May 15, 2013
      Hi, We use RNA isolation and library preparation protocols which capture polyadenylated RNA. My question is what kinds of RNA can we expect to... […]
      Kocur
    • Discrepancy between genotype and expressed alleles May 15, 2013
      Hi all, I am working on the analysis of allele-specific expression using both genotype information and RNA-seq data from the same individuals. I... […]
      RedMary
    • Does Cufflinks Give Me Trascriptomes? May 14, 2013
      Hi Everyone, I'm a beginner in this area, please forget any silly question. My situation is that I have a raw scaffold whole genome... […]
      hchang10
  • RSS Biostar – RNA-Seq

    • How do TopHat options -g , --supress-hits, and Bowtie options interplay?
      Hi, I am currently using TopHat2 to map RNA-seq runs. I think there have been some changes pertaining the -g option. Does anyone know how it works now? I used to think that setting -g would look for n alignments for a given read, report them [if top-scoring] and discard those reads that had more than g [top scoring] alignments. Now, the description sounds mo […]
    • What happened to -k in TopHat for multiple-mapping reads?
      Selecting -g n in tophat does not discard reads mapping more than n, but instead only reports n alignments for those out all all their TOP scoring alignments. I think there used to be an option -k that would allow one to discard reads that topped x alignments -- whatever happened to that? I only see -g in the tophat 2 manual, no reporting options like before […]
    • Does tophat use the library-type information for mapping, or just for the XS flag?
      When I specify library-type to TopHat, i.e., first-strand, second-strand, unstranded, TopHat appends a value + or - to the XS:A tag, which is useful for subsequent analyses, such as annotation. However, does this information influence the "mappability" of reads, or is this unaffected? My guess is that the information will be considered for mapping […]
    • Purpose of Y-shaped adapters in Illumina Sequencing?
      Hi all, Y adapters different sequences to be annealed to the 5' and 3' ends of each molecule in a library. The arms of the Y are unique, and the middle part, connected to the DNA fragment, is complementary. What are the advantages of this? My take of this over having fully-complementary adapters (ADAPTER1 - - - - - ADAPTER1) is that: -Upon primer a […]
    • Cell Type composition in a tissue based on gene marker expression
      I am not sure if the following would even make sense.... Tissues are composed of composite cell types, and often there are studies such as microarray/NGS where we perform a collective sampling of cells from these tissues. Information about the composition (say percentage of cell type) is not taken into consideration. In some case (such as brain/cancer), ther […]
    • Which SNP caller / method to use after aligning RNA-seq with TopHat
      Which SNP caller / method can / should I use after aligning RNA-seq data with TopHat? For genomic data I use GATK, but supposedly it is not just as easy as running GATK on the TopHat RNA-seq data. The team from Broad has no information / documentation on how to use GATK for RNA-seq data. I don't have any variants yet from DNA re-sequencing. […]