Comprehensive analysis of molecular pathology requires a collection of reference samples representing normal tissues from healthy donors. For the available limited collections of normal tissues from postmortal donors, there is a problem of data incompatibility, as different datasets generated using different experimental platforms often cannot be merged in a single panel.
Here, a team led by researchers at Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry constructed and deposited the gene expression database of normal human tissues based on uniformly screened original sequencing data. In total, 142 solid tissue samples representing 20 organs were taken from post-mortal human healthy donors of different age killed in road accidents no later than 36 hours after death. Blood samples were taken from 17 healthy volunteers. The researchers then compared them with the 758 transcriptomic profiles taken from the other databases. They found that overall 463 biosamples showed tissue-specific rather than platform- or database-specific clustering and could be aggregated in a single database termed Oncobox Atlas of Normal Tissue Expression (ANTE). This data will be useful to all those working with the analysis of human gene expression.
The hierarchical clustering dendrogram of all experimental
RNA sequencing profiles of human tissues
Gene expression data were used to calculate Euclidian distances between the samples. Color indicates the sample preparation method (tissue in FFPE, RNA in ethanol, tissue in RNAlater). The lower scale indicates the number of uniquely mapped reads. QC denotes the quality control threshold of 2.5 million uniquely mapped reads.