Accurate mapping of tRNA reads

Many repetitive DNA elements are transcribed at appreciable expression levels. Mapping the corresponding RNA sequencing reads back to a reference genome is notoriously difficult and error-prone task, however. This is in particular true if chemical modifications introduce systematic mismatches, while at the same time the genomic loci are only approximately identical, as in the case of tRNAs.

Researchers at the University Leipzig and the University of Vienna have developed a dedicated mapping strategy to handle RNA-seq reads that map to tRNAs relying on a modified target genome in which known tRNA loci are masked and instead intronless tRNA precursor sequences are appended as artificial “chromosomes”. In a first pass, reads that overlap the boundaries of mature tRNAs are extracted. In the second pass, the remaining reads are mapped to a tRNA-masked target that is augmented by representative mature tRNA sequences. Using both simulated and real life data the researchers show that their best-practice workflow removes most of the mapping artefacts introduced by simpler mapping schemes and makes it possible to reliably identify many of chemical tRNA modifications in generic small RNA-seq data. Using simulated data the FDR is only 2%. They find compelling evidence for tissue specific differences of tRNA modification patterns.

Scheme of the best-practice workflow for detecting tRNA misincorporations


The top part describes the construction of the artificial and cluster genome, the middle section refers to the mapping steps, the bottom layer describes the post-processing.

Availability – The workflow is available both as a bash script and as a Galaxy workflow from

Hoffmann A, Fallmann J, Mörl M, Stadler PF, Amman F. (2017) Accurate Mapping of tRNA Reads. Bioinformatics [Epub ahead of print]. [abstract]

Leave a Reply

Your email address will not be published. Required fields are marked *


Time limit is exhausted. Please reload CAPTCHA.