Next generation sequencing (NGS)/deep sequencing has become an important tool in the study of viruses. The use of unique molecular identifiers (UMI) can overcome the limitations of PCR errors and PCR-mediated recombination and reveal the true sampling depth of a viral population being sequenced in an NGS experiment. This approach of enhanced sequence data represents an ideal tool to study both high and low abundance drug resistance mutations and more generally to explore the genetic structure of viral populations. Central to the use of the UMI/Primer ID approach is the creation of a template consensus sequence (TCS) for each genome sequenced. Here we describe a series of experiments to validate several aspects of the Multiplexed Primer ID (MPID) sequencing approach using the MiSeq platform.
Researchers from the University of North Carolina at Chapel Hill have evaluated how multiplexing of cDNA synthesis and amplicons affects the sampling depth of the viral population for each individual cDNA and amplicon to understand the relationship between broader genome coverage versus maximal sequencing depth. The researchers have validated reproducibility of the MPID assay in the detection of minority mutations in viral genomes. They have also examined the determinants that allow sequencing reads of PCR recombinants to contaminate the final TCS data set and show how such contamination can be limited. Finally, they provide several examples where they have applied MPID to analyze features of minority variants and describe limits on their detection in viral populations of HIV-1 and SARS-CoV-2 to demonstrate the generalizable utility of this approach with any RNA virus.