Researchers have developed a large-scale sequencing technique called Genome and Transcriptome Sequencing (G&T-seq) that reveals, simultaneously, the unique genome sequence of a single cell and the activity of genes within that single cell.
The study, published today in Nature Methods, has experimentally established for the first time that when a cell loses or gains a copy of a chromosome during cell division, the genes in that particular region of DNA show decreased or increased expression. While this has long been assumed by genetic researchers, it has not been seen before.
The DNA code that makes up our genome can be found in every cell of our body. This is decoded into a group of molecules in our cells known as RNA, which provide instructions for that particular cell’s proteins to perform their particular function. Transcriptome sequencing measures the amount of each RNA molecule in a cell and provides an insight into its function that cannot be gained by looking at the DNA.
Scientists can now see a cell’s DNA, including the mutational damage it has sustained on its journey from the fertilised egg, alongside the RNA of the same cell, which enacts all the DNA’s instructions, even the errors.
In G&T-seq, genomic DNA and poly(A)+ mRNA are physically separated after cell lysis and then undergo separate amplification, library preparation and sequencing. Amplification of poly(A)+ mRNA is done with a modified Smart-seq2 method, whereas DNA can be amplified by any method; here, MDA and PicoPlex were chosen to suit the downstream application.
“By looking at the genome and transcriptome in parallel in a single cell, we can begin to see what the functional consequences of genetic variations are,” explains Professor Thierry Voet, a corresponding author from KU Leuven and the Wellcome Trust Sanger Institute. “This technology will enable us to probe for genetic heterogeneity in normal and diseased tissues deeper than ever before providing novel understanding of normal and disease development; for instance it will help us to better understand the diversity of cancer cells that arise within a tumour, and the difficulty of treating the disease.”
G&T-seq is unique in the field as it allows high throughput DNA-and RNA-sequencing from single cells in parallel on a diverse range of sequencers, while previous methods were limited to the interrogation of either the DNA or the RNA of a cell, but not both.
” This kind of integrated analysis allows us a more complete understanding of the extent and evolution of cellular heterogeneity in normal development and disease processes. “
“The potential to scale-up this method is one of the most exciting elements of this research,” says Dr Iain Macaulay, first author and a corresponding author from the Sanger Institute. “To meaningfully grasp the heterogeneity of cells in a tissue, we need to be able to look at genome and transcriptome data for hundreds or thousands of individual cells at a time. As the cost of sequencing continues to drop, these large-scale projects will become more feasible and we will begin to see, in unprecedented detail, the diversity and life-history of cells that form human tissue.”
To test the method, researchers sequenced two different cell lines derived from the same person; one from breast cancer and one from normal blood cells. The results from the normal cells showed, as expected, that the vast majority had two copies of each chromosome. However, many of the cancer cells were missing or had gained pieces of chromosomes, and their transcriptomes were significantly different from the normal cells.
The researchers also spotted a previously unobserved cancer-associated chromosomal fusion in the breast cancer cell line. After seeing abnormalities in the RNA data, the transcriptomes of three of the cells were further analysed using Pacific Biosciences long-read sequencing machines. These readings showed the complete sequence of a fusion transcript arising from the fusion of two genes on different chromosomes. Because the genomes of the same single cells were sequenced in parallel, the position where two different chromosomes had fused, generating this new transcript, could be detected.
An unexpected result in the parallel sequence data of the normal blood cell line demonstrates the potential of this method to see detail that was previously invisible. While the data for most cells was similar, four cells were discovered to have three copies of chromosome 11 each; this was reflected in the RNA expression data, which peaks at chromosome 11. With traditional sequencing, these cells would go unnoticed but scientists are now equipped to ask what effect minute changes like these have on the cells.
G&T-seq is currently being used to better understand early embryonic development. In this study, normal early-stage mouse embryos were compared with mouse embryos treated with a chemical that induces chromosomal missegregation. Once again, the method showed that when a cell gains or loses a chromosome, even after a single cell division, changes in gene expression dosage can immediately be seen.
“Using this method, we’ve been able to reveal cellular properties that cannot be seen by DNA or RNA sequencing alone,” says Professor Chris Ponting, a corresponding author from the University of Oxford and an Associate Faculty Member at the Sanger Institute. “This kind of integrated analysis, which we hope will soon also include epigenetic data, allows us a more complete understanding of the extent and evolution of cellular heterogeneity in normal development and disease processes.”
Source – Wellcome Trust Sanger Institute