Jump to content

Draft:Drop-Seq

From Wikipedia, the free encyclopedia


Introduction

[edit]
Overview of the Drop-seq workflow

Drop-seq is a high-throughput, single-cell RNA sequencing (scRNA-seq) technology used to analyze the mRNA expression of thousands of individual cells by separating them into nanoliter-sized droplets for parallel analysis.

The team behind the development of Drop-seq was led by Dr. Evan Macosko, a Harvard Medical school instructor in psychiatry at Massachusetts General Hospital and a Stanley Neuroscience Fellow in the McCarroll Lab[1]. The limitations of earlier scRNA-Seq methods hindered the fast and scalable characterization of complex tissues, leading to the establishment of Drop-Seq[2].

This method relies on combining a single cell with a uniquely barcoded microparticle (bead) into a single droplet using a microfluidic device. Each cell is lysed within the droplet, releasing its mRNA contents, which binds to an oligo-dT-containing primer sequence on its companion microparticle, capturing polyadenylated mRNA. The primers additionally contain cell barcodes, which are used to identify the cell-of-origin of each transcript, and a unique molecular identifier (UMI), enabling the identification of PCR duplicates. The collection of mRNA-bound microparticles are reverse-transcribed in bulk to form single-cell transcriptomes attached to microparticles (STAMPs). Template switching of reverse-transcriptase is employed to introduce a PCR handle downstream of the synthesized cDNA to enable efficient cDNA amplification. Using a high-throughput sequencer, paired-end reads are generated and aligned to a reference genome to identify the cDNA gene-of-origin and grouped by their cell barcode to determine its cell-of-origin.

History

[edit]

The development of sequencing technologies, such as Sanger sequencing and Maxam–Gilbert sequencing, provided the foundation for studying genetic material. Studies of individual transcripts were performed several decades before the emergence of high-throughput sequencing technologies. For instance, in the 1970s, silkmoth mRNA libraries were collected and converted to cDNA using reverse transcriptase[3], and individual transcripts from this library were sequenced using Sanger Sequencing.

In 1976, Walter Fiers and colleagues sequenced the complete transcriptome of bacteriophage MS2[4]. This work laid the foundation for the field of transcriptomics, which is the study of the complete set of RNA transcripts within a given cell, tissue, or organism[5]. Transcriptome sequencing presents challenges due to the characteristics of RNA. The single-stranded structure of RNA causes it to be very unstable and easily degraded by RNases within cells, which requires transcriptome sequencing to be very rigorous[6]. Despite these challenges, progress in transcriptomics has advanced. The first attempt to capture a partial human transcriptome was conducted in 1991 and reported 609 mRNA sequences from the human brain using expressed sequence tags derived from cDNA libraries[7]. While the insights gained from these early studies were valuable, these methods were limited in their abilities to capture the full transcriptome.

In the late 1990s, the development of DNA microarrays enabled the examination of thousands of transcripts at once[8]. Initial studies using microarrays focused on determining differentially expressed genes between normal cells and their corresponding cancerous cells[9]. However, microarrays rely on a priori knowledge of genomic sequences, have limited sensitivity, and are prone to cross-hybridization[10]. The development of RNA-Seq in the mid 2000s enabled comprehensive transcriptome analysis, overcoming some limitations of microarrays, such as their reliance on pre-designed probes, limited sensitivity, and susceptibility of cross-hybridization[11]. This technology can provide insight into functional pathways, regulation of biological processes, alternative splicing, allele-specific expression, and novel transcripts[12].

A caveat of early RNA-seq methods is that they measured the average gene expression levels of all cells in a sample, masking cellular heterogeneity. As a result, RNA-Seq was quickly adapted for single-cell analysis, known as single-cell RNA sequencing (scRNA-Seq), enabling the study of gene expression at the resolution of individual cells[13]. By profiling the transcriptomes of single cells, scRNA-Seq enables the study of cell-type-specific gene expression patterns and rare cell types, playing a crucial role in understanding tumour heterogeneity and evolution[14]. Early scRNA-Seq methods were limited to profiling only hundreds to a few thousand cells per experiment[15][16]. These methods required cells to be separated by flow sorting or microfluidics, followed by the amplification of each cell's transcriptome individually, highlighting the need for fast and scalable approaches to characterize complex tissues under diverse conditions[2].

In 2015, Macosko et al.[2] developed a method to analyze gene expression in thousands of cells in parallel by encapsulating individual cells in nanoliter-sized droplets. Additionally, a molecular barcoding strategy was implemented to trace the cell-of-origin for each transcript, enabling pooled sequencing while maintaining single-cell resolution.

Scientific Principles or Mechanisms

[edit]

Drop-seq integrates principles of microfluidics, molecular barcoding and single-cell RNA sequencing to determine gene expression at a single-cell level. Drop-seq’s workflow can be broken down into cell and bead encapsulation, cell lysis and mRNA capture, STAMP-formation, cDNA synthesis and amplification, and bioinformatics analysis [2].

Cell and Bead Encapsulation

The protocol begins by capturing one cell and one uniquely barcoded bead together using a custom microfluidic device. This device consists of two aqueous input channels (one carrying a suspension of cells and the other containing the barcoded beads suspended in lysis buffer) that both flow into an oil channel [2]. Laminar flow prevents the mixing of the aqueous inputs before droplet formation, ensuring that every droplet only contains one cell and one bead [2].

Cell Lysis and mRNA Capture

Immediately after droplet formation, the cells are lysed, releasing mRNAs that then hybridize onto the bead’s surface at the oligo-dT primers [2]. The capture of mRNA within the aqueous bubble ensures that cellular components are not lost [2]. Once hybridization of mRNA onto the bead surface is completed, the beads are broken using a reagent (perfluorooctanol in 30 ml of  6x SSC) that destabilizes the oil phase, releasing the beads into an aqueous solution [2]. This reduces any hybridization effects due to the second-order kinetics of DNA base pairing [2].

Reverse Transcription and cDNA Amplification

After breaking the droplets, the microparticles are washed and resuspended in a reverse transcriptase mix and treated with exonuclease I to remove unextended primers [2]. The mRNAs are then reverse-transcribed in bulk forming single-cell transcriptomes attached to microparticles (STAMPs)[2]. Once the STAMPs are formed, the beads are re-washed, counted, and aliquoted into PCR tubes for cDNA formation and amplification[2]. Template switching adds a second primer at the poly-A tail, allowing the full cDNA to be read. As such, the cDNA molecules will have both the cell barcode and the UMI from the bead [2].

Bioinformatics Analysis

Once sequencing of the cDNA is complete, researchers can determine three things: cell types within the population, gene expression within unique cell types, and gene expression of single cells. From the UMI’s and cellular barcodes, the gene expression of an individual cell can be determined. To determine the cell type, the gene expression is compared to the gene expression of different cell populations to determine cell type probability.

Characteristics or Properties

[edit]

The Drop-seq paradigm relies on uniquely barcoded beads and microfluidic droplet encapsulation to capture single cells and their mRNA for sequencing.

Barcoded Beads

One of the main components of Drop-seq is the use of microparticles functionalized with DNA primers that include four main elements for mRNA capture, quantification, and cell identification [2] .

PCR Barcode

Every DNA primer on every bead will have an identical constant region. This is used for downstream PCR and sequencing [2].

Cell Barcodes

A cellular barcode is a sequence found on DNA primers of any one bead, but is unique across beads (every bead has a unique cellular barcode). Synthesis of the cell barcode involves a process known as a “split-and-pool” DNA synthesis strategy [2]. Split-and-pool involves taking all of the microparticle beads and splitting them into four equal groups known as pools [2]. Each pool will then receive one of the four DNA bases [2]. The beads are then mixed together and the whole process is repeated 12 times, ensuring that each bead has a different sequence.

Unique Molecular Identifiers (UMIs)

The unique molecular identifier (UMI) is different on every primer to identify any PCR duplicates, ensuring proper quantification of mRNA [2]. To construct the UMIs, the beads after the initial twelve rounds of split-and-pool DNA synthesis then undergo eight rounds of degenerate oligonucleotide synthesis [2].

Oligo-dT Primers

The end of every oligonucleotide has a sequence of thirty thymines (T30) onto the 3’ end for enzymatic priming [2].

Microfluidic Channels

The microfluidic device is designed to allow co-flows of two aqueous solutions across an oil channel to form over 100,000 nanoliter sized droplets per minute [2].

Applications

[edit]

Advancing Breast Cancer Stratification

[edit]

Breast cancer patient stratification relies on receptor status and histological grading, but around 20% of patients lack actionable biomarkers, leading to unclear therapeutic intervention[17]. Tumours are composed of multiple subpopulations and display heterogeneity in the estrogen (ESR1), progesterone (PgR), and epidermal growth factor receptor 2 (HER2/ERBB2) biomarkers[18]. Drop-seq was used to measure the gene expression profiles of 35,276 cells from 32 breast cancer cell line subtypes, resulting in the breast cancer single-cell atlas[19]. This atlas can be used to automatically assign the cellular composition of patient tumour biopsies.

Detecting Variability in Response to Drugs

[edit]

Tyrosine kinase inhibitors are effective treatments for cancer patients harbouring mutations within receptor tyrosine kinases, yet resistance to these drugs often develops over time[20]. Variability in drug responses has been observed among individual cells within a clonal cell population, leading to varied responses to anticancer therapies[21]. Drop-seq was used to determine cell-to-cell differences in EGFR-mutated NSCLC, revealing that this variability influences treatment responses[22]. It was found that drug-tolerant states arise during treatment, and the distinct combinations of biomarkers identified through drop-seq could serve used as prognostic or therapeutic targets for small-molecule therapies.[22]

Male Meiotic Studies

[edit]

The ability of Drop-seq to generate large libraries from mixed cell suspensions has made it the most widely used scRNA-Seq method in male meiotic studies[23]. Spermatogenesis is a complex process in which spermatogonial stem cells undergo terminal differentiation into mature sperm within the testis, driving considerable efforts to understand germ cell differentiation programs [24]. Drop-seq was used to analyze 34,633 cells isolated from mouse testis, which enabled the generation of a detailed cellular and molecular atlas of cell types present in the testis, including 2 previously undescribed adult somatic cell populations[24]. This study further revealed the continuous nature of germ cell development and identified new candidate transcriptional regulators of germ cell differentiation.

References

[edit]
  1. ^ DUTCHEN, STEPHANIE. "Beyond Average". Harvard Medical School. Retrieved 22 February 2025.
  2. ^ a b c d e f g h i j k l m n o p q r s t u v w Macosko, Evan Z.; Basu, Anindita; Satija, Rahul; Nemesh, James; Shekhar, Karthik; Goldman, Melissa; Tirosh, Itay; Bialas, Allison R.; Kamitaki, Nolan; Martersteck, Emily M.; Trombetta, John J.; Weitz, David A.; Sanes, Joshua R.; Shalek, Alex K.; Regev, Aviv; McCarroll, Steven A. (2015). "Highly Parallel Genome-wide Expression Profiling of Individual Cells Using Nanoliter Droplets". Cell. 161 (5): 1202–1214. doi:10.1016/j.cell.2015.05.002. PMC 4481139. PMID 26000488.
  3. ^ Sim, G.; et al. (1979). "Use of a cDNA library for studies on evolution and developmental expression of the chorion multigene families". Cell. 18 (4): 1303–1316. doi:10.1016/0092-8674(79)90241-1.
  4. ^ Fiers, W.; Contreras, R.; Duerinck, F.; Haegeman, G.; Iserentant, D.; Merregaert, J.; Min Jou, W.; Molemans, F.; Raeymaekers, A.; Van den Berghe, A.; Volckaert, G.; Ysebaert, M. (1976-04-08). "Complete nucleotide sequence of bacteriophage MS2 RNA: primary and secondary structure of the replicase gene". Nature. 260 (5551): 500–507. doi:10.1038/260500a0. ISSN 0028-0836.
  5. ^ Lowe, Rohan; Shirley, Neil; Bleackley, Mark; Dolan, Stephen; Shafee, Thomas (2017-05-18). "Transcriptomics technologies". PLOS Computational Biology. 13 (5): e1005457. doi:10.1371/journal.pcbi.1005457. ISSN 1553-7358. PMC 5436640. PMID 28545146.
  6. ^ Zhang, Haotian (2019-10-01). "The review of transcriptome sequencing: principles, history and advances". IOP Conference Series: Earth and Environmental Science. 332 (4): 042003. doi:10.1088/1755-1315/332/4/042003. ISSN 1755-1307.
  7. ^ Adams, Mark D.; Kelley, Jenny M.; Gocayne, Jeannine D.; Dubnick, Mark; Polymeropoulos, Mihael H.; Xiao, Hong; Merril, Carl R.; Wu, Andrew; Olde, Bjorn; Moreno, Ruben F.; Kerlavage, Anthony R.; McCombie, W. Richard; Venter, J. Craig (1991-06-21). "Complementary DNA Sequencing: Expressed Sequence Tags and Human Genome Project". Science. 252 (5013): 1651–1656. doi:10.1126/science.2047873. ISSN 0036-8075.
  8. ^ Schena, Mark; Shalon, Dari; Davis, Ronald W.; Brown, Patrick O. (1995-10-20). "Quantitative Monitoring of Gene Expression Patterns with a Complementary DNA Microarray". Science. 270 (5235): 467–470. doi:10.1126/science.270.5235.467. ISSN 0036-8075. Retrieved 2025-02-22.
  9. ^ Hoopes, Laura. "Genetic Diagnosis: DNA Microarrays and Cancer". Scitable by Nature Education. Retrieved 21 February 2025.
  10. ^ Draghici, S; Khatri, P; Eklund, A; Szallasi, Z (2006). "Reliability and reproducibility issues in DNA microarray measurements". Trends in Genetics. 22 (2): 101–109. doi:10.1016/j.tig.2005.12.005. Retrieved 2025-02-22.
  11. ^ Wang, Zhong; Gerstein, Mark; Snyder, Michael (2009). "RNA-Seq: a revolutionary tool for transcriptomics". Nature Reviews Genetics. 10 (1): 57–63. doi:10.1038/nrg2484. ISSN 1471-0056. Retrieved 2025-02-22.
  12. ^ Khatoon, Zainab; Figler, Bryan; Zhang, Hui; Cheng, Feng (2014). "Introduction to RNA ‐Seq and its Applications to Drug Discovery and Development". Drug Development Research. 75 (5): 324–330. doi:10.1002/ddr.21215. ISSN 0272-4391. Retrieved 2025-02-22.
  13. ^ Tang, Fuchou; Barbacioru, Catalin; Wang, Yangzhou; Nordman, Ellen; Lee, Clarence; Xu, Nanlan; Wang, Xiaohui; Bodeau, John; Tuch, Brian B; Siddiqui, Asim; Lao, Kaiqin; Surani, M Azim (2009). "mRNA-Seq whole-transcriptome analysis of a single cell". Nature Methods. 6 (5): 377–382. doi:10.1038/nmeth.1315. ISSN 1548-7091.
  14. ^ Yu, Xiaoqing; Abbas-Aghababazadeh, Farnoosh; Chen, Y. Ann; Fridley, Brooke L. (2021). "Statistical and Bioinformatics Analysis of Data from Bulk and Single-Cell RNA Sequencing Experiments". Translational Bioinformatics for Therapeutic Development. Vol. 2194. New York, NY: Springer US. doi:10.1007/978-1-0716-0849-4_9. ISBN 978-1-0716-0848-7. Retrieved 2025-02-22.
  15. ^ Hashimshony, Tamar; Wagner, Florian; Sher, Noa; Yanai, Itai (2012). "CEL-Seq: Single-Cell RNA-Seq by Multiplexed Linear Amplification". Cell Reports. 2 (3): 666–673. doi:10.1016/j.celrep.2012.08.003.
  16. ^ Jaitin, Diego Adhemar; Kenigsberg, Ephraim; Keren-Shaul, Hadas; Elefant, Naama; Paul, Franziska; Zaretsky, Irina; Mildner, Alexander; Cohen, Nadav; Jung, Steffen; Tanay, Amos; Amit, Ido (2014-02-14). "Massively Parallel Single-Cell RNA-Seq for Marker-Free Decomposition of Tissues into Cell Types". Science. 343 (6172): 776–779. doi:10.1126/science.1247651. ISSN 0036-8075. PMC 4412462. PMID 24531970. Retrieved 2025-02-22.
  17. ^ Akshata Desai, Kartik Aysola (2012). "Triple Negative Breast Cancer – An Overview". Hereditary Genetics. doi:10.4172/2161-1041.S2-001. PMC 4181680. PMID 25285241.
  18. ^ Harbeck, Nadia; Penault-Llorca, Frédérique; Cortes, Javier; Gnant, Michael; Houssami, Nehmat; Poortmans, Philip; Ruddy, Kathryn; Tsang, Janice; Cardoso, Fatima (2019-09-23). "Breast cancer". Nature Reviews Disease Primers. 5 (1). doi:10.1038/s41572-019-0111-2. ISSN 2056-676X.
  19. ^ Gambardella, G.; Viscido, G.; Tumaini, B.; Isacchi, A.; Bosotti, R.; di Bernardo, D. (2022-03-31). "A single-cell analysis of breast cancer cell lines to study tumour heterogeneity and drug response" (PDF). Nature Communications. 13 (1). doi:10.1038/s41467-022-29358-6. ISSN 2041-1723. PMC 8971486. PMID 35361816. Retrieved 2025-02-22.
  20. ^ Paez, J. Guillermo; Jänne, Pasi A.; Lee, Jeffrey C.; Tracy, Sean; Greulich, Heidi; Gabriel, Stacey; Herman, Paula; Kaye, Frederic J.; Lindeman, Neal; Boggon, Titus J.; Naoki, Katsuhiko; Sasaki, Hidefumi; Fujii, Yoshitaka; Eck, Michael J.; Sellers, William R.; Johnson, Bruce E.; Meyerson, Matthew (2004-06-04). "EGFR Mutations in Lung Cancer: Correlation with Clinical Response to Gefitinib Therapy" (PDF). Science. 304 (5676): 1497–1500. doi:10.1126/science.1099314. ISSN 0036-8075. Retrieved 2025-02-22.
  21. ^ Tyson, Darren R; Garbett, Shawn P; Frick, Peter L; Quaranta, Vito (2012). "Fractional proliferation: a method to deconvolve cell population dynamics from single-cell data". Nature Methods. 9 (9): 923–928. doi:10.1038/nmeth.2138. ISSN 1548-7091. PMC 3459330. PMID 22886092. Retrieved 2025-02-22.
  22. ^ a b Aissa, Alexandre F.; Islam, Abul B. M. M. K.; Ariss, Majd M.; Go, Cammille C.; Rader, Alexandra E.; Conrardy, Ryan D.; Gajda, Alexa M.; Rubio-Perez, Carlota; Valyi-Nagy, Klara; Pasquinelli, Mary; Feldman, Lawrence E.; Green, Stefan J.; Lopez-Bigas, Nuria; Frolov, Maxim V.; Benevolenskaya, Elizaveta V. (2021-03-12). "Single-cell transcriptional changes associated with drug tolerance and response to combination therapies in cancer" (PDF). Nature Communications. 12 (1). doi:10.1038/s41467-021-21884-z. ISSN 2041-1723. PMC 7955121. PMID 33712615. Retrieved 2025-02-22.
  23. ^ Peng, Yiheng; Qiao, Huanyu (2021-08-18). "The Application of Single-Cell RNA Sequencing in Mammalian Meiosis Studies". Frontiers in Cell and Developmental Biology. 9. doi:10.3389/fcell.2021.673642. ISSN 2296-634X. PMC 8416306. PMID 34485276.
  24. ^ a b Green, Christopher Daniel; Ma, Qianyi; Manske, Gabriel L.; Shami, Adrienne Niederriter; Zheng, Xianing; Marini, Simone; Moritz, Lindsay; Sultan, Caleb; Gurczynski, Stephen J.; Moore, Bethany B.; Tallquist, Michelle D.; Li, Jun Z.; Hammoud, Saher Sue (2018). "A Comprehensive Roadmap of Murine Spermatogenesis Defined by Single-Cell RNA-Seq". Developmental Cell. 46 (5): 651–667.e10. doi:10.1016/j.devcel.2018.07.025. PMC 6713459. PMID 30146481. Retrieved 2025-02-22.