- 27 lessons
- 0 quizzes
- 10 week duration
Overview
Module 1
Module 2
Module 3
Module 4
Module 5
Module 6
Module 7
Module 8
Module 9
RNASeq Pipeline using edgeR
why single cells?
Single-cell multi-omic analyses are potential enough to uncover actuality level of heterogeneity across gene expression levels inside cells, that can modify the pattern of the genome, transcriptome, and epigenome at once. Additionally, scNGS has the ability to resolve noise in bulk-NGS through the extra ability to trace generated reads back to their cell of origin ( Eberwine et al., 2014)
Since assessing cellular co-occurrence is that the main disadvantage of bulk-NGS, several studies have additionally been conducted to more elucidate clonal structures using single-cell DNAseq [including whole exome sequencing (WES) or WGS], bisulfite sequencing, and ATACseq (assay for exchangeable accessible chromatin, ATAC). Given the variability and importance of gene expression, sc-RNAseq is one in all the foremost used single-cell sequencing techniques.
Bulk-WGS will be directly used to assess the existence of subclonal mutations through the use of variant allele frequencies (VAFs). Through the modeling of VAFs and copy number changes, An understanding of the clonal design could also be inferred from such bulk-NGS knowledge. whereas several bulk-NGS ways have faith in mixture models of the VAFs to investigate tiny indels and purpose mutations, these ways typically have faith in the copy range of the gene in question, which may be altered in cancers, and are unable to relate multiple mutations that exist at low frequencies. in addition, bulk sequencing includes a tendency to report what an “average” cell in a population would appear as if and for that reason wouldn’t be usable within the analysis of an all-or-nothing response.
Traditional sequencing ways will solely get the average of the many cells, unable to investigate a small range of cells and lose cellular heterogeneity info. Compared with traditional
sequencing technology, single-cell technologies have the benefits of detecting heterogeneity among individual cells, distinguishing a small range of cells, and delineating cell maps. In mammalian systems, single-cell desoxyribonucleic acid sequencing has been widely applied to check traditional physiology and sickness. Single-cell resolution will uncover the roles of genetic condition or intra-tumor genetic heterogeneity in cancer development or treatment response (Wang, D. and Bodovitz, S., 2010)
Considerations
Isolation of single cells
There ar many ways in which to isolate individual cells before whole amplification and sequencing. Fluorescence-activated cell sorting (FACS) could be a wide used approach. Individual cells also can be collected by micromanipulation, as an example by serial dilution or by employing a patch pipet or carbon nanotube to reap one cell.
The advantages of micromanipulation are ease and low cost, however they’re hard and prone to misidentification of cell sorts underneath magnifier. Laser-capture microdissection (LCM) also can be used for grouping single cells. though LCM preserves the data of the spacial location of a sampled cell within a tissue, it’s hard to capture a whole single cell while not additionally grouping the materials from neighboring cells. High-throughput methods for single cell isolation also embody microfluidics. each FACS and microfluidics are correct, automatic and capable of analytic unbiased samples. However, each methods need detaching cells from their microenvironments first, thereby causing perturbation to the transcriptional profiles in ribonucleic acid expression analysis.
Number of cells to be analyzed
In General, for a typical bulk cell ribonucleic acid sequencing (RNA-seq) experiment, 10 million reads are generated and a gene with beyond the threshold of 50 reads per kb per million reads (RPKM) is taken into account expressed. For a gene that’s 1kb long, this corresponds to 500 reads and a minimum coefficient of variation (CV) of 4% under the idea of the distribution. For a typical mammalian cell containing 200,000 mRNA, sequencing data from a minimum of 50 single cells have to be compelled to be pooled so as to attain this minimum CV worth. However, thanks to the potency of reverse transcription and alternative noise introduced within the experiments, a lot of cells are needed for correct expression analyses and cell-type identification.
Overview of scRNA-Seq Experimental Platforms
Once a biological sample has been obtained, cells should be separated before they will be more analysed. for a few samples (e.g. blood or other liquids), analytic individual cells is simple, whereas for others (e.g. brain or heart), it will be difficult to extract intact cells without inducing a stress response. Dissociation protocols are typically tissue/organism- specific, and are mostly independent of the cell/transcript capture and library preparation technique. Most dissociation protocols can manufacture an answer of freely flowing individual cells (Hebenstreit, D., 2012)
Low-Throughput ways
In general at the time of market introduction majority of scRNA-seq experiments relied on capturing single cells using the Fluidigm C1 chip. Hence the method is ready to perform lysis, reverse transcription and amplification of cDNA expeditiously and in tiny reaction volumes, the high value and little size of the microfluidic chips and probably high doublet rates create this one of the smallest cost efficient protocols. One advantage of the C1 is that it allows imaging of cells to spot doublets and broken cells (Ziegenhain et al., 2017).
High-Throughput ways
Much higher throughput is achieved by pooling cells before library preparation. The reads will later be assigned to their cell of origin since they’re assigned a cell-specific barcode (i.e. a brief sequence of nucleotides) before combination. Today, the foremost standard high-throughput cell capture systems are supported by microfluidic drop systems. These embody the Chromium system by 10X genomics and therefore the ASCII text file in Drop-seq and InDrop systems. All of those ways turn out water droplets containing barcoded primers that may hybridize to the messenger RNA and lysis buffer. The Chromium and InDrop package the barcoded primers into gel ‘beads’ and perform reverse transcription within the individual droplets.By distinction, Drop-seq uses barcoded primers hooked up to solid beads that capture the messenger RNA through hybridizing. Droplets are then pooled and therefore the RNA is extracted before performing reverse transcription. it’s been noted that Drop-seq typically has lower messenger RNA capture potency than InDrop/Chromium (Svensson et al., 2017). One potential rationalization for this can be that drop formation may be a partly theoretical account as there’s variation within the size of droplets which may have consequences for the library sizes generated from the encapsulated cells. in contrast, cell capture systems with predetermined compartment sizes (e.g. chip- or plate-based methods) can have a a lot of uniform distribution of reaction efficiencies. Recently a micro-well-based various to droplet-based capture referred to as SeqWell has been planned. This methodology retains the benefits of high turnout capture of individual cells and little reaction volumes however additionally allows imaging of captured cells and therefore the application of extra reagents to the cells before lysing. SeqWell has similar performance to Drop-seq, that is unsurprising since it collects mRNAs through hybridizing to solid beads before reverse transcription kind of like Drop-seq. Another recently planned methodology explains that combinatorial categorization (i.e. iteratively attaching short nucleotides sequences to random subsets of cells to create up a novel barcode for every cell) is accustomed generate cell-specific barcoded libraries from unrelated cells while not the requirement to isolate individual cells. an extra advantage of the combinatorial categorization strategy over droplet-based ways is that it is applied to profile nuclei. Single-nuclei sequencing is a vital variant of scRNA-seq that is employed once learning tissues that are tough to dissociate (e.g. neurons). All high-throughput ways mentioned higher than assume that the amount of cells in every drop can follow a Poisson distribution (Gierahn et al., 2017). so there’s a trade-off between cell capture potency and jacket rate. The random capture of cells in drops creates a completely unique applied mathematics challenge to properly establish those droplet barcodes that correspond to droplets that contain a cell as against simply background RNA or multiple cells. Once cells are captured, RNA should be reverse transcribed into cDNA and so amplified before activity library preparation. High-throughput ways pool cells before library preparation, and so before transcript fragmentation. Due to pooling, only one end of the transcript can be retained and it is impossible to obtain coverage of the entire transcriptome using short read technologies. (Cao et al., 2017).