Spatial transcriptomics is a method for assigning cell types (identified by the mRNA readouts) to their locations in the histological sections. Recent work demonstrated that the subcellular localization of mRNA molecules, for example, in the nucleus can also be studied. [1]
The first widely-adopted method, described by Ståhl et al., in a landmark 2016 paper in Science [2] implies positioning individual tissue samples on the glass slides or arrays including "spots" of spatially barcoded oligo(dT) tails to capture the poly-adenylated mRNAs. [2] Besides oligo(dT) tail and spatial barcode, which indicates the x and y position on the arrayed slide, the probe contains an amplification and sequencing handle, and unique molecular identifier. [2] Commonly, histological samples are cut using cryotome, then fixed, stained, and put on the microarrays. [2] After that, it undergoes enzymatic permeabilization, so that RNA molecules can diffuse down to the slide, with further hybridization of polyadenylated mRNA molecules to the oligo (dT) probes. [2] Reverse transcription is then carried out in situ. [2] As a result, spatially marked complementary DNA (cDNA) is synthesized, providing information about gene expression in the exact location of the sample. [2] From the cDNA, libraries are synthesised for short-read sequencing. In summary, the first widely-adopted spatial transcriptomics protocol combines paralleled sequencing and staining of the same sample. [2] In the downstream analysis, bioinformatic tools allow to overlap the tissue image with the gene expression. The output is a map of the transcriptome capture the gene expression of individual cells within a tissue section. It is important to mention that the first generation of the arrayed slides comprised about 1,000 spots of the 100-μm diameter, limiting resolution to ~10-40 cells per spot. [2]
Spatial transcriptomics includes methods that can be divided into two modalities, those based in next-generation sequencing for gene detection, and those based in imaging. [3] Some common approaches to resolve spatial distribution of transcripts are microdisection techniques, Fluorescent in situ hybridization methods, in situ sequencing, in situ capture protocols and in silico approaches. [4]
Defining the spatial distribution of mRNA molecules allows for the detection of cellular heterogeneity in tissues, tumours, immune cells as well as determine the subcellular distribution of transcripts in various conditions. [4] This information provides a unique opportunity to decipher both the cellular and subcellular architecture in both tissues and individual cells. These methodologies provide crucial insights in the fields of embryology, oncology, immunology and histology. [4] The functioning of the individual cells in multicellular organisms can only be completely explained in the context of identifying their exact location in the body. [4] Spatial transcriptomics techniques sought to elucidate cells’ properties this way. Below, we look into the methods that connect gene expression to the spatial organization of cells. [4]
Laser capture microdissection enables capturing single cells without causing morphologic alterations. [6] [7] It exploits transparent ethylene vinyl acetate film apposed to the histological section and a low-power infrared laser beam. [6] Once such beam is directed at the cells of interest, film directly above the targeted area temporarily melts so that its long-chain polymers cover and tightly capture the cells. [6] Then, the section is removed and cells of interest remain embedded in the film. [6] This method allows further RNA transcript profiling and cDNA library generation of the retrieved cells.
RNA sequencing of the selected regions in individual cryosections is another method that can produce location-based genome-wide expression data. [8] This method is carried out without laser capture microdissection. It was first used to determine genome-wide spatial patterns of gene expression in cryo-sliced Drosophila embryos. [8] Essentially, it implies simple preparation of the library from the selected regions of the sample. This method had difficulties in obtaining high-quality RNA-seq libraries from every section due to the material loss as a result of the small amount of total RNA in each slice. [4] [8] This problem was resolved by adding RNA of a distantly related Drosophila species to each tube after initial RNA extraction. [8]
Transcriptome in vivo analysis (TIVA) is a technique that enables capturing mRNA in live single cells in intact live tissue sections. [9] It uses a photoactivatable tag. [4] [9] The TIVA tag has several functional groups and a trapped poly(U) oligonucleotide coupled to biotin. [9] A disulfide-linked peptide, which is adjacent to the tag, allows it to penetrate the cell membrane. [9] Once inside, laser photoactivation is used to unblock poly(U) oligonucleotide in the cells of interest, so that TIVA tag hybridizes to mRNAs within the cell. [9] Then, streptavidin capture of the biotin group is used to extract poly(A)-tailed mRNA molecules bound to unblocked tags, after which these mRNAs are analyzed by RNA sequencing. [9] This method is limited by low throughput, as only a few single cells can be processed at a time. [4]
An advanced alternative for RNA Sequencing of Individual Cryosections described above, RNA tomography (tomo-seq) features better RNA quantification and spatial resolution. [10] It is also based on tissue cryosectioning with further RNA sequencing of individual sections, yielding genome-wide expression data and preserving spatial information. [10] In this protocol, usage of carrier RNA is omitted due to linear amplification of cDNA in individual histological sections. [10] The identical sample is sectioned in different directions followed by 3D transcriptional construction using overlapping data. [10] Overall, this method implies using identical samples for each section and thus cannot be applied for processing clinical material. [4]
LCM-seq utilizes laser capture microdissection (LCM) coupled with Smart-Seq2 RNA sequencing and is applicable down to the single cell level and can even be used on partially degraded tissues. The workflow includes cryosectioning of tissues followed by laser capture microdissection, where cells are collected directly into lysis buffer and cDNA is generated without the need for RNA isolation, which both simplifies the experimental procedures as well as lowers technical noise. [11] [12] As the positional identity of each cell is recorded during the LCM procedure, the transcriptome of each cell after RNA sequencing of the corresponding cDNA library can be inferred to the position where it was isolated from. [11] LCM-seq has been applied to multiple cell types to understand their intrinsic properties, including oculomotor neurons, facial motor neurons, hypoglossal motor neurons, spinal motor neurons, red nucleus neurons, [13] interneurons, [14] dopamine neurons, [15] and chondrocytes. [16]
Geo-seq is a method that utilizes both laser capture microdissection and single-cell RNA sequencing procedures to determine the spatial distribution of the transcriptome in tissue areas approximately ten cells in size. [17] The workflow involves removal and cryosectioning of tissue followed by laser capture microdissection. [17] [18] The extracted tissue is then lysed, and the RNA is purified and reverse transcribed into a cDNA library. [17] [18] Library is sequenced, and the transcriptomic profile can be mapped to the original location of the extracted tissue. [17] [18] This technique allows the user to define regions of interest in a tissue, extract said tissue and map the transcriptome in a targeted approach.
The NICHE-seq method uses photoactivatable fluorescent markers and two-photon laser scanning microscopy to provide spatial data to the transcriptome generated. [19] The cells bound by the fluorescent marker are photoactivated, dissociated and sorted via fluorescence-activated cell sorting. [19] This provides sorting specificity to only labeled, photoactivated cells. [19] Following sorting, single-cell RNA sequencing generates the transcriptome of the visualized cells. [19] This method can process thousands of cells within a defined niche at the cost of losing spatial data between cells in the niche. [19]
ProximID is a methodology based on iterative micro digestion of extracted tissue to single cells. [20] Initial mild digestion steps preserve small interacting structures that are recorded prior to continued digestion. [20] The single cells are then separated from each structure and undergo sc-RNAseq and clustered using t-distributed stochastic neighbour embedding. [20] [21] The clustered cells can be mapped to physical interactions based on the interacting structures prior to the micro digestions. [20] While the throughput of this technique is relatively low it provides information on physical interaction between cells to the dataset. [4]
CosMx Spatial Molecular Imager is the first high-plex in situ analysis platform to provide spatial multiomics with formalin-fixed paraffin-embedded (FFPE) and fresh frozen (FF) tissue samples at cellular and subcellular resolution. It enables rapid quantification and visualization of up to 1,000 RNA and 64 validated protein analytes and is the flexible, spatial single-cell imaging platform for cell atlasing, tissue phenotyping, cell-cell interactions, cellular processes, and biomarker discovery.
One of the first techniques able to achieve spatially resolved RNA profiling of individual cells was single-molecule fluorescent in situhybridization (smFISH). [22] It implemented short (50 base pairs) oligonucleotide probes conjugated with 5 fluorophores which could bind to a specific transcript yielding bright spots in the sample. [22] Detection of these spots provides quantitative information about expression of certain genes in the cell. [22] However, usage of probes labeled with multiple fluorophores was challenged by self-quenching, altered hybridization characteristics, their synthesis and purification. [4]
Later, this method was changed [23] by substituting the above described probes with those of 20 bp length, coupled to only one fluorophore and complementary in tandem to an mRNA sequence of interest, meaning that those would collectively bind to the targeted mRNA. One such probe itself wouldn't produce a strong signal, but the cumulative fluorescence of the congregated probes would show a bright spot. Since single misbound probes are unlikely to co-localize, false-positive signals in this method are limited. [4]
Thus, this in situ hybridization (ISH) technique spots spatial localization of RNA expression via direct imaging of individual RNA molecules in single cells.
Another in situ hybridization technique termed RNAscope employs probes of the specific Z-shaped design to simultaneously amplify hybridization signals and suppress background noise. [24] It allows for the visualization of single RNA in a variety of cellular types. [25] Most steps of RNAScope are similar to the classic ISH protocol. [24] The tissue sample is fixed onto slides and then treated with RNAscope reagents that permeate the cells. Z-probes are designed in a way that they are only effective when bound in pairs to the target sequence. [24] This allows another element of this method (preamplifier) to connect to the top tails of Z-probes. [24] Once affixed, preamplifier serves as a binding site for other elements: amplifiers which in turn bind to another type of probes: label probes. [24] As a result, a bulky structure is formed on the target sequence. Most importantly, the preamplifier fails to bind to a singular Z-probe, thus, nonspecific binding wouldn't entail signal emission, thus, eliminating background noise mentioned in the beginning. [4] [24]
Sequential fluorescence in situhybridization (seqFISH) is another method that provides identification of mRNA directly in single cells with preservation of their spatial context. [26] [27] This method is carried out in multiple rounds; each of them includes fluorescent probe hybridization, imaging, and consecutive probe stripping. [26] Various genes are assigned different colors in every round, generating a unique temporal barcode. [26] Thus, seqFISH distinguishes mRNAs by a sequential color code, such as red-red-green. Nevertheless, this technique has its flaws featuring autofluorescent background and high costs due to the number of probes used in each round. [4]
Conventional FISH methods are limited by the small number of genes that can be simultaneously analyzed due to the small number of distinct color channels, so multiplexed error-robust FISH was designed to overcome this problem. [28] Multiplexed Error-Robust FISH (MERFISH) greatly increases the number of RNA species that can be simultaneously imaged in single cells employing binary code gene labeling in multiple rounds of hybridization. [29] This approach can measure 140 RNA species at a time using an encoding scheme that both detects and corrects errors. [29] The core principle lies in identification of genes by combining signals from several consecutive hybridization rounds and assigning N-bit binary barcodes to genes of interest. [29] The Code depends on specific probes and comprises “1” or “0” values and their combination is set differently for each gene. [29] Errors are avoided by using six-bit or longer codes with any two of them differing by at least 3 bits. [29] A specific probe is created for each RNA species. [29] Each probe is a target-specific oligonucleotide that consists of 20-30 base pairs and complementary binds to mRNA sequence after permeating the cell. [29] Then, multiple rounds of hybridization are conducted as follows: for each round, only a probe that includes “1” in the corresponding binary code position is added. [29] At the end of each round, fluorescent microscopy is used to locate each probe. [29] Expectedly, only those mRNAs which had “1” in the assigned position would be captured. [29] Photos are then photobleached and a new subset is added. [29] Thus, we retrieve combination of binary values which makes it possible to distinguish between numerous RNA species.
Single-molecule RNA detection at depth by hybridization chain reaction (smHCR) is an advanced seqFISH technique that can overcome typical complication of autofluorescent background in thick and opaque tissue samples. [30] In this method, multiple readout probes are bound with the target region of mRNA. [30] Target is detected by a set of short DNA probes which attach to it in defined subsequence. [30] Each DNA probe carries an initiator for the same HCR amplifier. [30] Then, fluorophore-labeled DNA HCR hairpins penetrate the sample and assemble into fluorescent amplification polymers attaching to initiating probes. [30] In multiplexed studies, the same two-stage protocol described above is used: all probe sets are introduced simultaneously, just as all HCR amplifiers are; spectrally distinct fluorophores are used for further imaging. [30]
Cyclic-ouroboros smFISH (osmFISH) is an adaptation of smFISH which aims to overcome the challenge of optical crowding. [31] In osmFISH, transcripts are visualized, and an image is acquired before the probe is stripped and a new transcript is visualized with a different fluorescent probe. [31] After successive rounds the images are compiled to view the spatial distribution of the RNA. [31] Due to transcripts being sequentially visualized it eliminates the issue of signals interfering with each other. [4] [31] This method allows the user to generate high resolution images of larger tissue sections than other related techniques. [4]
Expansion FISH (ExFISH) leverages expansion microscopy to allow for super-resolution imaging of RNA location, even in thick specimens such as brain tissue. [32] It supports both single-molecule and multiplexed readouts. [32]
Expansion-Assisted Iterative Fluorescence In Situ Hybridization (EASI-FISH) optimizes and builds on ExFISH with improved detection accuracy and robust multi-round processing across samples thicker (300 μm) than what was previously possible. [33] It also includes a turn-key computational analysis pipeline. [34]
SeqFISH+ resolved optical issues related to spatial crowding by subsequent rounds of fluorescence. [35] First, a primary probe anneals to targeted mRNA and then subsequent probes bind to flanking regions of the primary probe resulting in a unique barcode. [35] Each readout probe is captured as an image and collapsed into a super resolved image. [35] This method allows the user to target up to ten thousand genes at a time. [4]
DNA microscopy is a distinct imaging method for optics-free mapping of molecules’ positions with simultaneous preservation of sequencing data carried out in several consecutive in situ reactions. [36] First, cells are fixed and cDNA is synthesized. [36] Randomized nucleotides then tag target cDNAs in situ, providing unique labels for each molecule. [36] Tagged transcripts are amplified in the second in situ reaction, retrieved copies are concatenated, and new randomized nucleotides are added. [36] Each consecutive concatenation event is labeled, yielding unique event identifiers. [36] Algorithm then generates images of the original transcripts based on decoded molecular proximities from the obtained concatenated sequences, while target's single nucleotide information is being recorded as well. [36]
The ISS padlock method [37] is based on padlock probing, [38] rolling-circle amplification (RCA), [39] and sequencing by ligation chemistry. [40] Within intact tissue sections, mRNA is reversely transcribed to cDNA, which is followed by mRNA degradation by RNase H. [4] Then, there are two ways of how this method can be carried out. The first way, gap-targeted sequencing, involves padlock probe binding to cDNA with a gap between the ends of the probe which are targeted for sequencing by ligation. [4] DNA polymerization then fills this gap and a DNA circle is created by DNA ligation. [4] Another way, barcode-targeted sequencing, DNA circularization of a padlock probe with a barcode sequence is conducted by ligation only. [4] In both versions of the method, the ends are ligated forming a circle of DNA. [4] Target amplification is then performed by RCA, yielding micrometer-sized RCA products (RCPs). [4] RCAs consist of repeats of the padlock probe sequence. [4] These DNA molecules are then subjected to sequencing by ligation, decoding either a gap-filled sequence or an up to four-base-long barcode within the probe with adjacent ends, depending on the version. [4] No-gap variant claims higher sensitivity, while gap-filled one implies reading out the actual RNA sequence of the transcript. [4] Later, this method was improved by automatization on a microfluidic platform and substitution of sequencing by ligation with sequencing by hybridization technology. [4]
Fluorescent in situ sequencing (FISSEQ), [41] like ISS padlock, is a method that uses reverse transcription, rolling-circle amplification, and sequencing by ligation techniques. [4] It allows spatial transcriptome analysis in fixed cells. [4] RNA is first reverse transcribed into cDNA with regular and modified amine-bases and tagged random hexamer RT primers. [4] Amine-bases mediate the cross-linkage of cDNA to its cellular surrounding. [4] Then cDNA is circulated by ligation and amplified by RCA. [4] Single-stranded DNA nanoballs of 200–400 nm in diameter are obtained as a result. [4] Thus, these nanoballs comprise numerous tandem repeats of the cDNA sequence. Then sequencing is performed via SOLiD sequencing by ligation. [4] Positions of both product of reverse transcription and clonally amplified RCPs are maintained via cross-linkage to cellular matrix components mentioned previously, creating a 3D in situ RNA-seq library within the cell. [4] Once bound with fluorescent probes featuring different colors, amplicons become highly fluorescent which allows visual detection of the signal; however, the image-processing algorithm relies on read alignment to reference sequences rather than signal intensity. [4]
Barcode in situtargeted sequencing (Barista-seq) is an improvement on the gap padlock probe methodology boasting a fivefold increase in efficiency, an increased read length of fifteen bases and is compatible with illumina sequencing platforms. [4] [21] The method also uses padlock probes and rolling circle amplification, however this approach uses sequencing-by-synthesis and crosslinking unlike the gap padlock method. [21] The crosslinking to the cellular matrix in the same procedure is the same as FISSEQ. [4] [21]
Spatially-resolved transcript amplicon readout mapping (STARmap) utilizes a padlock probe with an additional primer which allows for direct amplification of mRNA, forgoing the need for reverse transcription. [4] [42] Similar to other padlock probe based methods amplification occurs via rolling circle amplification. [4] The DNA amplicons are chemically modified and embedded into a polymerized hydrogel within the cell. [4] Captured RNA can then be sequenced in situ providing three dimensional locations of the mRNA within each cell. [4]
STOmics is a pioneer in advancing spatially-resolved transcriptomic analysis through its proprietary SpaTial Enhanced REsolution Omics-Sequencing (Stereo-seq) technology. [43] It combines in situ capture with DNB-seq, DNB sequencing is based on lithographically etched chips (patterned arrays) for in situ sequencing. Unlike other um-level in situ capture technologies, standard DNB chips have spots with approximately 220 nm diameter and a center-to-center distance of 500 nm, providing up to 20000 spots for tissue RNA capture per 10mm linear distance, or 4x108 spots per 1cm2. Therefore, STOmics can show higher resolution and wider field of view than other in situ capture technologies. [44]
Spatial Transcriptomics is the ability to capture the positional context of transcriptional activity within intact tissue, either for regions or single cells.
When a tissue cryosection is attached to a spatial transcriptomic slide the barcoded primers bind and capture adjacent mRNAs from the tissue. While the tissue section is attached to the slide, reverse transcription of captured mRNA is initiated and the resulting cDNA incorporates the spatial barcode of the primer. Following mRNA capture and reverse transcription, sequencing libraries are prepared and analyzed with Illumina dye sequencing. The spatial barcode present within each generated sequence allows the data for each individual mRNA transcript to be mapped back to its point of origin within the tissue section.
NanoString's GeoMx Digital Spatial Profiler (DSP) allows the user to define a microscopic region of interest on an FFPE or frozen tissue slide due to a UV-photocleavable barcode engineered into the in-situ hybridization probes. [4] [45] The region of interest is specifically exposed to UV light, the barcodes are cleaved and used to identify the RNA or protein present in the tissue. [45] The defined regions of interest can vary in size between ten and six hundred micrometers allowing targeting of a wide variety of structures and cells in the histological sample. [4]
Slide-seq relies on the attachment of RNA binding, DNA-barcoded micro beads to a rubber coated glass coverslip. [4] [46] The microbeads are mapped to their spatial location via SOLiD sequencing. [46] Tissue sections are transferred to this coverslip to capture extracted RNA. Captured RNA is amplified and sequenced. [46] Transcript localization is determined by the barcode oligonucleotide sequence from the bead that captured it. [46]
APEX-seq allows the for assessment of the spatial transcriptome in different regions of a cell. [4] [47] The method utilizes the APEX2 gene, expressed in live cells which are incubated with biotin-phenol and hydrogen peroxide. [47] In these conditions the APEX2 enzymes catalyse the transfer of biotin groups to the RNA molecules and these can then be purified via streptavidin bead purification. [47] The purified transcripts are then sequenced to determine which molecules were in close proximity to the biotin tagging enzyme. [47]
High-Definition Spatial Transcriptomics (HDST) begins with decoding the location of mRNA capture beads in wells on a glass slide. [48] This is accomplished by sequential hybridization to the barcode oligonucleotide sequence of each bead. [48] Once the location of each bead is decoded, a tissue sample can be placed on the slide and permeabilized. [48] The captured transcripts are then sequenced. [48] HDST uses smaller beads than Slide-seq and thus can resolve at a spatial resolution of two micrometers compared to ten micrometers of Slide-seq. [4]
The 10X Genomics Visium assay is a newer and improved version of the Spatial Transcriptomics assay. It also utilizes spotted arrays of mRNA-capturing probes on the surface of glass slides but with increased spot number, minimized spot size and increased amount of capture probes per spot. Within each of the four capture areas of the Visium Spatial Gene Expression slides, there are approximately 5000 barcoded spots, which in turn contain millions of spatially barcoded capture oligonucleotides. Tissue mRNA is released upon permeabilization and binds to the barcoded oligos, enabling capture of gene expression information. Each barcoded spot is 55 μm in diameter, and the distance from the center of one spot to the center of another is approximately 100 μm. The spots are staggered to minimize the distance between them. On average, mRNA from anywhere between 1 and 10 cells are captured per spot which provides near single-cell resolution. [49]
in silico Spatial Reconstruction with ISH implies computational spatial reconstruction of cells’ locations according to their expression profiles. [4] Several similar methods of this principle exist. [50] [51] They co-analyze single-cell transcriptomics and available ISH-based gene expression atlases of the same cell type. [4] Based on these data, cells are then assigned to their positions in the tissue. Obviously, this method is limited by the factor of availability of ISH references. [4] Additionally, it becomes more complicated when assigning cells in complex tissues. This approach is not applicable for clinical samples due to the lack of paired references. [4] Reported success rate for the exact allocation of cells in brain tissue was 81%. [50]
Mapping the transcriptome using the Distmap algorithm requires high-throughput single cell sequencing and an existing in situ hybridization atlas for the tissue of interest. [4] [52] The Distmap algorithm generates a virtual 3D model of the tissue of interest using the transcriptomes of sequenced cells and said reference atlas. [4] [52] The transcriptomes can be clustered into cell types using t-distributed stochastic neighbour embedding and mapped to the 3D model using virtual in situ hybridization. [52] [53] Essentially, this algorithm takes data generated from single cells in a dissociated tissue and is able to map individual transcripts to where the cell type exists in the tissue using virtual in situ hybridization.
in situ hybridization was developed in the late 60's and saw major developments in the 80's (smFISH) and 10's (RNAscope, seqFISH, MERFISH and osmFISH) with the most recent additions being seqFISH+ and DNA microscopy. [17] Microdisecction techniques were first developed in the late 90's (Laser Capture Microdissection) and the most recent addition was in 2016 with the development of ProximID. [4] Spatial genomics/transcriptomics as a technique was invented and developed in 2000 by Michael Doyle (of Eolas), Maurice Pescitelli (of the University of Illinois at Chicago), Betsey Williams (of Harvard), and George Michaels (of George Mason University), [54] [55] [56] as part of the Visible Embryo Project [57] [58] It was later expanded upon in 2016 by Jonas Frisén, Joakim Lundeberg, Patrik Ståhl and their colleagues in Stockholm, Sweden. [2] In 2019 at the Broad Institute, the labs of Fei Chen and Evan Macosko developed Slide-seq, which used barcoded oligos on beads. [46]
Functional genomics is a field of molecular biology that attempts to describe gene functions and interactions. Functional genomics make use of the vast data generated by genomic and transcriptomic projects. Functional genomics focuses on the dynamic aspects such as gene transcription, translation, regulation of gene expression and protein–protein interactions, as opposed to the static aspects of the genomic information such as DNA sequence or structures. A key characteristic of functional genomics studies is their genome-wide approach to these questions, generally involving high-throughput methods rather than a more traditional "candidate-gene" approach.
The transcriptome is the set of all RNA transcripts, including coding and non-coding, in an individual or a population of cells. The term can also sometimes be used to refer to all RNAs, or just mRNA, depending on the particular experiment. The term transcriptome is a portmanteau of the words transcript and genome; it is associated with the process of transcript production during the biological process of transcription.
Fluorescence in situ hybridization (FISH) is a molecular cytogenetic technique that uses fluorescent probes that bind to only particular parts of a nucleic acid sequence with a high degree of sequence complementarity. It was developed by biomedical researchers in the early 1980s to detect and localize the presence or absence of specific DNA sequences on chromosomes. Fluorescence microscopy can be used to find out where the fluorescent probe is bound to the chromosomes. FISH is often used for finding specific features in DNA for use in genetic counseling, medicine, and species identification. FISH can also be used to detect and localize specific RNA targets in cells, circulating tumor cells, and tissue samples. In this context, it can help define the spatial-temporal patterns of gene expression within cells and tissues.
In situ hybridization (ISH) is a type of hybridization that uses a labeled complementary DNA, RNA or modified nucleic acid strand to localize a specific DNA or RNA sequence in a portion or section of tissue or if the tissue is small enough, in the entire tissue, in cells, and in circulating tumor cells (CTCs). This is distinct from immunohistochemistry, which usually localizes proteins in tissue sections.
SOLiD (Sequencing by Oligonucleotide Ligation and Detection) is a next-generation DNA sequencing technology developed by Life Technologies and has been commercially available since 2006. This next generation technology generates 108 - 109 small sequence reads at one time. It uses 2 base encoding to decode the raw data generated by the sequencing platform into sequence data.
RNA-Seq is a technique that uses next-generation sequencing to reveal the presence and quantity of RNA molecules in a biological sample, providing a snapshot of gene expression in the sample, also known as transcriptome.
In the field of cellular biology, single-cell analysis and subcellular analysis is the study of genomics, transcriptomics, proteomics, metabolomics and cell–cell interactions at the single cell level. The concept of single-cell analysis originated in the 1970s. Before the discovery of heterogeneity, single-cell analysis mainly referred to the analysis or manipulation of an individual cell in a bulk population of cells at a particular condition using optical or electronic microscope. To date, due to the heterogeneity seen in both eukaryotic and prokaryotic cell populations, analyzing a single cell makes it possible to discover mechanisms not seen when studying a bulk population of cells. Technologies such as fluorescence-activated cell sorting (FACS) allow the precise isolation of selected single cells from complex samples, while high throughput single cell partitioning technologies, enable the simultaneous molecular analysis of hundreds or thousands of single unsorted cells; this is particularly useful for the analysis of transcriptome variation in genotypically identical cells, allowing the definition of otherwise undetectable cell subtypes. The development of new technologies is increasing our ability to analyze the genome and transcriptome of single cells, as well as to quantify their proteome and metabolome. Mass spectrometry techniques have become important analytical tools for proteomic and metabolomic analysis of single cells. Recent advances have enabled quantifying thousands of protein across hundreds of single cells, and thus make possible new types of analysis. In situ sequencing and fluorescence in situ hybridization (FISH) do not require that cells be isolated and are increasingly being used for analysis of tissues.
Fluorescent in situ sequencing (FISSEQ) is a method of sequencing a cell's RNA while it remains in tissue or culture using next-generation sequencing.
Single-cell sequencing examines the nucleic acid sequence information from individual cells with optimized next-generation sequencing technologies, providing a higher resolution of cellular differences and a better understanding of the function of an individual cell in the context of its microenvironment. For example, in cancer, sequencing the DNA of individual cells can give information about mutations carried by small populations of cells. In development, sequencing the RNAs expressed by individual cells can give insight into the existence and behavior of different cell types. In microbial systems, a population of the same species can appear genetically clonal. Still, single-cell sequencing of RNA or epigenetic modifications can reveal cell-to-cell variability that may help populations rapidly adapt to survive in changing environments.
A transcriptome in vivo analysis tag is a multifunctional, photoactivatable mRNA-capture molecule designed for isolating mRNA from a single cell in complex tissues.
Perturb-seq refers to a high-throughput method of performing single cell RNA sequencing (scRNA-seq) on pooled genetic perturbation screens. Perturb-seq combines multiplexed CRISPR mediated gene inactivations with single cell RNA sequencing to assess comprehensive gene expression phenotypes for each perturbation. Inferring a gene’s function by applying genetic perturbations to knock down or knock out a gene and studying the resulting phenotype is known as reverse genetics. Perturb-seq is a reverse genetics approach that allows for the investigation of phenotypes at the level of the transcriptome, to elucidate gene functions in many cells, in a massively parallel fashion.
Single-cell transcriptomics examines the gene expression level of individual cells in a given population by simultaneously measuring the RNA concentration of hundreds to thousands of genes. Single-cell transcriptomics makes it possible to unravel heterogeneous cell populations, reconstruct cellular developmental pathways, and model transcriptional dynamics — all previously masked in bulk RNA sequencing.
Transcriptomics technologies are the techniques used to study an organism's transcriptome, the sum of all of its RNA transcripts. The information content of an organism is recorded in the DNA of its genome and expressed through transcription. Here, mRNA serves as a transient intermediary molecule in the information network, whilst non-coding RNAs perform additional diverse functions. A transcriptome captures a snapshot in time of the total transcripts present in a cell. Transcriptomics technologies provide a broad account of which cellular processes are active and which are dormant. A major challenge in molecular biology is to understand how a single genome gives rise to a variety of cells. Another is how gene expression is regulated.
CITE-Seq is a method for performing RNA sequencing along with gaining quantitative and qualitative information on surface proteins with available antibodies on a single cell level. So far, the method has been demonstrated to work with only a few proteins per cell. As such, it provides an additional layer of information for the same cell by combining both proteomics and transcriptomics data. For phenotyping, this method has been shown to be as accurate as flow cytometry by the groups that developed it. It is currently one of the main methods, along with REAP-Seq, to evaluate both gene expression and protein levels simultaneously in different species.
snRNA-seq, also known as single nucleus RNA sequencing, single nuclei RNA sequencing or sNuc-seq, is an RNA sequencing method for profiling gene expression in cells which are difficult to isolate, such as those from tissues that are archived or which are hard to be dissociated. It is an alternative to single cell RNA seq (scRNA-seq), as it analyzes nuclei instead of intact cells.
Deterministic Barcoding in Tissue for Spatial Omics Sequencing (DBiT-seq) was developed at Yale University by Rong Fan and colleagues in 2020 to create a multi-omics approach for studying spatial gene expression heterogenicity within a tissue sample. This method can be used for the co-mapping mRNA and protein levels at a near single-cell resolution in fresh or frozen formaldehyde-fixed tissue samples. DBiT-seq utilizes next generation sequencing (NGS) and microfluidics. This method allows for simultaneous spatial transcriptomic and proteomic analysis of a tissue sample. DBiT-seq improves upon previous spatial transcriptomics applications such as High-Definition Spatial Transcriptomics (HDST) and Slide-seq by increasing the number of detectable genes per pixel, increased cellular resolution, and ease of implementation.
Genome editing of synthetic target arrays for lineage tracing (GESTALT) is a method used to determine the developmental lineages of cells in multicellular systems. GESTALT involves introducing a small DNA barcode that contains regularly spaced CRISPR/Cas9 target sites into the genomes of progenitor cells. Alongside the barcode, Cas9 and sgRNA are introduced into the cells. Mutations in the barcode accumulate during the course of cell divisions and the unique combination of mutations in a cell's barcode can be determined by DNA or RNA sequencing to link it to a developmental lineage.
Bulk RNA barcoding and sequencing (BRB-seq) is an ultra-high-throughput bulk 3' mRNA-seq technology that uses early-stage sample barcoding and unique molecular identifiers (UMIs) to allow the pooling of up to 384 samples in one tube early in the sequencing library preparation workflow. The transcriptomic technology is compatible with both Illumina and MGI short-read sequencing instruments.
3' mRNA-seq is a quantitative, genome-wide transcriptomic technique based on the barcoding of the 3' untranslated region (UTR) of mRNA molecules. Unlike standard bulk RNA-seq, where short sequencing reads are generated along the entire length of mRNA transcripts, only the 3' end of polyadenylated RNAs are sequenced in 3' mRNA-seq. This approach results in a need for fewer reads to quantify the expression of a gene and reduces the sequencing depth required per sample while providing robust and reliable transcriptome-wide read-outs of gene expression levels comparable to full-length RNA-seq methods.