Physical mapping

Last updated

Physical map is a technique used in molecular biology to find the order and physical distance between DNA base pairs by DNA markers. [1] It is one of the gene mapping techniques which can determine the sequence of DNA base pairs with high accuracy. Genetic mapping, another approach of gene mapping, can provide markers needed for the physical mapping. However, as the former deduces the relative gene position by recombination frequencies, it is less accurate than the latter.

Contents

Physical mapping uses DNA fragments and DNA markers to assemble larger DNA pieces. With the overlapping regions of the fragments, [2] researchers can deduce the positions of the DNA bases. There are different techniques to visualize the gene location, including somatic cell hybridization, radiation hybridization and in situ hybridization. [3]

The different approaches to physical mapping are available for analyzing different sizes of genome and achieving different levels of accuracy. Low- and high-resolution mapping are two classes for various resolution of genome, particularly for the investigation of chromosomes. [4] The three basic varieties of physical mapping are fluorescent in situ hybridization (FISH), restriction site mapping and sequencing by clones. [5]

The goal of physical mapping, as a common mechanism under genomic analysis, is to obtain a complete genome sequence in order to deduce any association between the target DNA sequence and phenotypic traits. [6] If the actual positions of genes which control certain phenotypes are known, it is possible to resolve genetic diseases by providing advice on prevention and developing new treatments. [5]

Low-resolution mapping

Low-resolution physical mapping is typically capable of resolving DNA ranging from one base pair to several mega bases. In this category, most mapping methods involve generating a somatic cell hybrid panel, which is able to map any human DNA sequences, the gene of interest[ clarification needed ], to specific chromosomes of animal cells, such as those of mice and hamsters. [4] The hybrid cell panel is produced by collecting hybrid cell lines containing human chromosomes, identified by polymerase chain reaction (PCR) screening with primers specific to the human sequence of interest as the hybridization probe. The human chromosome would be presented[ clarification needed ] in all of the cell lines.

There are different approaches to producing low-resolution physical mapping, including chromosome-mediated gene transfer and irradiation fusion gene transfer which generate the hybrid cell panel. Chromosome-mediated gene transfer is a process that coprecipitates human chromosome fragments with calcium phosphate onto the cell line, leading to a stable transformation of recipient chromosomes retaining human chromosomes ranging in size from 1 to 50 mega base pairs. [4] Irradiation fusion gene transfer produces radiation hybrids which contain the human sequence of interest and a random set of other human chromosome fragments. Markers from fragments of human chromosome in radiation hybrids give cross-reactivity patterns, which are further analyzed to generate a radiation hybrid map by ordering the markers and breakpoints. [5] This provides evidence on whether the markers are located on the same human chromosome fragment, and hence the order of gene sequence.

High-resolution mapping

High-resolution physical mapping could resolve hundreds of kilobases to a single nucleotide of DNA. [4] A major technique to map such large DNA regions is high resolution FISH mapping, which could be achieved by the hybridization of probes to extended interphase chromosomes or artificially extended chromatin. Since their hierarchic structure is less condensed comparing to prometaphase and metaphase chromosomes, the standard in situ hybridization target, a high resolution of physical mapping could be produced. [5]

FISH mapping using interphase chromosome is a conventional in situ method to map DNA sequences from 50 to 500 kilobases, which are mainly syntenic DNA clones. However, naturally extended chromosomes might be folded back and produces alternative physical map orders. As a result, statistical analysis is necessary to generate the accurate map order of interphase chromosomes. [4]

If artificially stretched chromatin is used instead, mapping resolutions could be over 700 kilobases. In order to produce extended chromosomes on a slide, direct visual hybridization (DIRVISH) is often carried out, that cells are lysed by detergent to allow DNA released into the solution to flow to the other end of the slide. An example of high resolution FISH mapping using stretched chromatin is extended chromatin fiber (ECF) FISH. The method suggests the order of desired regions on the DNA sequence by analyzing the partial overlaps and gaps between yeast artificial chromosomes (YACs). [4] Eventually, the linear sequence of the interested DNA regions could be determined. One more to note is that if metaphase chromosome is used in FISH mapping, the resolution resulted will be very poor, which is to be classified to low-resolution mapping rather than a high-resolution mapping. [5]

Restriction site mapping

Restriction mapping is a top-down strategy that divides a chromosome target into finer regions. [7] Restriction enzymes are used to digest a chromosome and produce an ordered set of DNA fragments. It involves genomic fragments of the target rather than cloned fragments in the library. [8] They will be pinned to probes from the genomic library that are chosen randomly for detection purpose. The lengths of the fragments are measured by electrophoresis, which can be used to deduce their distance along the map according to the restriction site, the markers of a physical map. [8] The progress involves combinatorial algorithms. [9]

During the progress, a chromosome is obtained from a hybrid cell and cut at rare restriction site to produce large fragments. The fragments will be separated by size and undergo hybridization, forming the macrorestriction map and different contiguous blocks (i.e. contigs). To ensure the fragments are linked, linking clones with the same rare cutting sites at the large fragments can be used.

After producing the low-resolution map, the fragments can be cut into smaller sections by restriction nucleases for further analysis to produce a map with higher resolution. PFG fractionation can be used for separation and purification of the fragments generated for small genome.

Through different digestion approaches, different types of DNA fragments are produced. The variation in types of fragments might affect the calculation result.

Double digestion

This technique uses two restriction enzymes and a combination of the two enzymes for digestion separately. [10] It assumes that complete digestion occurs at each restriction site. The lengths of the DNA fragments are measured and used for ordering of fragments by computation. This approach has easier experimental handling, but more difficult in terms of the combinatorial problem required for mapping.

Partial digestion

This technique uses one restriction enzyme to digest the desired DNA in separated experiments with different durations of exposure. [10] The extent of digestion for the fragments differs. DNA methylation is a technique that prevents the reaction from being completed at cutting sites. This method must be done more carefully, but its mathematical problem can be easily solved by exponential algorithm.

Sequencing by clones

Using clones to generate a physical map is a bottom-up approach with fairly high resolution. [8] It uses the existing cloned fragments in genomic libraries to form contigs. Through cloning the partially digested fragments generated by bacterial transformation, the immortal clones with overlapping regions of the genome, which will be examined by fingerprinting methods and stored in the libraries, are produced. [11] During sequencing process, the clones are randomly selected and placed on a set of microtitre plates randomly. They will be fingerprinted by different methods. To ensure there is a minimum set of clones that form one config for a genome (i.e. tiling path), the library used will have five to ten times redundancy. However, such techniques might produce unknown gaps in the map produced or result in saturation in clones eventually.

Application

Physical mapping is a technique to complete the sequencing of a genome. Ongoing projects that determine DNA base pair sequences, namely the Human Genome Project, give knowledge on the order of nucleotide and allow further investigation to answer genetic questions, particularly the association between the target sequence and the development of traits. From the individual DNA sequence isolated and mapped in physical mapping, it could provide information on the transcription and translation process during development of organisms, hence identifying the specific function of the gene and associated traits produced. [6] As a result of understanding the expression and regulation of the genes, potential new treatments can be developed to alter protein expression patterns in specific tissues. Moreover, if the location and sequence of disease genes are identified, medical advice can be given to potential patients who are the carrier the disease gene, with reference to the knowledge of the gene function and products. [5]

Related Research Articles

<span class="mw-page-title-main">Yeast artificial chromosome</span> Genetically engineered chromosome derived from the DNA of yeast

Yeast artificial chromosomes (YACs) are genetically engineered chromosomes derived from the DNA of the yeast, Saccharomyces cerevisiae, which is then ligated into a bacterial plasmid. By inserting large fragments of DNA, from 100–1000 kb, the inserted sequences can be cloned and physically mapped using a process called chromosome walking. This is the process that was initially used for the Human Genome Project, however due to stability issues, YACs were abandoned for the use of bacterial artificial chromosome

Comparative genomic hybridization (CGH) is a molecular cytogenetic method for analysing copy number variations (CNVs) relative to ploidy level in the DNA of a test sample compared to a reference sample, without the need for culturing cells. The aim of this technique is to quickly and efficiently compare two genomic DNA samples arising from two sources, which are most often closely related, because it is suspected that they contain differences in terms of either gains or losses of either whole chromosomes or subchromosomal regions. This technique was originally developed for the evaluation of the differences between the chromosomal complements of solid tumor and normal tissue, and has an improved resolution of 5–10 megabases compared to the more traditional cytogenetic analysis techniques of giemsa banding and fluorescence in situ hybridization (FISH) which are limited by the resolution of the microscope utilized.

Chromosome jumping is a tool of molecular biology that is used in the physical mapping of genomes. It is related to several other tools used for the same purpose, including chromosome walking.

Genetics, a discipline of biology, is the science of heredity and variation in living organisms.

Fluorescence <i>in situ</i> hybridization Genetic testing technique

Fluorescence in situ hybridization (FISH) is a molecular cytogenetic technique that uses fluorescent probes that bind to only particular parts of a nucleic acid sequence with a high degree of sequence complementarity. It was developed by biomedical researchers in the early 1980s to detect and localize the presence or absence of specific DNA sequences on chromosomes. Fluorescence microscopy can be used to find out where the fluorescent probe is bound to the chromosomes. FISH is often used for finding specific features in DNA for use in genetic counseling, medicine, and species identification. FISH can also be used to detect and localize specific RNA targets in cells, circulating tumor cells, and tissue samples. In this context, it can help define the spatial-temporal patterns of gene expression within cells and tissues.

<span class="mw-page-title-main">Gene mapping</span> Process of locating specific genes

Gene mapping or genome mapping describes the methods used to identify the location of a gene on a chromosome and the distances between genes. Gene mapping can also describe the distances between different sites within a gene.

A genomic library is a collection of overlapping DNA fragments that together make up the total genomic DNA of a single organism. The DNA is stored in a population of identical vectors, each containing a different insert of DNA. In order to construct a genomic library, the organism's DNA is extracted from cells and then digested with a restriction enzyme to cut the DNA into fragments of a specific size. The fragments are then inserted into the vector using DNA ligase. Next, the vector DNA can be taken up by a host organism - commonly a population of Escherichia coli or yeast - with each cell containing only one vector molecule. Using a host cell to carry the vector allows for easy amplification and retrieval of specific clones from the library for analysis.

<span class="mw-page-title-main">ChIP-on-chip</span> Molecular biology method

ChIP-on-chip is a technology that combines chromatin immunoprecipitation ('ChIP') with DNA microarray ("chip"). Like regular ChIP, ChIP-on-chip is used to investigate interactions between proteins and DNA in vivo. Specifically, it allows the identification of the cistrome, the sum of binding sites, for DNA-binding proteins on a genome-wide basis. Whole-genome analysis can be performed to determine the locations of binding sites for almost any protein of interest. As the name of the technique suggests, such proteins are generally those operating in the context of chromatin. The most prominent representatives of this class are transcription factors, replication-related proteins, like origin recognition complex protein (ORC), histones, their variants, and histone modifications.

<span class="mw-page-title-main">Chromosome conformation capture</span>

Chromosome conformation capture techniques are a set of molecular biology methods used to analyze the spatial organization of chromatin in a cell. These methods quantify the number of interactions between genomic loci that are nearby in 3-D space, but may be separated by many nucleotides in the linear genome. Such interactions may result from biological functions, such as promoter-enhancer interactions, or from random polymer looping, where undirected physical motion of chromatin causes loci to collide. Interaction frequencies may be analyzed directly, or they may be converted to distances and used to reconstruct 3-D structures.

The following outline is provided as an overview of and topical guide to genetics:

<span class="mw-page-title-main">Tiling array</span>

Tiling arrays are a subtype of microarray chips. Like traditional microarrays, they function by hybridizing labeled DNA or RNA target molecules to probes fixed onto a solid surface.

Epigenomics is the study of the complete set of epigenetic modifications on the genetic material of a cell, known as the epigenome. The field is analogous to genomics and proteomics, which are the study of the genome and proteome of a cell. Epigenetic modifications are reversible modifications on a cell's DNA or histones that affect gene expression without altering the DNA sequence. Epigenomic maintenance is a continuous process and plays an important role in stability of eukaryotic genomes by taking part in crucial biological mechanisms like DNA repair. Plant flavones are said to be inhibiting epigenomic marks that cause cancers. Two of the most characterized epigenetic modifications are DNA methylation and histone modification. Epigenetic modifications play an important role in gene expression and regulation, and are involved in numerous cellular processes such as in differentiation/development and tumorigenesis. The study of epigenetics on a global level has been made possible only recently through the adaptation of genomic high-throughput assays.

Diversity Arrays Technology (DArT) is a high-throughput genetic marker technique that can detect allelic variations to provide comprehensive genome coverage without any DNA sequence information for genotyping and other genetic analysis. The general steps involve reducing the complexity of the genomic DNA with specific restriction enzymes, choosing diverse fragments to serve as representations for the parent genomes, amplify via polymerase chain reaction (PCR), inserting fragments into a vector to be placed as probes within a microarray, and then fluorescent targets from a reference sequence will be allowed to hybridize with probes and put through an imaging system. The objective is to identify and quantify various forms of DNA polymorphism within genomic DNA of sampled species.

Paired-end tags (PET) are the short sequences at the 5’ and 3' ends of a DNA fragment which are unique enough that they (theoretically) exist together only once in a genome, therefore making the sequence of the DNA in between them available upon search or upon further sequencing. Paired-end tags (PET) exist in PET libraries with the intervening DNA absent, that is, a PET "represents" a larger fragment of genomic or cDNA by consisting of a short 5' linker sequence, a short 5' sequence tag, a short 3' sequence tag, and a short 3' linker sequence. It was shown conceptually that 13 base pairs are sufficient to map tags uniquely. However, longer sequences are more practical for mapping reads uniquely. The endonucleases used to produce PETs give longer tags but sequences of 50–100 base pairs would be optimal for both mapping and cost efficiency. After extracting the PETs from many DNA fragments, they are linked (concatenated) together for efficient sequencing. On average, 20–30 tags could be sequenced with the Sanger method, which has a longer read length. Since the tag sequences are short, individual PETs are well suited for next-generation sequencing that has short read lengths and higher throughput. The main advantages of PET sequencing are its reduced cost by sequencing only short fragments, detection of structural variants in the genome, and increased specificity when aligning back to the genome compared to single tags, which involves only one end of the DNA fragment.

<span class="mw-page-title-main">Chromatin immunoprecipitation</span> Genomic technique

Chromatin immunoprecipitation (ChIP) is a type of immunoprecipitation experimental technique used to investigate the interaction between proteins and DNA in the cell. It aims to determine whether specific proteins are associated with specific genomic regions, such as transcription factors on promoters or other DNA binding sites, and possibly define cistromes. ChIP also aims to determine the specific location in the genome that various histone modifications are associated with, indicating the target of the histone modifiers. ChIP is crucial for the advancements in the field of epigenomics and learning more about epigenetic phenomena.

Radiation hybrid mapping is a technique for mapping mammalian chromosomes.

<span class="mw-page-title-main">Jumping library</span>

Jumping libraries or junction-fragment libraries are collections of genomic DNA fragments generated by chromosome jumping. These libraries allow the analysis of large areas of the genome and overcome distance limitations in common cloning techniques. A jumping library clone is composed of two stretches of DNA that are usually located many kilobases away from each other. The stretch of DNA located between these two "ends" is deleted by a series of biochemical manipulations carried out at the start of this cloning technique.

<span class="mw-page-title-main">Single cell epigenomics</span> Study of epigenomics in individual cells by single cell sequencing

Single cell epigenomics is the study of epigenomics in individual cells by single cell sequencing. Since 2013, methods have been created including whole-genome single-cell bisulfite sequencing to measure DNA methylation, whole-genome ChIP-sequencing to measure histone modifications, whole-genome ATAC-seq to measure chromatin accessibility and chromosome conformation capture.

<span class="mw-page-title-main">Vectorette PCR</span>

Vectorette PCR is a variation of polymerase chain reaction (PCR) designed in 1988. The original PCR was created and also patented during the 1980s. Vectorette PCR was first noted and described in an article in 1990 by John H. Riley and his team. Since then, multiple variants of PCR have been created. Vectorette PCR focuses on amplifying a specific sequence obtained from an internal sequence that is originally known until the fragment end. Multiple researches have taken this method as an opportunity to conduct experiments in order to uncover the potential uses that can be derived from Vectorette PCR.

<span class="mw-page-title-main">Hi-C (genomic analysis technique)</span> Genomic analysis technique

Hi-C is a high-throughput genomic and epigenomic technique to capture chromatin conformation (3C). In general, Hi-C is considered as a derivative of a series of chromosome conformation capture technologies, including but not limited to 3C, 4C, and 5C. Hi-C comprehensively detects genome-wide chromatin interactions in the cell nucleus by combining 3C and next-generation sequencing (NGS) approaches and has been considered as a qualitative leap in C-technology development and the beginning of 3D genomics.

References

  1. Brown TA (2002). "Mapping Genomes". Genomes. Oxford: Wiley-Liss via NCBI.
  2. Alizadeh, F.; Karp, R. M.; Weisser, D. K.; Zweig, G. (1995). "Physical Mapping of Chromosomes Using Unique Probes". Journal of Computational Biology. 2 (2): 159–184. doi:10.1089/cmb.1995.2.159. PMID   7497125. S2CID   13628080.
  3. Griffiths AJ, Miller JH, Suzuki DT, et al. (2000). "Mapping human genes by using human–rodent somatic cell hybrids".{{cite journal}}: Cite journal requires |journal= (help)
  4. 1 2 3 4 5 6 Strachan, T.; Read, Andrew P. (1999). Human molecular genetics 2 (2nd ed.). Singapore: J. Wiley. ISBN   978-0471330615. OCLC   154500424.
  5. 1 2 3 4 5 6 Reilly, Cavan. (2009). Statistics in human genetics and molecular biology. Boca Raton: Taylor & Francis. ISBN   9781420072631. OCLC   318585618.
  6. 1 2 Liu, Ben-Hui (1998). Statistical genomics: linkage, mapping, and QTL analysis. Boca Raton: CRC Press. ISBN   978-0849331664. OCLC   36423381.
  7. Glen A. Evans. "Physical mapping of complex genomes".
  8. 1 2 3 Genomics : the science and technology behind the Human Genome Project. Wiley. pp. 234–284. ISBN   978-0-471-22056-5.
  9. FARID ALIZADEH; RICHARD M. KARP; DEBORAH K. WEISSER; GEOFFREY ZWEIG (1995). "Physical Mapping of Chromosomes Using Unique Probes". Journal of Computational Biology. 2 (2): 159–84. doi:10.1089/cmb.1995.2.159. PMID   7497125. S2CID   13628080.
  10. 1 2 Böckenhauer, Hans-Joachim; Bongartz, Dirk (2007). Algorithmic Aspects of Bioinformatics. Springer Berlin Heidelberg. pp. 123–169. ISBN   9783540719120.
  11. MICHAEL FONSTEIN AND ROBERT HASELKORN (June 1995). "Physical Mapping of Bacterial Genomes". Journal of Bacteriology. 177 (12): 3361–3369. doi:10.1128/jb.177.12.3361-3369.1995. PMC   177037 . PMID   7768844.