Second Generation Multiplex Plus

Last updated

Second Generation Multiplex Plus (SGM Plus), is a DNA profiling system developed by Applied Biosystems. It is an updated version of Second Generation Multiplex. SGM Plus has been used by the UK National DNA Database since 1998.

Contents

An SGM Plus profile consists of a list of 10 number pairs, one number pair for each of 10 genetic markers, along with two letters (XX or XY) which show the result of the Amelogenin sex indicating test. Each number pair denotes the two allele values for the marker - one value is inherited from each of the subject's parents. If both alleles are the same, then only a single number, rather than a pair, is recorded.

Genetic markers

The genetic markers (or loci) used by SGM Plus are all short tandem repeats (STRs). The markers used are: VWA, D8S1179, D21S11, D18S51, TH01, FGA, D3S1358, D16S539, D2S1338 and D19S433. Where a marker's designation begins with D, the digits immediately following the D indicate the chromosome that contains the marker. For example, D21S11 is on chromosome 21. SGM Plus also uses the amelogenin (amelo) sex-indicating test.

SGM Plus differs from SGM in that SGM does not use the markers D3S1358, D16S539, D2S1338 and D19S433.

SGM Plus has eight markers in common with CODIS FGA, TH01, VWA, D3S1358, D8S1179, D16S539, D18S51, and D21S11. It differs from CODIS in that it uses the additional markers D2S1338 and D19S433 and does not use the five markers CSF1PO, TPOX, D5S818, D7S820, D13S317. [1]

Characteristics of alleles observed in the SGM Plus loci [1]
Locus
designation
Chromosome
location
Common sequence motifAllele
range
Size range
(bp)
Dye label
FGA4q28(TTTC)3TTTT TTCT (CTTT)n CTCC (TTCC)212.2-51.2215–353NED
TH0111p15.5(AATG)n3-14165–204NED
VWA12p12-pterTCTA(TCTG)3-4(TCTA)n10-25157–2095-FAM
D2S13382q35–37.1(TGCC)n(TTCC)n15-28289–3415-FAM
D3S13583pTCTA (TCTG)1-3 (TCTA)n8-21114–1425-FAM
D8S11798(TCTR)n7-20128–172JOE
D16S53916q24-qter(AGAT)n5-16234–2745-FAM
D18S5118q21.3(AGAA)n7-39.226–345JOE
D19S43319q12–13.1(AAGG)(AAAG)(AAGG)(TAGG)(AAGG)n9-17.2106–140NED
D21S1121q11.2–q21(TCTA)n(TCTG)n[(TCTA)3TA(TCTA)3TCA (TCTA)2TCCA TA] (TCTA)n12-41.2187–243JOE
AmelogeninX: p22.1–22.3
Y: p11.2
107 113JOE

Dye tags

The primers are tagged with the following fluorescent dyes for detection under electrophoresis:

The primers for each locus are arranged on the dyes in the following order, from low molecular weight to large molecular weight:

The dyes to which each primer is attached differ from those of the original SGM DNA profiling system.

Example SGM Plus profile

The SGM Plus profile of subject GT36865 from a National Institute of Standards and Technology paper is given below: [2]

SGM Plus profile of subject GT36865 [3]
LocusAllele values
FGA22,22
TH016,7
VWA14,16
D2S133819,24
D3S135817,17
D8S117913,14
D16S5399,13
D18S5113,16
D19S43314,15
D21S1130,30
AmelogeninXX

An SGM Plus profile retrieved from a DNA database would just list the allele values: [4]

15,18; 6,9; 11,13; 22,22; 31,32.2; 14,17; 17,20; 11,12; 13,16.3; 15,16; XY

Each value is the number of tandem repeats within the allele. A non-standard repeat is designated by the number of complete repeat units and the number of base pairs of the partial repeat, separated by a decimal point.

Probability of Identity

The probability of identify (also known as the random match probability) is the probability that two individuals selected at random will have an identical genetic profile.

Applied Biosystems estimates the probability of identity for SGM Plus to be approximately 1 in 13 trillion for African-Americans and 1 in 3.3 trillion Caucasian Americans. [5]

The Human Genetics Commission has reported that the random match probability is in the region of 1 in a trillion. However it stated "When the SGM Plus profiling system was first introduced, there was agreement within the scientific community that identifications with match probabilities lower than one in a billion would not be quoted in the courts of law, so as to avoid overstating the value of the DNA evidence to take into account that match probabilities are only estimates, and to make sure that the figure used was one that was meaningful to non-specialists." [6]

The UK Crown Prosecution Service states "SGM Plus DNA profiling is very discriminating between individuals. The probability of obtaining a match between the profiles of two unrelated individuals by chance is very low, of the order of 1 in a billion. However, it has not yet been possible to carry out the required statistical testing to be able to quote this match probability, and in practice a more conservative chance match figure of 1 in 1,000 million is used." [7]

See also

Related Research Articles

In molecular biology, restriction fragment length polymorphism (RFLP) is a technique that exploits variations in homologous DNA sequences, known as polymorphisms, populations, or species or to pinpoint the locations of genes within a sequence. The term may refer to a polymorphism itself, as detected through the differing locations of restriction enzyme sites, or to a related laboratory technique by which such differences can be illustrated. In RFLP analysis, a DNA sample is digested into fragments by one or more restriction enzymes, and the resulting restriction fragments are then separated by gel electrophoresis according to their size.

A microsatellite is a tract of repetitive DNA in which certain DNA motifs are repeated, typically 5–50 times. Microsatellites occur at thousands of locations within an organism's genome. They have a higher mutation rate than other areas of DNA leading to high genetic diversity. Microsatellites are often referred to as short tandem repeats (STRs) by forensic geneticists and in genetic genealogy, or as simple sequence repeats (SSRs) by plant geneticists.

<span class="mw-page-title-main">DNA profiling</span> Technique used to identify individuals via DNA characteristics

DNA profiling is the process of determining an individual's deoxyribonucleic acid (DNA) characteristics. DNA analysis intended to identify a species, rather than an individual, is called DNA barcoding.

An STR multiplex system is used to identify specific short tandem repeats (STRs). STR polymorphisms are genetic markers that may be used to identify a DNA sequence.

A minisatellite is a tract of repetitive DNA in which certain DNA motifs are typically repeated two to several hundred times. Minisatellites occur at more than 1,000 locations in the human genome and they are notable for their high mutation rate and high diversity in the population. Minisatellites are prominent in the centromeres and telomeres of chromosomes, the latter protecting the chromosomes from damage. The name "satellite" refers to the early observation that centrifugation of genomic DNA in a test tube separates a prominent layer of bulk DNA from accompanying "satellite" layers of repetitive DNA. Minisatellites are small sequences of DNA that do not encode proteins but appear throughout the genome hundreds of times, with many repeated copies lying next to each other.

<span class="mw-page-title-main">Haplotype</span> Group of genes from one parent

A haplotype is a group of alleles in an organism that are inherited together from a single parent.

<span class="mw-page-title-main">Variable number tandem repeat</span>

A variable number tandem repeat is a location in a genome where a short nucleotide sequence is organized as a tandem repeat. These can be found on many chromosomes, and often show variations in length among individuals. Each variant acts as an inherited allele, allowing them to be used for personal or parental identification. Their analysis is useful in genetics and biology research, forensics, and DNA fingerprinting.

A genetic marker is a gene or DNA sequence with a known location on a chromosome that can be used to identify individuals or species. It can be described as a variation that can be observed. A genetic marker may be a short DNA sequence, such as a sequence surrounding a single base-pair change, or a long one, like minisatellites.

Amelogenins are a group of protein isoforms produced by alternative splicing or proteolysis from the AMELX gene, on the X chromosome, and also the AMELY gene in males, on the Y chromosome. They are involved in amelogenesis, the development of enamel. Amelogenins are type of extracellular matrix protein, which, together with ameloblastins, enamelins and tuftelins, direct the mineralization of enamel to form a highly organized matrix of rods, interrod crystal and proteins.

A Y-STR is a short tandem repeat (STR) on the Y-chromosome. Y-STRs are often used in forensics, paternity, and genealogical DNA testing. Y-STRs are taken specifically from the male Y chromosome. These Y-STRs provide a weaker analysis than autosomal STRs because the Y chromosome is only found in males, which are only passed down by the father, making the Y chromosome in any paternal line practically identical. This causes a significantly smaller amount of distinction between Y-STR samples. Autosomal STRs provide a much stronger analytical power because of the random matching that occurs between pairs of chromosomes during the zygote making process.

Second Generation Multiplex is a DNA profiling system used in the United Kingdom to set up the UK National DNA Database in 1995. It is manufactured by ABI.

A surname DNA project is a genetic genealogy project which uses genealogical DNA tests to trace male lineage.

<span class="mw-page-title-main">STR analysis</span> Biological DNA analysis for allele repeats

Shorttandemrepeat (STR) analysis is a common molecular biology method used to compare allele repeats at specific loci in DNA between two or more samples. A short tandem repeat is a microsatellite with repeat units that are 2 to 7 base pairs in length, with the number of repeats varying among individuals, making STRs effective for human identification purposes. This method differs from restriction fragment length polymorphism analysis (RFLP) since STR analysis does not cut the DNA with restriction enzymes. Instead, polymerase chain reaction (PCR) is employed to discover the lengths of the short tandem repeats based on the length of the PCR product.

In genetic genealogy, a unique-event polymorphism (UEP) is a genetic marker that corresponds to a mutation that is likely to occur so infrequently that it is believed overwhelmingly probable that all the individuals who share the marker, worldwide, will have inherited it from the same common ancestor, and the same single mutation event.

A DNA database or DNA databank is a database of DNA profiles which can be used in the analysis of genetic diseases, genetic fingerprinting for criminology, or genetic genealogy. DNA databases may be public or private, the largest ones being national DNA databases.

In paternity testing, Paternity Index (PI) is a calculated value generated for a single genetic marker or locus and is associated with the statistical strength or weight of that locus in favor of or against parentage given the phenotypes of the tested participants and the inheritance scenario. Phenotype typically refers to physical characteristics such as body plan, color, behavior, etc. in organisms. However, the term used in the area of DNA paternity testing refers to what is observed directly in the laboratory. Laboratories involved in parentage testing and other fields of human identity employ genetic testing panels that contain a battery of loci each of which is selected due to extensive allelic variations within and between populations. These genetic variations are not assumed to bestow physical and/or behavioral attributes to the person carrying the allelic arrangement(s) and therefore are not subject to selective pressure and follow Hardy Weinberg inheritance patterns.

<span class="mw-page-title-main">Combined DNA Index System</span> United States national DNA database

The Combined DNA Index System (CODIS) is the United States national DNA database created and maintained by the Federal Bureau of Investigation. CODIS consists of three levels of information; Local DNA Index Systems (LDIS) where DNA profiles originate, State DNA Index Systems (SDIS) which allows for laboratories within states to share information, and the National DNA Index System (NDIS) which allows states to compare DNA information with one another.

<span class="mw-page-title-main">Forensic DNA analysis</span>

DNA profiling is the determination of a DNA profile for legal and investigative purposes. DNA analysis methods have changed countless times over the years as technology changes and allows for more information to be determined with less starting material. Modern DNA analysis is based on the statistical calculation of the rarity of the produced profile within a population.

This glossary of genetics is a list of definitions of terms and concepts commonly used in the study of genetics and related disciplines in biology, including molecular biology, cell biology, and evolutionary biology. It is split across two articles:

References

  1. 1 2 Core STR Review cstl.nist.gov [ dead link ]
  2. "For the Record" (PDF). NIST. 17 October 2019.
  3. JFS 2003 ID Results cstl.nist.gov
  4. "NPIA: Basic Facts - FAQs". Archived from the original on 2010-03-03. Retrieved 2010-03-02.
  5. AmpFlSTR SGM Plus PCR Amplification Kit User's Manual (PDF). pp. 14–12.
  6. Human Genetics Commission. Nothing to hide, nothing to fear? p.49 (PDF) (Report).
  7. The Crown Prosecution Service. "B4. Adventitious (chance) DNA Matches".