Combined DNA Index System

Last updated

CODIS logo CODIS logo.png
CODIS logo

The Combined DNA Index System (CODIS) is the United States national DNA database created and maintained by the Federal Bureau of Investigation. CODIS consists of three levels of information; Local DNA Index Systems (LDIS) where DNA profiles originate, State DNA Index Systems (SDIS) which allows for laboratories within states to share information, and the National DNA Index System (NDIS) which allows states to compare DNA information with one another.

Contents

The CODIS software contains multiple different databases depending on the type of information being searched against. Examples of these databases include, missing persons, convicted offenders, and forensic samples collected from crime scenes. Each state, and the federal system, has different laws for collection, upload, and analysis of information contained within their database. However, for privacy reasons, the CODIS database does not contain any personal identifying information, such as the name associated with the DNA profile. The uploading agency is notified of any hits to their samples and are tasked with the dissemination of personal information pursuant to their laws.

Establishment

The creation of a national DNA database within the U.S. was first mentioned by the Technical Working Group on DNA Analysis Methods (TWGDAM) in 1989. [1] The FBI's strategic goal was to maximize the voluntary participation of states and avoid what happened several years early, when eight western states frustrated with the progress creating a national Automated Fingerprint Identification System (AFIS) network formed the Western Identification Network (WIN). [2] The FBI's strategy to discourage states from creating systems that competed with CODIS was to develop DNA databasing software and provide it free of charge to state and local crime laboratories.This strategic decision--to provide software free of charge for the purpose of gaining market share--was innovative at that time and predated the browser wars. In 1990, the FBI began a pilot DNA databasing program with 14 state and local laboratories. [3]

In 1994, Congress passed the DNA Identification Act which authorized the FBI to create a national DNA database of convicted offenders as well as separate databases for missing persons and forensic samples collected from crime scenes. (Some in the Bureau believed the Act was not required to establish a national DNA database because the FBI's Criminal Justice Information Services Division was already using similar authorities to provide data-sharing solutions to federal, state, local, and tribal law enforcement agencies.) The DNA Identification Act also required that laboratories participating in the CODIS program maintain accreditation from an independent nonprofit organization that is actively involved in the forensic fields and that scientists processing DNA samples for submission into CODIS maintain proficiency and are routinely tested to ensure the quality of the profiles being uploaded into the database. [4] The national level of CODIS (NDIS) was implemented in October 1998. Today, all 50 states, the District of Columbia, federal law enforcement, the Army Laboratory, and Puerto Rico participate in the national sharing of DNA profiles. [5]

Database structure

The CODIS database contains several different indexes for the storage of DNA profile information. For assistance in criminal investigations three indexes exist: the offender index, which contains DNA profiles of those convicted of crimes; the arrestee index, which contains profiles of those arrested of crimes pursuant to the laws of the particular state; and the forensic index, which contains profiles collected from a crime scene. [6] Additional indexes, such as the unidentified human remain index, the missing persons index, and the biological relatives of missing persons index, are used to assist in identifying missing persons. [6] Specialty indexes also exist for other specimens that do not fall into the other categories. These indexes include the staff index, for profiles of employees who work with the samples, and the multi-allelic offender index, for single-source samples that have three or more alleles at two or more loci. [7]

Non-criminal indexes

While CODIS is generally used for linking crimes to other crimes and potentially to suspects there are non-criminal portions of the database such as the missing person indexes. The National Missing Person DNA Database, also known as CODIS(mp), is maintained by the FBI at the NDIS level of CODIS allowing all states to share information with one another. Created in 2000 using the existing CODIS infrastructure, this section of the database is designed to help identify human remains by collecting and storing DNA information on the missing or the relatives of missing individuals. [8] [9] Unidentified remains are processed for DNA by the University of North Texas Center for Human Identification which is funded by the National Institute of Justice. [10] Nuclear, Y-STR (for males only), and mitochondrial analysis can be performed on both unknown remains and on known relatives in order to maximize the chance of identifying remains. [8]

Statistics

As of September 2020, NDIS contained more than 14 million offender profiles, more than four million arrestee profiles and more than one million forensic profiles. [11] [12] The effectiveness of CODIS is measured by the number of investigations aided through database hits. As of September 2020, CODIS had aided in over 520 thousand investigations and produced more than 530 thousand hits. [11] [12] Each state has their own SDIS database and each state can set their own inclusionary standards that can be less strict than the national level. For this reason, a number of profiles that are present in state level databases are not in the national database and are not routinely searched across state lines. [13]

Scientific basis

The bulk of identifications using CODIS rely on short tandem repeats (STRs) that are scattered throughout the human genome and on statistics that are used to calculate the rarity of that specific profile in the population. [14] STRs are a type of copy-number variation and comprise a sequence of nucleotide base pairs that is repeated over and over again. At each location tested during DNA analysis, also known as a locus (plural loci), a person has two sets of repeats, one from the father and one from the mother. Each set is measured and the number of repeat copies is recorded. [15] If both strands, inherited from the parents, contain the same number of repeats at that locus the person is said to be homozygous at that locus. If the repeat numbers differ they are said to be heterozygous. Every possible difference at a locus is an allele. [16] This repeat determination is performed across a number of loci and the repeat values is the DNA profile that is uploaded to CODIS. As of January 1, 2017, requirements for upload to national level for known offender profiles is 20 loci. [5]

Alternatively, CODIS allows for the upload of mitochondrial DNA (mtDNA) information into the missing persons indexes. Since mtDNA is passed down from mother to offspring it can be used to link remains to still living relatives who have the same mtDNA. [5]

Loci

The original 13 core loci and their locations on the genome plus the sex determining locus amelogenin (AMEL) Codis profile.jpg
The original 13 core loci and their locations on the genome plus the sex determining locus amelogenin (AMEL)

Prior to January 1, 2017, the national level of CODIS required that known offender profiles have a set of 13 loci called the "CODIS core". Since then, the requirement has expanded to include seven additional loci. Partial profiles are also allowed in CODIS in separate indexes and are common in crime scene samples that are degraded or are mixtures of multiple individuals. Upload of these profiles to the national level of CODIS requires at least eight of the core loci to be present as well as a profile rarity of 1 in 10 million (calculated using population statistics). [5]

Loci that fall within a gene are named after the gene. For example, TPOX, is named after the human thyroid peroxidase gene. [17] Loci that do not fall within genes are given a standard naming scheme for uniformity. These loci are named D + the chromosome the locus is on + S + the order in which the location on that chromosome was described. For example, D3S1358 is on the third chromosome and is the 1358th location described. [18] The CODIS core are listed below; loci with asterisks are the new core and were added to the list in January 2017. [19]

  • CSF1PO
  • D3S1358
  • D5S818
  • D7S820
  • D8S1179
  • D13S317
  • D16S539
  • D18S51
  • D21S11
  • FGA
  • TH01
  • TPOX
  • vWA
  • D1S1656*
  • D2S441*
  • D2S1338*
  • D10S1248*
  • D12S391*
  • D19S433*
  • D22S1045*

The loci used in CODIS were chosen because they are in regions of noncoding DNA, sections that do not code for proteins. These sections should not be able to tell investigators any additional information about the person such as their hair or eye color, or their race. [20] However, new advancements in the understanding of genetic markers and ancestry have indicated that the CODIS loci may contain phenotypic information. [21] [22]

International use

While the U.S. database is not directly connected to any other country, the underlying CODIS software is used by other agencies around the world. As of April 2016, the CODIS software is used by 90 international laboratories in 50 countries. [3] [21] International police agencies that want to search the U.S. database can submit a request to the FBI for review. If the request is reasonable and the profile being searched would meet inclusionary standards for a U.S. profile, such as number of loci, the request can be searched at the national level or forwarded to any states where reasonable suspicion exists that they may be present in that level of the database. [5]

Controversies

Arrestee collection

Current laws governing DNA collection as of 2023.

Collection upon conviction only
Collection from some felony arrests
Collection from all felony arrests CODIS arrestee collections.svg
Current laws governing DNA collection as of 2023.

     Collection upon conviction only
     Collection from some felony arrests
     Collection from all felony arrests

The original purpose of the CODIS database was to build upon the sex offender registry through the DNA collection of convicted sex offenders. [23] Over time, that has expanded. Currently, all 50 states collect DNA from those convicted of felonies. A number of states also collect samples from juveniles as well as those who are arrested, but not yet convicted, of a crime. [12] [24] Note that even in states which limit collection of DNA retained in the state database only to those convicted of a crime, local databases, such as the forensic laboratory operated by New York City's Office of Chief Medical Examiner, may collect DNA samples of arrestees who have not been convicted. The collection of arrestee samples raised constitutional issues, specifically the Fourth Amendment prohibiting unreasonable search and seizure. It was argued that the collection of DNA from those that were not convicted of a crime, without an explicit order to collect, was considered a warrantless search and therefore unlawful. [25] In 2013, the United States Supreme Court ruled in Maryland v. King that the collection of DNA from those arrested for a crime, but not yet convicted, is part of the police booking procedure and is reasonable when that collection is used for identification purposes. [26]

Familial searching

The inheritance pattern of some DNA means that close relatives share a higher percentage of alleles between each other than with other, random, members of society. [27] This allows for the searching of close matches within CODIS when an exact match is not found. By focusing on close matches, investigators can potentially find a close relative whose profile is in CODIS narrowing their search to one specific family. Familial searching has led to several convictions after the exhaustion of all other leads including the Grim Sleeper serial killer. [28] This practice also raised Fourth Amendment challenges as the individual who ends up being charged with a crime was only implicated because someone else's DNA was in the CODIS database. [29] As of April 2018, twelve states have approved the use of familial searching in CODIS. [30]

See also

Related Research Articles

<span class="mw-page-title-main">DNA profiling</span> Technique used to identify individuals via DNA characteristics

DNA profiling is the process of determining an individual's deoxyribonucleic acid (DNA) characteristics. DNA analysis intended to identify a species, rather than an individual, is called DNA barcoding.

<span class="mw-page-title-main">Forensic science</span> Application of science to criminal and civil laws

Forensic science, also known as criminalistics, is the application of science principles and methods to support legal decision-making in matters of criminal and civil law.

Forensic identification is the application of forensic science, or "forensics", and technology to identify specific objects from the trace evidence they leave, often at a crime scene or the scene of an accident. Forensic means "for the courts".

The United Kingdom National DNA Database is a national DNA Database that was set up in 1995. In 2005 it had 3.1 million profiles and in 2020 it had 6.6 million profiles. 270,000 samples were added to the database in 2019–20, populated by samples recovered from crime scenes and taken from police suspects. 124,000 were deleted for those not charged or not found guilty. There were 731,000 matches of unsolved crimes between 2001 and 2020.

<span class="mw-page-title-main">Promega</span>

Promega Corporation is a Madison, Wisconsin–based manufacturer of enzymes and other products for biotechnology and molecular biology with a portfolio covering the fields of genomics, protein analysis and expression, cellular analysis, drug discovery, and genetic identity.

<span class="mw-page-title-main">Forensic biology</span> Forensic application of the study of biology

Forensic biology is the use of biological principles and techniques in the context of law enforcement investigations.

<span class="mw-page-title-main">STR analysis</span> Biological DNA analysis for allele repeats

Shorttandemrepeat (STR) analysis is a common molecular biology method used to compare allele repeats at specific loci in DNA between two or more samples. A short tandem repeat is a microsatellite with repeat units that are 2 to 7 base pairs in length, with the number of repeats varying among individuals, making STRs effective for human identification purposes. This method differs from restriction fragment length polymorphism analysis (RFLP) since STR analysis does not cut the DNA with restriction enzymes. Instead, polymerase chain reaction (PCR) is employed to discover the lengths of the short tandem repeats based on the length of the PCR product.

<span class="mw-page-title-main">DNA Analysis Backlog Elimination Act of 2000</span> United States Act of Congress

The DNA Analysis Backlog Elimination Act of 2000 is a United States Act of Congress that primarily allows US states to carry out DNA analyses for use in the FBI's Combined DNA Index System and to collect and analyse DNA samples.

A government database collects information for various reasons, including climate monitoring, securities law compliance, geological surveys, patent applications and grants, surveillance, national security, border control, law enforcement, public health, voter registration, vehicle registration, social security, and statistics.

A DNA database or DNA databank is a database of DNA profiles which can be used in the analysis of genetic diseases, genetic fingerprinting for criminology, or genetic genealogy. DNA databases may be public or private, the largest ones being national DNA databases.

<span class="mw-page-title-main">Debbie Smith Act</span> United States federal criminal legislation

The Debbie Smith Act of 2004 provides United States federal government grants to eligible states and units of local government to conduct DNA analyses of backlogged DNA samples collected from victims of crimes and criminal offenders. The Act expands the Combined DNA Index System (CODIS) and provides legal assistance to survivors of dating violence. Named after sexual assault survivor Debbie Smith, the Act was passed by the 108th Congress as part of larger legislation, the Justice for All Act of 2004, and signed into law by President George W. Bush on October 30, 2004. The Act amended the DNA Analysis Backlog Elimination Act of 2000, the DNA Identification Act of 1994, the Violence Against Women Act of 2000, and the Uniform Code of Military Justice. The Act was reauthorized in 2008, extending the availability of DNA backlog reduction program grants, DNA evidence training and education program grants, and sexual assault forensic exam program grants through fiscal year 2014.

In paternity testing, Paternity Index (PI) is a calculated value generated for a single genetic marker or locus and is associated with the statistical strength or weight of that locus in favor of or against parentage given the phenotypes of the tested participants and the inheritance scenario. Phenotype typically refers to physical characteristics such as body plan, color, behavior, etc. in organisms. However, the term used in the area of DNA paternity testing refers to what is observed directly in the laboratory. Laboratories involved in parentage testing and other fields of human identity employ genetic testing panels that contain a battery of loci each of which is selected due to extensive allelic variations within and between populations. These genetic variations are not assumed to bestow physical and/or behavioral attributes to the person carrying the allelic arrangement(s) and therefore are not subject to selective pressure and follow Hardy Weinberg inheritance patterns.

The National Missing and Unidentified Persons System (NamUs) is a national clearinghouse and resource center for missing, unidentified, and unclaimed person cases throughout the United States.

The University of North Texas Center for Human Identification is a national resource for the identification of missing and unidentified persons at the University of North Texas Health Science Center. UNTCHI combines the services of a forensic DNA laboratory, a forensic anthropology laboratory, and the Forensic Services Unit (FSU). The center operates the National Missing and Unidentified Persons System (NamUs) and Project EDAN which helps identify human remains.

Rapid DNA describes the fully automated process of developing a CODIS Core STR profile or other STR profile from a reference sample buccal swab. The “swab in – profile out” process consists of automated extraction, amplification, separation, detection and allele calling without human intervention. A machine designed to perform such rapid DNA analysis is called a DNA "magic box" by enforcement authorities.

Criminal records in the United States contain records of arrests, criminal charges and the disposition of those charges. Criminal records are compiled and updated on local, state, and federal levels by government agencies, most often law enforcement agencies. Their primary purpose is to present a comprehensive criminal history for a specific individual.

Maryland v. King, 569 U.S. 435 (2013), was a decision of the United States Supreme Court which held that a cheek swab of an arrestee's DNA is comparable to fingerprinting and therefore, a legal police booking procedure that is reasonable under the Fourth Amendment.

<span class="mw-page-title-main">National Forensic DNA Database of South Africa</span>

The National Forensic DNA Database of South Africa (NFDD) is a national DNA database used in law enforcement in South Africa. The Criminal Law Amendment Act No. 37 of 2013 provides for the expansion and administration of such a database in South Africa, enabling the South African Police Service (SAPS) to match forensic DNA profiles derived from samples collected at crime scenes with forensic DNA profiles of offenders convicted of, and suspects arrested for, offences listed in a new Schedule 8 of the amended Criminal Procedure Act of 1977.

<span class="mw-page-title-main">Investigative genetic genealogy</span> Application of genealogy in a legal setting

Investigative genetic genealogy, also known as forensic genetic genealogy, is the emerging practice of utilizing genetic information from direct-to-consumer companies for identifying suspects or victims in criminal cases. As of December 2023, the use of this technology has solved a total of 651 criminal cases, including 318 individual perpetrators who were brought to light. There have also been 464 decedents identified, as well as 4 living does. The investigative power of genetic genealogy revolves around the use of publicly accessible genealogy databases such as GEDMatch and FamilyTreeDNA. On GEDMatch, users are able to upload their genetic data from any direct-to-consumer company in an effort to identify relatives that have tested at companies other than their own.

<span class="mw-page-title-main">Forensic DNA analysis</span> Genetic analyses in crime analysis

DNA profiling is the determination of a DNA profile for legal and investigative purposes. DNA analysis methods have changed countless times over the years as technology changes and allows for more information to be determined with less starting material. Modern DNA analysis is based on the statistical calculation of the rarity of the produced profile within a population.

References

  1. Budowle, Bruce; Moretti, Tamyra R.; Niezgoda, Stephen J.; Brown, Barry L. "CODIS and PCR-Based Short Tandem Repeat Loci: Law Enforcement Tools" (PDF). Promega. Retrieved October 8, 2016.
  2. "What WIN Is". Western Identification Network. Retrieved September 27, 2023.
  3. 1 2 "Combined DNA Index System (CODIS)". Federal Bureau of Investigation. Retrieved May 8, 2018.
  4. 42 U.S.C.   § 14132 : Index to facilitate law enforcement exchange of DNA identification information
  5. 1 2 3 4 5 "Frequently Asked Questions on CODIS and NDIS". Federal Bureau of Investigation. Retrieved March 6, 2017.
  6. 1 2 "Data and Communication Flow in CODIS" (PDF). Sexual Assault Kit Initiative. Retrieved April 9, 2017.
  7. "Procedure for CODIS". 10. North Carolina Department of Justice. December 28, 2015. Retrieved May 13, 2017.
  8. 1 2 "Missing Person Comparison Request". Federal Bureau of Investigation. Retrieved May 8, 2018.
  9. Ritter, Nancy. "Missing Person and Unidentified Remains: The Nation's Silent Mass Disaster" (PDF). NIJ Journal. National Institute of Justice (256). Retrieved May 8, 2018.
  10. "Family DNA Collection Protocol" (PDF). National Missing and Unidentified Persons System. Retrieved May 8, 2018.
  11. 1 2 "CODIS – NDIS Statistics as of September 2020". Federal Bureau of Investigation. September 2020. Retrieved November 6, 2020.
  12. 1 2 3 "CODIS – NDIS Statistics". Federal Bureau of Investigation. Retrieved May 13, 2017.
  13. Goulka, Jeremiah; Matthies, Carl; Disley, Emma; Steinberg, Paul (2010). Toward a Comparison of DNA Profiling and Databases in the United States and England (PDF) (Report). RAND Corporation. p. 6. Retrieved May 14, 2017.
  14. "What are the 13 core CODIS loci?". The Biology Project. October 27, 2000. Retrieved March 6, 2017 via The University of Arizona.
  15. "Forensics: Fingering Criminals Using DNA". For Dummies. Retrieved March 6, 2017.
  16. "Genetics of STR Inheritance". The Biology Project. October 27, 2000. Retrieved March 7, 2017 via The University of Arizona.
  17. "TPOX". National Institute of Standards and Technology. Retrieved April 2, 2017.
  18. Butler, John M. (2005). Forensic DNA Typing: Biology, Technology, and Genetics of STR Markers (2nd ed.). Elsevier Academic Press. pp. 23–24. ISBN   9780080470610 . Retrieved April 2, 2017 via Google Books.
  19. "FBI CODIS Core STR Loci". National Institute of Standards and Technology. August 26, 2015. Retrieved December 29, 2017.
  20. Augenstein, Seth (March 27, 2017). "Germany Mulls DNA Database Identifiers of Hair, Eye Color, Ethnicity". Forensic Magazine. Retrieved April 1, 2017.
  21. 1 2 Augenstein, Seth (April 13, 2016). "Does CODIS Contain Untapped Ancestry Information?". Forensic Magazine. Retrieved April 1, 2017.
  22. Augenstein, Seth (May 15, 2017). "CODIS Has More ID Information Than Believed, Scientists Find". Forensic Magazine. Retrieved May 16, 2017.
  23. "Proposed Guidelines for the Jacob Wetterling Crimes Against Children and Sexually Violent Offender Registration Act, as Amended". Federal Register. Federal Register Online. 63 (118): 33696–33708. June 19, 1998. Retrieved May 21, 2017 via Government Publishing Office.
  24. Samuels, Julie E.; Dwyer, Allison M.; Halberstadt, Robin; Lachman, Pamela (2011). Collecting DNA from Juveniles (PDF) (Report). Urban Institute Justice Policy Center. Retrieved May 21, 2017.
  25. Grossman, Steven P. (2015). "Using the DNA Testing of Arrestees to Reevaluate Fourth Amendment Doctrine". Valparaiso University Law Review. 49: 659–721. Retrieved May 21, 2017 via University of Baltimore Law.
  26. Liptak, Adam (June 3, 2013). "Justices Allow DNA Collection After an Arrest". The New York Times. Retrieved May 21, 2017.
  27. Steinberger, Eva; Sims, Gary. "Finding Criminals Through the DNA of Their Relatives – Familial Searching of the California Offender DNA Database" (PDF). Prosecutor's Brief. California District Attorneys Association. XXXI (1–2): 28–32. Archived from the original (PDF) on May 13, 2016. Retrieved May 21, 2017 via District Attorney's Office Denver.
  28. Gerber, Marisa (October 25, 2016). "The controversial DNA search that helped nab the 'Grim Sleeper' is winning over skeptics". Los Angeles Times. Retrieved May 21, 2017.
  29. Kim, Joyce; Mammo, Danny; Siegel, Mami B; Katsanis, Sara H (November 1, 2011). "Policy implications for familial searching". Investigative Genetics. 2 (22): 22. doi: 10.1186/2041-2223-2-22 . PMC   3253037 . PMID   22040348.
  30. Augenstein, Seth (April 11, 2018). "Arizona Nabs Murder Suspect With Familial DNA Searching on First Try". Forensic Magazine. Retrieved April 11, 2018.