DNA database

Last updated

A DNA database or DNA databank is a database of DNA profiles which can be used in the analysis of genetic diseases, genetic fingerprinting for criminology, or genetic genealogy. DNA databases may be public or private, the largest ones being national DNA databases.

Contents

DNA databases are often employed in forensic investigations. When a match is made from a national DNA database to link a crime scene to a person whose DNA profile is stored on a database, that link is often referred to as a cold hit. A cold hit is of particular value in linking a specific person to a crime scene, but is of less evidential value than a DNA match made without the use of a DNA database. [1] Research shows that DNA databases of criminal offenders reduce crime rates. [2] [3]

Types

Forensic

A forensic database is a centralized DNA database for storing DNA profiles of individuals that enables searching and comparing of DNA samples collected from a crime scene against stored profiles. The most important function of the forensic database is to produce matches between the suspected individual and crime scene bio-markers, and then provides evidence to support criminal investigations, and also leads to identify potential suspects in the criminal investigation. Majority of the National DNA databases are used for forensic purposes. [4]

The Interpol DNA database is used in criminal investigations. Interpol maintains an automated DNA database called DNA Gateway that contains DNA profiles submitted by member countries collected from crime scenes, missing persons, and unidentified bodies. [5] The DNA Gateway was established in 2002, and at the end of 2013, it had more than 140,000 DNA profiles from 69 member countries. Unlike other DNA databases, DNA Gateway is only used for information sharing and comparison, it does not link a DNA profile to any individual, and the physical or psychological conditions of an individual are not included in the database. [5]

Genealogical

A national or forensic DNA database is not available for non-police purposes. DNA profiles can also be used for genealogical purposes, so that a separate genetic genealogy database needs to be created that stores DNA profiles of genealogical DNA test results. GenBank is a public genetic genealogy database that stores genome sequences submitted by many genetic genealogists. Until now, GenBank has contained large number of DNA sequences gained from more than 140,000 registered organizations, and is updated every day to ensure a uniform and comprehensive collection of sequence information. These databases are mainly obtained from individual laboratories or large-scale sequencing projects. The files stored in GenBank are divided into different groups, such as BCT (bacterial), VRL (viruses), PRI (primates)...etc. People can access GenBank from NCBI's retrieval system, and then use “BLAST” function to identify a certain sequence within the GenBank or to find the similarities between two sequences. [6]

Medical

A medical DNA database is a DNA database of medically relevant genetic variations. It collects an individual's DNA which can reflect their medical records and lifestyle details. Through recording DNA profiles, scientists may find out the interactions between the genetic environment and occurrence of certain diseases (such as cardiovascular disease or cancer), and thus finding some new drugs or effective treatments in controlling these diseases. It is often collaborated with the National Health Service. [7]

National

A national DNA database is a DNA database maintained by the government for storing DNA profiles of its population. Each DNA profile based on PCR uses STR (Short Tandem Repeats) analysis. They are generally used for forensic purposes, including searching and matching DNA profiles of potential criminal suspects. [8]

In 2009 Interpol reported 54 police national DNA databases in the world and 26 more countries planned to start one. [9] In Europe Interpol reported there were 31 national DNA databases and six more planned. [9] The European Network of Forensic Science Institutes (ENFSI) DNA working group made 33 recommendations in 2014 for DNA database management and guidelines for auditing DNA databases. [10] Other countries have adopted privately developed DNA databases, such as Qatar, which has adopted Bode dbSEARCH. [11]

Typically, a tiny subset of the individual's genome is sampled from 13 or 16 regions that have high individuation.

United Kingdom

The first national DNA database in the United Kingdom was established in April 1995, called National DNA Database (NDNAD). By 2006, it contained 2.7 million DNA profiles (about 5.2% of the UK population), as well as other information from individuals and crime scenes. [12] in 2020 it had 6.6 million profiles (5.6 million individuals excluding duplicates). [13] [14] [15] The information is stored in the form of a digital code, which is based on the nomenclature of each STR. [16] In 1995 the database originally had 6 STR markers for each profile, from 1999 10 markers, and from 2014, 16 core markers and a gender identifier. Scotland has used 21 STR loci, two Y-DNA markers and a gender identifier since 2014. [17] In the UK, police have wide-ranging powers to take DNA samples and retain them if the subject is convicted of a recordable offence. [18] [19] As the large amount of DNA profiles which have been stored in NDNAD, "cold hits" may happen during the DNA matching, which means finding an unexpected match between an individual's DNA profile and an unsolved crime-scene DNA profile. This can introduce a new suspect into the investigation, thus helping to solve the old cases. [20]

In England and Wales, anyone arrested on suspicion of a recordable offence must submit a DNA sample, the profile of which is then stored on the DNA database. Those not charged or not found guilty have their DNA data deleted within a specified period of time. [21] In Scotland, the law similarly requires the DNA profiles of most people who are acquitted be removed from the database.

New Zealand

New Zealand was the second country to set up a DNA database. [22] In 2019 The New Zealand DNA Profile Databank held 40,000 DNA profiles and 200,000 samples. [23] [24]

United States

The United States national DNA database is called Combined DNA Index System (CODIS). It is maintained at three levels: national, state and local. Each level implemented its own DNA index system. The national DNA index system (NDIS) allows DNA profiles to be exchanged and compared between participated laboratories nationally. Each state DNA index system (SDIS) allows DNA profiles to be exchanged and compared between the laboratories of various states and the local DNA index system (LDIS) allows DNA profiles collected at local sites and uploaded to SDIS and NDIS.

CODIS software integrates and connects all the DNA index systems at the three levels. CODIS is installed on each participating laboratory site and uses a standalone network known as Criminal Justice Information Systems Wide Area Network (CJIS WAN) [8] [25] to connect to other laboratories. In order to decrease the number of irrelevant matches at NDIS, the Convicted Offender Index requires all 13 CODIS STRs to be present for a profile upload. Forensic profiles only require 10 of the STRs to be present for an upload.

As of 2011, over 9 million records were held within CODIS. [26] As of March 2011, 361,176 forensic profiles and 9,404,747 offender profiles have been accumulated, [27] making it the largest DNA database in the world. As of the same date, CODIS has produced over 138,700 matches to requests, assisting in more than 133,400 investigations. [28]

The growing public approval of DNA databases has seen the creation and expansion of many states' own DNA databases. Political measures such as California Proposition 69 (2004), which increased the scope of the DNA database, have already met with a significant increase in numbers of investigations aided. Forty-nine states in the USA, all apart from Idaho, store DNA profiles of violent offenders, and many also store profiles of suspects. [29] A 2017 study showed that DNA databases in U.S. states "deter crime by profiled offenders, reduce crime rates, and are more cost-effective than traditional law enforcement tools". [3]

CODIS is also used to help find missing persons and identify human remains. It is connected to the National Missing Persons DNA Database; samples provided by family members are sequenced by the University of North Texas Center for Human Identification, [30] which also runs the National Missing and Unidentified Persons System. UNTCHI can sequence both nuclear and mitochondrial DNA. [31]

The Department of Defense maintains a DNA database to identify the remains of service members. The Department of Defense Serum Repository maintains more than 50,000,000 records, primarily to assist in the identification of human remains. Submission of DNA samples is mandatory for US servicemen, but the database also includes information on military dependents. The National Defense Authorization Act of 2003 provided a means for federal courts or military judges to order the use of the DNA information collected to be made available for the purpose of investigation or prosecution of a felony, or any sexual offense, for which no other source of DNA information is reasonably available. [32]

Australia

The Australian national DNA database is called the National Criminal Investigation DNA Database (NCIDD). By July 2018, it contained 837,000+ DNA profiles. [33] [34] The database used nine STR loci and a sex gene for analysis, and this was increased to 18 core markers in 2013. [35] NCIDD combines all forensic data, including DNA profiles, advanced bio-metrics or cold cases.

Canada

The Canadian national DNA database is called the National DNA Data Bank (NDDB) which was established in 1998 but first used in 2000. [36] The legislation that Parliament enacted to govern the use of this technology within the criminal justice system has been found by Canadian courts to be respectful of the constitutional and privacy rights of suspects, and of persons found guilty of designated offences. [37]

On December 11, 1999, The Canadian Government agreed upon the DNA Identification Act. This would allow a Canadian DNA data bank to be created and amended for the criminal code. This provides a mechanism for judges to request the offender to provide blood, buccal swabs, or hair samples from DNA profiles. This legislation became official on June 29, 2000. Canadian police has been using forensic DNA evidence for over a decade. It has become one of the most powerful tools available to law enforcement agencies for the administration of justice. [38]

NDDB consists of two indexes: the Convicted Offender Index (COI) and National Crime Scene Index (CSI-nat). There is also the Local Crime Scene Index (CSI-loc) which is maintained by local laboratories but not NDDB as local DNA profiles do not meet NDDB collection criteria. Another National Crime Scene Index (CSI-nat) is a collection of three labs operated by Royal Canadian Mounted Police (RCMP), Laboratory Sciences Judiciary Medicine Legal (LSJML) and Center of Forensic Sciences (CFS).

Dubai

In 2017 Dubai announced an initiative called Dubai 10X which was planned to create 'disruptive innovation' into the country. [39] One of the projects in this initiative was a DNA database that would collect the genomes of all 3 million citizens of the country over a 10-year period. It was intended to use the data base for finding genetic causes of diseases and creating personalised medical treatments. [40]

Germany

Germany set up its DNA database for the German Federal Police (BKA) in 1998. [41] [42] [43] [44] In late 2010, the database contained DNA profiles of over 700,000 individuals and in September 2016 it contained 1,162,304 entries. [45] On 23 May 2011 in the "Stop the DNA Collection Frenzy!" campaign various civil rights and data protection organizations handed an open letter [46] to the German minister of justice Sabine Leutheusser-Schnarrenberger asking her to take action in order to stop the "preventive expansion of DNA data-collection" and the "preemptive use of mere suspicions and of the state apparatus against individuals" and to cancel projects of international exchange of DNA data at the European and transatlantic level. [47]

Israel

The Israeli national DNA database is called the Israel Police DNA Index System (IPDIS) [48] which was established in 2007, and has a collection of more than 135,000 DNA profiles. The collection includes DNA profiles from suspected and accused persons and convicted offenders. The Israeli database also include an “elimination bank” of profiles from laboratory staff and other police personnel who may have contact with the forensic evidence in the course of their work.

In order to handle the high throughput processing and analysis of DNA samples from FTA cards, the Israeli Police DNA database has established a semi-automated program LIMS, which enables a small number of police to finish processing a large number of samples in a relatively small period of time, and it is also responsible for the future tracking of samples.

Kuwait

The Kuwaiti government passed a law in July 2015 requiring all citizens and permanent residents (4.2 million people) to have their DNA taken for a national database. [49] The reason for this law was security concerns after the ISIS suicide bombing of the Imam Sadiq mosque. [50] They planned to finish collecting the DNA by September 2016 which outside observers thought was optimistic. [51] In October 2017 the Kuwait constitutional court struck down the law saying it was an invasion of personal privacy and the project was cancelled. [52]

Brazil

In 1998, the Forensic DNA Research Institute of Federal District Civil Police created DNA databases of sexual assault evidence. [53] In 2012, Brazil approved a national law establishing DNA databases at state and national levels regarding DNA typing of individuals convicted of violent crimes. [53] Following the decree of the Presidency of the Republic of Brazil in 2013, which regulates the 2012 law, Brazil began using CODIS in addition to the DNA databases of sexual assault evidence to solve sexual assault crimes in Brazil. [53]

France

France set up the DNA database called FNAEG in 1998. By December 2009, there were 1.27 million profiles on FNAEG. [54]

Russia

In Russia, scientific DNA testing is being actively carried out in order to study the genetic diversity of the peoples of Russia in the framework of the state task - to learn from DNA to determine the probable territory of human origin based on data on the majority of the peoples of the country. On June 16, 2017, the Council of Ministers of the Union State of Belarus and Russia adopted Resolution No. 26, in which it approved the scientific and technical program of the Union State "Development of innovative genogeographic and genomic technologies for identification of personality and individual characteristics of a person based on the study of gene pools of the regions of the Union State" (DNA - identification).

Within the framework of this program, it is also planned to include the peoples of neighboring countries, which are the main source of migration, into the genogeographic study on the basis of existing collections.

In accordance with the Federal Law of December 3, 2008 No. 242-FZ "On state genomic registration in the Russian Federation", voluntary state genomic registration of citizens of the Russian Federation, as well as foreign citizens and stateless persons living or temporarily staying in the territory of the Russian Federation on the basis of a written application and on a paid basis. Genomic information obtained as a result of state genomic registration is used, among other things, for the purpose of establishing family relationships of wanted (identified) persons. The form of keeping records of data on genomic registration of citizens is the Federal Genomic Information Database (FBDGI).

Articles 10 and 11 of the Federal Law of July 27, 2006 No. 152-FZ "On Personal Data" provide that the processing of special categories of personal data relating to race, nationality, political views, religious or philosophical beliefs, health status, intimate life is allowed if it is necessary in connection with the implementation of international agreements of the Russian Federation on readmission and is carried out in accordance with the legislation of the Russian Federation on citizenship of the Russian Federation. Information characterizing the physiological and biological characteristics of a person, on the basis of which it is possible to establish his identity (biometric personal data), can be processed without the consent of the subject of personal data in connection with the implementation of international agreements of the Russian Federation on readmission, administration of justice and execution of judicial acts, compulsory state fingerprinting registration, as well as in cases stipulated by the legislation of the Russian Federation on defense, security, anti-terrorism, transport security, anti-corruption, operational investigative activities, public service, as well as in cases stipulated by the criminal-executive legislation of Russia, the legislation of Russia on the procedure for leaving the Russian Federation and entering the Russian Federation, citizenship of the Russian Federation and notaries. [55]

Other European countries

In comparison with the other European countries, The Netherlands is the largest collector of DNA profiles of its citizens. At this moment the DNA databank at the Netherlands Forensic Institute contains the DNA profiles of over 316,000 Dutch citizens. [56]

Contrary to the situation in most other European countries, the Dutch police have wide-ranging powers to take and retain DNA samples if a subject is convicted of a recordable offence, except when the conviction only involves paying a fine. If a subject refuses, for example because of privacy concerns, the Dutch police will use force.

In Sweden, only the DNA profiles of criminals who have spent more than two years in prison are stored. In Norway and Germany, court orders are required, and are only available, respectively, for serious offenders and for those convicted of certain offences and who are likely to reoffend. Austria started a criminal DNA database in 1997 [57] and Italy also set one up in 2016 [58] [59] Switzerland started a temporary criminal DNA database in 2000 and confirmed it in law in 2005. [60]

In 2005 the incoming Portuguese government proposed to introduce a DNA database of the entire population of Portugal. [61] However, after informed debate including opinion from the Portuguese Ethics Council [62] the database introduced was of just the criminal population. [63]

Genuity Science (formerly Genomics Medicine Ireland) is an Irish life sciences company that was founded in 2015 to create a scientific platform to perform genomic studies and generate new disease prevention strategies and treatments. The company was founded by a group of life science entrepreneurs, investors and researchers and its scientific platform is based on work by Amgen’s Icelandic subsidiary, deCODE genetics, which has pioneered genomic population health studies. [64] The company is building a genomic database which will include data from about 10 per cent of the Irish population, including patients with various diseases and healthy people. [65] The idea of a private company owning public DNA data has raised concerns, with an Irish Times editorial stating: "To date, Ireland seems to have adopted an entirely commercial approach to genomic medicine. This approach places at risk the free availability of genomic data for scientific research that could benefit patients." [66] The paper's editorial pointed out that this is in stark contrast to the approach the U.K. has taken, which is the publicly and charitably funded 100,000 Genomes Project being carried out by Genomics England.

China

By 2020, Chinese police had collected 80 million DNA profiles. [67] [68] There have been concerns that China may be using DNA data not just for crime solving, but for tracking activists, including Uyghurs. [69]

Chinese have begun a $9 billion program for genetic science studying, Fire-Eye has DNA labs in over 20 countries. [70]

India

India announced it will launch its genomic database by fall 2019. [71] In the first phase of "Genome India" the genomic data of 10,000 Indians will be catalogued. The Department of Biotechnology (DBT) has initiated the project. The first private DNA bank in India is in Lucknow [72] - the capital of Indian State Uttar Pradesh. Unlike a research center, this is available for Public to store their DNA by paying a minimum amount and four drops of blood.

Corporate

Compression

[82] DNA databases occupy more storage when compared to other non DNA databases due to the enormous size of each DNA sequence. Every year DNA databases grow exponentially. This poses a major challenge to the storage, data transfer, retrieval and search of these databases. To address these challenges DNA databases are compressed to save storage space and bandwidth during the data transfers. They are decompressed during search and retrieval. Various compression algorithms are used to compress and decompress. The efficiency of any compression algorithm depends how well and fast it compresses and decompresses, which is generally measured in compression ratio. The greater the compression ratio, the better the efficiency of an algorithm. At the same time, the speed of compression and decompression are also considered for evaluation.

DNA sequences contain palindromic repetitions of A, C, T, G. Compression of these sequences involve locating and encoding these repetitions and decoding them during decompression.

Some approaches used to encode and decode are:

  1. Huffman Encoding
  2. Adaptive Huffman Encoding
  3. Arithmetic coding
  4. Arithmetic coding
  5. Context tree weighting (CTW) method

The compression algorithms listed below may use one of the above encoding approaches to compress and decompress DNA database

  1. Compression using Redundancy of DNA sets (COMRAD) [83] [84]
  2. Relative Lempel-Ziv (RLZ) [84]
  3. GenCompress
  4. BioCompress
  5. DNACompress
  6. CTW+LZ

In 2012, a team of scientists from Johns Hopkins University published the first genetic compression algorithm that does not rely on external genetic databases for compression. HAPZIPPER was tailored for HapMap data and achieves over 20-fold compression (95% reduction in file size), providing 2- to 4-fold better compression much faster than leading general-purpose compression utilities. [85]

Genomic sequence compression algorithms, also known as DNA sequence compressors, explore the fact that DNA sequences have characteristic properties, such as inverted repeats. The most successful compressors are XM and GeCo. [86] For eukaryotes XM is slightly better in compression ratio, though for sequences larger than 100 MB its computational requirements are impractical.

Medicine

Many countries collect newborn blood samples to screen for diseases mainly with a genetic basis. Mainly these are destroyed soon after testing. In some countries the dried blood (and the DNA) is retained for later testing.

In Denmark the Danish Newborn Screening Biobank at Statens Serum Institut keeps a blood sample from people born after 1981. The purpose is to test for phenylketonuria and other diseases. [87] However, it is also used for DNA profiling to identify deceased and suspected criminals. [88] Parents can request that the blood sample of their newborn be destroyed after the result of the test is known.

Privacy issues

Critics of DNA databases warn that the various uses of the technology can pose a threat to individual civil liberties. [89] [90] Personal information included in genetic material, such as markers that identify various genetic diseases, physical and behavioral traits, could be used for discriminatory profiling and its collection may constitute an invasion of privacy. [91] [92] [93] Also, DNA can be used to establish paternity and whether or not a child is adopted. Nowadays, the privacy and security issues of DNA database has caused huge attention. Some people are afraid that their personal DNA information will be let out easily, others may define their DNA profiles recording in the Databases as a sense of "criminal", and being falsely accused in a crime can lead to having a "criminal" record for the rest of their lives.

UK laws in 2001 and 2003 allowed DNA profiles to be taken immediately after a person was arrested and kept in a Database even if the suspect was later acquitted. [94] In response to public unease at these provisions, [94] the UK later changed this by passing the Protection of Freedoms Act 2012 which required that those suspects not charged or found not guilty would have their DNA data deleted from the Database. [21]

In European countries which have established a DNA database, there are some measures which are being used to protect the privacy of individuals, more specifically, some criteria to help removing the DNA profiles from the databases. Among the 22 European countries which have been analyzed, most of the countries will record the DNA profiles of suspects or those who have committed serious crimes. For some countries (like Belgium and France) may remove the criminal's profile after 30–40 years, because these “criminal investigation” database are no longer needed. Most of the countries will delete the suspect's profile after they are acquitted...etc. All the countries have a completed legislation to largely avoid the privacy issues which may occur during the use of DNA database. [4] Public discussion around the introduction of advanced forensic techniques (such as genetic genealogy using public genealogy databases and DNA phenotyping approaches) has been limited, disjointed, and unfocused, and raises issues of privacy and consent that may warrant additional legal protections to be established. [95]

Privacy issues surrounding DNA databases not only means privacy is threatened in collecting and analyzing DNA samples, it also exists in protecting and storing this important personal information. As the DNA profiles can be stored indefinitely in DNA database, it has raised concerns that these DNA samples can be used for new and unidentified purposes. [96] With the increase of the users who access the DNA database, people are worried about their information being let out or shared inappropriately, for example, their DNA profile may be shared with others such as law enforcement agencies or countries without individual consent. [97]

The application of DNA databases have been expanded into two controversial areas: arrestees and familial searching. An arrestee is a person arrested for a crime and who has not yet been convicted for that offense. Currently, 21 states in the United States have passed legislation that allows law enforcement to take DNA from an arrestee and enter it into the state's CODIS DNA database to see if that person has a criminal record or can be linked to any unsolved crimes. In familial searching, the DNA database is used to look for partial matches that would be expected between close family members. This technology can be used to link crimes to the family members of suspects and thereby help identify a suspect when the perpetrator has no DNA sample in the database. [98] [99]

Furthermore, DNA databases could fall into the wrong hands due to data breaches or data sharing.

DNA collection and human rights

In a judgement in December 2008, the European Court of Human Rights ruled that two British men should not have had their DNA and fingerprints retained by police saying that retention "could not be regarded as necessary in a democratic society". [100]

The DNA fingerprinting pioneer Professor Sir Alec Jeffreys condemned UK government plans to keep the genetic details of hundreds of thousands of innocent people in England and Wales for up to 12 years. Jeffreys said he was "disappointed" with the proposals, which came after a European court ruled that the current policy breaches people's right to privacy. Jefferys said "It seems to be as about as minimal a response to the European court of human rights judgment as one could conceive. There is a presumption not of innocence but of future guilt here … which I find very disturbing indeed". [101]

Effects on crime

A 2021 study found that registration of Danish criminal offenders in a DNA database substantially reduced the probability of re-offending, as well as increased the likelihood that re-offenders were identified if they committed future crimes. [2]

A 2017 study in the American Economic Journal: Applied Economics showed that databases of criminal offenders' DNA profiles in US states "deter crime by profiled offenders, reduce crime rates, and are more cost-effective than traditional law enforcement tools." [3]

Monozygotic twins

Monozygotic twins share around 99.99% of their DNA, while other siblings share around 50%. Some next generation sequencing tools are capable of detecting rare de novo mutations in only one of the twins (detectable in rare single nucleotide polymorphisms). [102] Most DNA testing tools would not detect these rare SNPs in most twins.

Each person's DNA is unique to them to the slight exception of identical (monozygotic and monospermotic) twins, who start out from the identical genetic line of DNA but during the twinning event have incredibly small mutations which can be detected now (for all intents and purposes, compared to all other humans and even to theoretical "clones, [who would not share the same uterus nor experience the same mutations pre-twinning event]" identical twins have more identical DNA than is probably possible to achieve between any other two humans). Tiny differences between identical twins can now (2014) be detected by next generation sequencing. For current fiscally available testing, "identical" twins cannot be easily differentiated by the most common DNA testing, but it has been shown to be possible. While other siblings (including fraternal twins) share about 50% of their DNA, monozygotic twins share virtually 99.99%. Beyond these more recently discovered twinning-event mutation disparities, since 2008 it has been known that people who are identical twins also each have their own set of copy number variants, which can be thought of as the number of copies they each personally exhibit for certain sections of DNA. [103]

See also

Related Research Articles

<span class="mw-page-title-main">DNA profiling</span> Technique used to identify individuals via DNA characteristics

DNA profiling is the process of determining an individual's deoxyribonucleic acid (DNA) characteristics. DNA analysis intended to identify a species, rather than an individual, is called DNA barcoding.

<span class="mw-page-title-main">Forensic science</span> Application of science to criminal and civil laws

Forensic science, also known as criminalistics, is the application of science principles and methods to support legal decision-making in matters of criminal and civil law.

Genetic genealogy is the use of genealogical DNA tests, i.e., DNA profiling and DNA testing, in combination with traditional genealogical methods, to infer genetic relationships between individuals. This application of genetics came to be used by family historians in the 21st century, as DNA tests became affordable. The tests have been promoted by amateur groups, such as surname study groups or regional genealogical groups, as well as research projects such as the Genographic Project.

Forensic identification is the application of forensic science, or "forensics", and technology to identify specific objects from the trace evidence they leave, often at a crime scene or the scene of an accident. Forensic means "for the courts".

The United Kingdom National DNA Database is a national DNA Database that was set up in 1995. In 2005 it had 3.1 million profiles and in 2020 it had 6.6 million profiles. 270,000 samples were added to the database in 2019–20, populated by samples recovered from crime scenes and taken from police suspects. 124,000 were deleted for those not charged or not found guilty. There were 731,000 matches of unsolved crimes between 2001 and 2020.

A government database collects information for various reasons, including climate monitoring, securities law compliance, geological surveys, patent applications and grants, surveillance, national security, border control, law enforcement, public health, voter registration, vehicle registration, social security, and statistics.

<span class="mw-page-title-main">DNAPrint Genomics</span>

DNAPrint Genomics was a genetics company with a wide range of products related to genetic profiling. They were the first company to introduce forensic and consumer genomics products, which were developed immediately upon the publication of the first complete draft of the human genome in the early 2000s. They researched, developed, and marketed the first ever consumer genomics product, based on "Ancestry Informative Markers" which they used to correctly identify the BioGeographical Ancestry (BGA) of a human based on a sample of their DNA. They also researched, developed and marketed the first ever forensic genomics product - DNAWITNESS - which was used to create a physical profile of donors of crime scene DNA. The company reached a peak of roughly $3M/year revenues but ceased operations in February 2009.

Personal genomics or consumer genetics is the branch of genomics concerned with the sequencing, analysis and interpretation of the genome of an individual. The genotyping stage employs different techniques, including single-nucleotide polymorphism (SNP) analysis chips, or partial or full genome sequencing. Once the genotypes are known, the individual's variations can be compared with the published literature to determine likelihood of trait expression, ancestry inference and disease risk.

S and Marper v United Kingdom [2008] ECHR 1581 is a case decided by the European Court of Human Rights which held that holding DNA samples of individuals arrested but who are later acquitted or have the charges against them dropped is a violation of the right to privacy under the European Convention on Human Rights.

<span class="mw-page-title-main">Forensic profiling</span> Study of trace evidence in criminal investigations

Forensic profiling is the study of trace evidence in order to develop information which can be used by police authorities. This information can be used to identify suspects and convict them in a court of law.

<span class="mw-page-title-main">Combined DNA Index System</span> United States national DNA database

The Combined DNA Index System (CODIS) is the United States national DNA database created and maintained by the Federal Bureau of Investigation. CODIS consists of three levels of information; Local DNA Index Systems (LDIS) where DNA profiles originate, State DNA Index Systems (SDIS) which allows for laboratories within states to share information, and the National DNA Index System (NDIS) which allows states to compare DNA information with one another.

Rapid DNA describes the fully automated process of developing a CODIS Core STR profile or other STR profile from a reference sample buccal swab. The “swab in – profile out” process consists of automated extraction, amplification, separation, detection and allele calling without human intervention. A machine designed to perform such rapid DNA analysis is called a DNA "magic box" by enforcement authorities.

Maryland v. King, 569 U.S. 435 (2013), was a decision of the United States Supreme Court which held that a cheek swab of an arrestee's DNA is comparable to fingerprinting and therefore, a legal police booking procedure that is reasonable under the Fourth Amendment.

<span class="mw-page-title-main">National Forensic DNA Database of South Africa</span>

The National Forensic DNA Database of South Africa (NFDD) is a national DNA database used in law enforcement in South Africa. The Criminal Law Amendment Act No. 37 of 2013 provides for the expansion and administration of such a database in South Africa, enabling the South African Police Service (SAPS) to match forensic DNA profiles derived from samples collected at crime scenes with forensic DNA profiles of offenders convicted of, and suspects arrested for, offences listed in a new Schedule 8 of the amended Criminal Procedure Act of 1977.

Genetic privacy involves the concept of personal privacy concerning the storing, repurposing, provision to third parties, and displaying of information pertaining to one's genetic information. This concept also encompasses privacy regarding the ability to identify specific individuals by their genetic sequence, and the potential to gain information on specific characteristics about that person via portions of their genetic information, such as their propensity for specific diseases or their immediate or distant ancestry.

<span class="mw-page-title-main">Investigative genetic genealogy</span> Application of genealogy in a legal setting

Investigative genetic genealogy, also known as forensic genetic genealogy, is the emerging practice of utilizing genetic information from direct-to-consumer companies for identifying suspects or victims in criminal cases. As of December 2023, the use of this technology has solved a total of 651 criminal cases, including 318 individual perpetrators who were brought to light. There have also been 464 decedents identified, as well as 4 living Does. The investigative power of genetic genealogy revolves around the use of publicly accessible genealogy databases such as GEDMatch and FamilyTreeDNA. On GEDMatch, users are able to upload their genetic data from any direct-to-consumer company in an effort to identify relatives that have tested at companies other than their own.

DNA encryption is the process of hiding or perplexing genetic information by a computational method in order to improve genetic privacy in DNA sequencing processes. The human genome is complex and long, but it is very possible to interpret important, and identifying, information from smaller variabilities, rather than reading the entire genome. A whole human genome is a string of 3.2 billion base paired nucleotides, the building blocks of life, but between individuals the genetic variation differs only by 0.5%, an important 0.5% that accounts for all of human diversity, the pathology of different diseases, and ancestral story. Emerging strategies incorporate different methods, such as randomization algorithms and cryptographic approaches, to de-identify the genetic sequence from the individual, and fundamentally, isolate only the necessary information while protecting the rest of the genome from unnecessary inquiry. The priority now is to ascertain which methods are robust, and how policy should ensure the ongoing protection of genetic privacy.

<span class="mw-page-title-main">GEDmatch</span> Genetic genealogy website

GEDmatch is an online service to compare autosomal DNA data files from different testing companies. It is owned by Qiagen.

The rape and murder of Angie Dodge occurred in Idaho Falls, Idaho on June 13, 1996. The true perpetrator was apprehended in May 2019, nearly 23 years after the crime was committed.

The National DNA Data Bank of Canada (NDDB) is a national DNA Database that was set up in 2000. Managed by the RCMP, it provide matches to convicted offenders and offer a memory repository for cold cases. The database hold 642,758 DNA profiles as of December 31, 2022.

References

  1. Rose & Goos: DNA - A Practical Guide (Carswell Publications, Toronto).
  2. 1 2 Anker, Anne Sofie Tegner; Doleac, Jennifer L.; Landersø, Rasmus (2021). "The Effects of DNA Databases on the Deterrence and Detection of Offenders". American Economic Journal: Applied Economics. 13 (4): 194–225. doi: 10.1257/app.20190207 . ISSN   1945-7782. S2CID   239235452.
  3. 1 2 3 Doleac, Jennifer L. (2017-01-01). "The Effects of DNA Databases on Crime". American Economic Journal: Applied Economics . 9 (1): 165–201. CiteSeerX   10.1.1.269.6210 . doi:10.1257/app.20150043. ISSN   1945-7782.
  4. 1 2 Santos, Filipe; Machado, Helena; Silva, Susana (3 December 2013). "Forensic DNA databases in European countries: is size linked to performance?". Life Sciences, Society and Policy. 9 (1): 12. doi: 10.1186/2195-7819-9-12 . PMC   4513018 .
  5. 1 2 "DNA / Forensics / INTERPOL expertise / Internet / Home - INTERPOL".
  6. Benson, Dennis A.; Cavanaugh, Mark; Clark, Karen; Karsch-Mizrachi, Ilene; Lipman, David J.; Ostell, James; Sayers, Eric W. (27 November 2012). "GenBank". Nucleic Acids Res. 41 (Database issue): D36–42. doi:10.1093/nar/gks1195. PMC   3531190 . PMID   23193287 via nar.oxfordjournals.org.
  7. Hagmann, M (2000). "UK plans major medical DNA database". Science. 287 (5456): 1184b–1184. doi:10.1126/science.287.5456.1184b. PMID   10712143. S2CID   70954894.
  8. 1 2 Butler, John M. (27 July 2011). Advanced Topics in Forensic DNA Typing: Methodology. Academic Press. ISBN   978-0-12-387823-6.
  9. 1 2 "Global DNA Profiling Survey; Results and Analysis" (PDF). Interpol DNA Unit. 2009. p. Appendix 1. Retrieved 12 October 2015.
  10. ENFSI DNA Working Group. (2010). DNA database management: Review and Recommendation. The Hague (The Netherlands): ENFSI. Archived 2014-09-11 at the Wayback Machine
  11. http://www.bodetech.com/bodedbsearch%5B%5D
  12. Linacre, A (2003). "The UK National DNA Database". The Lancet. 361 (9372): 1841–1842. doi:10.1016/s0140-6736(03)13539-8. PMID   12788567. S2CID   31070032.
  13. National DNA Database Strategy Board Biennial Report 2018–2020 (PDF). UK Home Office; Her Majesty's Stationery Office. September 2020. p. 10. ISBN   978-1-5286-1916-5 . Retrieved 6 November 2020.
  14. "National DNA Database statistics, Q1 2015 to 2016". National DNA Database statistics. UK Government Home Office. Retrieved 11 October 2015.
  15. Gav Ireland; Simon Lewis; Dan Fookes. "Statistics". npia.police.uk. NPIA. Archived from the original on 2012-06-17. Retrieved 2012-08-04.
  16. Gill, P. (February 2002). "Role of Short Tandem Repeat DNA in Forensic Casework in the UK—Past, Present, and Future Perspectives" (PDF). BioTechniques. 32 (2): 366–385. doi: 10.2144/02322rv01 . PMID   11848414.
  17. Forensic DNA analysis : a primer for courts. London: Royal Society. 2017. ISBN   978-1-78252-301-7. OCLC   1039675621.
  18. Bowcott, Owen (13 May 2015). "Retention of offenders' DNA profiles not illegal, supreme court rules". The Guardian. Retrieved 11 October 2015.
  19. "Identification by body samples and impressions—4.4 Section 82: Restrictions on use and destruction of fingerprints and samples". WikiCrimeLine. Archived from the original on 2007-02-23.
  20. Wallace, Helen (1 July 2006). "The UK National DNA Database". EMBO Reports. 7 (1S): S26–S30. doi:10.1038/sj.embor.7400727. PMC   1490298 . PMID   16819445.
  21. 1 2 "Protection of Freedoms Act 2012: DNA and fingerprint provisions". Protection of Freedoms Act 2012: how DNA and fingerprint evidence is protected in law. UK Government Home Office. 4 April 2014. Retrieved 11 October 2015.
  22. Forensic Science
  23. "About the DNA databank ESR". Institute of Environmental Science and Research, ESR, New Zealand Government. Retrieved 2020-11-07.
  24. "Reviewing the DNA Database" (PDF). ESR NZ Institute of Environmental Science and Research - Crime Scene Intelligence Newsletter. November 2019. p. 3.
  25. CODIS Brochure
  26. "Laboratory Services".
  27. "CODIS - NDIS Statistics".
  28. Investigations Aided Archived April 6, 2009, at the Wayback Machine
  29. "Supreme Court says police can take DNA swabs after arrest". CBS News.
  30. "Family DNA Collection Protocol" (PDF). Archived from the original (PDF) on 2010-12-16. Retrieved 2018-05-01.
  31. Missing Persons Unit
  32. http://www.councilforresponsiblegenetics.org/geneticprivacy/DNA_mil.html
  33. Commission, Australian Criminal Intelligence (18 July 2018). "National Criminal Investigation DNA Database".
  34. Mobbs, Jonathan D. (2001). "Crimtrac-technology and detection". 4th National Outlook Symposium on Crime in Australia, New Crimes or New Responses. Canberra.
  35. Curtis, Caitlin; Hereward, James (August 29, 2017). "From the crime scene to the courtroom: the journey of a DNA sample". The Conversation. Retrieved October 14, 2017.
  36. Milot, E; Lecomte, MM; Germain, H; Crispino, F (2013). "The National DNA Data Bank of Canada: a Quebecer perspective". Front Genet. 4: 249. doi: 10.3389/fgene.2013.00249 . PMC   3834530 . PMID   24312124.
  37. "National DNA Data Bank" Archived 2013-06-25 at the Wayback Machine
  38. "National DNA Data Bank". Royal Canadian Mounted Police. 2001-04-22.
  39. Sutton, Mark (2017-02-14). "HH Sheikh Mohammed launches 10x initiative". ITP.net. Retrieved 2018-03-02.
  40. Treviño, Julissa (2018-03-20). "Dubai Wants to DNA Test Its Millions of Residents to Prevent Genetic Disease". Smithsonian. Retrieved 2018-03-02.
  41. "GeneWatch UK - Germany". genewatch.org. Retrieved 29 December 2016.
  42. "Germany's DNA database". Archived from the original on 29 December 2016. Retrieved 29 December 2016.
  43. "National DNA Intelligence Databases in Europe – Report on the Current Situation" (PDF). Retrieved 29 December 2016.
  44. Peerenboom, E. (1 June 1998). "Central criminal DNA database created in Germany". Nature Biotechnology. 16 (6): 510–511. doi:10.1038/nbt0698-510. ISSN   1087-0156. PMID   9624672. S2CID   28662677.
  45. Käppner, Joachim (8 December 2016). "Justiz: Verräterische Proben" (in German). Süddeutsche Zeitung. Retrieved 29 December 2016.
  46. "Ope Letter Stop the police's DNA collection frenzy!" (PDF). Retrieved 29 December 2016.
  47. Schultz, Susanne. ""Stop the DNA Collection Frenzy!": Expansion of Germany's DNA Database". Forensic Genetics Policy Initiative. Retrieved 29 December 2016.
  48. Zamir, Ashira; Dell’Ariccia-Carmon, Aviva; Zaken, Neomi; Oz, Carla (1 March 2012). "The Israel DNA database—The establishment of a rapid, semi-automated analysis system". Forensic Science International: Genetics. 6 (2): 286–289. doi:10.1016/j.fsigen.2011.06.003. PMID   21727053.
  49. Visser, Nick (14 July 2015). "Kuwait To Institute Mandatory DNA Testing For All Residents". Huffington Post. Retrieved 10 October 2015.
  50. "ISIL claims responsibility for Kuwait Shia mosque blast". Al Jazeera. 27 June 2015. Retrieved 10 October 2015.
  51. Field, Dawn (3 September 2015). "Kuwait's war on ISIS and DNA". Oxford University Press Blog. Retrieved 10 October 2015.
  52. Coghlan, Andy (2017-10-09). "Kuwait's plans for mandatory DNA database have been cancelled". New Scientist. Retrieved 2018-03-02.
  53. 1 2 3 Ferreira, Samuel T.G.; Paula, Karla A.; Maia, Flávia A.; Svidizinski, Arthur E.; Amaral, Marinã R.; Diniz, Silmara A.; Siqueira, Maria E.; Moraes, Adriana V. (2015). "The use of DNA database of biological evidence from sexual assaults in criminal investigations: A successful experience in Brasília, Brazil". Forensic Science International: Genetics Supplement Series. 5: 595–597. doi: 10.1016/j.fsigss.2015.09.235 .
  54. Raoult, Eric (2010-01-12). "Question No: 68468" (in French). 13th legislature. Response 2010-04-06.
  55. Mirolyubova, Svetlana (2021). "Проблемы применения ДНК-теста в целях воссоединения семьи и репатриации". Surgut State University Journal. 2021. № 1(31): 91–100. doi:10.34822/2312-3419-2021-1-91-100.
  56. Veiligheid, Ministerie van Justitie en (2013-05-14). "Home - Nederlandse DNA-databank". dnadatabank.forensischinstituut.nl (in Dutch). Retrieved 2019-09-22.
  57. Hindmash, Richard; Prainsack, Barbara, eds. (2010-08-12). Genetic Suspects: Global Governance of Forensic DNA Profiling and Databasing. Cambridge University Press. p. 154. ISBN   978-0521519434.
  58. Negri, Giovanni (2016-03-26). "Italy approves DNA database to fight crime". Il Sole 24 Ore, English edition. Retrieved 2017-05-11.
  59. "Italy creates national DNA database to enhance anti-terror fight". Jamaica Observer. 2016-03-26. Retrieved 2017-05-11.
  60. Haas, C.; Voegeli, P.; Hess, M.; Kratzer, A.; Bär, W. (2006-04-01). "A new legal basis and communication platform for the Swiss DNA database". International Congress Series. Progress in Forensic Genetics 11Proceedings of the 21st International ISFG Congress held in Ponta Delgada, The Azores, Portugal between 13 and 16 September 2005. 1288: 734–736. doi:10.1016/j.ics.2005.11.040.
  61. "Newropeans Magazine - The European Perspective. Preparing for the world of tomorrow".
  62. "CNECV - Conselho Nacional de Ética para as Ciências da Vida".
  63. Skinner, David (14 July 2010). "Sociology 52: 13: Machado and Silva: Forensic DNA in Portugal".
  64. "Genuity Science | Genomic Data Insights to Power Discovery". Genuity Science. Retrieved 27 May 2021.
  65. "Genomics: Exploring new horizons". The Irish Times. Retrieved 2020-11-07.
  66. McConnell, David; Hardiman, Orla. "Ireland putting profit before people with genomic medicine strategy". The Irish Times. Retrieved 2020-11-07.
  67. Wee, Sui-Lee (2020-07-30). "China Is Collecting DNA From Tens of Millions of Men and Boys, Using U.S. Equipment". The New York Times. ISSN   0362-4331 . Retrieved 2020-11-07.
  68. Qianwei, Wenxin Fan and Natasha Khan in Hong Kong and Liza Lin in (2017-12-27). "China Snares Innocent and Guilty Alike to Build World's Biggest DNA Database". Wall Street Journal. ISSN   0099-9660 . Retrieved 2020-11-07.
  69. Wee, Sui-Lee (21 February 2019). "China Uses DNA to Track Its People, with the Help of American Expertise". The New York Times.
  70. Warrick, Joby; Brown, Cate. "China's quest for human genetic data spurs fears of a DNA arms race". Washington Post. Retrieved 2023-10-27.
  71. RAJAGOPAL, DIVYA. "India to launch its 1st human genome cataloguing project". The Economic Times. Retrieved 2020-11-07.
  72. "Uttar Pradesh - Google Search". www.google.com. Retrieved 2022-04-06.
  73. Bursztynsky, Jessica (2019-02-12). "More than 26 million people shared their DNA with ancestry firms, allowing researchers to trace relationships between virtually all Americans: MIT". CNBC. Retrieved 2020-11-07.
  74. 1 2 3 Regalado, Antonio (2019-02-11). "More than 26 million people have taken an at-home ancestry test". MIT Technology Review. Retrieved 2020-11-07.
  75. "23andMe - Ancestry". 23andme.com. Retrieved 29 December 2016.
  76. 1 2 Potenza, Alessandra (13 July 2016). "23andMe wants researchers to use its kits, in a bid to expand its collection of genetic data". The Verge. Retrieved 29 December 2016.
  77. "This Startup Will Sequence Your DNA, So You Can Contribute To Medical Research". Fast Company. 23 December 2016. Retrieved 29 December 2016.
  78. Seife, Charles. "23andMe Is Terrifying, but Not for the Reasons the FDA Thinks". Scientific American. Retrieved 29 December 2016.
  79. Zaleski, Andrew (22 June 2016). "This biotech start-up is betting your genes will yield the next wonder drug". CNBC. Retrieved 29 December 2016.
  80. Regalado, Antonio. "How 23andMe turned your DNA into a $1 billion drug discovery machine". MIT Technology Review. Retrieved 29 December 2016.
  81. "23andMe reports jump in requests for data in wake of Pfizer depression study | FierceBiotech". fiercebiotech.com. 22 August 2016. Retrieved 29 December 2016.
  82. Ateet Mehta & Bankim Patel, et al., 2010, "DNA Compression using Hash Based Data Structure", International Journal of Information Technology and Knowledge Management July–December 2010, Volume 2, No. 2, pp. 383–386
  83. Biji, C.L.; Madhu, M.K.; Vishnu, V. (May 28, 2015). "Compression of Large genomic datasets using COMRAD on Parallel Computing Platform". Bioinformation. 11 (5): 267–271. doi:10.6026/97320630011267. PMC   4464544 . PMID   26124572.
  84. 1 2 Kuruppu, S.S. (January 2012). Compression of Large DNA Databases (PDF) (PhD). The University of Melbourne.
  85. Chanda, P.; Elhaik, E.; Bader, J.S. (2012). "HapZipper: sharing HapMap populations just got easier". Nucleic Acids Res. 40 (20): 1–7. doi: 10.1093/nar/gks709 . PMC   3488212 . PMID   22844100.
  86. Pratas, D.; Pinho, A. J.; Ferreira, P. J. S. G. (2016). Efficient compression of genomic sequences. Data Compression Conference. Snowbird, Utah.
  87. [ dead link ]
  88. "Blodbank som forbryderalbum". 16 September 2007.
  89. Jeffries, Stuart (27 October 2006). "Suspect nation". The Guardian.
  90. Lemieux, Scott (March 23, 2012). "Are Police Building a Massive DNA Database?". AlterNet.
  91. "DNA database 'breach of rights'". BBC News. 4 December 2008.
  92. Curtis, Caitlin; Hereward, James (May 2, 2018). "DNA facial prediction could make protecting your privacy more difficult". The Conversation. Retrieved May 21, 2018.
  93. Curtis, Caitlin; Hereward, James (December 4, 2017). "It's time to talk about who can access your digital genomic data". The Conversation. Retrieved May 21, 2018.
  94. 1 2 Wallace, H. M.; Jackson, A. R.; Gruber, J.; Thibedeau, A. D. (1 September 2014). "Forensic DNA databases–Ethical and legal standards: A global review". Egyptian Journal of Forensic Sciences. 4 (3): 57–63. doi: 10.1016/j.ejfs.2014.04.002 .
  95. Curtis, Caitlin; Hereward, James; Mangelsdorf, Marie; Hussey, Karen; Devereux, John (18 December 2018). "Protecting trust in medical genetics in the new era of forensics". Genetics in Medicine. 21 (7): 1483–1485. doi:10.1038/s41436-018-0396-7. PMC   6752261 . PMID   30559376.
  96. Roman-Santos, Candice (2010). "Concerns Associated with Expanding DNA Databases" . Hastings Science and Technology Law Journal. 2: 267.
  97. Marten Youssef (October 2, 2009). "DNA databank proposal raises privacy concerns". The National.
  98. "DNA Forensics".
  99. Compulsory DNA Collection: A Fourth Amendment Analysis Congressional Research Service
  100. "UK | DNA database 'breach of rights'". BBC News. 2008-12-04. Retrieved 2012-08-04.
  101. James Sturcke (2009-05-07). "DNA pioneer condemns plans to retain data on innocent | Politics | guardian.co.uk". London: Guardian. Retrieved 2012-08-04.
  102. Weber-Lehmann, Jacqueline; Schilling, Elmar; Gradl, Georg; Richter, Daniel C.; Wiehler, Jens; Rolf, Burkhard (2014). "Finding the needle in the haystack: Differentiating "identical" twins in paternity testing and forensics by ultra-deep next generation sequencing". Forensic Science International: Genetics. 9: 42–46. doi: 10.1016/j.fsigen.2013.10.015 . ISSN   1872-4973. PMID   24528578.
  103. Am J Hum Genet. 2008 Mar;82(3):763-71. doi: 10.1016/j.ajhg.2007.12.011. Epub 2008 Feb 14. Phenotypically concordant and discordant monozygotic twins display different DNA copy-number-variation profiles.