GestaltMatcher

Last updated
GestaltMatcher is a blend word of Gestalt and Match with a camel case M. Gestalt is a professional term in dysmorphology for a recognizable pattern and match is a person resembling another person in some respect. Since match is also a polyseme for a slender piece of wood with a flammable tip, this was used for the illustration of algorithm GestaltMatcher. The two matches with blue tips indicate the individuals affected by a shared rare disorder that need to be found. The AI can support with that task by computing the similarity between portraits. GestaltMatcher.png
GestaltMatcher is a blend word of Gestalt and Match with a camel case M. Gestalt is a professional term in dysmorphology for a recognizable pattern and match is a person resembling another person in some respect. Since match is also a polyseme for a slender piece of wood with a flammable tip, this was used for the illustration of algorithm GestaltMatcher. The two matches with blue tips indicate the individuals affected by a shared rare disorder that need to be found. The AI can support with that task by computing the similarity between portraits.

GestaltMatcher is a continuously updated collection of medical images of individuals with rare diseases and open-source AIs for the interpretation of such data. [1] [2] As of March 2023, GestaltMatcher DataBase (GMDB) contained approximately 10,000 case reports with a molecular diagnosis and clinical features annotated with HPO terminology. [3] Medical images include, for example, facial photographs of patients with genetic syndromes manifesting with facial dysmorphic features, as well as radiographs from those with skeletal dysplasias.

Contents

GestaltMatcher allows users to find and publish case reports, including medical images, if that option is chosen in the dynamic consent module. By that means, GMDB complements medRxiv and can also be used as a repository for re-identifiable images in preprints.

In a prospective three year multi center study, GestaltMatcher showed clinical utility as an artificial expert opinion in a multidisciplinary team. [4]

History

The GestaltMatcher project started in April 2021 during the revision of the manuscript from Hsieh, et al. [5] with funding from University of Bonn and the German Research Foundation (DFG). The reviewers and editors of Nature Genetics asked for FAIR data in order to reproduce the algorithmic results described in that work. Since then, the database (GMDB) has grown by contributions from its community. Since January 2022, GMDB can be used as repository for medical imaging data for preprints submitted to medRxiv. In February 2023, at the 14th ICHG meeting in Cape Town, Prof. Shahida Moosa (Stellenbosch University) reported the 10,000 case, which is a patient from South Africa with Mabry syndrome. Prof. Peter Krawitz also announced at the conference that AGD e.V., a German non-profit organization, will oversee the GMDB from this point forward. In January 2024 the GestaltMatcher project received a donation from the Eva Luise und Horst Köhler Stiftung, which is a charity of the former German president Horst Köhler and his wife, Eva Köhler, to improve the medica care for people with rare diseases.

Related Research Articles

<span class="mw-page-title-main">Mutation</span> Alteration in the nucleotide sequence of a genome

In biology, a mutation is an alteration in the nucleic acid sequence of the genome of an organism, virus, or extrachromosomal DNA. Viral genomes contain either DNA or RNA. Mutations result from errors during DNA or viral replication, mitosis, or meiosis or other types of damage to DNA, which then may undergo error-prone repair, cause an error during other forms of repair, or cause an error during replication. Mutations may also result from substitution, insertion or deletion of segments of DNA due to mobile genetic elements.

<span class="mw-page-title-main">Pelizaeus–Merzbacher disease</span> X-linked leukodystrophy

Pelizaeus–Merzbacher disease is an X-linked neurological disorder that damages oligodendrocytes in the central nervous system. It is caused by mutations in proteolipid protein 1 (PLP1), a major myelin protein. It is characterized by a decrease in the amount of insulating myelin surrounding the nerves (hypomyelination) and belongs to a group of genetic diseases referred to as leukodystrophies.

<span class="mw-page-title-main">Limb–girdle muscular dystrophy</span> Muscular degenerative disorder primarily of the hip and shoulders

Limb–girdle muscular dystrophy (LGMD) is a genetically heterogeneous group of rare muscular dystrophies that share a set of clinical characteristics. It is characterised by progressive muscle wasting which affects predominantly hip and shoulder muscles. LGMD usually has an autosomal pattern of inheritance. It currently has no known cure or treatment.

<span class="mw-page-title-main">Harlequin-type ichthyosis</span> Genetic skin disease

Harlequin-type ichthyosis is a genetic disorder that results in thickened skin over nearly the entire body at birth. The skin forms large, diamond/trapezoid/rectangle-shaped plates that are separated by deep cracks. These affect the shape of the eyelids, nose, mouth, and ears and limit movement of the arms and legs. Restricted movement of the chest can lead to breathing difficulties. These plates fall off over several weeks. Other complications can include premature birth, infection, problems with body temperature, and dehydration. The condition is the most severe form of ichthyosis, a group of genetic disorders characterised by scaly skin.

<span class="mw-page-title-main">Single-nucleotide polymorphism</span> Single nucleotide in genomic DNA at which different sequence alternatives exist

In genetics and bioinformatics, a single-nucleotide polymorphism is a germline substitution of a single nucleotide at a specific position in the genome. Although certain definitions require the substitution to be present in a sufficiently large fraction of the population, many publications do not apply such a frequency threshold.

Miller–Dieker syndrome, also called Miller–Dieker lissencephaly syndrome (MDLS) or chromosome 17p13.3 deletion syndrome, is a micro deletion syndrome characterized by congenital malformations. Congenital malformations are physical defects detectable in an infant at birth which can involve many different parts of the body, including the brain, heart, lungs, liver, bones, or intestinal tract. MDS is a contiguous gene syndrome – a disorder due to the deletion of multiple gene loci adjacent to one another. The disorder arises from the deletion of part of the small arm of chromosome 17p, leading to partial monosomy. There may be unbalanced translocations, or the presence of a ring chromosome 17.

Phakomatoses, also known as neurocutaneous syndromes, are a group of multisystemic diseases that most prominently affect structures primarily derived from the ectoderm such as the central nervous system, skin and eyes. The majority of phakomatoses are single-gene disorders that may be inherited in an autosomal dominant, autosomal recessive or X-linked pattern. Presentations may vary dramatically between patients with the same particular syndrome due to mosaicism, variable expressivity, and penetrance.

<span class="mw-page-title-main">T-box transcription factor T</span> Protein-coding gene in the species Homo sapiens

T-box transcription factor T, also known as Brachyury protein, is encoded for in humans and other apes by the TBXT gene. Brachyury functions as a transcription factor within the T-box family of genes. Brachyury homologs have been found in all bilaterian animals that have been screened, as well as the freshwater cnidarian Hydra.

<span class="mw-page-title-main">Beta thalassemia</span> Blood disorder

Beta thalassemias are a group of inherited blood disorders. They are forms of thalassemia caused by reduced or absent synthesis of the beta chains of hemoglobin that result in variable outcomes ranging from severe anemia to clinically asymptomatic individuals. Global annual incidence is estimated at one in 100,000. Beta thalassemias occur due to malfunctions in the hemoglobin subunit beta or HBB. The severity of the disease depends on the nature of the mutation.

<span class="mw-page-title-main">Dysmorphic feature</span> Abnormal difference in body structure

A dysmorphic feature is an abnormal difference in body structure. It can be an isolated finding in an otherwise normal individual, or it can be related to a congenital disorder, genetic syndrome or birth defect. Dysmorphology is the study of dysmorphic features, their origins and proper nomenclature. One of the key challenges in identifying and describing dysmorphic features is the use and understanding of specific terms between different individuals. Clinical geneticists and pediatricians are usually those most closely involved with the identification and description of dysmorphic features, as most are apparent during childhood.

<span class="mw-page-title-main">Whole genome sequencing</span> Determining nearly the entirety of the DNA sequence of an organisms genome at a single time

Whole genome sequencing (WGS) is the process of determining the entirety, or nearly the entirety, of the DNA sequence of an organism's genome at a single time. This entails sequencing all of an organism's chromosomal DNA as well as DNA contained in the mitochondria and, for plants, in the chloroplast.

1q21.1 deletion syndrome is a rare aberration of chromosome 1. A human cell has one pair of identical chromosomes on chromosome 1. With the 1q21.1 deletion syndrome, one chromosome of the pair is not complete, because a part of the sequence of the chromosome is missing. One chromosome has the normal length and the other is too short.

<span class="mw-page-title-main">1q21.1 duplication syndrome</span> Medical condition

1q21.1 duplication syndrome, also known as 1q21.1 microduplication, is an uncommon copy number variant associated with several congenital abnormalities, including developmental delay, dysmorphic traits, autism spectrum disorder, and congenital cardiac defects. Common facial features include frontal bossing, hypertelorism, and macrocephaly. Around 18 and 29% of patients with 1q21.1 microduplications have congenital cardiac abnormalities. 1q21.1 duplication syndrome is caused by microduplications of the BP3-BP4 region. 18-50% are de novo deletions and 50-82% inherited from parents. The 1q21.1 area, one of the largest regions in the human genome, is highly susceptible to copy number variation due to its frequent low-copy duplications. Whole exon sequencing and quantitative polymerase chain reaction can provide a precise molecular diagnosis for children with 1q21.1 microduplication syndrome.

<span class="mw-page-title-main">Hyperphosphatasia with mental retardation syndrome</span> Medical condition

Hyperphosphatasia with mental retardation syndrome, HPMRS, also known as Mabry syndrome, has been described in patients recruited on four continents world-wide. Mabry syndrome was confirmed to represent an autosomal recessive syndrome characterized by severe mental retardation, considerably elevated serum levels of alkaline phosphatase, hypoplastic terminal phalanges, and distinct facial features that include: hypertelorism, a broad nasal bridge and a rectangular face.

Single nucleotide polymorphism annotation is the process of predicting the effect or function of an individual SNP using SNP annotation tools. In SNP annotation the biological information is extracted, collected and displayed in a clear form amenable to query. SNP functional annotation is typically performed based on the available information on nucleic acid and protein sequences.

<span class="mw-page-title-main">Topologically associating domain</span> Self-interacting genomic region

A topologically associating domain (TAD) is a self-interacting genomic region, meaning that DNA sequences within a TAD physically interact with each other more frequently than with sequences outside the TAD. The average size of a topologically associating domain (TAD) is 1000 kb in humans, 880 kb in mouse cells, and 140 kb in fruit flies. Boundaries at both side of these domains are conserved between different mammalian cell types and even across species and are highly enriched with CCCTC-binding factor (CTCF) and cohesin. In addition, some types of genes appear near TAD boundaries more often than would be expected by chance.

<span class="mw-page-title-main">Alcohol intolerance</span> Medical condition

Alcohol intolerance is due to a genetic polymorphism of the aldehyde dehydrogenase enzyme, which is responsible for the metabolism of acetaldehyde. This polymorphism is most often reported in patients of East Asian descent. Alcohol intolerance may also be an associated side effect of certain drugs such as disulfiram, metronidazole, or nilutamide. Skin flushing and nasal congestion are the most common symptoms of intolerance after alcohol ingestion. It may also be characterized as intolerance causing hangover symptoms similar to the "disulfiram-like reaction" of aldehyde dehydrogenase deficiency or chronic fatigue syndrome. Severe pain after drinking alcohol may indicate a more serious underlying condition.

<span class="mw-page-title-main">Polygenic score</span> Numerical score aimed at predicting a trait based on variation in multiple genetic loci

In genetics, a polygenic score (PGS) is a number that summarizes the estimated effect of many genetic variants on an individual's phenotype. The PGS is also called the polygenic index (PGI) or genome-wide score; in the context of disease risk, it is called a polygenic risk score or genetic risk score. The score reflects an individual's estimated genetic predisposition for a given trait and can be used as a predictor for that trait. It gives an estimate of how likely an individual is to have a given trait based only on genetics, without taking environmental factors into account; and it is typically calculated as a weighted sum of trait-associated alleles.

<span class="mw-page-title-main">Bainbridge–Ropers syndrome</span> Human genetic disorder

Bainbridge–Ropers syndrome was first identified in 2013 and is characterized by failure to thrive, feeding problems, hypotonia, intellectual disabilities, autism, postnatal growth delay, abnormal facial features such as arched eyebrows, anteverted nares, and delays in language acquisition. BRPS is extremely rare worldwide; more than thirty cases of BRPS have been reported abroad, and four cases have been reported in China.

<span class="mw-page-title-main">Du Pan syndrome</span> Medical condition

Du Pan syndrome, also known as fibular aplasia-complex brachydactyly syndrome, is an extremely rare genetic condition. Unlike other rare genetic conditions, Du Pan syndrome does not affect brain function or the appearance of the head and trunk. This condition is associated with alterations to the GDF5 gene. The way that this condition is passed on from generation to generation varies, but it is most commonly inherited in an autosomal recessive manner, meaning two copies of the same version of the gene are required to show this condition. Rare cases exist where the mode of inheritance is autosomal dominant, which means having only one version of the gene is enough to cause this condition.

References

  1. Hustinx, Alexander; Hellmann, Fabio; Sümer, Ömer; Javanmardi, Behnam; André, Elisabeth; Krawitz, Peter; Hsieh, Tzung-Chien (2023). "Improving Deep Facial Phenotyping for Ultra-rare Disorder Verification Using Model Ensembles". 2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV). pp. 5018–5028. arXiv: 2211.06764 . doi:10.1109/WACV56688.2023.00499. ISBN   978-1-6654-9346-8. S2CID   253510768.
  2. Sümer, Ömer; Hellmann, Fabio; Hustinx, Alexander; Hsieh, Tzung-Chien; André, Elisabeth; Krawitz, Peter (2023). "Few-Shot Meta-Learning for Recognizing Facial Phenotypes of Genetic Disorders". Caring is Sharing – Exploiting the Value in Data for Health and Innovation. Studies in Health Technology and Informatics. Vol. 302. pp. 932–936. arXiv: 2210.12705 . doi:10.3233/SHTI230312. ISBN   978-1-64368-388-1. PMID   37203539.
  3. Lesmann, Hellen; Hustinx, Alexander; Moosa, Shahida; Klinkhammer, Hannah; Marchi, Elaine; Caro, Pilar; Abdelrazek, Ibrahim M.; Pantel, Jean Tori; Hagen, Merle ten (2024-05-21), "GestaltMatcher Database - A global reference for facial phenotypic variability in rare human diseases", medRxiv, pp. 2023.06.06.23290887, doi:10.1101/2023.06.06.23290887, PMC   10371103 , PMID   37503210 , retrieved 2024-07-30
  4. Schmidt, Axel; Danyel, Magdalena; Grundmann, Kathrin; Brunet, Theresa; Klinkhammer, Hannah; Hsieh, Tzung-Chien; Engels, Hartmut; Peters, Sophia; Knaus, Alexej; Moosa, Shahida; Averdunk, Luisa; Boschann, Felix; Sczakiel, Henrike Lisa; Schwartzmann, Sarina; Mensah, Martin Atta (2024-07-22). "Next-generation phenotyping integrated in a national framework for patients with ultrarare disorders improves genetic diagnostics and yields new molecular findings". Nature Genetics. 56 (8): 1644–1653. doi: 10.1038/s41588-024-01836-1 . ISSN   1546-1718. PMC   11319204 . PMID   39039281.
  5. Hsieh, Tzung-Chien; Bar-Haim, Aviram; Moosa, Shahida; Ehmke, Nadja; Gripp, Karen W.; Pantel, Jean Tori; Danyel, Magdalena; Mensah, Martin Atta; Horn, Denise; Rosnev, Stanislav; Fleischer, Nicole; Bonini, Guilherme; Hustinx, Alexander; Schmid, Alexander; Knaus, Alexej (March 2022). "GestaltMatcher facilitates rare disease matching using facial phenotype descriptors". Nature Genetics. 54 (3): 349–357. doi:10.1038/s41588-021-01010-x. ISSN   1061-4036. PMC   9272356 . PMID   35145301.