C13orf42

Last updated
C13orf42
Identifiers
Aliases C13orf42 , LINC00372, LINC00371, long intergenic non-protein coding RNA 371, chromosome 13 open reading frame 42
External IDs GeneCards: C13orf42
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)

NM_001351589

n/a

RefSeq (protein)

n/a

n/a

Location (UCSC) Chr 13: 51.08 – 51.2 Mb n/a
PubMed search [2] n/a
Wikidata
View/Edit Human

C13orf42 is a protein which, in humans, is encoded by the gene chromosome 13 open reading frame 42 (C13orf42). RNA sequencing data shows low expression of the C13orf42 gene in a variety of tissues. [3] The C13orf42 protein is predicted to be localized in the mitochondria, nucleus, and cytosol. [4] Tertiary structure predictions for C13orf42 indicate multiple alpha helices. [5]

Contents

Gene

Summary

C13orf42 is a protein encoding gene containing 4 exons. C13orf42 is also known by aliases LINC00371 and LINC00372. [3] RNA sequencing shows the gene's expression at low levels in various tissues. [3] [6]

Location

C13orf42 is located on the minus strand of chromosome 13 at 13q14.3 in humans. [3] [7] C13orf42 is located from 51.08 Mb to 51.20 Mb on chromosome 13 and spans 118 kilobases. [8]

Neighborhood

The genomic neighborhood of C13orf42 consists of several pseudogenes along with ribonuclease H2 subunit B (RNASEH2B), uncharacterized LOC107984554, and family with sequence similarity 124 member A (FAM124A). [3]

A depiction of exons to scale on the C13orf42 gene. Image was created using Illustrator for Biological Sciences (IBS) from biocuckoo and gene information from UCSC BLAT. C13orf42 exon diagram to scale.png
A depiction of exons to scale on the C13orf42 gene. Image was created using Illustrator for Biological Sciences (IBS) from biocuckoo and gene information from UCSC BLAT.

Exons

The C13orf42 gene contains 4 exons. [10]

Expression

RNA sequencing of C13orf42 shows expression in a variety of tissues including the spleen, kidney, heart, brain, testis, skin, esophagus, colon, small intestine, stomach, lung, placenta, salivary gland, thymus, and adipose. [3] RNA sequencing of human fetal tissue shows C13orf42 expression starting at 20 weeks in the intestine, 16 weeks in the kidney, 10 weeks in the lung, and expression in the stomach is seen at 16 weeks but not 10, 18, or 20 weeks. [3] Recorded RNA expression is very low, with all results being lower than 0.5 reads per kilobase of transcript per million reads mapped (RPKM). Microarray data from NCBI geo (GDS425) shows expression in additional tissues including bone marrow, liver, skeletal muscle, spinal cord, and pancreas. [6]

Transcript

Variants

C13orf42 produces four known transcript variants, variant 1, variant 2, variant 3, and variant X1. Transcript variant 3 (accession number: NM_001351589.3) is the longest high-quality mRNA at 3075 nucleotides. [10] Transcript variant 3 contains 4 exons and encodes a 325 amino acid protein.

Transcript variants 1, 2, and X1 all lack the first exon but align with exons 2, 3, and 4 of transcript variant 3. Variants 1 and 2 are not protein encoding, while variants 3 and X1 are protein coding. Variant X1 is 2717 nucleotides long and encodes a 189 amino acid protein which aligns with the last 187 amino acids of the longer protein encoded by transcript variant 3 and differs in its first two amino acids. [11] [12]

Protein

Isoforms

There are two known proteins encoded by the isoforms of C13orf42. [13] Transcript variant 3 encodes the longest protein at 325 amino acids long. [13] Transcript variant X1 encodes a 189 amino acid long protein. [13] This protein aligns with exons 2, 3, and 4 of the 325 amino acid protein, but is missing exon 1. [12]

Protein Composition

C13orf42 has a predicted isoelectric point of 9.3 and a predicted molecular weight of 37.4 kDa. [14] Human C13orf42 is a serine rich and positively charged amino acid (lysine and arginine) rich protein. [15] This composition is partially conserved in orthologs.

Tertiary Structure

The C13orf42 tertiary structure of the highest confidence predicted by I-Tasser is predicted to have many alpha helices. [5] In the structure below, residues indicated to be present in C13orf42 in higher amounts (serine, lysine and arginine) are annotated. [16] A space filling model and a charge model is also shown for C13orf42.

Tertiary structure of C13orf42 predicted by I-Tasser and annotated in NCBI iCn3D. The protein is shown in space filling form and highly conserved amino acids are labeled in yellow. Predicted C13orf42 tertiary structure with conserved amino acids labeled.png
Tertiary structure of C13orf42 predicted by I-Tasser and annotated in NCBI iCn3D. The protein is shown in space filling form and highly conserved amino acids are labeled in yellow.
Tertiary structure of C13orf42 predicted by I-Tasser and annotated in NCBI iCn3D. Charge is indicated with blue being positive and red being negative residues. Highly conserved amino acids are labeled in yellow. Predicted C13orf42 structure with charge and conserved amino acids.png
Tertiary structure of C13orf42 predicted by I-Tasser and annotated in NCBI iCn3D. Charge is indicated with blue being positive and red being negative residues. Highly conserved amino acids are labeled in yellow.
C13orf42 structure predicted by I-Tasser and annotated in Chimera with residues indicated to be present in higher-than-average amounts highlighted. (Serine = blue, Lysine = green, Arginine = orange) C13orf42 structure predicted by I-Tasser and annotated in Chimera.png
C13orf42 structure predicted by I-Tasser and annotated in Chimera with residues indicated to be present in higher-than-average amounts highlighted. (Serine = blue, Lysine = green, Arginine = orange)

Subcellular Localization

Human C13orf42 is predicted to be localized to the mitochondria, nucleus, cytosol, and endoplasmic reticulum with the ER predicted at a low percentage (<5%). [4] Orthologs show similar predicted subcellular localization with mitochondria, nucleus, and cytosol being the top predicted locations, however, predicted percentages vary. [4]

Immunohistochemistry

C13orf42 antibody B-4 (catalog number: sc-376095) shows cytoplasmic and nuclear staining in seminiferous ducts and Lyedig cells of testis tissue. [18] C13orf42 antibody E-3 (catalog number: sc-374567) shows cytoplasmic staining in seminiferous ducts and Lyedig cells of testis tissue, and cytoplasmic and nucleolar localization in HeLa cells. [19]

Post translational Modifications

Predicted post translational modifications shown on human C13orf42. Diagram was created using Illustrator for Biological Sciences. Human C13orf42 predicted post translational modification illustration.png
Predicted post translational modifications shown on human C13orf42. Diagram was created using Illustrator for Biological Sciences.

C13orf42 is predicted to have 10 highly conserved (in over 70% of analyzed orthologs from table below) phosphorylation sites. [20] Phosphorylation sites include one CK2 phosphorylation, one TYR phosphorylation, two cAMP phosphorylation sites, and six PKC phosphorylation sites. There are three predicted O-β-GlcNAc sites and two predicted yin-yang sites in C13orf42 which are fully conserved in orthologs. [21] A yin-yang site occurs when O-β-GlcNAc and phosphorylation are predicted for the same site. C13orf42 is not predicted to have myristylation sites as it does not contain an N-terminal glycine. [22]

Domains

C13orf42 has no identified domains with high confidence or conservation in orthologs.

Homology and evolution

Corrected Divergence vs. Date of Divergence (MYA) of C13orf42 compared to Cytochrome C and Fibrinogen alpha. Graph of C13orf42 evolution compared to Cytochrome C and Fibrinogen Alpha.jpg
Corrected Divergence vs. Date of Divergence (MYA) of C13orf42 compared to Cytochrome C and Fibrinogen alpha.

Orthologs

C13orf42 has orthologs in mammals, birds, reptiles, amphibians, bony fish, and cartilaginous fish as shown in the ortholog table below. [23] No orthologs were found in jawless fish, invertebrates, plants, fungi, viruses, or bacteria. [23] All mammals contain the same 4 exons as the human C13orf42 protein, and nonmammals are missing exon 4. Mammalian orthologs have a high percent identity to human C13orf42, each having over 62% identity. The furthest orthologs (cartilaginous fish) have sequence identities around 33%. Human C13orf42 does not have paralogs. [8]

Ortholog Table

Table depicting genus and species, common name, taxonomic class, date of divergence, accession number, length, percent identity, and percent similarity of C13orf42 and its orthologs. Default sorting is by date of divergence then percent sequence identity. [13] [12]
Genus and SpeciesCommon NameTaxonomic ClassDate of Divergence (MYA) [24] Accession NumberLength (amino acids)Percent Identity to Homo sapiensPercent Similarity to Homo sapiens
Homo sapiens HumanPrimates0NP_001338518.1325100100
Mus musculus MouseDasyuromorphia87XP_030104110.131876.984
Tursiops truncatus Common bottlenose dolphinCetacea94XP_033699576.131982.887.7
Equus caballus HorsePerissodactyla94XP_023477317.132682.690.2
Mustela putorius European polecatCarnivora94XP_004775284.132679.485
Pipistrellus kuhlii Kuhl's pipistrelle (bat)Chiroptera94XP_036312978.132579.186.8
Ursus maritimus Polar bearUrsidae94XP_040478472.132877.282.7
Elephas maximus indicus Indian elephantProboscidea99XP_049709344.132679.486.2
Dasypus novemcinctus Nine-banded armadilloCingulata99XP_023446856.132872.881.4
Dromiciops gliroides Monito del monteMicrobiotheria160XP_043849658.132667.979.5
Trichosurus vulpecula Common brushtail possumDiprotodontia160XP_036599801.132666.779.5
Sarcophilus harrisii Tasmanian devilDasyuromorphia160XP_023355407.132666.480.1
Ornithorhynchus anatinus PlaytpusMonotremes180XP_028904285.133062.373
Alligator sinensis Chinese alligatorCrocodilian319XP_025062978.126648.661.7
Dromaius novaehollandiae EmuCasuariiformes319XP_025964173.126848.259.8
Gallus gallus ChickenGalliformes319XP_004938779.126847.661.3
Camarhynchus parvulus Small tree finchThraupidae319XP_030802909.126447.160.9
Pelodiscus sinensis Chinese softshell turtleTestudines319XP_014429996.12674760.6
Phasianus colchicus Common pheasantGalliformes319XP_031471701.127246.459.9
Varanus komodoensis Komodo dragonSquamata319XP_044300138.126945.960.1
Python bivittatus Burmese pythonSquamata319XP_025026122.126944.158
Pantherophis guttatus Corn snakeSquamata319XP_034287884.126742.756.4
Rhinatrema bivittatum Two-lined caecilianRhinatrematidae353XP_029459031.127345.459.5
Xenopus tropicalis Western clawed frogAnura353XP_031752228.126043.557.4
Bufo gargarizans Asiatic toadAnura353XP_044141363.126240.756.8
Bufo bufo Common toadAnura353XP_040278366.139130.942
Protopterus annectens West african lungfishLepidosireniformes408XP_043927617.126937.853.5
Polyodon spathula PaddlefishAcipenseriformes431XP_041127510.127533.852.4
Danio rerio ZebrafishCypriniformes431XP_021329868.128730.244.7
Clupea harengus Atlantic herringClupeiformes431XP_042559186.13062941.7
Callorhinchus milii Australian ghostsharkChimaeriformes464XP_042189981.126735.553.3
Amblyraja radiata Thorny skateRajiformes464XP_032889397.12673347.5
Scyliorhinus canicula Small-spotted catsharkCarcharhiniformes464XP_038658253.126432.550.4

Phylogeny

A phylogenetic tree shows human C13orf42 is most related its mammalian orthologs, and most distantly related to cartilaginous fish orthologs. [25]

A phylogenetic tree of the C13orf42 protein made using NCBI protein to collect sequences and phylogeny.fr to generate the tree. Phylogenetic Tree of C13orf42.pdf
A phylogenetic tree of the C13orf42 protein made using NCBI protein to collect sequences and phylogeny.fr to generate the tree.

Function

Clinical significance

Kanagal-Shamanna et. al identified an ATM fusion with C13orf42 in a patient with chronic lymphocytic leukemia which lead to ATM inactivation. [26]

Xiong et. al indicated SNP rs7325564 to be significantly associated with nasion and pronasale face shape in humans. [27]

Related Research Articles

<span class="mw-page-title-main">TMEM98</span> Protein-coding gene in the species Homo sapiens

Transmembrane protein 98 is a single-pass membrane protein that in humans is encoded by the TMEM98 gene. The function of this protein is currently unknown. TMEM98 is also known as UNQ536/PRO1079.

<span class="mw-page-title-main">C20orf27</span> Protein-coding gene in the species Homo sapiens

UPF0687 protein C20orf27 is a protein that in humans is encoded by the C20orf27 gene. It is expressed in the majority of the human tissues. One study on this protein revealed its role in regulating cell cycle, apoptosis, and tumorigenesis via promoting the activation of NFĸB pathway.

TMEM143 is a protein that in humans is encoded by TMEM143 gene. TMEM143, a dual-pass protein, is predicted to reside in the mitochondria and high expression has been found in both human skeletal muscle and the heart. Interaction with other proteins indicate that TMEM143 could potentially play a role in tumor suppression/expression and cancer regulation.

<span class="mw-page-title-main">SHLD1</span> Protein-coding gene in the species Homo sapiens

SHLD1 or shieldin complex subunit 1 is a gene on chromosome 20. The C20orf196 gene encodes an mRNA that is 1,763 base pairs long, and a protein that is 205 amino acids long.

<span class="mw-page-title-main">C1orf112</span> Protein-coding gene in the species Homo sapiens

Chromosome 1 open reading frame 112, is a protein that in humans is encoded by the C1orf112 gene, and is located at position 1q24.2. C1orf112 encodes for seventeen variants of mRNA, fifteen of which are functional proteins. C1orf112 has a determined precursor molecular weight of 96.6 kDa and an isoelectric point of 5.62. C1orf112 has been experimentally determined to localize to the mitochondria, although it does not contain a mitochondrial targeting sequence.

C11orf42 is an uncharacterized protein in Homo sapiens that is encoded by the C11orf42 gene. It is also known as chromosome 11 open reading frame 42 and uncharacterized protein C11orf42, with no other aliases. The gene is mostly conserved in mammals, but it has also been found in rodents, reptiles, fish and worms.

LOC101928193 is a protein which in humans is encoded by the LOC101928193 gene. There are no known aliases for this gene or protein. Similar copies of this gene, called orthologs, are known to exist in several different species across mammals, amphibians, fish, mollusks, cnidarians, fungi, and bacteria. The human LOC101928193 gene is located on the long (q) arm of chromosome 9 with a cytogenic location at 9q34.2. The molecular location of the gene is from base pair 133,189,767 to base pair 133,192,979 on chromosome 9 for an mRNA length of 3213 nucleotides. The gene and protein are not yet well understood by the scientific community, but there is data on its genetic makeup and expression. The LOC101928193 protein is targeted for the cytoplasm and has the highest level of expression in the thyroid, ovary, skin, and testes in humans.

<span class="mw-page-title-main">WD Repeat and Coiled Coil Containing Protein</span> Protein-coding gene in humans

WD Repeat and Coiled-coiled containing protein (WDCP) is a protein which in humans is encoded by the WDCP gene. The function of the protein is not completely understood, but WDCP has been identified in a fusion protein with anaplastic lymphoma kinase found in colorectal cancer. WDCP has also been identified in the MRN complex, which processes double-stranded breaks in DNA.

TMEM275 is a protein that in humans is encoded by the TMEM275 gene. TMEM275 has two, highly-conserved, helical trans-membrane regions. It is predicted to reside within the plasma membrane or the endoplasmic reticulum's membrane.

C2orf74, also known as LOC339804, is a protein encoding gene located on the short arm of chromosome 2 near position 15 (2p15). Isoform 1 of the gene is 19,713 base pairs long. C2orf74 has orthologs in 135 different species, including primarily placental mammals and some marsupials.

<span class="mw-page-title-main">TMEM101</span>

Transmembrane protein 101 (TMEM101) is a protein that in humans is encoded by the TMEM101 gene. The TMEM101 protein has been demonstrated to activate the NF-κB signaling pathway. High levels of expression of TMEM101 have been linked to breast cancer.

<span class="mw-page-title-main">FAM98C</span> Gene

Family with sequence 98, member C or FAM98C is a gene that encodes for FAM98C has two aliases FLJ44669 and hypothetical protein LOC147965. FAM98C has two paralogs in humans FAM98A and FAM98B. FAM98C can be characterized for being a Leucine-rich protein. The function of FAM98C is still not defined. FAM98C has orthologs in mammals, reptiles, and amphibians and has a distant orhtologs in Rhinatrema bivittatum and Nanorana parkeri.

<span class="mw-page-title-main">C5orf24</span> Protein-coding gene in the species Homo sapiens

C5orf24 is a protein encoded by the C5orf24 gene (5q31.1) in humans. C5orf24 is primarily localized to the nucleus and is highly conserved with orthologs in mammals, birds, reptiles, amphibians, and fish.

<span class="mw-page-title-main">C4orf19</span> Human C4orf19 gene

C4orf19 is a protein which in humans is encoded by the C4orf19 gene.

<span class="mw-page-title-main">TMEM212</span> Protein-coding gene in the species Homo sapiens

Transmembrane protein 212 is a protein that in humans is encoded by the TMEM212 gene. The protein consists of five transmembrane domains and localizes in the plasma membrane and endoplasmic reticulum. TMEM212 has orthologs in vertebrates but not invertebrates. TMEM212 has been associated with sporadic Parkinson's disease, facial processing, and adiposity in African Americans.

<span class="mw-page-title-main">C4orf36</span> Draft for page on C4orf36 gene/protein

C4orf36 is a protein that in humans is encoded by the c4orf36 gene.

<span class="mw-page-title-main">CXorf65</span> Gene on human chromosome X

Human uncharacterized protein CXorf65 is encoded by the gene CXorf65, which is located on the minus strand of chromosome X. Its transcript is 834 nucleotides long and consists of 6 exons. The translated protein is 183 amino acids in length. with a molecular weight of 21.3 kDa

<span class="mw-page-title-main">THAP3</span> Protein in Humans

THAP domain-containing protein 3 (THAP3) is a protein that, in Homo sapiens (humans), is encoded by the THAP3 gene. The THAP3 protein is as known as MGC33488, LOC90326, and THAP domain-containing, apoptosis associated protein 3. This protein contains the Thanatos-associated protein (THAP) domain and a host-cell factor 1C binding motif. These domains allow THAP3 to influence a variety of processes, including transcription and neuronal development. THAP3 is ubiquitously expressed in H. sapiens, though expression is highest in the kidneys.

<span class="mw-page-title-main">C13orf46</span> C13of46 Gene and Protein

Chromosome 13 Open Reading Frame 46 is a protein which in humans is encoded by the C13orf46 gene. In humans, C13orf46 is ubiquitously expressed at low levels in tissues, including the lungs, stomach, prostate, spleen, and thymus. This gene encodes eight alternatively spliced mRNA transcript, which produce five different protein isoforms.

<span class="mw-page-title-main">Chromosome 5 open reading frame 47</span> Human C5ORF47 Gene

Chromosome 5 Open Reading Frame 47, or C5ORF47, is a protein which, in humans, is encoded by the C5ORF47 gene. It also goes by the alias LOC133491. The human C5ORF47 gene is primarily expressed in the testis.

References

  1. 1 2 3 GRCh38: Ensembl release 89: ENSG00000226792 - Ensembl, May 2017
  2. "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  3. 1 2 3 4 5 6 7 "C13orf42 chromosome 13 open reading frame 42 [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov.
  4. 1 2 3 "PSORT II Prediction". psort.hgc.jp.
  5. 1 2 3 4 5 Yang J, Zhang Y (July 2015). "I-TASSER server: new development for protein structure and function predictions". Nucleic Acids Research. 43 (W1): W174–W181. doi: 10.1093/nar/gkv342 . PMC   4489253 . PMID   25883148.
  6. 1 2 Yanai I, Benjamin H, Shmoish M, Chalifa-Caspi V, Shklar M, Ophir R, et al. (March 2005). "Genome-wide midrange transcription profiles reveal expression level relationships in human tissue specification". Bioinformatics. 21 (5): 650–659. doi: 10.1093/bioinformatics/bti042 . PMID   15388519.
  7. "C13orf42 Gene - GeneCards | CM042 Protein | CM042 Antibody". www.genecards.org.
  8. 1 2 3 Kent WJ, Sugnet CW, Furey TS, Roskin KM, Pringle TH, Zahler AM, Haussler D (June 2002). "The human genome browser at UCSC". Genome Research. 12 (6): 996–1006. doi: 10.1101/gr.229102 . PMC   186604 . PMID   12045153.
  9. 1 2 Liu W, Xie Y, Ma J, Luo X, Nie P, Zuo Z, et al. (October 2015). "IBS: an illustrator for the presentation and visualization of biological sequences". Bioinformatics. 31 (20): 3359–3361. doi:10.1093/bioinformatics/btv362. PMC   4595897 . PMID   26069263.
  10. 1 2 "Homo sapiens chromosome 13 open reading frame 42 (C13orf42), transcript variant 3, mRNA". NCBI nucleotide. 11 June 2022.
  11. "PREDICTED: Homo sapiens chromosome 13 open reading frame 42 (C13orf42), transcript variant X1, mRNA". 5 April 2022.
  12. 1 2 3 Needleman SB, Wunsch CD (March 1970). "A general method applicable to the search for similarities in the amino acid sequence of two proteins". Journal of Molecular Biology. 48 (3): 443–53. doi:10.1016/0022-2836(70)90057-4. PMID   5420325.
  13. 1 2 3 4 5 "Home - Protein - NCBI". www.ncbi.nlm.nih.gov.
  14. Wilkins MR, Gasteiger E, Bairoch A, Sanchez JC, Williams KL, Appel RD, Hochstrasser DF (24 September 1998). "Protein identification and analysis tools in the ExPASy server". 2-D Proteome Analysis Protocols. Methods in Molecular Biology. Vol. 112. pp. 531–552. doi:10.1385/1-59259-584-7:531. ISBN   1-59259-584-7. PMID   10027275.
  15. 1 2 Brendel V, Bucher P, Nourbakhsh IR, Blaisdell BE, Karlin S (March 1992). "Methods and algorithms for statistical analysis of protein sequences". Proceedings of the National Academy of Sciences of the United States of America. 89 (6): 2002–2006. Bibcode:1992PNAS...89.2002B. doi: 10.1073/pnas.89.6.2002 . PMC   48584 . PMID   1549558.
  16. 1 2 Pettersen EF, Goddard TD, Huang CC, Couch GS, Greenblatt DM, Meng EC, Ferrin TE (October 2004). "UCSF Chimera--a visualization system for exploratory research and analysis". Journal of Computational Chemistry. 25 (13): 1605–1612. doi:10.1002/jcc.20084. PMID   15264254. S2CID   8747218.
  17. 1 2 Wang J, Youkharibache P, Zhang D, Lanczycki CJ, Geer RC, Madej T, et al. (January 2020). "iCn3D, a web-based 3D viewer for sharing 1D/2D/3D representations of biomolecular structures". Bioinformatics. 36 (1): 131–135. doi:10.1093/bioinformatics/btz502. PMC   6956771 . PMID   31218344.
  18. "C13orf42 Antibody (B-4)". www.scbt.com.
  19. "C13orf42 Antibody (E-3)". www.scbt.com.
  20. "Motif Scan". myhits.sib.swiss.
  21. Blom N, Sicheritz-Pontén T, Gupta R, Gammeltoft S, Brunak S (June 2004). "Prediction of post-translational glycosylation and phosphorylation of proteins from the amino acid sequence". Proteomics. 4 (6): 1633–1649. doi:10.1002/pmic.200300771. PMID   15174133. S2CID   18810164.
  22. Bologna G, Yvon C, Duvaud S, Veuthey AL (June 2004). "N-Terminal myristoylation predictions by ensembles of neural networks". Proteomics. 4 (6): 1626–1632. doi:10.1002/pmic.200300783. PMID   15174132. S2CID   20289352.
  23. 1 2 "C13orf42 BLAST: Basic Local Alignment Search Tool". blast.ncbi.nlm.nih.gov.
  24. Kumar S, Suleski M, Craig JM, Kasprowicz AE, Sanderford M, Li M, et al. (August 2022). "TimeTree 5: An Expanded Resource for Species Divergence Times". Molecular Biology and Evolution. 39 (8): msac174. doi:10.1093/molbev/msac174. PMC   9400175 . PMID   35932227.
  25. 1 2 Dereeper A, Guignon V, Blanc G, Audic S, Buffet S, Chevenet F, et al. (July 2008). "Phylogeny.fr: robust phylogenetic analysis for the non-specialist". Nucleic Acids Research. 36 (Web Server issue): W465–W469. doi: 10.1093/nar/gkn180 . PMC   2447785 . PMID   18424797.
  26. Kanagal-Shamanna R, Bao H, Kearney H, Smoley S, Tang Z, Luthra R, et al. (April 2022). "Molecular characterization of Novel ATM fusions in chronic lymphocytic leukemia and T-cell prolymphocytic leukemia". Leukemia & Lymphoma. 63 (4): 865–875. doi:10.1080/10428194.2021.2010061. PMID   34898335. S2CID   245131979.
  27. Xiong Z, Dankova G, Howe LJ, Lee MK, Hysi PG, de Jong MA, et al. (November 2019). McCarthy MI, Morris AP, Cha S (eds.). "Novel genetic loci affecting facial shape variation in humans". eLife. 8: e49898. doi: 10.7554/eLife.49898 . PMC   6905649 . PMID   31763980.