ANKRD24

Last updated
ANKRD24
Identifiers
Aliases ANKRD24 , ankyrin repeat domain 24
External IDs MGI: 1890394 HomoloGene: 49885 GeneCards: ANKRD24
Gene location (Human)
Ideogram human chromosome 19.svg
Chr. Chromosome 19 (human) [1]
Human chromosome 19 ideogram.svg
HSR 1996 II 3.5e.svg
Red rectangle 2x18.png
Band 19p13.3Start4,183,354 bp [1]
End4,224,814 bp [1]
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)

NM_133475

NM_027480

RefSeq (protein)

NP_597732

NP_081756

Location (UCSC) Chr 19: 4.18 – 4.22 Mb Chr 10: 81.63 – 81.65 Mb
PubMed search [3] [4]
Wikidata
View/Edit Human View/Edit Mouse
Predicted tertiary structure of ANKRD24 using I-TASSER ANKRD24.jpg
Predicted tertiary structure of ANKRD24 using I-TASSER

Ankyrin repeat domain-containing protein 24 is a protein in humans that is coded for by the ANKRD24 gene. [6] The gene is also known as KIAA1981 [7] . The protein's function in humans is currently unknown. ANKRD24 is in the protein family that contains ankyrin-repeat domains.

Ankyrin repeat InterPro Repeat

The ankyrin repeat is a 33-residue motif in proteins consisting of two alpha helices separated by loops, first discovered in signaling proteins in yeast Cdc10 and Drosophila Notch. Domains consisting of ankyrin tandem repeats mediate protein–protein interactions and are among the most common structural motifs in known proteins. They appear in bacterial, archaeal, and eukaryotic proteins, but are far more common in eukaryotes. Ankyrin repeat proteins, though absent in most viruses, are common among poxviruses. Most proteins that contain the motif have four to six repeats, although its namesake ankyrin contains 24, and the largest known number of repeats is 34, predicted in a protein expressed by Giardia lamblia.

Contents

Gene

Locus

Cytochromatic location of ANKRD24 gene Locus of ANKRD24.png
Cytochromatic location of ANKRD24 gene
ANKRD24 gene neighborhood ANKRD24 gene neighborhood.png
ANKRD24 gene neighborhood

The gene is located on chromosome 19 at p13.3 on the forward strand. [10] [11] The gene is 4041 base pairs in length and contains 29 exons. [9] [12] The gene is neighbored by the gene SIRT6 that encodes for the Sirtuin-6 protein and the EBI3 gene that encodes for the Epstein-Barr virus induced gene 3 protein. [13]

Sirtuin enzyme

Sirtuins are a class of proteins that possess either mono-ADP-ribosyltransferase, or deacylase activity, including deacetylase, desuccinylase, demalonylase, demyristoylase and depalmitoylase activity. The name Sir2 comes from the yeast gene 'silent mating-type information regulation 2', the gene responsible for cellular regulation in yeast.

EBI3 protein-coding gene in the species Homo sapiens

Epstein-Barr virus induced gene 3, also known as interleukin-27 subunit beta or IL-27B, is a protein which in humans is encoded by the EBI3 gene.

Expression

The expression pattern of ANKRD24 is uncharacterized. Under conditions of cell growth and proliferation, the expression levels increase. In germ line tumors, glioma, and prostate cancer, the expression is elevated relative to other disease states. During development, the expression level is elevated in the blastocyst stage. In adults, there are elevated levels of expression in the placenta, stomach, kidneys, and eye relative to other tissues. [14] However, the results of experimental gene expression profiles are inconsistent relative to ANKRD24 expression, suggesting redundancy of the gene and its protein product.

Glioma A type of tumor that starts in the brain or spine

A glioma is a type of tumor that starts in the glial cells of the brain or the spine. Gliomas comprise about 30 percent of all brain tumors and central nervous system tumors, and 80 percent of all malignant brain tumors.

Prostate cancer male reproductive cancer

Prostate cancer is the development of cancer in the prostate, a gland in the male reproductive system. Most prostate cancers are slow growing; however, some grow relatively quickly. The cancer cells may spread from the prostate to other areas of the body, particularly the bones and lymph nodes. It may initially cause no symptoms. In later stages, it can lead to difficulty urinating, blood in the urine or pain in the pelvis, back, or when urinating. A disease known as benign prostatic hyperplasia may produce similar symptoms. Other late symptoms may include feeling tired due to low levels of red blood cells.

Blastocyst

The blastocyst is a structure formed in the early development of mammals. It possesses an inner cell mass (ICM) which subsequently forms the embryo. The outer layer of the blastocyst consists of cells collectively called the trophoblast. This layer surrounds the inner cell mass and a fluid-filled cavity known as the blastocoel. The trophoblast gives rise to the placenta. The name "blastocyst" arises from the Greek βλαστός blastos and κύστις kystis.

mRNA

Alternative expression

13 transcription splice variants of ANKRD24 mRNA have been predicted. [15]

Messenger RNA large family of RNA molecules that convey genetic information from DNA to the ribosome, where they specify the amino acid sequence of the protein products of gene expression

Messenger RNA (mRNA) is a large family of RNA molecules that convey genetic information from DNA to the ribosome, where they specify the amino acid sequence of the protein products of gene expression. RNA polymerase transcribes primary transcript mRNA into processed, mature mRNA. This mature mRNA is then translated into a polymer of amino acids: a protein, as summarized in the central dogma of molecular biology.

Protein

General features

The ANKRD24 protein is 1146 amino acids in length, has a molecular weight of 124kDa, and has an isoelectric point of 4.98. [16] The secondary structure is predicted to consist of all alpha helices and to not contain any beta strands. [17] The tertiary structure of the protein is predicted to be a helical twist. [5]

The isoelectric point (pI, pH(I), IEP), is the pH at which a molecule carries no net electrical charge or is electrically neutral in the statistical mean. The standard nomenclature to represent the isoelectric point is pH(I), although pI is also commonly seen, and is used in this article for brevity. The net charge on the molecule is affected by pH of its surrounding environment and can become more positively or negatively charged due to the gain or loss, respectively, of protons (H+).

Alpha helix type of secondary structure

The alpha helix (α-helix) is a common motif in the secondary structure of proteins and is a right hand-helix conformation in which every backbone N−H group donates a hydrogen bond to the backbone C=O group of the amino acid located three or four residues earlier along the protein sequence.

Composition

ANKRD24 has a relatively high composition of alanine (15.0%), glutamic acid (13.5%), and leucine (11.0%) and a relatively low composition of cysteine (1.5%), phenylalanine (0.7%), tryptophan (0.2%), and tyrosine (0.8%). The protein contains positive run clusters that could be nuclear localization signals. The protein does not have any significant negative charge clusters and no significant charge patterns. [18]

Alanine chemical compound

Alanine (symbol Ala or A) is an α-amino acid that is used in the biosynthesis of proteins. It contains an amine group and a carboxylic acid group, both attached to the central carbon atom which also carries a methyl group side chain. Consequently, its IUPAC systematic name is 2-aminopropanoic acid, and it is classified as a nonpolar, aliphatic α-amino acid. Under biological conditions, it exists in its zwitterionic form with its amine group protonated (as −NH3+) and its carboxyl group deprotonated (as −CO2). It is non-essential to humans as it can be synthesised metabolically and does not need to be present in the diet. It is encoded by all codons starting with GC (GCU, GCC, GCA, and GCG).

Glutamic acid amino acid

Glutamic acid is an α-amino acid that is used by almost all living beings in the biosynthesis of proteins. It is non-essential in humans, meaning the body can synthesize it. It is also an excitatory neurotransmitter, in fact the most abundant one, in the vertebrate nervous system. It serves as the precursor for the synthesis of the inhibitory gamma-aminobutyric acid (GABA) in GABA-ergic neurons.

Leucine chemical compound

Leucine (symbol Leu or L) is an essential amino acid that is used in the biosynthesis of proteins. Leucine is an α-amino acid, meaning it contains an α-amino group (which is in the protonated −NH3+ form under biological conditions), an α-carboxylic acid group (which is in the deprotonated −COO form under biological conditions), and a side chain isobutyl group, making it a non-polar aliphatic amino acid. It is essential in humans, meaning the body cannot synthesize it: it must be obtained from the diet. Human dietary sources are foods that contain protein, such as meats, dairy products, soy products, and beans and other legumes. It is encoded by the codons UUA, UUG, CUU, CUC, CUA, and CUG.

Subcellular localization

The ANKRD24 protein is predicted to localize in the nucleus of cells. [19]

Domains

ANKRD24 is in the protein family that contains ankyrin-repeat domains. Ankyrin repeats are known for mediating protein-protein interactions. [20] The protein also contains two coiled-coil regions. [21]

Post-translational modification

ANKRD24 is predicted to undergo C-mannosylation. [22]

Interacting proteins

ANKRD24 is predicted to interact with disks large homolog 4 (DLG4), eukaryotic translation elongation factor 1-alpha 1 (EEF1A1), unc-119 homolog A (UNC119), replication timing regulatory factor 1 (RIF1), protein kinase C and casein kinase substrate in neurons 1 (PACSIN1), nuclear factor NF-kappa-B p105 subunit (NFKB1), cholest-5-ene-3β,7α-diol 3β-dehydrogenase (HSD3B7), lethal giant larvae homolog 2 (L2GL2), and glucocorticoid induced 1 (GLC)CI1. [23] [24] [25] [26] No characterization of these interactions has yet to be observed. [27]

Homology

ANKRD24 unrooted phylogenetic tree ANKRD24 unrooted phylogenetic tree.png
ANKRD24 unrooted phylogenetic tree

ANKRD24 has no human paralogs. Orthologous proteins are found in other organisms. The following table represents some of the orthologs found using searches in BLAST [29] and BLAT. [30] However, this list is not exhaustive for the orthologs of ANKRD24 and is only meant to display the wide diversity of species for which orthologs of ANKRD24 can be found.

Genus speciesCommon NameDate of Divergence (MYA) [31] Accession # (Nucleotide)Accession # (Protein)Sequence Length (bp)Sequence Length (aa)E valueSequence Identity (%)Sequence Similarity (%)
1 Homo sapiens Humans0 NM_133475.1 NP_597732.140411146100100
2Nomascus leucogenysNorthern white-cheeked gibbon19.9XM_012503370.1XP_012358824.13675116509596
3Otolemur garnettiiSmall-eared galago75.9 XM_012804592.1 XP_012660046.1 4181103508387
4Trichechus manatus latirostrisFlorida Manatee105 XM_004378459.1 ]3354111708186
5Eptesicus fuscusBig brown bat97.5 XM_008150610.1 XP_008148832.1 3369112208084
6Ailuropoda melanoleucaGiant panda97.5XM_002928471.2 XP_002928517.1 3729111608084
7Loxodonta africanaAfrican savanna elephant105XM_003421925.2 XP_003421973.1 4049134507985
8Octodon degusDegu90.9 XM_004632746.1 XP_004632803.1 3381112607984
9Chrysochloris asiaticaCape golden mole105XM_006869037.1 XP_006869099.1 3747124807885
10Orcinus orcaKiller Whale97.5 XM_004277648.2 XP_004277696.1 3482107907782
11Chelonia mydasGreen sea turle320.5XM_007072230.1XP_007072292305110169.00E-1786377
12Ficedula albicollisCollared flycatcher320.5XM_005060250.1XP_00506030718856212.00E-1716074
13Gekko JaponicusJapanese gecko320.5XM_015410656.1XP_015266142305710193.00E-1755875
14Python bivittatusBurmese python320.5XM_007434390XP_007434452319210631.00E-1675875
15Falco cherrugSaker falcon320.5XM_014277718.1XP_014133193143147605771
16Stegastes partitusBicolor damselfish429.6 XM_008295305.1 XP_008293527.1362910952.00E-904155
17Trichomonas vaginalis G3Trichomonas vaginalis2040 XM_001329224.1 XP_0013292597742572.00E-243454
18Ciona intestinalisVase tunicate733 XM_009863371.1 XP_00986167331569569.00E-243248
19Talaromyces stipitatusFungi1302.5 XM_002486774.1 XP_002486819.1321610713.00E-183251
20Cryptococcus neoformansYeast1302.5 XM_012197089.1 XP_012052479.19432363.00E-163251

Related Research Articles

Chromosome 16 open reading frame 13 protein-coding gene in the species Homo sapiens

Chromosome 16 open reading frame 13, also called C16orf13, is a protein-coding gene of unknown function, also known as JFP2. Though the function of this gene is unknown, various data have revealed that it is expressed at high levels in various cancerous tissues. Underexpression of this gene has also been linked to disease consequences in humans.

POTEB protein-coding gene in the species Homo sapiens

POTE ankyrin domain family, member B is a protein in humans that is encoded by the POTEB gene.(Prostate, Ovary, Testes Expressed ankyrin domain family member B).It is most likely involved in mediating protein-protein interaction via its 5 ankyrin domains. POTEB is most probably aids in intracellular signaling, but is not likely to be a secreted or nuclear protein. POTEB's function is likely to be regulated via 17 potential phosphorylation sites. There is currently no evidence to suggest that POTEB has nuclear localization signals.

C8orf48 protein-coding gene in the species Homo sapiens

C8orf48 is a protein that in humans is encoded by the C8orf48 gene. C8orf48 is a nuclear protein specifically predicted to be located in the nuclear lamina. C8orf48 has been found to interact with proteins that are involved in the regulation of various cellular responses like gene expression, protein secretion, cell proliferation, and inflammatory responses. This protein has been linked to breast cancer and papillary thyroid carcinoma.

LOC105377021 is a protein which in humans is encoded by the LOC105377021 gene. LOC105377021 exhibits expressional pathology related to breast cancer, specifically triple negative breast cancer. LOC105377021 contains a serine rich region in addition to predicted alpha helix motifs.

Chromosome 16 open reading frame 95 (C16orf95) is a gene which in humans encodes the protein C16orf95. It has orthologs in mammals, and is expressed at a low level in many tissues. C16orf95 evolves quickly compared to other proteins.

LRRC24 protein-coding gene in the species Homo sapiens

Leucine rich repeat containing 24 is a protein that, in humans, is encoded by the LRRC24 gene. The protein is represented by the official symbol LRRC24, and is alternatively known as LRRC14OS. The function of LRRC24 is currently unknown. It is a member of the leucine-rich repeat (LRR) superfamily of proteins.

PRR29 protein-coding gene in the species Homo sapiens

PRR29 is a protein located on human chromosome 17 that in humans is encoded by the PRR29 gene.

OCC-1 is a protein, which in humans is encoded by the gene C12orf75. The gene is approximately 40,882 bp long and encodes 63 amino acids. OCC-1 is ubiquitously expressed throughout the human body. OCC-1 has shown to be overexpressed in various colon carcinomas. Novel splice variant of this gene was also detected in various human cancer types; in addition to encoding a novel smaller protein, OCC-1 gene produces a non-protein coding RNA splice variant lncRNA.

Glutamate rich 5 protein-coding gene in the species Homo sapiens

Glutamate Rich Protein 5 is a protein in humans encoded by the ERICH5 gene, also known as Chromosome 8 open reading frame 47 (C8orf47).

C10orf67 protein-coding gene in the species Homo sapiens

Chromosome 10 open reading frame 67 (C10orf67), also known as C10orf115, LINC01552, and BA215C7.4, is an un-characterized human protein-coding gene. Several studies indicate a possible link between genetic polymorphisms of this and several other genes to chronic inflammatory barrier diseases such as Crohn’s Disease and sarcoidosis.

Uncharacterized protein Chromosome 16 Open Reading Frame 71 is a protein in humans, encoded by the C16orf71 gene. The gene is expressed in epithelial tissue of the respiratory system, adipose tissue, and the testes. Predicted associated biological processes of the gene include regulation of the cell cycle, cell proliferation, apoptosis, and cell differentiation in those tissue types. 1357 bp of the gene are antisense to spliced genes ZNF500 and ANKS3, indicating possibility of regulated alternate expression.

RTL6 mammalian protein found in Homo sapiens

Retrotransposon Gag Like 6 is a protein encoded by the RTL6 gene in humans. RTL6 is a member of the Mart family of genes, which are related to Sushi-like retrotransposons and were derived from fish and amphibians. The RTL6 protein is localized to the nucleus and has a predicted leucine zipper motif that is known to bind nucleic acids in similar proteins, such as LDOC1.

C17orf53 protein-coding gene in the species Homo sapiens

C17orf53 is a gene in humans that encodes a protein known as C17orf53, uncharacterized protein C17orf53. It has been shown to target the nucleus, with minor localization in the cytoplasm. Based on current findings C17orf53 is predicted to perform functions of transport, however further research into the protein could provide more specific evidence regarding its function.

C21orf58 protein-coding gene in the species Homo sapiens

Chromosome 21 Open Reading Frame 58 (C21orf58) is a protein that in humans is encoded by the C21orf58 gene.

CFAP157

Cilia and flagella associated protein 157 (CFAP157) also known as chromosome 9 open reading frame 117 (c9orf117) is a protein that in humans is encoded by the CFAP157 gene.

C15orf39

C15orf39 is a protein that in humans is encoded by the Chromosome 15 open reading frame 15 (C15orf39) gene.

Transmembrane protein 44 mammalian protein found in Homo sapiens

Transmembrane protein 44 is a protein that in humans is encoded by the TMEM44 gene.

C19orf44 (gene) protein-coding gene in the species Homo sapiens

Chromosome 19 open reading frame 44 is a protein that in humans is encoded by the C19orf44 gene. C19orf44 is an uncharacterized protein with an unknown function in humans. C19orf44 is non-limiting implying that the protein exists in other species besides human. The protein contains one domain of unknown function (DUF) that is highly conserved throughout its orthologs. This protein is most highly expressed in the testis and ovary, but also has significant expression in the thyroid and parathyroid. Other names for this protein include: LOC84167.

C9orf50 protein-coding gene in the species Homo sapiens

Chromosome 9 open reading frame 50 is a protein that in humans is encoded by the C9orf50 gene. C9orf50 has one other known alias, FLJ35803. In humans the gene coding sequence is 10,051 base pairs long, transcribing an mRNA of 1,624 bases that encodes a 431 amino acid protein.

SMCO3 protein-coding gene in the species Homo sapiens

Single-pass membrane and coiled-coil domain-containing protein 3 is a protein that is encoded in humans by the SMCO3 gene.

References

  1. 1 2 3 GRCh38: Ensembl release 89: ENSG00000089847 - Ensembl, May 2017
  2. 1 2 3 GRCm38: Ensembl release 89: ENSMUSG00000054708 - Ensembl, May 2017
  3. "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  4. "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  5. 1 2 "I-TASSER server for protein structure and function prediction". zhanglab.ccmb.med.umich.edu. Retrieved 2016-04-26.
  6. "ankyrin repeat domain-containing protein 24 [Homo sapiens] - Protein - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2016-02-07.
  7. "ANKRD24 - Ankyrin repeat domain-containing protein 24 - Homo sapiens (Human) - ANKRD24 gene & protein". www.uniprot.org. Retrieved 2016-02-07.
  8. "Gene Card entry for ANKRD24". Gene Card. Retrieved 2016-05-09.
  9. 1 2 "ANKRD24 ankyrin repeat domain 24 [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2016-05-09.
  10. "Gene Card Entry on ANKRD24". www.genecards.org. Retrieved 2016-02-07.
  11. "Gene: ANKRD24 (ENSG00000089847) - Summary - Homo sapiens - Ensembl genome browser 84". useast.ensembl.org. Retrieved 2016-05-09.
  12. "Homo sapiens ankyrin repeat domain 24 (ANKRD24), mRNA - Nucleotide - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2016-04-26.
  13. "ANKRD24 ankyrin repeat domain 24 [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2016-04-27.
  14. "EST Profile - Hs.197872". www.ncbi.nlm.nih.gov. Retrieved 2016-04-27.
  15. "ankyrin repeat domain-containing protein 24 [Homo sapiens] - Protein - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2016-04-24.
  16. "SDSC Biology Workbench". workbench.sdsc.edu. Retrieved 2016-05-09.
  17. "Phyre 2 Results for My_protein". www.sbg.bio.ic.ac.uk. Retrieved 2016-04-27.[ permanent dead link ]
  18. "SAPS submission on ANKRD24". Biology WorkBench.
  19. "Gene Card Entry on ANKRD24". www.genecards.org. Retrieved 2016-04-24.
  20. Mosavi LK, Cammett TJ, Desrosiers DC, Peng ZY (June 2004). "The ankyrin repeat as molecular architecture for protein recognition". Protein Science. 13 (6): 1435–48. doi:10.1110/ps.03554604. PMC   2279977 . PMID   15152081.
  21. "ANKRD24 - Ankyrin repeat domain-containing protein 24 - Homo sapiens (Human) - ANKRD24 gene & protein". www.uniprot.org. Retrieved 2016-02-08.
  22. "NetCGlyc 1.0 Server". www.cbs.dtu.dk. Retrieved 2016-05-09.
  23. "8 binary interactions found for search term ANKRD24". IntAct Molecular Interaction Database. EMBL-EBI. Retrieved 2018-08-25.
  24. al., David Lynn et. "InnateDB: Systems Biology of the Innate Immune Response". www.innatedb.com. Retrieved 2016-04-24.
  25. "mentha: the interactome browser". mentha.uniroma2.it. Retrieved 2016-04-24.
  26. "mint database". mint.bio.uniroma2.it. Archived from the original on 2006-05-06. Retrieved 2016-04-24.
  27. "STRING: functional protein association networks". string-db.org. Retrieved 2016-02-08.
  28. "DRAWTREE". Biology WorkBench.
  29. "BLAST: Basic Local Alignment Search Tool". blast.ncbi.nlm.nih.gov. Retrieved 2016-05-09.
  30. "UCSC Genome Bioinformatics: FAQ". genome.ucsc.edu. Retrieved 2016-05-09.
  31. Hedges SB, Marin J, Suleski M, Paymer M, Kumar S (April 2015). "Tree of life reveals clock-like speciation and diversification". Molecular Biology and Evolution. 32 (4): 835–45. arXiv: 1412.4312 . doi:10.1093/molbev/msv037. PMC   4379413 . PMID   25739733.