CFAP299

Last updated

CFAP299
Identifiers
Aliases CFAP299 , chromosome 4 open reading frame 22, C4orf22, cilia and flagella associated protein 299
External IDs MGI: 1916571; HomoloGene: 51893; GeneCards: CFAP299; OMA:CFAP299 - orthologs
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)

NM_001206997
NM_152770

NM_001024614

RefSeq (protein)

NP_001193926
NP_689983

NP_001019785

Location (UCSC) Chr 4: 80.34 – 80.96 Mb Chr 5: 98.48 – 98.95 Mb
PubMed search [3] [4]
Wikidata
View/Edit Human View/Edit Mouse

Cilia- and flagella-associated protein 299 (CFAP299) is a protein that in humans is encoded by the CFAP299 gene. CFAP299 is predicted to play a role in spermatogenesis and cell apoptosis. [5]

Contents

Gene

Location

CFAP299 gene is located at chromosome 4, 4q21.21 spanning 642,492 bases from position 80,321,265 to position 80,963,756 on the plus strand. CFAP299 gene is also known as C4orf22, chromosome 4 Open Reading Frame 22 and Uncharacterized Protein C4orf22. [6] CFAP299 gene is located near MRPS25P1 and BMP3 and it has 13 exons. [7]

CFAP299 gene location on chromosome 4 Geneneighb.fcgi-2.gif
CFAP299 gene location on chromosome 4

Expression

CFAP299 is widely expressed in a variety of normal tissue in Homo sapiens . CFAP299 is highly expressed in testis, trachea, lung, fetal lung and epididymis. [8] In terms of health state, CFAP299 has a decreased expression level in glioma, germ cell tumors and chondrosarcoma. An even higher expression of CFAP299 is shown in condition of soft tissue tumor and muscle tissue tumor. CFAP299 is only exist in fetus and adult. [9]

CFAP299 expression in 42 multiple normal tissues in human ProfileGraph.cgi-5.png
CFAP299 expression in 42 multiple normal tissues in human

Promoter

The promoter of CFAP299 gene is predicted to present 1000 base pairs upstream of the start of transcription. A variety of transcription factors such as CCAAT binding factors, X-box binding factors and AT rich interactive domain factor bind to promoter to regulate the sequence. [10]

mRNA

Splice variants

CFAP299 has 9 alternatively spliced variants and 1 unspliced form. [11]

Protein

General feature

CFAP299 protein contains 233 amino acids in length. The molecular weight of Homo sapiens CFAP299 protein is 26869 Da and the predicted isoelectric point is 5.28. Total number of negatively charged residues is 39 and total number of positively charged residues is 33. [12] Aspartic acid has a higher frequency in CFAP299 protein than in other human proteins. [13]

Isoforms

CFAP299 protein has two important isoforms. Cilia- and flagella-associated protein 299 isoform 1 is the longest isoform [7] and cilia- and flagella-associated protein 299 isoform 2 is chosen as canonical sequence, [14] which is also the target for this article.

Domains

There is only one conserved domain DUF4464 from position 13 to position 232 in CFAP299 protein. [7] This domain belongs to DUF4464 family, which is found in eukaryotes and the proteins in this family has a length of 224 to 241 amino acids. [15] This domain is conserved through the orthologs of CFAP299 as indicated by BLAST. [16]

Secondary structure

CFAP299 proteins secondary structure is dominated by alpha helix and random coil as predicted by GOR4. [17]

Secondary structure of CFAP299 predicted by GOR4 Cfap299 second.jpg
Secondary structure of CFAP299 predicted by GOR4

Tertiary structure

Tertiary structure of CFAP299 protein predicted by I-TASSER showed that the protein is comprised by alpha helix and coils. [18]

I-TASSER result of tertiary structure of CFAP299 protein Cfap299-3rd.gif
I-TASSER result of tertiary structure of CFAP299 protein

Post-translational modifications

CFAP299 is predicted to undergo phosphorylation in various site as shown in graph. [19] CFAP299 also predicted to have sumoylation site in position 58, 137 and 232 and two SUMO-interaction Motifs in position 45-49 and 212-216. [20]

CFAP299 phosphorylation site predicted by Netphos Cfap299 phos.gif
CFAP299 phosphorylation site predicted by Netphos
Sumoylation site and Sumoylation interaction motifs of CFAP299 protein Cfap299 sumo.png
Sumoylation site and Sumoylation interaction motifs of CFAP299 protein

Subcellular localization

CFAP299 protein is targeted to cytoplasm. [21]

Interacting proteins

CFAP299 protein is believed to interact with amyloid beta (A4) precursor protein (APP) [22] and BCL2-associated athanogene 3 (BCL2). [23]

Evolution

OrthologS

CFAP299 protein orthologs exists in mammals, reptiles, birds, amphibians, fish, sponges, sea urchins, insects, fungi and plants. Its most distant relative appear in plants. The table below shows orthologs found by BLAST. [16]

Genus and speciesCommon nameTaxonomic GroupDate of divergenceaccession numbersequence lengthsequence identitysequence similarity
Homo SapiensHumanMammalia0NP_689983.2233100%100%
Ochotona princepsAmerican pikaLagomorpha88XP_004590671.123385%93%
Mus musculusHouse mouseRodentia88NP_00101978523385%91%
Eumetopias jubatusSteller sea lionCarnivora94XP_02798003123386%93%
Erinaceus europaeusEuropean hedgehogSoricomorpha94XP_00751856223383%93%
Ornithorhynchus anatinusplatypusMonotremata169XP_00765976916474%88%
Pogona vitticepsCentral bearded dragonReptilia320XP_02065882923672%85%
Anolis carolinensisGreen anoleReptilia320XP_00811809319371%85%
Dromaius novaehollandiaeEmuAves320XP_02595915522664%81%
Anas platyrhynchosMallardAves320XP_027312784.124358%75%
Xenopus laevisAfrican clawed frogAmphibia353NP_00108872223373%89%
Nanorana parkeriXizang Plateau frogAmphibia353XP_018414504.123373%88%
Danio rerioZebrafishActinopterygii432NP_00110859623960%77%
Callorhinchus miliiAustralian ghostsharkChondrichthyes465XP_00789515723568%82%
Strongylocentrotus purpuratusPacific purple sea urchinEchinoidea627XP_01166300223666%80%
Nematostella vectensisStarlet sea anemoneAnthozoa685XP_001619741.119961%70%
Drosophila melanogasterFruit flyInsecta794NP_650260.123331%46%
Amphimedon queenslandicaSpongeDemospongiae951.8XP_00338244623564%80%
Batrachochytrium dendrobatidisChytridiomycetesAmphibian chytrid fungus1150XP_00668137223861%78%
Physcomitrella patensSpreading earthmossBryopsida1624XP_02437910625550%65%

Paralog

There are no paralogs for CFAP299. [6] [16]

Clinical significance

CFAP299 expression is lowered in people with teratozoospermia,  a condition that causes abnormal morphology of sperm and decreased fertility. [24]

In airway epithelial cells that had excessive mucous secretion, a condition that simulated chronic lung disease, CFAP299 showed a reduced expression. [25]

Related Research Articles

<span class="mw-page-title-main">C8orf48</span> Protein-coding gene in the species Homo sapiens

C8orf48 is a protein that in humans is encoded by the C8orf48 gene. C8orf48 is a nuclear protein specifically predicted to be located in the nuclear lamina. C8orf48 has been found to interact with proteins that are involved in the regulation of various cellular responses like gene expression, protein secretion, cell proliferation, and inflammatory responses. This protein has been linked to breast cancer and papillary thyroid carcinoma.

TMEM156 is a gene that encodes the transmembrane protein 156 (TMEM156) in Homo sapiens. It has the clone name of FLJ23235.

<span class="mw-page-title-main">Glutamate rich 5</span> Protein-coding gene in the species Homo sapiens

Glutamate rich protein 5 is a protein in humans encoded by the ERICH5 gene, also known as chromosome 8 open reading frame 47 (C8orf47).

<span class="mw-page-title-main">C21orf62</span> Protein-coding gene in the species Homo sapiens

Exosomal polycystin-1-interacting protein is a protein that, in humans, is encoded by the EPCIP gene. EPCIP is found on human chromosome 21, and it is thought to be expressed in tissues of the brain and reproductive organs. Additionally, EPCIP is highly expressed in ovarian surface epithelial cells during normal regulation, but is not expressed in cancerous ovarian surface epithelial cells.

<span class="mw-page-title-main">C17orf53</span>

C17orf53 is a gene in humans that encodes a protein known as C17orf53, uncharacterized protein C17orf53. It has been shown to target the nucleus, with minor localization in the cytoplasm. Based on current findings C17orf53 is predicted to perform functions of transport, however further research into the protein could provide more specific evidence regarding its function.

<span class="mw-page-title-main">C21orf58</span> Protein-coding gene in the species Homo sapiens

Chromosome 21 Open Reading Frame 58 (C21orf58) is a protein that in humans is encoded by the C21orf58 gene.

<span class="mw-page-title-main">C15orf39</span>

C15orf39 is a protein that in humans is encoded by the Chromosome 15 open reading frame 15 (C15orf39) gene.

<span class="mw-page-title-main">TMEM44</span> Protein-coding gene in the species Homo sapiens

TMEM44 is a protein that in humans is encoded by the TMEM44 gene. DKFZp686O18124 is a synonym of TMEM44.

<span class="mw-page-title-main">C19orf44</span> Mammalian protein found in Homo sapiens

Chromosome 19 open reading frame 44 is a protein that in humans is encoded by the C19orf44 gene. C19orf44 is an uncharacterized protein with an unknown function in humans. C19orf44 is non-limiting implying that the protein exists in other species besides human. The protein contains one domain of unknown function (DUF) that is highly conserved throughout its orthologs. This protein is most highly expressed in the testis and ovary, but also has significant expression in the thyroid and parathyroid. Other names for this protein include: LOC84167.

<span class="mw-page-title-main">C16orf86</span> Protein-coding gene in the species Homo sapiens

Uncharacterized protein C16orf86 is a protein in humans that is encoded by the C16orf86 gene. It is mostly made of alpha helices and it is expressed in the testes, but also in other tissues such as the kidney, colon, brain, fat, spleen, and liver. For the function of C16orf86, it is not well understood, however it could be a transcription factor in the nucleus that regulates G0/G1 in the cell cycle for tissues such as the kidney, brain, and skeletal muscles as mentioned in the DNA microarray data below in the gene level regulation section.

Proline-rich protein 16 (PRR16) is a protein coding gene in Homo sapiens. The protein is known by the alias Largen.

<span class="mw-page-title-main">C1orf122</span> Protein-coding gene in the species Homo sapiens

C1orf122 is a gene in the human genome that encodes the cytosolic protein ALAESM.. ALAESM is present in all tissue cells and highly up-regulated in the brain, spinal cord, adrenal gland and kidney. This gene can be expressed up to 2.5 times the average gene in its highly expressed tissues. Although the function of C1orf122 is unknown, it is predicted to be used for mitochondria localization.

<span class="mw-page-title-main">Fam89A</span> Human protein and gene

ProteinFAM89A is a protein which in humans is encoded by the FAM89A gene. It is also known as chromosome 1 open reading frame 153 (C1orf153). Highest FAM89A gene expression is observed in the placenta and adipose tissue. Though its function is largely unknown, FAM89A is found to be differentially expressed in response to interleukin exposure, and it is implicated in immune responses pathways and various pathologies such as atherosclerosis and glioma cell expression.

TMEM275 is a protein that in humans is encoded by the TMEM275 gene. TMEM275 has two, highly-conserved, helical trans-membrane regions. It is predicted to reside within the plasma membrane or the endoplasmic reticulum's membrane.

<span class="mw-page-title-main">FAM98C</span> Gene

Family with sequence 98, member C or FAM98C is a gene that encodes for FAM98C has two aliases FLJ44669 and hypothetical protein LOC147965. FAM98C has two paralogs in humans FAM98A and FAM98B. FAM98C can be characterized for being a Leucine-rich protein. The function of FAM98C is still not defined. FAM98C has orthologs in mammals, reptiles, and amphibians and has a distant orhtologs in Rhinatrema bivittatum and Nanorana parkeri.

<span class="mw-page-title-main">C11orf98</span> Protein-coding gene in the species Homo sapiens

C11orf98 is a protein-encoding gene on chromosome 11 in humans of unknown function. It is otherwise known as c11orf48. The gene spans the chromosomal locus from 62,662,817-62,665,210. There are 4 exons. It spans across 2,394 base pairs of DNA and produces an mRNA that is 646 base pairs long.

<span class="mw-page-title-main">C4orf19</span> Human C4orf19 gene

C4orf19 is a protein which in humans is encoded by the C4orf19 gene.

<span class="mw-page-title-main">THAP3</span> Protein in Humans

THAP domain-containing protein 3 (THAP3) is a protein that, in Homo sapiens (humans), is encoded by the THAP3 gene. The THAP3 protein is as known as MGC33488, LOC90326, and THAP domain-containing, apoptosis associated protein 3. This protein contains the Thanatos-associated protein (THAP) domain and a host-cell factor 1C binding motif. These domains allow THAP3 to influence a variety of processes, including transcription and neuronal development. THAP3 is ubiquitously expressed in H. sapiens, though expression is highest in the kidneys.

<span class="mw-page-title-main">C13orf46</span> C13of46 Gene and Protein

Chromosome 13 Open Reading Frame 46 is a protein which in humans is encoded by the C13orf46 gene. In humans, C13orf46 is ubiquitously expressed at low levels in tissues, including the lungs, stomach, prostate, spleen, and thymus. This gene encodes eight alternatively spliced mRNA transcript, which produce five different protein isoforms.

<span class="mw-page-title-main">Chromosome 5 open reading frame 47</span> Human C5ORF47 Gene

Chromosome 5 Open Reading Frame 47, or C5ORF47, is a protein which, in humans, is encoded by the C5ORF47 gene. It also goes by the alias LOC133491. The human C5ORF47 gene is primarily expressed in the testis.

References

  1. 1 2 3 GRCh38: Ensembl release 89: ENSG00000197826 Ensembl, May 2017
  2. 1 2 3 GRCm38: Ensembl release 89: ENSMUSG00000057816 Ensembl, May 2017
  3. "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  4. "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  5. Li H, Dai Y, Luo Z, Nie D (April 2019). "Cloning of a new testis-enriched gene C4orf22 and its role in cell cycle and apoptosis in mouse spermatogenic cells". Molecular Biology Reports. 46 (2): 2029–2038. doi:10.1007/s11033-019-04651-8. PMID   30820741. S2CID   71147966.
  6. 1 2 "GeneCards CFAP299". www.genecards.org. Retrieved 2019-05-05.
  7. 1 2 3 "CFAP299 cilia and flagella associated protein 299 [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2019-02-26.
  8. "Home - GEO Profiles - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2019-05-02.
  9. "Home - Nucleotide - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2019-05-05.
  10. "Genomatix - NGS Data Analysis & Personalized Medicine". www.genomatix.de. Archived from the original on 2001-02-24. Retrieved 2019-05-03.
  11. "AceView: Gene:FGF5andC4orf22, a comprehensive annotation of human, mouse and worm genes with mRNAs or ESTsAceView". www.ncbi.nlm.nih.gov. Retrieved 2019-05-02.
  12. "ExPASy - ProtParam tool". web.expasy.org. Retrieved 2019-05-03.
  13. "SAPS Results". www.ebi.ac.uk. Retrieved 2019-05-03.
  14. "CFAP299 - Cilia- and flagella-associated protein 299 - Homo sapiens (Human) - CFAP299 gene & protein". www.uniprot.org. Retrieved 2019-05-02.
  15. "NCBI Conserved Domain Search". www.ncbi.nlm.nih.gov. Retrieved 2019-05-03.
  16. 1 2 3 "BLAST: Basic Local Alignment Search Tool". blast.ncbi.nlm.nih.gov. Retrieved 2019-02-26.
  17. "NPS@ : GOR4 secondary structure prediction". npsa-prabi.ibcp.fr. Retrieved 2019-05-05.
  18. "I-TASSER server for protein structure and function prediction". zhanglab.ccmb.med.umich.edu. Retrieved 2019-05-03.
  19. "NetPhos 3.1 Server - prediction results". www.cbs.dtu.dk. Retrieved 2019-05-05.
  20. "GPS-SUMO: Prediction of SUMOylation Sites & SUMO-interaction Motifs". sumosp.biocuckoo.org. Archived from the original on 2018-05-06. Retrieved 2019-05-05.
  21. "PSORT II Prediction". psort.hgc.jp. Retrieved 2019-05-03.
  22. Oláh J, Vincze O, Virók D, Simon D, Bozsó Z, Tõkési N, Horváth I, Hlavanda E, Kovács J, Magyar A, Szũcs M, Orosz F, Penke B, Ovádi J (September 2011). "Interactions of pathological hallmark proteins: tubulin polymerization promoting protein/p25, beta-amyloid, and alpha-synuclein". The Journal of Biological Chemistry. 286 (39): 34088–100. doi: 10.1074/jbc.m111.243907 . PMC   3190826 . PMID   21832049.
  23. Chen Y, Yang LN, Cheng L, Tu S, Guo SJ, Le HY, Xiong Q, Mo R, Li CY, Jeong JS, Jiang L, Blackshaw S, Bi LJ, Zhu H, Tao SC, Ge F (October 2013). "Bcl2-associated athanogene 3 interactome analysis reveals a new role in modulating proteasome activity". Molecular & Cellular Proteomics. 12 (10): 2804–19. doi: 10.1074/mcp.m112.025882 . PMC   3790292 . PMID   23824909.
  24. Platts AE, Dix DJ, Chemes HE, Thompson KE, Goodrich R, Rockett JC, Rawe VY, Quintana S, Diamond MP, Strader LF, Krawetz SA (April 2007). "Success and failure in human spermatogenesis as revealed by teratozoospermic RNAs". Human Molecular Genetics. 16 (7): 763–73. doi: 10.1093/hmg/ddm012 . PMID   17327269.
  25. Alevy YG, Patel AC, Romero AG, Patel DA, Tucker J, Roswit WT, Miller CA, Heier RF, Byers DE, Brett TJ, Holtzman MJ (December 2012). "IL-13-induced airway mucus production is attenuated by MAPK13 inhibition". The Journal of Clinical Investigation. 122 (12): 4555–68. doi:10.1172/jci64896. PMC   3533556 . PMID   23187130.