CFAP299

Last updated
CFAP299
Identifiers
Aliases CFAP299 , chromosome 4 open reading frame 22, C4orf22, cilia and flagella associated protein 299
External IDs MGI: 1916571 HomoloGene: 51893 GeneCards: CFAP299
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)

NM_001206997
NM_152770

NM_001024614

RefSeq (protein)

NP_001193926
NP_689983

NP_001019785

Location (UCSC) Chr 4: 80.34 – 80.96 Mb Chr 5: 98.48 – 98.95 Mb
PubMed search [3] [4]
Wikidata
View/Edit Human View/Edit Mouse

Cilia- and flagella-associated protein 299 (CFAP299), is a protein that in humans is encoded by the CFAP299 gene. CFAP299 is predicted to play a role in spermatogenesis and cell apoptosis. [5]

Contents

Gene

Location

CFAP299 gene is located at chromosome 4, 4q21.21 spanning 642,492 bases from position 80,321,265 to position 80,963,756 on the plus strand. CFAP299 gene is also known as C4orf22, chromosome 4 Open Reading Frame 22 and Uncharacterized Protein C4orf22. [6] CFAP299 gene is located near MRPS25P1 and BMP3 and it has 13 exons. [7]

CFAP299 gene location on chromosome 4 Geneneighb.fcgi-2.gif
CFAP299 gene location on chromosome 4

Expression

CFAP299 is widely expressed in a variety of normal tissue in Homo sapiens . CFAP299 is highly expressed in testis, trachea, lung, fetal lung and epididymis. [8] In terms of health state, CFAP299 has a decreased expression level in glioma, germ cell tumors and chondrosarcoma. An even higher expression of CFAP299 is shown in condition of soft tissue tumor and muscle tissue tumor. CFAP299 is only exist in fetus and adult. [9]

CFAP299 expression in 42 multiple normal tissues in human ProfileGraph.cgi-5.png
CFAP299 expression in 42 multiple normal tissues in human

Promoter

The promoter of CFAP299 gene is predicted to present 1000 base pairs upstream of the start of transcription. A variety of transcription factors such as CCAAT binding factors, X-box binding factors and AT rich interactive domain factor bind to promoter to regulate the sequence. [10]

mRNA

Splice variants

CFAP299 has 9 alternatively spliced variants and 1 unspliced form. [11]

Protein

General feature

CFAP299 protein contains 233 amino acids in length. The molecular weight of Homo sapiens CFAP299 protein is 26869 Da and the predicted isoelectric point is 5.28. Total number of negatively charged residues is 39 and total number of positively charged residues is 33. [12] Aspartic acid has a higher frequency in CFAP299 protein than in other human proteins. [13]

Isoforms

CFAP299 protein has two important isoforms. Cilia- and flagella-associated protein 299 isoform 1 is the longest isoform [7] and cilia- and flagella-associated protein 299 isoform 2 is chosen as canonical sequence, [14] which is also the target for this article.

Domains

There is only one conserved domain DUF4464 from position 13 to position 232 in CFAP299 protein. [7] This domain belongs to DUF4464 family, which is found in eukaryotes and the proteins in this family has a length of 224 to 241 amino acids. [15] This domain is conserved through the orthologs of CFAP299 as indicated by BLAST. [16]

Secondary structure

CFAP299 proteins secondary structure is dominated by alpha helix and random coil as predicted by GOR4. [17]

Secondary structure of CFAP299 predicted by GOR4 Cfap299 second.jpg
Secondary structure of CFAP299 predicted by GOR4

Tertiary structure

Tertiary structure of CFAP299 protein predicted by I-TASSER showed that the protein is comprised by alpha helix and coils. [18]

I-TASSER result of tertiary structure of CFAP299 protein Cfap299-3rd.gif
I-TASSER result of tertiary structure of CFAP299 protein

Post-translational modifications

CFAP299 is predicted to undergo phosphorylation in various site as shown in graph. [19] CFAP299 also predicted to have sumoylation site in position 58, 137 and 232 and two SUMO-interaction Motifs in position 45-49 and 212-216. [20]

CFAP299 phosphorylation site predicted by Netphos Cfap299 phos.gif
CFAP299 phosphorylation site predicted by Netphos
Sumoylation site and Sumoylation interaction motifs of CFAP299 protein Cfap299 sumo.png
Sumoylation site and Sumoylation interaction motifs of CFAP299 protein

Subcellular localization

CFAP299 protein is targeted to cytoplasm. [21]

Interacting proteins

CFAP299 protein is believed to interact with amyloid beta (A4) precursor protein (APP) [22] and BCL2-associated athanogene 3 (BCL2). [23]

Evolution

OrthologS

CFAP299 protein orthologs exists in mammals, reptiles, birds, amphibians, fish, sponges, sea urchins, insects, fungi and plants. Its most distant relative appear in plants. The table below shows orthologs found by BLAST. [16]

Genus and speciesCommon nameTaxonomic GroupDate of divergenceaccession numbersequence lengthsequence identitysequence similarity
Homo SapiensHumanMammalia0NP_689983.2233100%100%
Ochotona princepsAmerican pikaLagomorpha88XP_004590671.123385%93%
Mus musculusHouse mouseRodentia88NP_00101978523385%91%
Eumetopias jubatusSteller sea lionCarnivora94XP_02798003123386%93%
Erinaceus europaeusEuropean hedgehogSoricomorpha94XP_00751856223383%93%
Ornithorhynchus anatinusplatypusMonotremata169XP_00765976916474%88%
Pogona vitticepsCentral bearded dragonReptilia320XP_02065882923672%85%
Anolis carolinensisGreen anoleReptilia320XP_00811809319371%85%
Dromaius novaehollandiaeEmuAves320XP_02595915522664%81%
Anas platyrhynchosMallardAves320XP_027312784.124358%75%
Xenopus laevisAfrican clawed frogAmphibia353NP_00108872223373%89%
Nanorana parkeriXizang Plateau frogAmphibia353XP_018414504.123373%88%
Danio rerioZebrafishActinopterygii432NP_00110859623960%77%
Callorhinchus miliiAustralian ghostsharkChondrichthyes465XP_00789515723568%82%
Strongylocentrotus purpuratusPacific purple sea urchinEchinoidea627XP_01166300223666%80%
Nematostella vectensisStarlet sea anemoneAnthozoa685XP_001619741.119961%70%
Drosophila melanogasterFruit flyInsecta794NP_650260.123331%46%
Amphimedon queenslandicaSpongeDemospongiae951.8XP_00338244623564%80%
Batrachochytrium dendrobatidisChytridiomycetesAmphibian chytrid fungus1150XP_00668137223861%78%
Physcomitrella patensSpreading earthmossBryopsida1624XP_02437910625550%65%

Paralog

There are no paralogs for CFAP299. [6] [16]

Clinical significance

CFAP299 expression is lowered in people with teratozoospermia,  a condition that causes abnormal morphology of sperm and decreased fertility. [24]

In airway epithelial cells that had excessive mucous secretion, a condition that simulated chronic lung disease, CFAP299 showed a reduced expression. [25]

Related Research Articles

<span class="mw-page-title-main">Interferon-inducible GTPase 5</span> Protein-coding gene in the species Homo sapiens

Interferon-inducible GTPase 5 also known as immunity-related GTPase cinema 1 (IRGC1) is an enzyme that in humans is coded by the IRGC gene. It is predicted to behave like other proteins in the p47-GTPase-like and IRG families. It is most expressed in the testis.

<span class="mw-page-title-main">METTL26</span> Protein-coding gene in the species Homo sapiens

METTL26, previously designated C16orf13, is a protein-coding gene for Methyltransferase Like 26, also known as JFP2. Though the function of this gene is unknown, various data have revealed that it is expressed at high levels in various cancerous tissues. Underexpression of this gene has also been linked to disease consequences in humans.

<span class="mw-page-title-main">C8orf48</span> Protein-coding gene in the species Homo sapiens

C8orf48 is a protein that in humans is encoded by the C8orf48 gene. C8orf48 is a nuclear protein specifically predicted to be located in the nuclear lamina. C8orf48 has been found to interact with proteins that are involved in the regulation of various cellular responses like gene expression, protein secretion, cell proliferation, and inflammatory responses. This protein has been linked to breast cancer and papillary thyroid carcinoma.

TMEM156 is a gene that encodes the transmembrane protein 156 (TMEM156) in Homo sapiens. It has the clone name of FLJ23235.

<span class="mw-page-title-main">PRR29</span> Protein-coding gene in the species Homo sapiens

PRR29 is a protein encoded by the PRR29 gene located in humans on chromosome 17 at 17q23.

<span class="mw-page-title-main">Glutamate rich 5</span> Protein-coding gene in the species Homo sapiens

Glutamate rich protein 5 is a protein in humans encoded by the ERICH5 gene, also known as chromosome 8 open reading frame 47 (C8orf47).

<span class="mw-page-title-main">C21orf58</span> Protein-coding gene in the species Homo sapiens

Chromosome 21 Open Reading Frame 58 (C21orf58) is a protein that in humans is encoded by the C21orf58 gene.

<span class="mw-page-title-main">C15orf39</span>

C15orf39 is a protein that in humans is encoded by the Chromosome 15 open reading frame 15 (C15orf39) gene.

<span class="mw-page-title-main">TMEM44</span> Protein-coding gene in the species Homo sapiens

TMEM44 is a protein that in humans is encoded by the TMEM44 gene. DKFZp686O18124 is a synonym of TMEM44.

<span class="mw-page-title-main">SMCO3</span> Protein-coding gene in the species Homo sapiens

Single-pass membrane and coiled-coil domain-containing protein 3 is a protein that is encoded in humans by the SMCO3 gene.

Proline-rich protein 16 (PRR16) is a protein coding gene in Homo sapiens. The protein is known by the alias Largen.

<span class="mw-page-title-main">C1orf122</span> Protein-coding gene in the species Homo sapiens

C1orf122 is a gene in the human genome that encodes the cytosolic protein ALAESM.. ALAESM is present in all tissue cells and highly up-regulated in the brain, spinal cord, adrenal gland and kidney. This gene can be expressed up to 2.5 times the average gene in its highly expressed tissues. Although the function of C1orf122 is unknown, it is predicted to be used for mitochondria localization.

<span class="mw-page-title-main">Fam89A</span> Human protein and gene

ProteinFAM89A is a protein which in humans is encoded by the FAM89A gene. It is also known as chromosome 1 open reading frame 153 (C1orf153). Highest FAM89A gene expression is observed in the placenta and adipose tissue. Though its function is largely unknown, FAM89A is found to be differentially expressed in response to interleukin exposure, and it is implicated in immune responses pathways and various pathologies such as atherosclerosis and glioma cell expression.

TMEM275 is a protein that in humans is encoded by the TMEM275 gene. TMEM275 has two, highly-conserved, helical trans-membrane regions. It is predicted to reside within the plasma membrane or the endoplasmic reticulum's membrane.

<span class="mw-page-title-main">FAM214B</span> Protein-coding gene in the species Homo sapiens

The FAM214B, also known as protein family with sequence similarity 214, B (FAM214B) is a protein that, in humans, is encoded by the FAM214B gene located on the human chromosome 9. The protein has 538 amino acids. The gene contain 9 exon. There has been studies that there are low expression of this gene in patients with major depression disorder. In most organisms such as mammals, amphibians, reptiles, and birds, there are high levels of gene expression in the bone marrow and blood. For humans in fetal development, FAM214B is mostly expressed in the brains and bone marrow.

<span class="mw-page-title-main">C6orf136</span> Protein-coding gene in the species Homo sapiens

C6orf136 is a protein in humans encoded by the C6orf136 gene. The gene is conserved in mammals, mollusks, as well some porifera. While the function of the gene is currently unknown, C6orf136 has been shown to be hypermethylated in response to FOXM1 expression in Head Neck Squamous Cell Carcinoma (HNSCC) tissue cells. Additionally, elevated expression of C6orf136 has been associated with improved survival rates in patients with bladder cancer. C6orf136 has three known isoforms.

<span class="mw-page-title-main">FAM98C</span> Gene

Family with sequence 98, member C or FAM98C is a gene that encodes for FAM98C has two aliases FLJ44669 and hypothetical protein LOC147965. FAM98C has two paralogs in humans FAM98A and FAM98B. FAM98C can be characterized for being a Leucine-rich protein. The function of FAM98C is still not defined. FAM98C has orthologs in mammals, reptiles, and amphibians and has a distant orhtologs in Rhinatrema bivittatum and Nanorana parkeri.

<span class="mw-page-title-main">THAP3</span> Protein in Humans

THAP domain-containing protein 3 (THAP3) is a protein that, in Homo sapiens (humans), is encoded by the THAP3 gene. The THAP3 protein is as known as MGC33488, LOC90326, and THAP domain-containing, apoptosis associated protein 3. This protein contains the Thanatos-associated protein (THAP) domain and a host-cell factor 1C binding motif. These domains allow THAP3 to influence a variety of processes, including transcription and neuronal development. THAP3 is ubiquitously expressed in H. sapiens, though expression is highest in the kidneys.

<span class="mw-page-title-main">C13orf46</span> C13of46 Gene and Protein

Chromosome 13 Open Reading Frame 46 is a protein which in humans is encoded by the C13orf46 gene. In humans, C13orf46 is ubiquitously expressed at low levels in tissues, including the lungs, stomach, prostate, spleen, and thymus. This gene encodes eight alternatively spliced mRNA transcript, which produce five different protein isoforms.

<span class="mw-page-title-main">Chromosome 5 open reading frame 47</span> Human C5ORF47 Gene

Chromosome 5 Open Reading Frame 47, or C5ORF47, is a protein which, in humans, is encoded by the C5ORF47 gene. It also goes by the alias LOC133491. The human C5ORF47 gene is primarily expressed in the testis.

References

  1. 1 2 3 GRCh38: Ensembl release 89: ENSG00000197826 - Ensembl, May 2017
  2. 1 2 3 GRCm38: Ensembl release 89: ENSMUSG00000057816 - Ensembl, May 2017
  3. "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  4. "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  5. Li H, Dai Y, Luo Z, Nie D (April 2019). "Cloning of a new testis-enriched gene C4orf22 and its role in cell cycle and apoptosis in mouse spermatogenic cells". Molecular Biology Reports. 46 (2): 2029–2038. doi:10.1007/s11033-019-04651-8. PMID   30820741. S2CID   71147966.
  6. 1 2 "GeneCards CFAP299". www.genecards.org. Retrieved 2019-05-05.
  7. 1 2 3 "CFAP299 cilia and flagella associated protein 299 [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2019-02-26.
  8. "Home - GEO Profiles - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2019-05-02.
  9. "Home - Nucleotide - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2019-05-05.
  10. "Genomatix - NGS Data Analysis & Personalized Medicine". www.genomatix.de. Archived from the original on 2001-02-24. Retrieved 2019-05-03.
  11. "AceView: Gene:FGF5andC4orf22, a comprehensive annotation of human, mouse and worm genes with mRNAs or ESTsAceView". www.ncbi.nlm.nih.gov. Retrieved 2019-05-02.
  12. "ExPASy - ProtParam tool". web.expasy.org. Retrieved 2019-05-03.
  13. "SAPS Results". www.ebi.ac.uk. Retrieved 2019-05-03.
  14. "CFAP299 - Cilia- and flagella-associated protein 299 - Homo sapiens (Human) - CFAP299 gene & protein". www.uniprot.org. Retrieved 2019-05-02.
  15. "NCBI Conserved Domain Search". www.ncbi.nlm.nih.gov. Retrieved 2019-05-03.
  16. 1 2 3 "BLAST: Basic Local Alignment Search Tool". blast.ncbi.nlm.nih.gov. Retrieved 2019-02-26.
  17. "NPS@ : GOR4 secondary structure prediction". npsa-prabi.ibcp.fr. Retrieved 2019-05-05.
  18. "I-TASSER server for protein structure and function prediction". zhanglab.ccmb.med.umich.edu. Retrieved 2019-05-03.
  19. "NetPhos 3.1 Server - prediction results". www.cbs.dtu.dk. Retrieved 2019-05-05.
  20. "GPS-SUMO: Prediction of SUMOylation Sites & SUMO-interaction Motifs". sumosp.biocuckoo.org. Retrieved 2019-05-05.
  21. "PSORT II Prediction". psort.hgc.jp. Retrieved 2019-05-03.
  22. Oláh J, Vincze O, Virók D, Simon D, Bozsó Z, Tõkési N, Horváth I, Hlavanda E, Kovács J, Magyar A, Szũcs M, Orosz F, Penke B, Ovádi J (September 2011). "Interactions of pathological hallmark proteins: tubulin polymerization promoting protein/p25, beta-amyloid, and alpha-synuclein". The Journal of Biological Chemistry. 286 (39): 34088–100. doi: 10.1074/jbc.m111.243907 . PMC   3190826 . PMID   21832049.
  23. Chen Y, Yang LN, Cheng L, Tu S, Guo SJ, Le HY, Xiong Q, Mo R, Li CY, Jeong JS, Jiang L, Blackshaw S, Bi LJ, Zhu H, Tao SC, Ge F (October 2013). "Bcl2-associated athanogene 3 interactome analysis reveals a new role in modulating proteasome activity". Molecular & Cellular Proteomics. 12 (10): 2804–19. doi:10.1074/mcp.m112.025882. PMC   3790292 . PMID   23824909.
  24. Platts AE, Dix DJ, Chemes HE, Thompson KE, Goodrich R, Rockett JC, Rawe VY, Quintana S, Diamond MP, Strader LF, Krawetz SA (April 2007). "Success and failure in human spermatogenesis as revealed by teratozoospermic RNAs". Human Molecular Genetics. 16 (7): 763–73. doi: 10.1093/hmg/ddm012 . PMID   17327269.
  25. Alevy YG, Patel AC, Romero AG, Patel DA, Tucker J, Roswit WT, Miller CA, Heier RF, Byers DE, Brett TJ, Holtzman MJ (December 2012). "IL-13-induced airway mucus production is attenuated by MAPK13 inhibition". The Journal of Clinical Investigation. 122 (12): 4555–68. doi:10.1172/jci64896. PMC   3533556 . PMID   23187130.