CXorf49 is a protein, which in humans is encoded by the gene chromosome X open reading frame 49(CXorf49).
The CXorf49 gene has one alias CXorf49B. [1] The recname A8MYA2 also refers to the protein coded by CXorf49 or CXorf49B. [2]
CXorf49 is located on the X chromosome at Xq13.1. It is 3912 base pairs long and the gene sequence has 6 exons. [3] CXorf49 has one protein coding transcript. [4]
The protein has 514 amino acids and a molecular mass of 54.4 kDa. [5] The isoelectric point is 9.3. Compared to other human proteins CXorf49 is glycine- and proline-rich, but the protein has lower levels of asparagine, isoleucine, tyrosine and threonine(Statistical Analysis of Protein Sequences, SAPS [6] ).
The domain of unknown function, DUF4641, is almost the entire protein. It is 433 amino acids long, from amino acid 80 until amino acid number 512. [7] DUF4641 is a part of pfam15483. [8] The domain is proline- and arginine-rich, but DUF4641 has lower levels of isoleucine, tyrosine and threonine compared to other proteins in human (Analysis of Protein Sequences, SAPS [6] ). DUF4641 has an unusual spacing between lysine residues and positive charged amino acids (Analysis of Protein Sequences, SAPS [6] ).
CXorf49 is predicted to have several post-translational sites. This include sites for N-acetyltransferase (NetAcet 1- [9] ), glycation of ε amino groups of lysines (NetGlycate 1.0 [10] ), mucin type GalNAc O-glycosylation (NetOglyc 4.0 [11] ), phosphorylation (NetPhos 2.0 [12] ), sumoylation (SUMOplot Analysis Program [13] ) and O-ß-GlcNAc attachment(YinOYang WWW [14] ).
The CXorf49 protein has been predicted to be located in the cell nucleus (PSORT II [15] ).
The promoter region of CXorf49 is located between base pair 71718051 and 71718785 on the minus strand of the X chromosome and it is 735 bp long (Genomatix’s ElDorado program [16] ). One of the most frequent transcription factor binding-sites in the promoter region are sites for Y-box binding factor.
Though expression of CXorf49 is very low in human cells, is it somewhat higher in connective tissues, testis and uterus(NCBI-Unigene [17] ).
The protein CXorf49 has not yet been shown to interact with other proteins (PSICQUIC [18] ).
CXorf49 is found to be one of the components of a small group of the HL-60 cell proteome that were most prone to form 4-Hydroxy-2-nonenal(HNE) adducts, upon exposure to nontoxic (10 μM) HNE concentrations, along with heat shock 60 kDa protein 1. [19]
Using BLAST [20] no orthologs for CXorf49 are found in single celled organisms, fungi or plants whose genomes have been sequenced. For multicellular organisms orthologs are found in mammals. The table below show a selection of the mammal orthologs. They are listed after time of divergence from human.
Genus and species name | Common name | Accession Number | Sequence length | Identity to human protein |
---|---|---|---|---|
Pan troglodytes | Chimpanzee | XP_001137982 | 514 aa | 98 % |
Callithrix jacchus | Common marmoset | XP_008987719 | 487 aa | 65 % |
Galeopterus variegatus | Malayan flying lemur | XP_008574823 | 525 aa | 54 % |
Tupaia chinensis | Chinese tree shrew | XP_006168003 | 527 aa | 35 % |
Chinchilla lanigera | Long-tailed chinchilla | XP_013358263 | 307 aa | 49 % |
Mus musculus | House mouse | NP_081944 | 513 aa | 36 % |
Canis lupus familiaris | Dog | XP_850392 | 526 aa | 54 % |
Odobenus rosmarus divergens | Pacific walrus | XP_012422579 | 530 aa | 51 % |
Mustela putorius furo | Ferret | XP_004777306 | 544 aa | 50 % |
Lipotes vexillifer | Chinese river dolphin | XP_007452050 | 529 aa | 45 % |
Ovis areis | Sheep | XP_004022229 | 536 aa | 45 % |
Capra hircus | Goat | XP_005700711 | 538 aa | 44 % |
Myotis lucifugus | Little brown bat | XP_006083036 | 500 aa | 42 % |
Myotis davidii | David's myotis | XP_006759573 | 495 aa | 42 % |
Bos taurus | Cattle | NP_001092664 | 534 aa | 42 % |
Equus asinus | Asinus | XP_014707878 | 723 aa | 42 % |
Trichechus manatus latirostris | Florida manatee | XP_012415455 | 505 aa | 44 % |
Dasypus novemcinctus | Nine-banded armadillo | XP_004475873 | 497 aa | 44 % |
Orycteropus afer afer | Aardvark | XP_007957133 | 477 aa | 38 % |
CXorf49 has developed from aardvarks, to the human protein over 105.0 million years.