LOC101059915 | |||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Identifiers | |||||||||||||||||||||||||||||||||||||||||||||||||||
Aliases | chromosome X open reading frame 49-like | ||||||||||||||||||||||||||||||||||||||||||||||||||
External IDs | HomoloGene: 131525 GeneCards: | ||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
Wikidata | |||||||||||||||||||||||||||||||||||||||||||||||||||
|
LOC101059915 is a protein, which in humans is encoded by the LOC101059915 gene. It is located on the X chromosome and has restricted expression in the testis.
The LOC101059915 gene has two aliases known as chromosome X open reading frame 49-like and BX276092.6. [3]
LOC101059915 is located on the X chromosome at locus Xq13.1. It is 1,831 base pairs long and the gene sequence has 6 exons. [4] LOC101059915 also has one protein coding transcript.
The promoter region of LOC101059915 is located on the sense strand of DNA, and between base pair 71666098 and 71667904 on the X chromosome. It spans up to 1.806 bp. [5] Expression of LOC101059915, however, is relatively low in human cells, and is primarily limited to the testis. [6]
The protein has 518 amino acids and a molecular mass of 55.1 kDa. [7] The isoelectric point is 8.15. Compared to other human proteins LOC101059915 is glycine-, proline-rich, and serine-rich but the protein has lower levels of tyrosine. [8]
The domain of unknown function, DUF4641 covers almost the entire protein. It is a part of pfam15483. [9] It is 410 amino acids long, from amino acid 85 until amino acid number 495. [10]
The secondary structure of LOC101059915 has been shown to consist of primarily alpha helices as determined by models made on I-TASSER and analysis using ExPASy tools. [11]
LOC101055915 is predicted to contain many different post-translational modifications. This include sites for phosphorylation (NetPhos 2.0 [12] ) and sumoylation (SUMOplot Analysis Program [13] ).
The LOC101059915 protein has been predicted to be located in the cell nucleus (PSORT II). [14]
CXorf49 and CXorf49B are paralogs of LOC101059915. They share upwards of 78% similarity with LOC101059915 and likely went through a gene duplication event relatively recently, in evolutionary terms, due to the high degree of conservation between all three sequences.
CXorf49 is especially interesting due it being shown to be involved as one of the components of a small group of the HL-60 cell proteome that are most prone to form 4-Hydroxy-2-nonenal(HNE) adducts, upon exposure to nontoxic (10 μM) HNE concentrations, along with heat shock 60 kDa protein 1. [15]
Using BLAST [16] no orthologs for LOC101059915 are found in single celled organisms, fungi or plants whose genomes have been sequenced. For multi-cellular organisms, orthologs are found in mammals, excluding Monotremes. The table below shows a representative sample of 20 of the orthologs for LOC101059915. The table is organized based on the time of divergence from humans in millions of years (MYA). In cases where the divergence time-frame is the same the orthologs are sorted by identity (%).
Genus and Species Name | Common name | Divergence from Human Lineage (MYA) | Accession Number | Sequence length (aa) | Sequence Identity to Human Protein | Sequence Similarity to Human Protein |
---|---|---|---|---|---|---|
Pan paniscus | Chimpanzee | 6.65 | XP_003820175.1 | 518 | 97% | 97% |
Papio anubis | Olive Baboon | 29.4 | XP_003919408.1 | 514 | 70 % | 77% |
Piliocolobus tephrosceles | Ugandan Red Colobus | 29.4 | XP_023050488.1 | 514 | 68 % | 76% |
Cebus capucinus imitator | White-Headed Capuchin | 43 | XP_017372264.1 | 472 | 68 % | 72% |
Callithrix jacchus | Common Marmoset | 43 | XP_008987720.1 | 474 | 67% | 72% |
Aotus nancymaae | Nancy Ma's Night Monkey | 43 | XP_010822786.1 | 432 | 63% | 66% |
Saimiri boliviensis boliviensis | Black-capped Squirrel Monkey | 43 | XP_003944512.1 | 521 | 58% | 68% |
Rattus norvegicus | Brown Rat | 90 | XP_008756683.2 | 136 | 46% | 55% |
Ictidomys tridecemlineatus | Thirteen-Lined Ground Squirrel | 90 | XP_021576972.1 | 565 | 45% | 57% |
Canis lupus familiaris | Dog | 96 | XP_850392.2 | 526 | 52% | 64% |
Panthera pardus | Leopard | 96 | XP_019284263.1 | 472 | 47% | 58% |
Enhydra lutris kenyoni | Sea Otter | 96 | XP_022348355.1 | 495 | 47% | 59% |
Ovis aries musimon | Mouflon | 96 | XP_01208848.1 | 539 | 45% | 56% |
Capra hircus | Goat | 96 | XP_017899210.1 | 538 | 44% | 55% |
Pantholops hodgsonii | Tibetan Antelope | 96 | XP_005965061.1 | 533 | 44% | 55% |
Myotis lucifugus | Little brown bat | 96 | XP_006083036.1 | 500 | 42 % | 55% |
Trichechus manatus latirostris | Florida manatee | 105 | XP_012415455.1 | 505 | 44% | 55% |
Orycteropus afer afer | Aardvark | 105 | XP_007957133.1 | 477 | 39% | 50% |
Phascolarctos cinereus | Koala | 159 | XP_020834608.1 | 474 | 37% | 50% |
Sarcophilus harrisii | Tasmanian Devil | 159 | XP_023362890.1 | 576 | 33% | 50% |
The most distant ortholog for LOC101059915 is from the species Sarcrophilus harrisii which is commonly known as the Tasmanian Devil dating from more than 159.0 million years ago.
NBEAL1 is a protein that in humans is encoded by the NBEAL1 gene. It is found on chromosome 2q33.2 of Homo sapiens.
C8orf48 is a protein that in humans is encoded by the C8orf48 gene. C8orf48 is a nuclear protein specifically predicted to be located in the nuclear lamina. C8orf48 has been found to interact with proteins that are involved in the regulation of various cellular responses like gene expression, protein secretion, cell proliferation, and inflammatory responses. This protein has been linked to breast cancer and papillary thyroid carcinoma.
CXorf49 is a protein, which in humans is encoded by the gene chromosome X open reading frame 49(CXorf49).
C9orf135 is a gene that encodes a 229 amino acid protein. It is located on Chromosome 9 of the Homo sapiens genome at 9q12.21. The protein has a transmembrane domain from amino acids 124-140 and a glycosylation site at amino acid 75. C9orf135 is part of the GRCh37 gene on Chromosome 9 and is contained within the domain of unknown function superfamily 4572. Also, c9orf135 is known by the name of LOC138255 which is a description of the gene location on Chromosome 9.1.
FAM210B is a gene that which in Homo sapiens encodes the protein FAM210B. It has been conserved throughout evolutionary history, and is highly expressed in multiple tissues within the human body. FAM210B's primary location is the endoplasmic reticulum.
PRR29 is a protein encoded by the PRR29 gene located in humans on chromosome 17 at 17q23.
Coiled-coil domain containing protein 180 (CCDC180) is a protein that in humans is encoded by the CCDC180 gene. This protein is known to localize to the nucleus and is thought to be involved in regulation of transcription as are many proteins containing coiled-coil domains. As it is expressed most highly in the testes and is regulated by SRY and SOX transcription factors, it could be involved in sex determination.
Glutamate Rich Protein 2 is a protein in humans encoded by the gene ERICH2. This protein is expressed heavily in male tissues specifically in the testes, and proteins are specifically found in the nucleoli fibrillar center and the vesicles of these testicular cells. The protein has multiple protein interactions which indicate that it may play a role in histone modification and proper histone functioning.
Uncharacterized protein C2orf73 is a protein that in humans is encoded by the C2orf73 gene. The protein is predicted to be localized to the nucleus.
C17orf53 is a gene in humans that encodes a protein known as C17orf53, uncharacterized protein C17orf53. It has been shown to target the nucleus, with minor localization in the cytoplasm. Based on current findings C17orf53 is predicted to perform functions of transport, however further research into the protein could provide more specific evidence regarding its function.
Chromosome 16 open reading frame 46 is a protein of yet to be determined function in Homo sapiens. It is encoded by the C16orf46 gene with NCBI accession number of NM_001100873. It is a protein-coding gene with an overlapping locus.
Chromosome 19 open reading frame 44 is a protein that in humans is encoded by the C19orf44 gene. C19orf44 is an uncharacterized protein with an unknown function in humans. C19orf44 is non-limiting implying that the protein exists in other species besides human. The protein contains one domain of unknown function (DUF) that is highly conserved throughout its orthologs. This protein is most highly expressed in the testis and ovary, but also has significant expression in the thyroid and parathyroid. Other names for this protein include: LOC84167.
Chromosome 4 open reading frame 51 (C4orf51) is a protein which in humans is encoded by the C4orf51 gene.
Cilia- and flagella-associated protein 299 (CFAP299), is a protein that in humans is encoded by the CFAP299 gene. CFAP299 is predicted to play a role in spermatogenesis and cell apoptosis.
Chromosome 1 open reading frame 141, or C1orf141 is a protein which, in humans, is encoded by gene C1orf141. It is a precursor protein that becomes active after cleavage. The function is not yet well understood, but it is suggested to be active during development
Chromosome 1 open reading frame (C1orf167) is a protein which in humans is encoded by the C1orf167 gene. The NCBI accession number is NP_001010881. The protein is 1468 amino acids in length with a molecular weight of 162.42 kDa. The mRNA sequence was found to be 4689 base pairs in length.
Proline-rich protein 16 (PRR16) is a protein coding gene in Homo sapiens. The protein is known by the alias Largen.
C12orf24 is a gene in humans that encodes a protein known as FAM216A. This gene is primarily expressed in the testis and brain, but has constitutive expression in 25 other tissues. FAM216A is an intracellular protein that has been predicted to reside within the nucleus of cells. The exact function of C12orf24 is unknown. FAM216A is highly expressed in Sertoli cells of the testis as well as different stage spermatids.
ProteinFAM89A is a protein which in humans is encoded by the FAM89A gene. It is also known as chromosome 1 open reading frame 153 (C1orf153). Highest FAM89A gene expression is observed in the placenta and adipose tissue. Though its function is largely unknown, FAM89A is found to be differentially expressed in response to interleukin exposure, and it is implicated in immune responses pathways and various pathologies such as atherosclerosis and glioma cell expression.
Chromosome 13 Open Reading Frame 46 is a protein which in humans is encoded by the C13orf46 gene. In humans, C13orf46 is ubiquitously expressed at low levels in tissues, including the lungs, stomach, prostate, spleen, and thymus. This gene encodes eight alternatively spliced mRNA transcript, which produce five different protein isoforms.