LOC101059915

Last updated
LOC101059915
Identifiers
Aliases chromosome X open reading frame 49-like
External IDs HomoloGene: 131525 GeneCards:
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)

NM_001323075

n/a

RefSeq (protein)

n/a

n/a

Location (UCSC) Chr X: 71.67 – 71.67 Mb n/a
PubMed search [2] n/a
Wikidata
View/Edit Human

LOC101059915 is a protein, which in humans is encoded by the LOC101059915 gene. It is located on the X chromosome and has restricted expression in the testis.

Contents

Gene

The LOC101059915 gene has two aliases known as chromosome X open reading frame 49-like and BX276092.6. [3]

Locus and Structure

LOC101059915 is located on the X chromosome at locus Xq13.1. It is 1,831 base pairs long and the gene sequence has 6 exons. [4] LOC101059915 also has one protein coding transcript.

Promoter Region and Expression

The promoter region of LOC101059915 is located on the sense strand of DNA, and between base pair 71666098 and 71667904 on the X chromosome. It spans up to 1.806 bp. [5] Expression of LOC101059915, however, is relatively low in human cells, and is primarily limited to the testis. [6]

Protein

General Features and Compositional Analysis

The protein has 518 amino acids and a molecular mass of 55.1 kDa. [7] The isoelectric point is 8.15. Compared to other human proteins LOC101059915 is glycine-, proline-rich, and serine-rich but the protein has lower levels of tyrosine. [8]

Domains

The domain of unknown function, DUF4641 covers almost the entire protein. It is a part of pfam15483. [9] It is 410 amino acids long, from amino acid 85 until amino acid number 495. [10]

Schematic illustration made using DOG software showing the domain of unknown function (DUF4641) as well as the location of secondary structures such as alpha helices, and post-translation modifications such as SUMO sites. LOC101059915 Diagram.png
Schematic illustration made using DOG software showing the domain of unknown function (DUF4641) as well as the location of secondary structures such as alpha helices, and post-translation modifications such as SUMO sites.

Secondary Structure

The secondary structure of LOC101059915 has been shown to consist of primarily alpha helices as determined by models made on I-TASSER and analysis using ExPASy tools. [11]

Displays the modeled secondary structures of LOC101059915 with the red indicating alpha helices, and the yellow indicating possible beta sheets. LOC101059915 Secondary Structure Model.gif
Displays the modeled secondary structures of LOC101059915 with the red indicating alpha helices, and the yellow indicating possible beta sheets.

Post-translation modifications

LOC101055915 is predicted to contain many different post-translational modifications. This include sites for phosphorylation (NetPhos 2.0 [12] ) and sumoylation (SUMOplot Analysis Program [13] ).

Subcellular localization

The LOC101059915 protein has been predicted to be located in the cell nucleus (PSORT II). [14]

Homology and Evolution

Paralogs

CXorf49 and CXorf49B are paralogs of LOC101059915. They share upwards of 78% similarity with LOC101059915 and likely went through a gene duplication event relatively recently, in evolutionary terms, due to the high degree of conservation between all three sequences.

CXorf49 is especially interesting due it being shown to be involved as one of the components of a small group of the HL-60 cell proteome that are most prone to form 4-Hydroxy-2-nonenal(HNE) adducts, upon exposure to nontoxic (10 μM) HNE concentrations, along with heat shock 60 kDa protein 1. [15]

Orthologs

Using BLAST [16] no orthologs for LOC101059915 are found in single celled organisms, fungi or plants whose genomes have been sequenced. For multi-cellular organisms, orthologs are found in mammals, excluding Monotremes. The table below shows a representative sample of 20 of the orthologs for LOC101059915. The table is organized based on the time of divergence from humans in millions of years (MYA). In cases where the divergence time-frame is the same the orthologs are sorted by identity (%).

Genus and Species NameCommon nameDivergence from Human Lineage (MYA)Accession NumberSequence length (aa)Sequence Identity to Human ProteinSequence Similarity to Human Protein
Pan paniscusChimpanzee6.65XP_003820175.151897%97%
Papio

anubis

Olive Baboon29.4XP_003919408.151470 %77%
Piliocolobus tephroscelesUgandan Red

Colobus

29.4XP_023050488.151468 %76%
Cebus capucinus imitatorWhite-Headed

Capuchin

43XP_017372264.147268 %72%
Callithrix jacchusCommon

Marmoset

43XP_008987720.147467%72%
Aotus nancymaaeNancy Ma's

Night Monkey

43XP_010822786.143263%66%
Saimiri boliviensis

boliviensis

Black-capped

Squirrel Monkey

43XP_003944512.152158%68%
Rattus norvegicusBrown Rat90XP_008756683.213646%55%
Ictidomys

tridecemlineatus

Thirteen-Lined

Ground Squirrel

90XP_021576972.156545%57%
Canis lupus familiarisDog96XP_850392.252652%64%
Panthera pardusLeopard96XP_019284263.147247%58%
Enhydra lutris kenyoniSea Otter96XP_022348355.149547%59%
Ovis aries musimonMouflon96XP_01208848.153945%56%
Capra hircusGoat96XP_017899210.153844%55%
Pantholops hodgsoniiTibetan Antelope96XP_005965061.153344%55%
Myotis lucifugusLittle brown bat96XP_006083036.150042 %55%
Trichechus manatus latirostrisFlorida manatee105XP_012415455.150544%55%
Orycteropus afer aferAardvark105XP_007957133.147739%50%
Phascolarctos cinereusKoala159XP_020834608.147437%50%
Sarcophilus harrisiiTasmanian Devil159XP_023362890.157633%50%

Phylogeny

Shows the unrooted branching of select orthologs for LOC10105519. LOC101055915 Unrooted Phylogenetic Tree.png
Shows the unrooted branching of select orthologs for LOC10105519.

The most distant ortholog for LOC101059915 is from the species Sarcrophilus harrisii which is commonly known as the Tasmanian Devil dating from more than 159.0 million years ago.

Related Research Articles

<span class="mw-page-title-main">NBEAL1</span> Protein-coding gene in the species Homo sapiens

NBEAL1 is a protein that in humans is encoded by the NBEAL1 gene. It is found on chromosome 2q33.2 of Homo sapiens.

<span class="mw-page-title-main">C8orf48</span> Protein-coding gene in the species Homo sapiens

C8orf48 is a protein that in humans is encoded by the C8orf48 gene. C8orf48 is a nuclear protein specifically predicted to be located in the nuclear lamina. C8orf48 has been found to interact with proteins that are involved in the regulation of various cellular responses like gene expression, protein secretion, cell proliferation, and inflammatory responses. This protein has been linked to breast cancer and papillary thyroid carcinoma.

CXorf49 is a protein, which in humans is encoded by the gene chromosome X open reading frame 49(CXorf49).

<span class="mw-page-title-main">C9orf135</span> Mammalian protein found in Homo sapiens

C9orf135 is a gene that encodes a 229 amino acid protein. It is located on Chromosome 9 of the Homo sapiens genome at 9q12.21. The protein has a transmembrane domain from amino acids 124-140 and a glycosylation site at amino acid 75. C9orf135 is part of the GRCh37 gene on Chromosome 9 and is contained within the domain of unknown function superfamily 4572. Also, c9orf135 is known by the name of LOC138255 which is a description of the gene location on Chromosome 9.1.

<span class="mw-page-title-main">FAM210B</span> Protein-coding gene in the species Homo sapiens

FAM210B is a gene that which in Homo sapiens encodes the protein FAM210B. It has been conserved throughout evolutionary history, and is highly expressed in multiple tissues within the human body. FAM210B's primary location is the endoplasmic reticulum.

<span class="mw-page-title-main">PRR29</span> Protein-coding gene in the species Homo sapiens

PRR29 is a protein encoded by the PRR29 gene located in humans on chromosome 17 at 17q23.

Coiled-coil domain containing protein 180 (CCDC180) is a protein that in humans is encoded by the CCDC180 gene. This protein is known to localize to the nucleus and is thought to be involved in regulation of transcription as are many proteins containing coiled-coil domains. As it is expressed most highly in the testes and is regulated by SRY and SOX transcription factors, it could be involved in sex determination.

<span class="mw-page-title-main">ERICH2</span> Protein-coding gene in the species Homo sapiens

Glutamate Rich Protein 2 is a protein in humans encoded by the gene ERICH2. This protein is expressed heavily in male tissues specifically in the testes, and proteins are specifically found in the nucleoli fibrillar center and the vesicles of these testicular cells. The protein has multiple protein interactions which indicate that it may play a role in histone modification and proper histone functioning.

<span class="mw-page-title-main">C2orf73</span> Protein-coding gene in the species Homo sapiens

Uncharacterized protein C2orf73 is a protein that in humans is encoded by the C2orf73 gene. The protein is predicted to be localized to the nucleus.

<span class="mw-page-title-main">C17orf53</span>

C17orf53 is a gene in humans that encodes a protein known as C17orf53, uncharacterized protein C17orf53. It has been shown to target the nucleus, with minor localization in the cytoplasm. Based on current findings C17orf53 is predicted to perform functions of transport, however further research into the protein could provide more specific evidence regarding its function.

<span class="mw-page-title-main">C16orf46</span> Human gene

Chromosome 16 open reading frame 46 is a protein of yet to be determined function in Homo sapiens. It is encoded by the C16orf46 gene with NCBI accession number of NM_001100873. It is a protein-coding gene with an overlapping locus.

<span class="mw-page-title-main">C19orf44</span> Mammalian protein found in Homo sapiens

Chromosome 19 open reading frame 44 is a protein that in humans is encoded by the C19orf44 gene. C19orf44 is an uncharacterized protein with an unknown function in humans. C19orf44 is non-limiting implying that the protein exists in other species besides human. The protein contains one domain of unknown function (DUF) that is highly conserved throughout its orthologs. This protein is most highly expressed in the testis and ovary, but also has significant expression in the thyroid and parathyroid. Other names for this protein include: LOC84167.

<span class="mw-page-title-main">C4orf51</span> Protein-coding gene in the species Homo sapiens

Chromosome 4 open reading frame 51 (C4orf51) is a protein which in humans is encoded by the C4orf51 gene.

<span class="mw-page-title-main">CFAP299</span> Protein-coding gene in the species Homo sapiens

Cilia- and flagella-associated protein 299 (CFAP299), is a protein that in humans is encoded by the CFAP299 gene. CFAP299 is predicted to play a role in spermatogenesis and cell apoptosis.

Chromosome 1 open reading frame 141, or C1orf141 is a protein which, in humans, is encoded by gene C1orf141. It is a precursor protein that becomes active after cleavage. The function is not yet well understood, but it is suggested to be active during development

Chromosome 1 open reading frame (C1orf167) is a protein which in humans is encoded by the C1orf167 gene. The NCBI accession number is NP_001010881. The protein is 1468 amino acids in length with a molecular weight of 162.42 kDa. The mRNA sequence was found to be 4689 base pairs in length.

Proline-rich protein 16 (PRR16) is a protein coding gene in Homo sapiens. The protein is known by the alias Largen.

<span class="mw-page-title-main">C12orf24</span> Protein-coding gene in humans

C12orf24 is a gene in humans that encodes a protein known as FAM216A. This gene is primarily expressed in the testis and brain, but has constitutive expression in 25 other tissues. FAM216A is an intracellular protein that has been predicted to reside within the nucleus of cells. The exact function of C12orf24 is unknown. FAM216A is highly expressed in Sertoli cells of the testis as well as different stage spermatids.

<span class="mw-page-title-main">Fam89A</span> Human protein and gene

ProteinFAM89A is a protein which in humans is encoded by the FAM89A gene. It is also known as chromosome 1 open reading frame 153 (C1orf153). Highest FAM89A gene expression is observed in the placenta and adipose tissue. Though its function is largely unknown, FAM89A is found to be differentially expressed in response to interleukin exposure, and it is implicated in immune responses pathways and various pathologies such as atherosclerosis and glioma cell expression.

<span class="mw-page-title-main">C13orf46</span> C13of46 Gene and Protein

Chromosome 13 Open Reading Frame 46 is a protein which in humans is encoded by the C13orf46 gene. In humans, C13orf46 is ubiquitously expressed at low levels in tissues, including the lungs, stomach, prostate, spleen, and thymus. This gene encodes eight alternatively spliced mRNA transcript, which produce five different protein isoforms.

References

  1. 1 2 3 GRCh38: Ensembl release 89: ENSG00000283599 - Ensembl, May 2017
  2. "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  3. "LOC101055915". GeneCards. Retrieved 2018-02-05.
  4. "LOC101055915 [Homo sapiens (human)] - Gene - NCBI". Ncbi.nlm.nih.gov. Retrieved 2018-04-28.
  5. "Genomatix's ElDorado". Archived from the original on 2021-04-03. Retrieved 2018-05-14.
  6. "EST Profile - Hs.632817". Ncbi.nlm.nih.gov. Retrieved 2018-04-28.
  7. "LOC101059915". ExPASy pI/mW tool. Retrieved 2018-04-28.
  8. "SDSC Biology Workbench". seqtool.sdsc.edu. Archived from the original on 2003-08-11. Retrieved 2018-05-06.
  9. "NCBI CDD Conserved Protein Domain DUF4641". www.ncbi.nlm.nih.gov. Retrieved 2018-05-06.
  10. "uncharacterized protein LOC101059915 [Homo sapiens] - Protein - NCBI". Ncbi.nlm.nih.gov. Retrieved 2018-01-28.
  11. "LOC101059915 Secondary Structure Model". Zhang Lab I-TASSER. Retrieved 2018-02-05.
  12. "NetPhos 2.0 Server". Cbs.dtu.dk. Retrieved 2018-04-28.
  13. "SUMOplot Analysis Program". Abgent. Retrieved 2018-04-28.
  14. "PSORT II Server" . Retrieved 2018-05-04.
  15. Arcaro, Alessia; Daga, Martina; Cetrangolo, Giovanni Paolo; Ciamporcero, Eric Stefano; Lepore, Alessio; Pizzimenti, Stefania; Petrella, Claudia; Graf, Maria; Uchida, Koji; Mamone, Gianfranco; Ferranti, Pasquale; Ames, Paul R. J.; Palumbo, Giuseppe; Barrera, Giuseppina; Gentile, Fabrizio (2015). "Generation of Adducts of 4-Hydroxy-2-nonenal with Heat Shock 60 kDa Protein 1 in Human Promyelocytic HL-60 and Monocytic THP-1 Cell Lines". Oxidative Medicine and Cellular Longevity. 2015: 296146. doi: 10.1155/2015/296146 . PMC   4452872 . PMID   26078803.
  16. Protein BLAST