C11orf91

Last updated
C11orf91
Identifiers
Aliases C11orf91 , chromosome 11 open reading frame 91
External IDs MGI: 1915493; HomoloGene: 41690; GeneCards: C11orf91; OMA:C11orf91 - orthologs
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)

NM_001166692

NM_026634

RefSeq (protein)

NP_001160164

NP_080910

Location (UCSC) Chr 11: 33.7 – 33.7 Mb Chr 2: 103.95 – 103.96 Mb
PubMed search [3] [4]
Wikidata
View/Edit Human View/Edit Mouse

Chromosome 11 open reading frame 91, or C11orf91 is a protein which in humans is encoded by the C11orf91 gene.

Contents

Gene

Conceptual translation of human C11orf91 gene/protein. Conceptual Translation of Human C11orf91.png
Conceptual translation of human C11orf91 gene/protein.

The C11orf91 gene consists of 5159 nucleotides with an mRNA of approximately 836 base pairs. [5] [6] There is one exon found in the C11orf91 gene.

mRNA

The cytogenetic band location of C11orf91 is 11p13 and is located on the minus strand of the DNA . [7]

Conceptual translation

Annotated depiction of the C11orf91 mRNA and amino acid protein sequences.

Protein

Tertiary Structure of human C11orf91 protein. Created with AlphaFold Protein Structure Database. Tertiary Structure of C11orf91.png
Tertiary Structure of human C11orf91 protein. Created with AlphaFold Protein Structure Database.

The C11orf91 gene encodes a protein that is 193 amino acids in length. [9] [10] The C11orf91 protein contains a domain of unknown function, DUF5529, that spans nearly the entire protein. RBMX protein binding sites were found to be highly conserved in several structures of human C11orf91 3'UTR and 5' UTR. [11] [12] C11orf91 is rich in serine and proline and poor in valine and asparagine. [13] There is a proline rich region found in the middle of the C11orf91. [14] The human C11orf91 protein is approximately 20 kDal and has an isoelectric point around 9. [15]

Localization

Human C11orf91 protein is predicted to be localized in vesicles. [16] [17]

Structure

C11orf91 has two helices located near the C-terminus and no beta sheets. [18] [19] [20] [21] [22] [23] [24] [ excessive citations ]

Post-translational modifications

C11orf91 has a predicted Protein kinase C (PKC) phosphorylation site, Casein kinase 2 (CK2) phosphorylation site, amidation site, and two predicted serine phosphorylation sites, see Conceptual Translation for post-translational modification site locations. [25] [26] [27] [28]

Evolution

Corrected Sequence Divergence vs. Median Date of Divergence graph for human C11orf91, Fibrinogen Alpha, and Cytochrome C. C11orf91 Cytochrome C Fibrinogen Alpha Graph.png
Corrected Sequence Divergence vs. Median Date of Divergence graph for human C11orf91, Fibrinogen Alpha, and Cytochrome C.

There are no paralogs of the human C11orf91 protein. [29] [30] The human C11orf91 protein has several orthologs found across eight categories of jawed vertebrates including: aves, testudines, alligators, reptiles, mammals, amphibians, lungfishes, and cartilaginous fishes. [31]

Select Orthologs of C11orf91
Genus and SpeciesCommon NameTaxonomic GroupMedian Date of Divergence (MYA)Accession NumberSequence Length (aa)Sequence Identity to Human Protein (%)Sequence Similarity to Human Protein (%)
Homo sapiens HumanPrimates0 XP_016872542.1 193100100
Mus musculus MouseRodentia87 NP_080910.1 19481.784.3
Equus caballus HorseOdd-toed ungulates94 XP_023509601.1 1918789.1
Choloepus didactylus Southern two-toed slothPilosa99 XP_037697448.1 19186.287.2
Phascolarctos cinereus KoalaDiprotodontia160 XP_020819588.1 21267.170.8
Gopherus evgoodei Goode's thornscrub tortoiseTestudines319 XP_030417986.1 16346.555.5
Pogona vitticeps Central bearded dragonSquamata319 XP_020663275.1 16844.354.7
Sphaerodactylus townsendi Townsend's least geckoSquamata319 XP_048340539.1 1684351.7
Alligator mississippiensis American alligatorCrocodilia319 XP_059577366.1 15741.451.2
Tyto alba Barn owlStrigiformes319 XP_042641164.1 14334.541.5
Calypte anna HummingbirdApodiformes319 XP_030307391.1 14632.344.6
Gymnogyps californianus California condorAccipitriformes319 XP_050754406.1 14232.238.8
Dryobates pubescens Downy woodpeckerPiciformes319 XP_054027854.1 14332.143.5
Dromaius novaehollandiae EmuStruthioniformes319 XP_025967391.1 20929.635.8
Rhinatrema bivittatum Two-lined caeciliansCaecilidae352 XP_029438087.1 14445.157
Pleurodeles waltl Iberian ribbed newtCaudata352 KAJ1177600.1 14040.152.3
Hyla sarda Sardinian tree frogAnura352 XP_056384394.1 15327.437
Protopterus annectens West African lungfishLepidosireniformes408 XP_043915714.1 17328.537.4
Pygocentrus nattereri Red-bellied piranhaCharaciformes429 XP_017546298.1 14725.937.1
Amblyraja radiata Thorny skateRajiformes462 XP_032894439.1 13637.746.2

Related Research Articles

<span class="mw-page-title-main">C6orf201</span> Protein-coding gene in the species Homo sapiens

Chromosome 6 open reading frame 201, C6orf201, is a protein that in humans is encoded by the C6orf201 gene. In humans this gene encodes for a nuclear protein that is primarily expressed in the testis.

<span class="mw-page-title-main">PRR29</span> Protein-coding gene in the species Homo sapiens

PRR29 is a protein encoded by the PRR29 gene located in humans on chromosome 17 at 17q23.

<span class="mw-page-title-main">C2orf73</span> Protein-coding gene in the species Homo sapiens

Uncharacterized protein C2orf73 is a protein that in humans is encoded by the C2orf73 gene. The protein is predicted to be localized to the nucleus.

<span class="mw-page-title-main">C8orf58</span> Protein-coding gene in the species Homo sapiens

Chromosome 8 open reading frame 58 is an uncharacterised protein that in humans is encoded by the C8orf58 gene. The protein is predicted to be localized in the nucleus.

<span class="mw-page-title-main">C9orf25</span> Protein-coding gene in the species Homo sapiens

Chromosome 9 open reading frame 25 (C9orf25) is a domain that encodes the FAM219A gene. The terms FAM219A and C9orf25 are aliases and can be used interchangeably. The function of this gene is not yet completely understood.

<span class="mw-page-title-main">C19orf44</span> Mammalian protein found in Homo sapiens

Chromosome 19 open reading frame 44 is a protein that in humans is encoded by the C19orf44 gene. C19orf44 is an uncharacterized protein with an unknown function in humans. C19orf44 is non-limiting implying that the protein exists in other species besides human. The protein contains one domain of unknown function (DUF) that is highly conserved throughout its orthologs. This protein is most highly expressed in the testis and ovary, but also has significant expression in the thyroid and parathyroid. Other names for this protein include: LOC84167.

<span class="mw-page-title-main">C22orf23</span> Protein-coding gene in the species Homo sapiens

C22orf23 is a protein which in humans is encoded by the C22orf23 gene. Its predicted secondary structure consists of alpha helices and disordered/coil regions. It is expressed in many tissues and highest in the testes and it is conserved across many orthologs.

<span class="mw-page-title-main">C1orf122</span> Protein-coding gene in the species Homo sapiens

C1orf122 is a gene in the human genome that encodes the cytosolic protein ALAESM.. ALAESM is present in all tissue cells and highly up-regulated in the brain, spinal cord, adrenal gland and kidney. This gene can be expressed up to 2.5 times the average gene in its highly expressed tissues. Although the function of C1orf122 is unknown, it is predicted to be used for mitochondria localization.

<span class="mw-page-title-main">C17orf78</span> Mammalian protein found in Homo sapiens

Uncharacterized protein C17orf78 is a protein encoded by the C17orf78 gene in humans. The name denotes the location of the parent gene, being at the 78th open reading frame, on the 17th human chromosome. The protein is highly expressed in the small intestine, especially the duodenum. The function of C17orf78 is not well defined.

<span class="mw-page-title-main">CCDC121</span> Protein found in humans

Coiled-coil domain containing 121 (CCDC121) is a protein encoded by the CCDC121 gene in humans. CCDC121 is located on the minus strand of chromosome 2 and encodes three protein isoforms. All isoforms of CCDC121 contain a domain of unknown function referred to as DUF4515 or pfam14988.

<span class="mw-page-title-main">C11orf98</span> Protein-coding gene in the species Homo sapiens

C11orf98 is a protein-encoding gene on chromosome 11 in humans of unknown function. It is otherwise known as c11orf48. The gene spans the chromosomal locus from 62,662,817-62,665,210. There are 4 exons. It spans across 2,394 base pairs of DNA and produces an mRNA that is 646 base pairs long.

<span class="mw-page-title-main">C12orf50</span> Protein-coding gene in humans

Chromosome 12 Open Reading Frame 50 (C12orf50) is a protein-encoding gene which in humans encodes for the C12orf50 protein. The accession id for this gene is NM_152589. The location of C12orf50 is 12q21.32. It covers 55.42 kb, from 88429231 to 88373811, on the reverse strand. Some of the neighboring genes to C12orf50 are RPS4XP15, LOC107984542, and C12orf29. RPS4XP15 is upstream C12orf50 and is on the same strand. LOC107984542 and C12orf29 are both downstream. LOC107984542 is on the opposite strand while C12orf29 is on the same strand. C12orf50 has six isoforms. This page is focusing on isoform X1. C12orf50 isoform X1 is 1711 nucleotides long and has a protein with a length of 414 aa.

<span class="mw-page-title-main">C4orf19</span> Human C4orf19 gene

C4orf19 is a protein which in humans is encoded by the C4orf19 gene.

<span class="mw-page-title-main">C22orf15</span> Protein-coding gene in the species Homo sapiens

C22orf15 is a protein which, in humans, is encoded by the C22orf15 gene.

Chromosome 4 open reading frame 54 is a protein that in humans is coded by the c4orf54 gene. This gene is also known as FOPV and LOC285556. This protein is mostly expressed in the nucleus of muscle cells. Orthologs are found in vertebrates but not invertebrates.

<span class="mw-page-title-main">C17orf75</span> Protein-coding gene in the species Homo sapiens

Protein Njmu-R1 is a protein that in humans is encoded by the C17orf75 gene. C17orf75 is also known as SRI2 and is a human protein encoding gene located at 17q11.2 on the complementary strand. The C17orf75 gene is ubiquitously expressed at medium-low levels throughout the body and at slightly higher levels in the brain and testes. This protein is thought to be part of a complex associated with Golgi-mediated vesicle capture.

<span class="mw-page-title-main">C13orf42</span> C13orf42 gene page

C13orf42 is a protein which, in humans, is encoded by the gene chromosome 13 open reading frame 42 (C13orf42). RNA sequencing data shows low expression of the C13orf42 gene in a variety of tissues. The C13orf42 protein is predicted to be localized in the mitochondria, nucleus, and cytosol. Tertiary structure predictions for C13orf42 indicate multiple alpha helices.

<span class="mw-page-title-main">THAP3</span> Protein in Humans

THAP domain-containing protein 3 (THAP3) is a protein that, in Homo sapiens (humans), is encoded by the THAP3 gene. The THAP3 protein is as known as MGC33488, LOC90326, and THAP domain-containing, apoptosis associated protein 3. This protein contains the Thanatos-associated protein (THAP) domain and a host-cell factor 1C binding motif. These domains allow THAP3 to influence a variety of processes, including transcription and neuronal development. THAP3 is ubiquitously expressed in H. sapiens, though expression is highest in the kidneys.

<span class="mw-page-title-main">C20orf144</span> Human protein-encoding gene

Chromosome 20 open reading frame 144 (c20orf144) is a human protein-encoding gene. The human c20orf144 protein consists of 153 amino acids, with the first 150 amino acids being characterized as part of the Bcl-2 like protein of testis (Bclt) family.

<span class="mw-page-title-main">C11ORF97</span> Protein which in humans is encoded by the C11ORF97 gene

C11ORF97, or Chromosome 11 Open Reading Frame 97, is a protein which in humans is encoded by the C11ORF97 gene. It is hypothesized to localize to the cytoplasm, and plays a role in the ciliary basal body. Based on its protein interactions, it is thought to have a role in Lemierre's Syndrome and Hepatic Coma.

References

  1. 1 2 3 GRCh38: Ensembl release 89: ENSG00000205177 Ensembl, May 2017
  2. 1 2 3 GRCm38: Ensembl release 89: ENSMUSG00000032671 Ensembl, May 2017
  3. "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  4. "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  5. Human chromosome 11 open reading frame 91 (C11orf91), mRNA. NCBI Nucleotide. Retrieved 23 September 2023.
  6. Uncharacterized protein C11orf91 [Homo sapiens]. NCBI Protein. Retrieved 23 September 2023.
  7. C11orf91 Gene. Gene Cards. Retrieved 23 September 2023.
  8. AlphaFold Protein Structure Database. AlphaFold Protein Structure Database. Retrieved 5 December 2023.
  9. Homo sapiens chromosome 11 open reading frame 91 (C11orf91), mRNA. NCBI Nucleotide. Retrieved 23 September 2023.
  10. Uncharacterized protein C11orf91 [Homo sapiens]. NCBI Protein. Retrieved 23 September 2023.
  11. RNA Folding Form Tool. The UNAFold Web Server. Retrieved 5 December 2023.
  12. Predict Binding Sites from PWMs Tool. RBPDB.Retrieved 5 December 2023.
  13. Statistical Analysis of Protein Sequences Tool. European Bioinformatics Institute. Retrieved 5 December 2023.
  14. Homo sapiens chromosome 11 open reading frame 91 (C11orf91), mRNA. NCBI Nucleotide. Retrieved 5 December 2023.
  15. PhosphoSitePlus Tool. PhosphoSitPlus. Retrieved 15 December 2023.
  16. C11orf91 Polyclonal Antibody. ThermoFisher Scientific. Retrieved 5 December 2023.
  17. The Human Protein Atlas entry on C11orf91. Human Protein Atlas. Retrieved 5 December 2023.
  18. Chou & Fasman Secondary Structure Prediction Server. Biogem. Retrieved 5 December 2023.
  19. JPred 4: A Protein Secondary Structure Prediction Server. JPred 4. Retrieved 5 December 2023.
  20. Predict Protein Tool. Predictprotein. Retrieved 5 December 2023.
  21. Ali2D Tool. MPI Bioinformatics Toolkit. Retrieved 5 December 2023.
  22. I-TASSER Tool. The Yang Zhang Lab. Retrieved 5 December 2023.
  23. AlphaFold Protein Structure Database. AlphaFold Protein Structure Database. Retrieved 5 December 2023.
  24. NCBI Structure iCN3D. iCn3D: Web-based 3D Structure Viewer. Retrieved 5 December 2023.
  25. MyHits Motif Scan Tool. SIB Swiss Institute of Bioinformatics. Retrieved 5 December 2023.
  26. DTU Health Tech NetPhos 3.1 Tool. DTU Health Tech Department of Health Technology. Retrieved 5 December 2023.
  27. GPS 6.0. Group Based Prediction System. Retrieved 5 December 2023.
  28. PhosphoSitePlus Tool. Phosphosite. Retrieved 5 December 2023.
  29. (National Center for Biotechnology Information) Basic Local Alignment Search Tool. NCBI BLAST. Retrieved 5 December 2023.
  30. UCSC Genome Browser entry on Human GRCh38/hg38. UCSC Genome Browser. Retrieved 5 December 2023.
  31. NCBI (National Center for Biotechnology Information) entry on C11orf91 - chromosome 11 open reading frame 91 NCBI Orthologs . NCBI Protein. Retrieved 19 October 2023.