C11orf91 | |||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Identifiers | |||||||||||||||||||||||||||||||||||||||||||||||||||
Aliases | C11orf91 , chromosome 11 open reading frame 91 | ||||||||||||||||||||||||||||||||||||||||||||||||||
External IDs | MGI: 1915493; HomoloGene: 41690; GeneCards: C11orf91; OMA:C11orf91 - orthologs | ||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
Wikidata | |||||||||||||||||||||||||||||||||||||||||||||||||||
|
Chromosome 11 open reading frame 91, or C11orf91 is a protein which in humans is encoded by the C11orf91 gene.
The C11orf91 gene consists of 5159 nucleotides with an mRNA of approximately 836 base pairs. [5] [6] There is one exon found in the C11orf91 gene.
The cytogenetic band location of C11orf91 is 11p13 and is located on the minus strand of the DNA . [7]
Annotated depiction of the C11orf91 mRNA and amino acid protein sequences.
The C11orf91 gene encodes a protein that is 193 amino acids in length. [9] [10] The C11orf91 protein contains a domain of unknown function, DUF5529, that spans nearly the entire protein. RBMX protein binding sites were found to be highly conserved in several structures of human C11orf91 3'UTR and 5' UTR. [11] [12] C11orf91 is rich in serine and proline and poor in valine and asparagine. [13] There is a proline rich region found in the middle of the C11orf91. [14] The human C11orf91 protein is approximately 20 kDal and has an isoelectric point around 9. [15]
Human C11orf91 protein is predicted to be localized in vesicles. [16] [17]
C11orf91 has two helices located near the C-terminus and no beta sheets. [18] [19] [20] [21] [22] [23] [24] [ excessive citations ]
C11orf91 has a predicted Protein kinase C (PKC) phosphorylation site, Casein kinase 2 (CK2) phosphorylation site, amidation site, and two predicted serine phosphorylation sites, see Conceptual Translation for post-translational modification site locations. [25] [26] [27] [28]
There are no paralogs of the human C11orf91 protein. [29] [30] The human C11orf91 protein has several orthologs found across eight categories of jawed vertebrates including: aves, testudines, alligators, reptiles, mammals, amphibians, lungfishes, and cartilaginous fishes. [31]
Genus and Species | Common Name | Taxonomic Group | Median Date of Divergence (MYA) | Accession Number | Sequence Length (aa) | Sequence Identity to Human Protein (%) | Sequence Similarity to Human Protein (%) |
---|---|---|---|---|---|---|---|
Homo sapiens | Human | Primates | 0 | XP_016872542.1 | 193 | 100 | 100 |
Mus musculus | Mouse | Rodentia | 87 | NP_080910.1 | 194 | 81.7 | 84.3 |
Equus caballus | Horse | Odd-toed ungulates | 94 | XP_023509601.1 | 191 | 87 | 89.1 |
Choloepus didactylus | Southern two-toed sloth | Pilosa | 99 | XP_037697448.1 | 191 | 86.2 | 87.2 |
Phascolarctos cinereus | Koala | Diprotodontia | 160 | XP_020819588.1 | 212 | 67.1 | 70.8 |
Gopherus evgoodei | Goode's thornscrub tortoise | Testudines | 319 | XP_030417986.1 | 163 | 46.5 | 55.5 |
Pogona vitticeps | Central bearded dragon | Squamata | 319 | XP_020663275.1 | 168 | 44.3 | 54.7 |
Sphaerodactylus townsendi | Townsend's least gecko | Squamata | 319 | XP_048340539.1 | 168 | 43 | 51.7 |
Alligator mississippiensis | American alligator | Crocodilia | 319 | XP_059577366.1 | 157 | 41.4 | 51.2 |
Tyto alba | Barn owl | Strigiformes | 319 | XP_042641164.1 | 143 | 34.5 | 41.5 |
Calypte anna | Hummingbird | Apodiformes | 319 | XP_030307391.1 | 146 | 32.3 | 44.6 |
Gymnogyps californianus | California condor | Accipitriformes | 319 | XP_050754406.1 | 142 | 32.2 | 38.8 |
Dryobates pubescens | Downy woodpecker | Piciformes | 319 | XP_054027854.1 | 143 | 32.1 | 43.5 |
Dromaius novaehollandiae | Emu | Struthioniformes | 319 | XP_025967391.1 | 209 | 29.6 | 35.8 |
Rhinatrema bivittatum | Two-lined caecilians | Caecilidae | 352 | XP_029438087.1 | 144 | 45.1 | 57 |
Pleurodeles waltl | Iberian ribbed newt | Caudata | 352 | KAJ1177600.1 | 140 | 40.1 | 52.3 |
Hyla sarda | Sardinian tree frog | Anura | 352 | XP_056384394.1 | 153 | 27.4 | 37 |
Protopterus annectens | West African lungfish | Lepidosireniformes | 408 | XP_043915714.1 | 173 | 28.5 | 37.4 |
Pygocentrus nattereri | Red-bellied piranha | Characiformes | 429 | XP_017546298.1 | 147 | 25.9 | 37.1 |
Amblyraja radiata | Thorny skate | Rajiformes | 462 | XP_032894439.1 | 136 | 37.7 | 46.2 |
Chromosome 6 open reading frame 201, C6orf201, is a protein that in humans is encoded by the C6orf201 gene. In humans this gene encodes for a nuclear protein that is primarily expressed in the testis.
PRR29 is a protein encoded by the PRR29 gene located in humans on chromosome 17 at 17q23.
Uncharacterized protein C2orf73 is a protein that in humans is encoded by the C2orf73 gene. The protein is predicted to be localized to the nucleus.
Chromosome 8 open reading frame 58 is an uncharacterised protein that in humans is encoded by the C8orf58 gene. The protein is predicted to be localized in the nucleus.
Chromosome 9 open reading frame 25 (C9orf25) is a domain that encodes the FAM219A gene. The terms FAM219A and C9orf25 are aliases and can be used interchangeably. The function of this gene is not yet completely understood.
Chromosome 19 open reading frame 44 is a protein that in humans is encoded by the C19orf44 gene. C19orf44 is an uncharacterized protein with an unknown function in humans. C19orf44 is non-limiting implying that the protein exists in other species besides human. The protein contains one domain of unknown function (DUF) that is highly conserved throughout its orthologs. This protein is most highly expressed in the testis and ovary, but also has significant expression in the thyroid and parathyroid. Other names for this protein include: LOC84167.
C22orf23 is a protein which in humans is encoded by the C22orf23 gene. Its predicted secondary structure consists of alpha helices and disordered/coil regions. It is expressed in many tissues and highest in the testes and it is conserved across many orthologs.
C1orf122 is a gene in the human genome that encodes the cytosolic protein ALAESM.. ALAESM is present in all tissue cells and highly up-regulated in the brain, spinal cord, adrenal gland and kidney. This gene can be expressed up to 2.5 times the average gene in its highly expressed tissues. Although the function of C1orf122 is unknown, it is predicted to be used for mitochondria localization.
Uncharacterized protein C17orf78 is a protein encoded by the C17orf78 gene in humans. The name denotes the location of the parent gene, being at the 78th open reading frame, on the 17th human chromosome. The protein is highly expressed in the small intestine, especially the duodenum. The function of C17orf78 is not well defined.
Coiled-coil domain containing 121 (CCDC121) is a protein encoded by the CCDC121 gene in humans. CCDC121 is located on the minus strand of chromosome 2 and encodes three protein isoforms. All isoforms of CCDC121 contain a domain of unknown function referred to as DUF4515 or pfam14988.
C11orf98 is a protein-encoding gene on chromosome 11 in humans of unknown function. It is otherwise known as c11orf48. The gene spans the chromosomal locus from 62,662,817-62,665,210. There are 4 exons. It spans across 2,394 base pairs of DNA and produces an mRNA that is 646 base pairs long.
Chromosome 12 Open Reading Frame 50 (C12orf50) is a protein-encoding gene which in humans encodes for the C12orf50 protein. The accession id for this gene is NM_152589. The location of C12orf50 is 12q21.32. It covers 55.42 kb, from 88429231 to 88373811, on the reverse strand. Some of the neighboring genes to C12orf50 are RPS4XP15, LOC107984542, and C12orf29. RPS4XP15 is upstream C12orf50 and is on the same strand. LOC107984542 and C12orf29 are both downstream. LOC107984542 is on the opposite strand while C12orf29 is on the same strand. C12orf50 has six isoforms. This page is focusing on isoform X1. C12orf50 isoform X1 is 1711 nucleotides long and has a protein with a length of 414 aa.
C4orf19 is a protein which in humans is encoded by the C4orf19 gene.
C22orf15 is a protein which, in humans, is encoded by the C22orf15 gene.
Chromosome 4 open reading frame 54 is a protein that in humans is coded by the c4orf54 gene. This gene is also known as FOPV and LOC285556. This protein is mostly expressed in the nucleus of muscle cells. Orthologs are found in vertebrates but not invertebrates.
Protein Njmu-R1 is a protein that in humans is encoded by the C17orf75 gene. C17orf75 is also known as SRI2 and is a human protein encoding gene located at 17q11.2 on the complementary strand. The C17orf75 gene is ubiquitously expressed at medium-low levels throughout the body and at slightly higher levels in the brain and testes. This protein is thought to be part of a complex associated with Golgi-mediated vesicle capture.
C13orf42 is a protein which, in humans, is encoded by the gene chromosome 13 open reading frame 42 (C13orf42). RNA sequencing data shows low expression of the C13orf42 gene in a variety of tissues. The C13orf42 protein is predicted to be localized in the mitochondria, nucleus, and cytosol. Tertiary structure predictions for C13orf42 indicate multiple alpha helices.
THAP domain-containing protein 3 (THAP3) is a protein that, in Homo sapiens (humans), is encoded by the THAP3 gene. The THAP3 protein is as known as MGC33488, LOC90326, and THAP domain-containing, apoptosis associated protein 3. This protein contains the Thanatos-associated protein (THAP) domain and a host-cell factor 1C binding motif. These domains allow THAP3 to influence a variety of processes, including transcription and neuronal development. THAP3 is ubiquitously expressed in H. sapiens, though expression is highest in the kidneys.
Chromosome 20 open reading frame 144 (c20orf144) is a human protein-encoding gene. The human c20orf144 protein consists of 153 amino acids, with the first 150 amino acids being characterized as part of the Bcl-2 like protein of testis (Bclt) family.
C11ORF97, or Chromosome 11 Open Reading Frame 97, is a protein which in humans is encoded by the C11ORF97 gene. It is hypothesized to localize to the cytoplasm, and plays a role in the ciliary basal body. Based on its protein interactions, it is thought to have a role in Lemierre's Syndrome and Hepatic Coma.