Transmembrane protein 241 (aka C18orf45, hVVT) is a ubiquitous sugar transporter protein which in humans is encoded by the TMEM241 gene. [1]
In humans, TMEM241 is a 142,188 bp gene located at 18q11.2 which contains 24 exons. [2]
TMEM241 is located near CABLES1, RIOK3, and NPC1 on chromosome 18. [3]
The primary mRNA for human TMEM241, isoform 1, [3] contains a 5' UTR hairpin loop conserved in primates. The primary mRNA for human TMEM241 isoform 1 contains binding sites in its 3' UTR for the miRNAs 520f-5p, 378a-5p, and 6866-5p. [4]
There are over 10 transcript variants predicted for the human TMEM241 gene found on BLAST. TMEM241 Isoform 1 is approximately 31 kDa in mass. The protein has an isoelectric point of 8.7. and is particularly rich in the amino acid phenylalanine, containing twice the normal proportion of this amino acid. [5]
TMEM241 is composed of 9 transmembrane domains forming a hydrophobic integral component of the membrane [2] composed primarily of alpha helices. [6] [7] [8] TMEM241 contains a VRG4 (Vanadate Resistant Glycosylation [9] ) domain with homology to the sugar transporter domain VRG4 from Saccharomyces cerevisiae (yeast). [10]
TMEM241 is predicted to undergo various phosphorylations, [11] glycation, [11] palmitoylation. [12] For example, TMEM241 isoform 1 [13] has a phosphorylation sites on S6, 64, 170, 177, 291, 295 and 296; [11] glycation sites on K125, 169 and 172; [11] palmitoylation sites on C13, 15, 221. [12]
There is some evidence that this protein may interact with keratin filament based on a two hybrid screen with the keratin protein KRT40. [14]
TMEM241 is likely to be expressed in all tissues at varying levels from basal to moderate expression. [15] Some studies have found changes in the expression of TMEM241. For instance, in cases of acute megakaryoblastic leukemia, TMEM241 was found to be one of the most upregulated genes. [16] In another case TMEM241 was found to be upregulated during the unfolded protein response following the overexpression of Ero1α (Endoplasmic Reticulum oxidoreduclin 1α). [10]
TMEM241 is conserved throughout eukaryotes.
TMEM241 is conserved across all animals and homologs are found throughout eukaryotes. TMEM241 has 19% global identity and 60% identity to GDP-mannose transporter from S. cerevisiae, which contains the VRG4 domain. It is likely that TMEM241 is a GDP-mannose transporter due to this similarity. The graph on the right shows the relative level of conservation of TMEM241 across many species of organism using the principle of a Molecular Clock.
Scientific name | Common name | Divergence from H. sapiens (m.y.a) [17] | Protein Length (amino acids) | % Identity to H. sapiens [18] [19] | Accession number |
---|---|---|---|---|---|
Homo sapiens | Human | 0 | 296 | 100 | NP_116322 |
Mus musculus | Mouse | 90.1 | 297 | 84 | NP_001276595 |
Calidris pugnax | Ruff (bird) | 320.5 | 296 | 80 | XP_003219697 |
Python bivittatus | Python | 320.5 | 296 | 74 | XP_014793906 |
Xenopus laevis | Frog | 354.4 | 293 | 69 | NP_001091222 |
Salmo salar | Salmon | 436.8 | 296 | 63 | XP_014036453 |
Octopus bimaculoides | Octopus | 903 | 291 | 35 | XP_014776992 |
Halyomorpha halys | Stink bug | 903 | 257 | 32 | XP_014284006 |
Saccharomyces cerevisiae | Yeast | 1335.5 | 216 | 25 | AJS02580 |
Coccomyxa subellipsoidea | Green Algae | 1570.5 | 296 | 28 | XP_005651133 |
TMEM241 has two paralogs in humans which have homologs throughout eukaryotes, UGTREL8 [20] and UGTREL7. [21] TMEM241, UGTREL8 and UGTREL7 are a family of sugar transport proteins with close identity to the GDP-mannose transporter identified in S. cerevisiae.
% Identity [18] [19] | TMEM241 (H.sapiens) | UGTREL7 (H.sapiens) | UGTREL8 (H.sapiens) | GDP Mannose Transporter (S. cerevisiae) |
---|---|---|---|---|
TMEM241 (H. sapiens) | 100% | 19.9% | 23.9% | 19.0% |
UGTREL7 (H. sapiens) | 19.9% | 100% | 52.2% | 18.8% |
UGTREL8 (H. sapiens) | 23.9% | 52.2% | 100% | 19.9% |
GDP Mannose Transporter (S. cerivisiae) | 19.0% | 18.8% | 19.9% | 100% |
Interferon-inducible GTPase 5 also known as immunity-related GTPase cinema 1 (IRGC1) is an enzyme that in humans is coded by the IRGC gene. It is predicted to behave like other proteins in the p47-GTPase-like and IRG families. It is most expressed in the testis.
TMEM156 is a gene that encodes the transmembrane protein 156 (TMEM156) in Homo sapiens. It has the clone name of FLJ23235.
Chromosome 16 open reading frame 95 (C16orf95) is a gene which in humans encodes the protein C16orf95. It has orthologs in mammals, and is expressed at a low level in many tissues. C16orf95 evolves quickly compared to other proteins.
The coiled-coil domain containing 142 (CCDC142) is a gene which in humans encodes the CCDC142 protein. The CCDC142 gene is located on chromosome 2, spans 4339 base pairs and contains 9 exons. The gene codes for the coiled-coil domain containing protein 142 (CCDC142), whose function is not yet well understood. There are two known isoforms of CCDC142. CCDC142 proteins produced from these transcripts range in size from 743 to 665 amino acids and contain signals suggesting protein movement between the cytosol and nucleus. Homologous CCDC142 genes are found in many animals including vertebrates and invertebrates but not fungus, plants, protists, archea, or bacteria. Although the function of this protein is not well understood, it contains a coiled-coil domain and a RINT1_TIP1 motif located within the coiled-coil domain.
WD repeat containing protein 53 (WDR53) is a protein encoded by the WDR53 gene that has been identified in the human genome by the Human Genome Project but has, at the moment, lacked experimental procedures to understand the function. It is located on chromosome 3 at location 3q29 in Homo sapiens. It has short up and down stream untranslated regions as well as WD40 repeat regions which have been linked to various functions.
Transmembrane protein 44 is a protein that in humans is encoded by the TMEM44 gene.
Neuroblastoma breakpoint family member 19, or NBPF19, is a protein that in humans is encoded by the NBPF19 gene. This protein is included in the neuroblastoma breakpoint family of proteins.
Transmembrane protein 151A, also known as TMEM151A, is a protein that is encoded by the TMEM151A gene.
TMEM128, also known as Transmembrane Protein 128, is a protein that in humans is encoded by the TMEM128 gene. TMEM128 has three variants, varying in 5' UTR's and start codon location. TMEM128 contains four transmembrane domains and is localized in the Endoplasmic Reticulum membrane. TMEM128 contains a variety of regulation at the gene, transcript, and protein level. While the function of TMEM128 is poorly understood, it interacts with several proteins associated with the cell cycle, signal transduction, and memory.
WD Repeat and Coiled-coiled containing protein (WDCP) is a protein which in humans is encoded by the WDCP gene. The function of the protein is not completely understood, but WDCP has been identified in a fusion protein with anaplastic lymphoma kinase found in colorectal cancer. WDCP has also been identified in the MRN complex, which processes double-stranded breaks in DNA.
C16orf90 or chromosome 16 open reading frame 90 produces uncharacterized protein C16orf90 in homo sapiens. C16orf90's protein has four predicted alpha-helix domains and is mildly expressed in the testes and lowly expressed throughout the body. While the function of C16orf90 is not yet well understood by the scientific community, it has suspected involvement in the biological stress response and apoptosis based on expression data from microarrays and post-translational modification data.
C1orf122 is a gene in the human genome that encodes the cytosolic protein ALAESM.. ALAESM is present in all tissue cells and highly up-regulated in the brain, spinal cord, adrenal gland and kidney. This gene can be expressed up to 2.5 times the average gene in its highly expressed tissues. Although the function of C1orf122 is unknown, it is predicted to be used for mitochondria localization.
Uncharacterized protein C17orf78 is a protein encoded by the C17orf78 gene in humans. The name denotes the location of the parent gene, being at the 78th open reading frame, on the 17th human chromosome. The protein is highly expressed in the small intestine, especially the duodenum. The function of C17orf78 is not well defined.
Chromosome 1 Opening Reading Frame 94 or C1orf94 is a protein in human coded by the C1orf94 gene. The function of this protein is still poorly understood.
Leucine rich single-pass membrane protein 2 is a single-pass membrane protein rich in leucine, that in humans is encoded by the LSMEM2 gene. The LSMEM2 protein is conserved in mammals, birds, and reptiles. In humans, LSMEM2 is found to be highly expressed in the heart, skeletal muscle and tongue.
Transmembrane protein 221 (TMEM221) is a protein that in humans is encoded by the TMEM221 gene. The function of TMEM221 is currently not well understood.
TMEM275 is a protein that in humans is encoded by the TMEM275 gene. TMEM275 has two, highly-conserved, helical trans-membrane regions. It is predicted to reside within the plasma membrane or the endoplasmic reticulum's membrane.
THAP domain-containing protein 3 (THAP3) is a protein that, in Homo sapiens (humans), is encoded by the THAP3 gene. The THAP3 protein is as known as MGC33488, LOC90326, and THAP domain-containing, apoptosis associated protein 3. This protein contains the Thanatos-associated protein (THAP) domain and a host-cell factor 1C binding motif. These domains allow THAP3 to influence a variety of processes, including transcription and neuronal development. THAP3 is ubiquitously expressed in H. sapiens, though expression is highest in the kidneys.
NADP-dependent oxidoreductase domain-containing protein 1 is a protein that in humans is encoded by the NOXRED1 gene. An alias of this gene is Chromosome 14 Open Reading Frame 148 (c14orf148). This gene is located on chromosome 14, at 14q24.3. NOXRED1 is predicted to be involved in pyrroline-5-carboxylate reductase activity as part of the L-proline biosynthetic pathway. It is expressed in a wide variety of tissues at a relatively low level, including the testes, thyroid, skin, small intestine, brain, kidney, colon, and more.
Chromosome 13 Open Reading Frame 46 is a protein which in humans is encoded by the C13orf46 gene. In humans, C13orf46 is ubiquitously expressed at low levels in tissues, including the lungs, stomach, prostate, spleen, and thymus. This gene encodes eight alternatively spliced mRNA transcript, which produce five different protein isoforms.