TMEM44 | |||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Identifiers | |||||||||||||||||||||||||||||||||||||||||||||||||||
Aliases | TMEM44 , transmembrane protein 44 | ||||||||||||||||||||||||||||||||||||||||||||||||||
External IDs | MGI: 1924489 HomoloGene: 26702 GeneCards: TMEM44 | ||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
Wikidata | |||||||||||||||||||||||||||||||||||||||||||||||||||
|
TMEM44 (Transmembrane protein 44) is a protein that in humans is encoded by the TMEM44 gene. [5] DKFZp686O18124 is a synonym of TMEM44.
TMEM44 gene has 14 transcripts (splice variants). The whole span of the gene is 46,016 base pairs long, while the mRNA sequence of TMEM44 is 1,483 base pairs long, with 13 exons. Exon 1 and 2 partial are part of 5'-UTR, and the partial exon 2 is only highly conserved in primates. [6]
There are 5 experimentally verified promoters, and 4 predicted ones. Promoter GXP_232172, which is promoter set 5 is the longest with 1,276 base pairs and a total of 11 coding transcripts. [7]
There is an overall low level expression of TMEM44 gene throughout the body parts and throughout the developmental stages of humans. Some parts where TMEM44 expression is detected are in bone, brain, eye, ovary, pancreas and uterus. Some expression was also detected under certain health conditions including gastrointestinal tumor, glioma, ovarian tumor, pancreatic tumor, muscle tissue tumor and uterine tumor. [8]
TMEM44 gene is located near the end of the long arm of chromosome 3 (3q29) in humans (Homo sapiens). [9]
TMEM44 is 428 amino acids in length. The molecular weight of the protein is 47.1kDa, and its formula is C2086H3315N585O611S22, with a total of 6,619 atoms. [10] The theoretical isoelectric point (pI) of TMEM44 is 8.12. [11] The instability index (II) of TMEM44 is 47.96, which classifies the protein as unstable. There are 12 isoforms of TMEM44, with isoform c being the longest. [9] The function of TMEM44 is currently unknown.
The C-terminus of TMEM44 is found in the cytoplasm, and the protein is predicted to be integrated within the membrane of the endoplasmic reticulum. [12]
TMEM44 has 41.12% of alpha helix, 15.65% of extended strand and 43.22% of random coil. [13]
There are seven predicted transmembrane domains in TMEM44 protein.
GSK3B (Glycogen synthase kinase 3 beta), KAT6B (Histone acetyltransferase KAT6B/Histone acetyltransferase MYST4), TMEM31 (Transmembrane protein 31), SPAG9 (sperm associated antigen 9) and TNKS (tankyrase-1) are predicted to interact with TMEM44. [14] [15] [16]
TMEM44 undergoes threonine, tyrosine and serine phosphorylations. [17] Many serine phosphorylation takes place near the C-terminus, causing it to be negatively charged.
The Glycine (G) found nearest from the C-terminus is predicted to have glycosylphosphatidylinositol (GPI) attached, which anchors the protein to the cellular plasma membrane. [18]
The first 45 amino acids serve as a signal peptide cleavage site. [19]
Orthologs with the TMEM44 protein include amphibians, birds, fish, and mammals. The closest ortholog from human with TMEM44 is common chimpanzee (Pan troglodytes) with 98% identity, and the most distantly related ortholog is common carp (Cyprinus carpio) with 27% identity. [20]
12 selected orthologs of TMEM44 are shown below.
sequence number | genus | species | common name | date of divergence/MYA | NCBI [21] accession number | identity/% |
---|---|---|---|---|---|---|
Homo | sapiens | human | 0 | AAI44160.1 | 100 | |
1 | Macaca | fascicularis | crab-eating macaque | 29 | XP_005545405.1 | 94 |
2 | Rhinolophus | sinicus | Chinese rufous horseshoe bat | 96 | XP_019578895.1 | 80 |
3 | Condylura | cristata | star-nosed mole | 96 | XP_012585115.1 | 79 |
4 | Enhydra | lutris kenyoni | sea otter | 96 | XP_022370817.1 | 78 |
5 | Sorex | araneus | common shrew | 96 | XP_012791655.1 | 63 |
6 | Chrysemys | picta bellii | painted turtle | 312 | XP_023967126.1 | 51 |
7 | Nipponia | nippon | crested ibis | 312 | XP_009466798.1 | 42 |
8 | Xenopus | tropicalis | western clawed frog | 353 | XP_012818195.1 | 39 |
9 | Salvelinus | alpinus | arctic char | 435 | XP_023859379.1 | 35 |
10 | Hippocampus | comes | tiger tail seahorse | 435 | XP_019735697.1 | 33 |
11 | Oreochromis | niloticus | Nile tilapia | 435 | XP_013119610.1 | 30 |
12 | Monopterus | albus | Asian swamp eel | 435 | XP_020452501.1 | 29 |
TMEM44 is generally fast evolving, with about 0.310 changes of amino acids per 100 over a million year.
Predicted paralogous proteins of TMEM44 are C9IZ85, F8WCY1, F8WE47, H7C3X7, J3KQW3, Q6PL43, and Q96I73. [22]
TMEM156 is a gene that encodes the transmembrane protein 156 (TMEM156) in Homo sapiens. It has the clone name of FLJ23235.
FAM210B is a gene that which in Homo sapiens encodes the protein FAM210B. It has been conserved throughout evolutionary history, and is highly expressed in multiple tissues within the human body. FAM210B's primary location is the endoplasmic reticulum.
Transmembrane Protein 176B, or TMEM176B is a transmembrane protein that in humans is encoded by the TMEM176B gene. It is thought to play a role in the process of maturation of dendritic cells.
Glutamate Rich Protein 2 is a protein in humans encoded by the gene ERICH2. This protein is expressed heavily in male tissues specifically in the testes, and proteins are specifically found in the nucleoli fibrillar center and the vesicles of these testicular cells. The protein has multiple protein interactions which indicate that it may play a role in histone modification and proper histone functioning.
Transmembrane and coiled-coil domains 4, TMCO4, is a protein in humans that is encoded by the TMCO4 gene. Currently, its function is not well defined. It is transmembrane protein that is predicted to cross the endoplasmic reticulum membrane three times. TMCO4 interacts with other proteins known to play a role in cancer development, hinting at a possible role in the disease of cancer.
Chromosome 16 open reading frame 46 is a protein of yet to be determined function in Homo sapiens. It is encoded by the C16orf46 gene with NCBI accession number of NM_001100873. It is a protein-coding gene with an overlapping locus.
Transmembrane protein 44 is a protein that in humans is encoded by the TMEM44 gene.
Chromosome 4 open reading frame 51 (C4orf51) is a protein which in humans is encoded by the C4orf51 gene.
Cilia- and flagella-associated protein 299 (CFAP299), is a protein that in humans is encoded by the CFAP299 gene. CFAP299 is predicted to play a role in spermatogenesis and cell apoptosis.
Single-pass membrane and coiled-coil domain-containing protein 3 is a protein that is encoded in humans by the SMCO3 gene.
Transmembrane protein 155 is a protein that in humans is encoded by the TMEM155 gene. It is located on human chromosome 4, spanning 6,497 bases. It is also referred to as FLJ30834 and LOC132332. This protein is known to be expressed mainly in the brain, placenta, and lymph nodes and is conserved throughout most placental mammals. The function and structure of this protein is still not well understood, but its level of expression has been studied pertaining to various pathologies.
C5orf46 is a protein coding gene located on chromosome 5 in humans. It is also known as sssp1, or skin and saliva secreted protein 1. There are two known isoforms known in humans, with isoform 2 being the longer of the two. The protein encoded is predicted to have one transmembrane domain, and has a predicted molecular weight of 9,692 Da, and a basal isoelectric point of 4.67.
Uncharacterized protein C17orf78 is a protein encoded by the C17orf78 gene in humans. The name denotes the location of the parent gene, being at the 78th open reading frame, on the 17th human chromosome. The protein is highly expressed in the small intestine, especially the duodenum. The function of C17orf78 is not well defined.
Leucine rich single-pass membrane protein 2 is a single-pass membrane protein rich in leucine, that in humans is encoded by the LSMEM2 gene. The LSMEM2 protein is conserved in mammals, birds, and reptiles. In humans, LSMEM2 is found to be highly expressed in the heart, skeletal muscle and tongue.
Transmembrane protein 221 (TMEM221) is a protein that in humans is encoded by the TMEM221 gene. The function of TMEM221 is currently not well understood.
TMEM275 is a protein that in humans is encoded by the TMEM275 gene. TMEM275 has two, highly-conserved, helical trans-membrane regions. It is predicted to reside within the plasma membrane or the endoplasmic reticulum's membrane.
Transmembrane protein 169 (TMEM169) in humans is encoded by TMEM169 gene. The aliases of TMEM169 include FLJ34263, DKFZp781L2456, and LOC92691. TMEM169 has the highest expression in the brain, particularly the fetal brain. TMEM169 has homologs mammals, reptiles, amphibians, birds, fish, chordates and invertebrates. The most distantly related homolog of TMEM169 is Anopheles albimanus.
Transmembrane protein 212 is a protein that in humans is encoded by the TMEM212 gene. The protein consists of 5 transmembrane domains and localizes in the plasma membrane and endoplasmic reticulum. TMEM212 has orthologs in vertebrates but not invertebrates. TMEM212 has been associated with sporadic Parkinson's disease, facial processing, and adiposity in African Americans.
Transmembrane Protein 144 (TMEM144) is a protein in humans encoded by the TMEM144 gene.
Transmembrane protein 248, also known as C7orf42, is a gene that in humans encodes the TMEM248 protein. This gene contains multiple transmembrane domains and is composed of seven exons.TMEM248 is predicted to be a component of the plasma membrane and be involved in vesicular trafficking. It has low tissue specificity, meaning it is ubiquitously expressed in tissues throughout the human body. Orthology analyses determined that TMEM248 is highly conserved, having homology with vertebrates and invertebrates. TMEM248 may play a role in cancer development. It was shown to be more highly expressed in cases of colon, breast, lung, ovarian, brain, and renal cancers.