TMEM143 (Transmembrane protein 143) is a protein that in humans is encoded by TMEM143 gene. [1] TMEM143, a dual-pass protein (two transmembrane domains), is predicted to reside in the mitochondria [2] [3] and high expression has been found in both human skeletal muscle and the heart. [4] [5] Interaction with other proteins indicate that TMEM143 could potentially play a role in tumor suppression/expression and cancer regulation. [6]
Located on the negative strand of human DNA, TMEM143 spans 31,882 base pairs on human chromosome 19 (19q13.33), neighbored by genes Coiled-coil domain containing 114 (CCDC114) and ER lumen protein-retaining receptor 1 (KDELR1). [1]
In humans, there are five transcript variants encoded by TMEM143 gene (1-5). Variant 1 is the longest mRNA transcript, with a coding region of 2577 nucleotides (nt) and a total of 8 exons, [1] [7] and possibly most indicative of function. Compared to variant 1, variant 2 (2472 nt, 424 amino acid protein) and variant 3 (2382nt, 394 amino acid protein) lack an in-frame exon in the 5' coding region while variant 4 (2277 nt, 359 amino acid protein) lacks two in-frame exons in the 5’ coding region, all leading to an N-terminally truncated protein. [1] Transcript variant 5 is a non-coding RNA, approximately 2231 nt long, resulting in a transcript candidate for nonsense-mediated mRNA.
There are four protein isoforms, [1] corresponding to a matching variant. Variant 1 codes for isoform a (the longest protein), and variants 2, 3, 4 code for isoforms b, c, and d, respectively. TMEM143 isoform a is 459 amino acids in length, has a molecular weight of 51.6 kDa and an isoelectric point of 9.7 in humans. [2] A domain of unknown function (DUF3754) is present within which two transmembrane domains reside, 24 and 16 amino acids in length, both helical in nature. [7]
The transmembrane domains encompass the uncharged region present at amino acids 278 to 302. A predicted mitochondrial target peptide resides at the N-terminus spanning 52 amino acids before the cleavage site between amino acids M-51 and G-52. [8] In addition, phosphorylation sites, both general and kinase-specific, were predicted to be found throughout the protein, indicating the location of the protein inside the cell. [9] [10]
Orthologs have been identified in more than 85 vertebrate species. [1] No TMEM143 orthologs have yet to be identified in birds.
There are currently 85 ortholog species, however all exist as vertebrates only (with the exception of birds), the most distant being Latimeria chalumnae (coelacanth). [1] DUF3754 appears in a majority of the orthologs, a generally conserved region, with slight amino acid alterations to the sequence. This domain has been found in organisms as diverse as bacteria and archaea, however there are no known orthologs in either organismal domain. [7]
There are no known paralogs for the human TMEM143 sequence.
Possible human expression of TMEM143 protein occurs in Jurkat cells (T lymphocyte). [1] Organelle association puts TMEM143 in the mitochondria as an integral protein in the membrane, [2] as well as the predicted of presence in the plasma membrane, endoplasmic reticulum, extracellular matrix and the Golgi apparatus. [3]
High expression has been found in the heart and skeletal muscle, as indicated through human expression profiling. [5] Microarray expression of normal human tissues [4] also predict expression in the heart and skeletal muscle, a 95-97 percentile rank (amongst other tissues tested for normal human expression of TMEM143).
Through text mining, TMEM143 is shown to have interactions with seven different proteins in humans: Zinc finger protein 541 (ZNF541), DNA-damage inducible 1 homolog 2 (DD12), Paraneoplastic Ma antigen family-like 2 (PNMAL2), Kelch-like 31(JLHL31), Chromosome 14 open reading frame 28 (C14orf28), Chromosome 14 open reading frame 28 (TRIN71), and Cytoplasmic polyadenylation element binding protein 2 (CPEB). [11] [12]
ZNF541 and PNMAL2, in relation to TMEM143, have been documented as having a role in the allelic loss of q13.3 of chromosome 19. This loss results in documented cases of malignant gliomas, neuroblastomas, and ovarian carcinomas, all suggesting a tumor suppression gene or genes in this region. [6] While TMEM143 is not directly referred to in research in this area, it is present in this region on the chromosome, indicating a potential functional role in humans.
Interactions between TMEM143 and DD12, JLHL31, C14orf28, TRIN71, and CPEB (in humans) have been documented through microarray data. [13] Illustrating predicted gene regulation with different microRNA (miRNA) under ionizing radiation conditions, TMEM143 and DD12, JLHL31, C14orf28, TRIN71, and CPEB all share predicted regulatory miRNAs. TMEM143 has also been found to be associated with adipocyte differentiation. Along with other genes, TMEM143 has been documented as a PPARγ (peroxisome proliferator-activated receptor gamma) target. This indicates the possibility of TMEM143 participation in lipid metabolic pathways and lipid cell differentiation [14]
HIKESHI is a protein important in lung and multicellular organismal development that, in humans, is encoded by the HIKESHI gene. HIKESHI is found on chromosome 11 in humans and chromosome 7 in mice. Similar sequences (orthologs) are found in most animal and fungal species. The mouse homolog, lethal gene on chromosome 7 Rinchik 6 protein is encoded by the l7Rn6 gene.
Transmembrane protein 131-like, alternatively named uncharacterized protein KIAA0922, is an integral transmembrane protein encoded by the human gene KIAA0922 that is significantly conserved in eukaryotes, at least through protists. Although the function of this gene is not yet fully elucidated, initial microarray evidence suggests that it may be involved in immune responses. Furthermore, its paralog, prolyl endopeptidase (PREP) whose function is known, provides clues as to the function of TMEM131L.
Transmembrane protein 33 is a protein that in humans, is encoded by the TMEM33 gene, also known as SHINC3. Another name for the TMEM33 protein is DB83.
DEP Domain Containing Protein 1B also known as XTP1, XTP8, HBV XAg-Transactivated Protein 8, [formerly referred to as BRCC3] is a human protein encoded by a gene of similar name located on chromosome 5.
Transmembrane protein 134 is a protein encoded by the TMEM134 gene. TMEM134 does not have any other known aliases. There are two transmembrane domains and a domain of unknown function (DUF872). Evolutionary, the majority of the organisms that have this gene are primates and mammals, although there are some organisms dating back to Drosophila and C. elegans. Through current research, there has not been any confirmed function of TMEM134.
Transmembrane protein 251, also known as C14orf109 or UPF0694, is a protein that in humans is encoded by the TMEM251 gene. One notable feature of this protein is the presence of proline residues on one of its predicted transmembrane domains., which is a determinant of the intramitochondrial sorting of inner membrane proteins.
The coiled-coil domain containing 142 (CCDC142) is a gene which in humans encodes the CCDC142 protein. The CCDC142 gene is located on chromosome 2, spans 4339 base pairs and contains 9 exons. The gene codes for the coiled-coil domain containing protein 142 (CCDC142), whose function is not yet well understood. There are two known isoforms of CCDC142. CCDC142 proteins produced from these transcripts range in size from 743 to 665 amino acids and contain signals suggesting protein movement between the cytosol and nucleus. Homologous CCDC142 genes are found in many animals including vertebrates and invertebrates but not fungus, plants, protists, archea, or bacteria. Although the function of this protein is not well understood, it contains a coiled-coil domain and a RINT1_TIP1 motif located within the coiled-coil domain.
Transmembrane Protein 217 is a protein encoded by the gene TMEM217. TMEM217 has been found to have expression correlated with the lymphatic system and endothelial tissues and has been predicted to have a function linked to the cytoskeleton.
Chromosome 8 open reading frame 58 is an uncharacterised protein that in humans is encoded by the C8orf58 gene. The protein is predicted to be localized in the nucleus.
TMEM44 is a protein that in humans is encoded by the TMEM44 gene. DKFZp686O18124 is a synonym of TMEM44.
Chromosome 9 open reading frame 50 is a protein that in humans is encoded by the C9orf50 gene. C9orf50 has one other known alias, FLJ35803. In humans the gene coding sequence is 10,051 base pairs long, transcribing an mRNA of 1,624 bases that encodes a 431 amino acid protein.
Chromosome 1 open reading frame 141, or C1orf141 is a protein which, in humans, is encoded by gene C1orf141. It is a precursor protein that becomes active after cleavage. The function is not yet well understood, but it is suggested to be active during development
Leucine rich single-pass membrane protein 2 is a single-pass membrane protein rich in leucine, that in humans is encoded by the LSMEM2 gene. The LSMEM2 protein is conserved in mammals, birds, and reptiles. In humans, LSMEM2 is found to be highly expressed in the heart, skeletal muscle and tongue.
Family with Sequence Similarity 155 Member B is a protein in humans that is encoded by the FAM155B gene. It belongs to a family of proteins whose function is not yet well understood by the scientific community. It is a transmembrane protein that is highly expressed in the heart, thyroid, and brain.
TMEM275 is a protein that in humans is encoded by the TMEM275 gene. TMEM275 has two, highly-conserved, helical trans-membrane regions. It is predicted to reside within the plasma membrane or the endoplasmic reticulum's membrane.
Transmembrane protein 39B (TMEM39B) is a protein that in humans is encoded by the gene TMEM39B. TMEM39B is a multi-pass membrane protein with eight transmembrane domains. The protein localizes to the plasma membrane and vesicles. The precise function of TMEM39B is not yet well-understood by the scientific community, but differential expression is associated with survival of B cell lymphoma, and knockdown of TMEM39B is associated with decreased autophagy in cells infected with the Sindbis virus. Furthermore, the TMEM39B protein been found to interact with the SARS-CoV-2 ORF9C protein. TMEM39B is expressed at moderate levels in most tissues, with higher expression in the testis, placenta, white blood cells, adrenal gland, thymus, and fetal brain.
C2orf74, also known as LOC339804, is a protein encoding gene located on the short arm of chromosome 2 near position 15 (2p15). Isoform 1 of the gene is 19,713 base pairs long. C2orf74 has orthologs in 135 different species, including primarily placental mammals and some marsupials.
Transmembrane protein 101 (TMEM101) is a protein that in humans is encoded by the TMEM101 gene. The TMEM101 protein has been demonstrated to activate the NF-κB signaling pathway. High levels of expression of TMEM101 have been linked to breast cancer.
Transmembrane protein 212 is a protein that in humans is encoded by the TMEM212 gene. The protein consists of 5 transmembrane domains and localizes in the plasma membrane and endoplasmic reticulum. TMEM212 has orthologs in vertebrates but not invertebrates. TMEM212 has been associated with sporadic Parkinson's disease, facial processing, and adiposity in African Americans.
Chromosome 13 Open Reading Frame 46 is a protein which in humans is encoded by the C13orf46 gene. In humans, C13orf46 is ubiquitously expressed at low levels in tissues, including the lungs, stomach, prostate, spleen, and thymus. This gene encodes eight alternatively spliced mRNA transcript, which produce five different protein isoforms.