TEDDM1 | |||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Identifiers | |||||||||||||||||||||||||||||||||||||||||||||||||||
Aliases | TEDDM1 , EDDM9, Epdd1, HE9, TMEM45C, HEL-S-45e, transmembrane epididymal protein 1 | ||||||||||||||||||||||||||||||||||||||||||||||||||
External IDs | MGI: 3646829 HomoloGene: 131186 GeneCards: TEDDM1 | ||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
Wikidata | |||||||||||||||||||||||||||||||||||||||||||||||||||
|
Transmembrane epididymal protein 1 is a transmembrane protein encoded by the TEDDM1 gene. TEDDM1 is also commonly known as TMEM45C and encodes 273 amino acids that contains six alpha-helix transmembrane regions. The protein contains a 118 amino acid length family of unknown function. While the exact function of TEDDM1 is not understood, it is predicted to be an integral component of the plasma membrane. [5]
TEDDM1 is located on chromosome 1 found at 1q25.3 on the minus end. TEDDM1 is composed of a single exon. The gene neighborhood of TEDDM1 includes glutamate-ammonia ligase (GLUL), long intergenic non-protein coding RNA 272 (LINC00272), and Sharpr-MPRA regulatory region 13543 (LOC122149321). [6] [7] [8]
TEDDM1 is expressed in most major tissues. The gene is expressed at a high level in the testis and a low level in the heart relative to all other tissues with TEDDM1 expression. [9] In the brain, TEDDM1 is expressed in the amygdala, the hippocampus, and the striatum. In the mouse brain, TEDDM1 was discovered with general expression using an RNA probe. [10]
Within the cell, TEDDM1 is primarily expressed in the plasma membrane. Similar to other membrane-bound proteins, TEDDM1 protein contains a predicted signal peptide, although the type of signal peptide is unknown. [11] Because TEDDM1 shares many characteristics with other transmembrane proteins, this suggests that the protein may have a role in various cell processes like signal transduction and protein trafficking. [12]
The expression pattern of TEDDM1 was ubiquitous. TEDDM1 mRNA abundance was moderate to low in humans. [13]
The molecular weight of the TEDDM1 protein is approximately 31.3 kDa and the theoretical isoelectric point (pI) of TEDDM1 is 8.02. Notably, this is different than the average pI of other proteins as their pI land in the pH range of 4-7. [14] This suggests that TEDDM1 protein contains an excess of basic amino acids relative to most proteins. [15]
TEDDM1 has a single major domain of unknown function entitled DUF716. This domain spans from amino acid position 95 to 219 of the protein. DUF716 contains the majority of a leucine zipper which is typically associated with dimerization of the protein. DUF716 also contains three of the six transmembrane regions in the protein, and contains amino acid segments of high conservation, suggesting DUF716 likely plays an important role in TEDDM1 protein function. [16]
TEDDM1 is subject to several post translational modifications. There are eighteen predicted phosphorylation sites. Other specific sites include an N-glycosylation site, a casein kinase II phosphorylation site, a Leucine zipper pattern, two N-myristylation sites, and three protein kinase C phosphorylation sites. N-glycosylation is another typical feature of membrane proteins and is involved in secretion and cytoskeletal organization. [17] Casein kinase II is involved with protein stability and cell signaling response. [18] Leucine zippers are sequence motifs that facilitate protein-protein interactions. [19] N-myristylation sites are involved in ligand binding and protein dynamics. [20] Protein kinase C phosphorylation has to do with regulation of cell proliferation and gene expression. [21]
The secondary structure of TEDDM1 begins with free random coils at the N-terminus and a single alpha helix. The six transmembrane proteins in TEDDM1 are characterized as structurally similar alpha helices. This region of the protein is quite condensed relative to the N-terminus. [22] Alpha helices typically have mainly hydrophobic residues, which aligns with these regions being located within the plasma membrane. Alpha helices are also commonly associated with DNA binding motifs. [23]
TEDDM1 has two known paralogs, TMEM45A and TMEM45B. [24] [25]
Over 100 orthologs exist for human gene TEDDM1. Of these known orthologs they were found only to exist in mammals, amphibians, and fish.The divergence date of all orthologs found were compared relative to Homosapiens . Fish TEDDM1 are the most distantly related orthologs to Human TEDDM1, with the furthest median date of divergence being 464 million years ago. While orthologs for TEDDM1 exist in bony fish, as evidenced by its existence in Protopterus annectens , orthologs of the gene were far more frequently found in cartilaginous fish. [26]
Seq # | TEDDM1 | Genus, Species | Common Name | Taxonomic Order | Divergence Date (MYA) | Accession Number | Query Cover | Sequence Length (aa) | Sequence Identity (%) | Sequence Similarity (%) |
1 | Mammals | Homo sapiens | human | Primate | 0 | NP_741997.3 | 100 | 273 | 100 | 100 |
2 | Nannospalax galili | middle east blind mole-rat | Rodentia | 89 | XP_008836723.1 | 100 | 305 | 71 | 74 | |
3 | Gracilinanus agilis | agile gracile opossum | Didelphimorphia | 160 | XP_044531579.1 | 80 | 295 | 35 | 49 | |
4 | Tachyglossus aculeatus | short beaked echidna | Tachyglossidae | 180 | XP_038601041.1 | 82 | 295 | 37 | 50 | |
5 | Amphibians | Xenopus tropicalis | western clawed frog | Pipidae | 353 | XP_017949911.1 | 82 | 284 | 32 | 43 |
6 | Geotrypetes seraphini | gaboon caecelian | Dermorphiidae | 353 | XP_033814670.1 | 70 | 308 | 36 | 28 | |
7 | Rhinatrema bivittatum | two-lined caecelian | Gymnophiona | 353 | XP_029473035.1 | 77 | 305 | 36 | 47 | |
8 | Nanorana parkeri | high himalaya frog | Dicroglossidae | 353 | XP_018432032.1 | 80 | 264 | 30 | 44 | |
9 | Fish | Protopterus annectens | west african lungfish | Protopteridae | 408 | XP_043932144.1 | 86 | 308 | 34 | 44 |
10 | Amblyraja radiata | thorny skate | Rajidae | 464 | XP_032887195.1 | 58 | 314 | 47 | 47 | |
11 | Callorhinchus milii | elephant shark | Holocephali | 464 | XP_007900619.1 | 75 | 300 | 36 | 46 | |
12 | Carcharodon carcharias | great white shark | Chondrichthyes | 464 | XP_041060943.1 | 81 | 306 | 39 | 48 |
The relative rate of molecular evolution of TEDDM1 was quite similar to that of Fibrinogen Alpha and much more rapid than the evolution rate of Cytochrome C.
Predicted functional partners to the TEDDM1 protein found via STRING. [27]
Protein | Description | Function |
SBK2 | Serine/threonine-protein kinase 2 | Predicted to be involved in MAPK cascade and protein phosphorylation |
METTL7B | Methyltransferase-like protein 7B | Probable methyltransferase |
PATE2 | Prostate and testis expressed 2 | Contain LY6/PLAUR domain in extracellular space |
ODF3L1 | Outer dense fiber protein 3-like protein 1 | Active in cytoskeleton |
DEFB30 | Beta-defensin 130a; Defensin, beta 130 | Antimicrobial host-defense peptide, antiplasmodial activity |
SPINT3 | Serine peptidase inhibitor, Kunitz type 3 | Enable receptor antagonist activity and beta binding activity |
TSPAN1 | Tetraspanin-1 | Mediate signal transduction events, role in the regulation of cell development, growth |
WFDC10A | Wap four-disulfide core domain protein 10a | Functions as a protease inhibitor |
EDDM3B | Epididymal secretory protein E3-beta | Possible function in sperm maturation |
According to previous cancer research studies, gene-based association analyses discovered that TEDDM1 along with GLUL are the two genes most significantly correlated with hepatitis B virus related hepatocellular carcinoma, the most common form of liver cancer. [28] TEDDM1 also has significant clinical significance due to its involvement with keratinocytes and the inhibition of microRNA-31, an important regulator of embryonic implantation and development. [29]
Transmembrane protein 242 (TMEM242) is a protein that in humans is encoded by the TMEM242 gene. The tmem242 gene is located on chromosome 6, on the long arm, in band 2 section 5.3. This protein is also commonly called C6orf35, BM033, and UPF0463 Transmembrane Protein C6orf35. The tmem242 gene is 35,238 base pairs long, and the protein is 141 amino acids in length. The tmem242 gene contains 4 exons. The function of this protein is not well understood by the scientific community. This protein contains a DUF1358 domain.
TMEM143 is a protein that in humans is encoded by TMEM143 gene. TMEM143, a dual-pass protein, is predicted to reside in the mitochondria and high expression has been found in both human skeletal muscle and the heart. Interaction with other proteins indicate that TMEM143 could potentially play a role in tumor suppression/expression and cancer regulation.
PRR29 is a protein encoded by the PRR29 gene located in humans on chromosome 17 at 17q23.
Retrotransposon Gag Like 6 is a protein encoded by the RTL6 gene in humans. RTL6 is a member of the Mart family of genes, which are related to Sushi-like retrotransposons and were derived from fish and amphibians. The RTL6 protein is localized to the nucleus and has a predicted leucine zipper motif that is known to bind nucleic acids in similar proteins, such as LDOC1.
Transmembrane protein 44 is a protein that in humans is encoded by the TMEM44 gene.
TMEM44 is a protein that in humans is encoded by the TMEM44 gene. DKFZp686O18124 is a synonym of TMEM44.
Transmembrane protein 125 is a protein that, in humans, is encoded by the TMEM125 gene. It has 4 transmembrane domains and is expressed in the lungs, thyroid, pancreas, intestines, spinal cord, and brain. Though its function is currently poorly understood by the scientific community, research indicates it may be involved in colorectal and lung cancer networks. Additionally, it was identified as a cell adhesion molecule in oligodendrocytes, suggesting it may play a role in neuron myelination.
TMEM128, also known as Transmembrane Protein 128, is a protein that in humans is encoded by the TMEM128 gene. TMEM128 has three variants, varying in 5' UTR's and start codon location. TMEM128 contains four transmembrane domains and is localized in the Endoplasmic Reticulum membrane. TMEM128 contains a variety of regulation at the gene, transcript, and protein level. While the function of TMEM128 is poorly understood, it interacts with several proteins associated with the cell cycle, signal transduction, and memory.
Leucine rich single-pass membrane protein 2 is a single-pass membrane protein rich in leucine, that in humans is encoded by the LSMEM2 gene. The LSMEM2 protein is conserved in mammals, birds, and reptiles. In humans, LSMEM2 is found to be highly expressed in the heart, skeletal muscle and tongue.
Transmembrane protein 221 (TMEM221) is a protein that in humans is encoded by the TMEM221 gene. The function of TMEM221 is currently not well understood.
TMEM275 is a protein that in humans is encoded by the TMEM275 gene. TMEM275 has two, highly-conserved, helical trans-membrane regions. It is predicted to reside within the plasma membrane or the endoplasmic reticulum's membrane.
Transmembrane protein 39B (TMEM39B) is a protein that in humans is encoded by the gene TMEM39B. TMEM39B is a multi-pass membrane protein with eight transmembrane domains. The protein localizes to the plasma membrane and vesicles. The precise function of TMEM39B is not yet well-understood by the scientific community, but differential expression is associated with survival of B cell lymphoma, and knockdown of TMEM39B is associated with decreased autophagy in cells infected with the Sindbis virus. Furthermore, the TMEM39B protein been found to interact with the SARS-CoV-2 ORF9C protein. TMEM39B is expressed at moderate levels in most tissues, with higher expression in the testis, placenta, white blood cells, adrenal gland, thymus, and fetal brain.
C2orf74, also known as LOC339804, is a protein encoding gene located on the short arm of chromosome 2 near position 15 (2p15). Isoform 1 of the gene is 19,713 base pairs long. C2orf74 has orthologs in 135 different species, including primarily placental mammals and some marsupials.
Family with sequence 98, member C or FAM98C is a gene that encodes for FAM98C has two aliases FLJ44669 and hypothetical protein LOC147965. FAM98C has two paralogs in humans FAM98A and FAM98B. FAM98C can be characterized for being a Leucine-rich protein. The function of FAM98C is still not defined. FAM98C has orthologs in mammals, reptiles, and amphibians and has a distant orhtologs in Rhinatrema bivittatum and Nanorana parkeri.
C11orf98 is a protein-encoding gene on chromosome 11 in humans of unknown function. It is otherwise known as c11orf48. The gene spans the chromosomal locus from 62,662,817-62,665,210. There are 4 exons. It spans across 2,394 base pairs of DNA and produces an mRNA that is 646 base pairs long.
Transmembrane protein 212 is a protein that in humans is encoded by the TMEM212 gene. The protein consists of 5 transmembrane domains and localizes in the plasma membrane and endoplasmic reticulum. TMEM212 has orthologs in vertebrates but not invertebrates. TMEM212 has been associated with sporadic Parkinson's disease, facial processing, and adiposity in African Americans.
Transmembrane protein 104 (TMEM104) is a protein that in humans is encoded by the TMEM104 gene. The aliases of TMEM104 are FLJ00021 and FLJ20255. Humans have a 163,255 base pair long gene coding sequence, 4703 base pair long mRNA, and 496 amino acid long protein sequence. In Eukaryotes, the TMEM104 gene is conserved.
Chromosome 13 Open Reading Frame 46 is a protein which in humans is encoded by the C13orf46 gene. In humans, C13orf46 is ubiquitously expressed at low levels in tissues, including the lungs, stomach, prostate, spleen, and thymus. This gene encodes eight alternatively spliced mRNA transcript, which produce five different protein isoforms.
Transmembrane protein 248, also known as C7orf42, is a gene that in humans encodes the TMEM248 protein. This gene contains multiple transmembrane domains and is composed of seven exons.TMEM248 is predicted to be a component of the plasma membrane and be involved in vesicular trafficking. It has low tissue specificity, meaning it is ubiquitously expressed in tissues throughout the human body. Orthology analyses determined that TMEM248 is highly conserved, having homology with vertebrates and invertebrates. TMEM248 may play a role in cancer development. It was shown to be more highly expressed in cases of colon, breast, lung, ovarian, brain, and renal cancers.
Transmembrane protein 271, or TMEM271 is a protein in Homo sapiens encoded by the TMEM271 gene, located at 4p16.3 on the minus strand. The protein is located on the plasma membrane of cells and highly expressed in several regions of the brain.
The pI of most proteins is in the pH range of 4 to 7.
The α-helix is the structural element most frequently used for sequence-specific interactions in protein–DNA interfaces. The size of an α-helix matches the width of the DNA major groove, allowing them to fit together tightly while the protein side chains on the helix probe the available base-pair functional groups.