SMCO4 | |||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Identifiers | |||||||||||||||||||||||||||||||||||||||||||||||||||
Aliases | SMCO4 , C11orf75, FN5, single-pass membrane protein with coiled-coil domains 4 | ||||||||||||||||||||||||||||||||||||||||||||||||||
External IDs | OMIM: 609477 HomoloGene: 137340 GeneCards: SMCO4 | ||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
Wikidata | |||||||||||||||||||||||||||||||||||||||||||||||||||
|
Single-pass membrane and coiled-coil domain-containing protein 4 is a single-pass membrane protein with a colloid-coil domain containing protein 4. In humans it is encoded by the SMCO4 gene. A single-pass transmembrane protein, means that one end of the protein will remain in the cytoplasm, while the other end is exposed to the cell exterior.
The SMCO4 gene is located on chromosome 11, with specific chromosomal coordinates of 11q13.3-q23.3 in Homo sapiens . [3] SMCO4 has two common aliases: C11orf75 (Chromosome 11 Open Reading Frame 75) and FN5. [4]
SMCO4 has 7 isoforms in Homo sapiens. [5] The SMCO4 isoform 1 has 9 exons, while the rest of the isoforms have varying amounts of exons. These isoforms are listed in the table below, with their accession numbers for reference.
Isoform | SMCO4 isoform x1 | SMCO4 isoform x2 | SMCO4 isoform x3 | SMCO4 isoform x4 | SMCO4 isoform x5 | SMCO4 isoform x6 | SMCO4 isoform x6 |
---|---|---|---|---|---|---|---|
NCBI accession # | XM_011542907.1 | XM_011542908.2 | XM_011542909.1 | XM_011542910.2 | XM_017018019.1 | XM_017018020.1 | XM_011542911.2 |
Sequence Length (bp) | Protein Sequence Length (aa) [3] | Molecular Weight (kD) [6] | Isoelectric Point [6] |
---|---|---|---|
979 | 59 | 6.7 | 10.75 |
MRQLKGKPKKETSKDKKERKQAMQEARQQITTVVLPTLAVVVLLIVVFVYVATRPTITE
The protein that is encoded by SMCO4 is rich in phosphorylable amino acids and has a high pH. There are no predicted protein repeats. There is evidence that the protein is present in the outer membrane of the mitochondria. [7] The absence of an N-terminal signal peptide shows that the protein does not have the ability to leave the organelle it is in, which correlates with the knowledge that it is a transmembrane protein.
The structure of the FN5 protein is not yet known, but the protein qualities are useful in predicting the protein's structure. There is no evidence that the protein would take on any Beta-turns; its secondary structure would be composed of only alpha-helixes. There are two separate helixes: the transmembrane helix and the alpha helix.
SMCO4 expression in humans was predicted to be highest in the salivary glands. [8] [9] Overall, there was RNA tissue expression shown in 37 human tissues and expression of the protein in 71 cell types. [9]
SMCO4 is not ubiquitously expressed throughout humans. There is evidence that the expression of SMCO4 is easily manipulated by many different factors, making constant expression unlikely. One example of this expression regulation comes from an experiment that showed increased SMCO4 expression in the presence of Gamma-tocotrienol. [10] [11]
SMCO4 is predicted to interact with the following proteins: [12]
BNIP3L | BCL2/adenovirus E1B 19kDa protein-interacting protein 3-like |
---|---|
SYNE4 | Spectrin Repeat Containing Nuclear Envelope Family Member 4 |
CCDC155 | Coiled-Coil Domain Containing 155 |
Two microRNAs, miR-124-3p.1 and miR-183-5p.1, are thought to be highly conserved throughout variety of vertebrates. TheseicroR io RNAs are both neuronal in nature. [13]
Orthologs of SMCO4 were found in a variety of species, including mammals, birds, reptiles, amphibians, insects and parasites. The primates were consistently found to have the highest sequence identity with the human SMCO4 sequence, followed closely by an extinct fish species called Latimeria chalumnae. There was no evidence of SMCO4 presence in yeast species.
Proline-rich 12 (PRR12) is a protein of unknown function encoded by the gene PRR12.
Coiled-coil domain containing 94 (CCDC94), is a protein that in humans is encoded by the CCDC94 gene. The CCDC94 protein contains a coiled-coil domain, a domain of unknown function (DUF572), an uncharacterized conserved protein (COG5134), and lacks a transmembrane domain.
Coiled-coil domain-containing protein 138, also known as CCDC138, is a human protein encoded by the CCDC138 gene. The exact function of CCDC138 is unknown.
TMEM143 is a protein that in humans is encoded by TMEM143 gene. TMEM143, a dual-pass protein, is predicted to reside in the mitochondria and high expression has been found in both human skeletal muscle and the heart. Interaction with other proteins indicate that TMEM143 could potentially play a role in tumor suppression/expression and cancer regulation.
C12orf40, also known as Chromosome 12 Open Reading Frame 40, HEL-206, and Epididymis Luminal Protein 206 is a protein that in humans is encoded by the C12orf40 gene.
TMEM156 is a gene that encodes the transmembrane protein 156 (TMEM156) in Homo sapiens. It has the clone name of FLJ23235.
FAM221B is a protein that in humans is encoded by the FAM221B gene . FAM221B is also known by the alias C9orf128, is expressed at low level, and is defined by 17 GenBank accessions . It is predicted to function in transcription regulation as a transcription factor.
Transmembrane and coiled-coil domains 4, TMCO4, is a protein in humans that is encoded by the TMCO4 gene. Currently, its function is not well defined. It is transmembrane protein that is predicted to cross the endoplasmic reticulum membrane three times. TMCO4 interacts with other proteins known to play a role in cancer development, hinting at a possible role in the disease of cancer.
TMEM44 is a protein that in humans is encoded by the TMEM44 gene. DKFZp686O18124 is a synonym of TMEM44.
Single-pass membrane and coiled-coil domain-containing protein 3 is a protein that is encoded in humans by the SMCO3 gene.
Transmembrane protein 155 is a protein that in humans is encoded by the TMEM155 gene. It is located on human chromosome 4, spanning 6,497 bases. It is also referred to as FLJ30834 and LOC132332. This protein is known to be expressed mainly in the brain, placenta, and lymph nodes and is conserved throughout most placental mammals. The function and structure of this protein is still not well understood, but its level of expression has been studied pertaining to various pathologies.
TMEM128, also known as Transmembrane Protein 128, is a protein that in humans is encoded by the TMEM128 gene. TMEM128 has three variants, varying in 5' UTR's and start codon location. TMEM128 contains four transmembrane domains and is localized in the Endoplasmic Reticulum membrane. TMEM128 contains a variety of regulation at the gene, transcript, and protein level. While the function of TMEM128 is poorly understood, it interacts with several proteins associated with the cell cycle, signal transduction, and memory.
TMEM275 is a protein that in humans is encoded by the TMEM275 gene. TMEM275 has two, highly-conserved, helical trans-membrane regions. It is predicted to reside within the plasma membrane or the endoplasmic reticulum's membrane.
SMIM19, also known as Small Integral Membrane Protein 19, encodes the SMIM19 protein. SMIM19 is a confirmed single-pass transmembrane protein passing from outside to inside, 5' to 3' respectively. SMIM19 has ubiquitously high to medium expression with among varied tissues or organs. The validated function of SMIM19 remains under review because of on sub-cellular localization uncertainty. However, all linked proteins research to interact with SMIM19 are associated with the endoplasmic reticulum (ER), presuming SMIM19 ER association
Coiled-Coil Domain Containing 190, also known as C1orf110, the Chromosome 1 Open Reading Frame 110, MGC48998 and CCDC190, is found to be a protein coding gene widely expressed in vertebrates. RNA-seq gene expression profile shows that this gene selectively expressed in different organs of human body like lung brain and heart. The expression product of c1orf110 is often called Coiled-coil domain-containing protein 190 with a size of 302 aa. It may get the name because a coiled-coil domain is found from position 14 to 72. At least 6 spliced variants of its mRNA and 3 isoforms of this protein can be identified, which is caused by alternative splicing in human.
Chromosome 5 open reading frame 22 (c5orf22) is a protein-coding gene of poorly characterized function in Homo sapiens. The primary alias is unknown protein family 0489 (UPF0489).
C1orf159 is a protein that in human is encoded by the C1orf159 gene located on chromosome 1. This gene is also found to be an unfavorable prognosis marker for renal and liver cancer, and a favorable prognosis marker for urothelial cancer.
Transmembrane Protein 144 (TMEM144) is a protein in humans encoded by the TMEM144 gene.
Chromosome 13 Open Reading Frame 46 is a protein which in humans is encoded by the C13orf46 gene. In humans, C13orf46 is ubiquitously expressed at low levels in tissues, including the lungs, stomach, prostate, spleen, and thymus. This gene encodes eight alternatively spliced mRNA transcript, which produce five different protein isoforms.
Transmembrane protein 248, also known as C7orf42, is a gene that in humans encodes the TMEM248 protein. This gene contains multiple transmembrane domains and is composed of seven exons.TMEM248 is predicted to be a component of the plasma membrane and be involved in vesicular trafficking. It has low tissue specificity, meaning it is ubiquitously expressed in tissues throughout the human body. Orthology analyses determined that TMEM248 is highly conserved, having homology with vertebrates and invertebrates. TMEM248 may play a role in cancer development. It was shown to be more highly expressed in cases of colon, breast, lung, ovarian, brain, and renal cancers.