TMEM275 (Transmembrane protein 275 [1] ) is a protein that in humans is encoded by the TMEM275 gene. [2] TMEM275 has two, highly-conserved, helical trans-membrane regions. [3] [4] It is predicted to reside within the plasma membrane or the endoplasmic reticulum's membrane. [5]
In humans, the gene is located on chromosomal band 1p33 on the minus strand. [2] [6] It specifically resides on chromosome 1 at chr1:46,532,166-46,543,969. [6] TMEM275 is 11,804bp long. [6]
Upstream, on the minus strand, of TMEM275 is another gene; KNCN. [8]
The TMEM275 gene encodes four exons, only three of which are included within the final mRNA transcript. [9] Two of those three are within the 5’ UTR and the coding sequence codes for 177 amino acids. [2] [6] TMEM275 is found two have two potential isoforms outside of its reference transcript. The reference isoform does not include exon 1, while isoform X1 has a shortened 5' UTR and isoform X2 has a shortened 3' UTR. [10] [11] [12] However, all versions of mRNA transcripts yield the same 177 amino acid sequence.
The TMEM275 protein contains 177 amino acids with a predicted molecular mass of 17.2 kDa and a pI of 8.13. [13] TMEM275 was found to have a higher presence of alanine and proline amino acid residues than most proteins. [14] When looking at orthologous proteins sequences within other species, the alanine presence was conserved throughout, but the same could not be said of the proline presence.
TMEM275's protein consists of 177 amino acids. [7] The protein, or polypeptide chain, that is encoded by the coding sequence is made through the process of translation and is shown below among other regions of interest. There is also a poly-A, or polyadenylation signal, towards the end of the 3'UTR.
TMEM275 has two, highly-conserved, helical trans-membrane regions. [3] [4] The regions can be seem within the amino acid sequence above within the conceptual translation in purple. Evolution ary analysis showed that these trans-membrane domains are highly conserved across all ortholog taxa.
Many programs were used to analyze the predicted secondary structure for TMEM275. It was found to have a highly varied structure. However, prediction data supports the alpha helical structure of the two transmembrane domains. [16] [17] [18] [19] [20]
The predicted promoter region is 1116 bp long and located on chromosome 1 on the minus strand and extends from 46535401- 46536515. [21]
ElDorado through Genomatix was used to analyze the top 20 transcription factor binding sites within the promoter. [22]
TMEM275's RNA levels are very high at around 11 weeks of gestation within the intestines. [23] Some other notable peaks include the lungs, kidneys, and adrenal tissues at 10, 16, and 20 weeks, respectively. [24] It has also been found that of the 20 human tissues tested, RNA was notably present within fetal brain tissue. [25] Further testing on tissue types lead to the discovery that TMEM275 may have tissue-specificity with the testis, the brain, and the prostate. [26] Along with the thyroid and ovaries. [27]
TMEM275 is predicted to be within the plasma membrane or the endoplasmic reticulum's membrane. [5]
Various PTMs were analyzed for association with TMEM275. This includes looking for the presence of a SUMO-motif, acetylation sites, the presence of signal peptides, and any O-GlcNAc site and N-myristylation predictions. [28] [29] [30]
There are no known paralogs for TMEM275.
No homologs or homologous domains exist within TMEM275.
A total of 105 organisms are found to have orthologs with the human TMEM275 genes, all of which are a part of the Teleostomi, or jawed vertebrates, clade. [31] Of the 105 organisms that have an orthologous TMEM275: 49 are mammals, 27 are birds, 3 are turtles, 3 are lizards, 3 are amphibians, 19 are bony fishes, and 1 is an alligator. [31]
The group with the most similar sequences by percent identity was unsurprisingly Mammalia. The percent identities of those orthologous proteins in relation to humans ranged from 52.1% to 70.9%. The group that had the orthologous TMEM275, but was the least similar was Actinopterygii or Bony Fishes. Their percent similarities ranged from 38.5% to 47.6%. Percent similarities were found by conducting a pairwise analysis of each orthologous protein within each species against the human protein. [32]
TMEM275 seems to have appeared 435 MYA within the Bony fishes or Actinopterygii.
There were no known protein interactions for TMEM275.
TMEM275 has no known link to medical disease.
Interferon-inducible GTPase 5 also known as immunity-related GTPase cinema 1 (IRGC1) is an enzyme that in humans is coded by the IRGC gene. It is predicted to behave like other proteins in the p47-GTPase-like and IRG families. It is most expressed in the testis.
The coiled-coil domain containing 142 (CCDC142) is a gene which in humans encodes the CCDC142 protein. The CCDC142 gene is located on chromosome 2, spans 4339 base pairs and contains 9 exons. The gene codes for the coiled-coil domain containing protein 142 (CCDC142), whose function is not yet well understood. There are two known isoforms of CCDC142. CCDC142 proteins produced from these transcripts range in size from 743 to 665 amino acids and contain signals suggesting protein movement between the cytosol and nucleus. Homologous CCDC142 genes are found in many animals including vertebrates and invertebrates but not fungus, plants, protists, archea, or bacteria. Although the function of this protein is not well understood, it contains a coiled-coil domain and a RINT1_TIP1 motif located within the coiled-coil domain.
PRR29 is a protein encoded by the PRR29 gene located in humans on chromosome 17 at 17q23.
Chromosome 16 open reading frame 46 is a protein of yet to be determined function in Homo sapiens. It is encoded by the C16orf46 gene with NCBI accession number of NM_001100873. It is a protein-coding gene with an overlapping locus.
TMEM44 is a protein that in humans is encoded by the TMEM44 gene. DKFZp686O18124 is a synonym of TMEM44.
Chromosome 9 open reading frame 50 is a protein that in humans is encoded by the C9orf50 gene. C9orf50 has one other known alias, FLJ35803. In humans the gene coding sequence is 10,051 base pairs long, transcribing an mRNA of 1,624 bases that encodes a 431 amino acid protein.
Single-pass membrane and coiled-coil domain-containing protein 3 is a protein that is encoded in humans by the SMCO3 gene.
C20orf202 is a protein that in humans is encoded by the C20orf202 gene. In humans, this gene encodes for a nuclear protein that is primarily expressed in the lung and placenta.
Uncharacterized protein C17orf78 is a protein encoded by the C17orf78 gene in humans. The name denotes the location of the parent gene, being at the 78th open reading frame, on the 17th human chromosome. The protein is highly expressed in the small intestine, especially the duodenum. The function of C17orf78 is not well defined.
Leucine rich single-pass membrane protein 2 is a single-pass membrane protein rich in leucine, that in humans is encoded by the LSMEM2 gene. The LSMEM2 protein is conserved in mammals, birds, and reptiles. In humans, LSMEM2 is found to be highly expressed in the heart, skeletal muscle and tongue.
SMIM15(small integral membrane protein 15) is a protein in humans that is encoded by the SMIM15 gene. It is a transmembrane protein that interacts with PBX4. Deletions where SMIM15 is located have produced mental defects and physical deformities. The gene has been found to have ubiquitous but variable expression in many tissues throughout the body.
C3orf56 is a protein encoding gene found on chromosome 3. Although, the structure and function of the protein is not well understood, it is known that the C3orf56 protein is exclusively expressed in metaphase II of oocytes and degrades as the oocyte develops towards the blastocyst stage. Degradation of the C3orf56 protein suggests that this gene plays a role in the progression from maternal to embryonic genome and in embryonic genome activation.
SMIM19, also known as Small Integral Membrane Protein 19, encodes the SMIM19 protein. SMIM19 is a confirmed single-pass transmembrane protein passing from outside to inside, 5' to 3' respectively. SMIM19 has ubiquitously high to medium expression with among varied tissues or organs. The validated function of SMIM19 remains under review because of on sub-cellular localization uncertainty. However, all linked proteins research to interact with SMIM19 are associated with the endoplasmic reticulum (ER), presuming SMIM19 ER association
Chromosome 9 open reading frame 85, commonly known as C9orf85, is a protein in Homo sapiens encoded by the C9orf85 gene. The gene is located at 9q21.13. When spliced, four different isoforms are formed. C9orf85 has a predicted molecular weight of 20.17 kdal. Isoelectric point was found to be 9.54. The function of the gene has not yet been confirmed, however it has been found to show high levels of expression in cells of high differentiation.
The FAM214B, also known as protein family with sequence similarity 214, B (FAM214B) is a protein that, in humans, is encoded by the FAM214B gene located on the human chromosome 9. The protein has 538 amino acids. The gene contain 9 exon. There has been studies that there are low expression of this gene in patients with major depression disorder. In most organisms such as mammals, amphibians, reptiles, and birds, there are high levels of gene expression in the bone marrow and blood. For humans in fetal development, FAM214B is mostly expressed in the brains and bone marrow.
Family with Sequence Similarity 166, member C (FAM166C), is a protein encoded by the FAM166C gene. The protein FAM166C is localized in the nucleus. It has a calculated molecular weight of 23.29 kDa. It also contains DUF2475, a protein of unknown function from amino acid 19–85. The FAM166C protein is nominally expressed in the testis, stomach, and thyroid.
C11orf98 is a protein-encoding gene on chromosome 11 in humans of unknown function. It is otherwise known as c11orf48. The gene spans the chromosomal locus from 62,662,817-62,665,210. There are 4 exons. It spans across 2,394 base pairs of DNA and produces an mRNA that is 646 base pairs long.
Major facilitator superfamily domain containing 6 like (MFSD6L) is a protein encoded by the MFSD6L gene in humans. The MFSD6L protein is a transmembrane protein that is part of the major facilitator superfamily (MFS) that uses chemiosmotic gradients to facilitate the transport of small solutes across cell membranes.
Transmembrane protein 212 is a protein that in humans is encoded by the TMEM212 gene. The protein consists of five transmembrane domains and localizes in the plasma membrane and endoplasmic reticulum. TMEM212 has orthologs in vertebrates but not invertebrates. TMEM212 has been associated with sporadic Parkinson's disease, facial processing, and adiposity in African Americans.
Proline-Rich Protein 23A is a protein that is encoded by the Proline-Rich 23A (PRR23A) gene.
{{cite journal}}
: Cite journal requires |journal=
(help){{cite journal}}
: Cite journal requires |journal=
(help){{cite journal}}
: Cite journal requires |journal=
(help)