TMEM156 is a gene that encodes the transmembrane protein 156 (TMEM156) in Homo sapiens. It has the clone name of FLJ23235. [1]
TMEM156 is located on the short arm of chromosome 4. It is found at position 4p14. It has four known transcripts, only two of which have proteins. [2]
Cytogenetic band: 4p14 [2]
The image above shows chromosome four and the various gene locations on it. TMEM156 can be seen at the thin red band that has been placed at p14.
TMEM156 is 66,499 bases and 296 amino acids in length. [4] TMEM156 spans from 38,966,744 to 39,032,922 bp. [5]
TMEM156 has one known splice variant, TMEM156-003. It has one transmembrane region and is 179 amino acids in length resulting in a lower molecular mass of 20.9 kDa. [6] In addition the isoelectric point is 7.0633 and the gene has four transcripts. [7]
Type | Length | Gene Coordinates |
Alternative exon 1 | 64 bp | 1 to 64 |
Alternative exon 2 | 912 bp | 4 to 915 |
Exon 3 | 270 bp | 11439 to 11708 |
Exon 4 | 261 bp | 15807 to 16067 |
Exon 5 | 120 bp | 17760 to 17879 |
Exon 6 | 84 bp | 19305 to 19388 |
Exon 7 | 80 bp | 33260 to 33339 |
Exon 8 | 95 bp | 33907 to 34001 |
Exon 9 | 37 bp | 37247 to 37283 |
There is one known isoform of TMEM156. This particular isoform uses an alternate in-frame splice site in the 3' coding region. The resulting isoform is shorter than the original. [9]
The TMEM156 protein is 296 amino acids in length. It has a molecular weight of 34.323 kDa and an isoelectric point of 7.98. [10] The protein interacts with the membrane three times as seen in the figure below.
TMEM156 has a secondary structure composed of primarily α-helices. Phyre was used to create this structure that can be viewed on the right hand side of this article. It is evident that the predicted protein structure is vastly composed of α-helices. The N terminus is located at the top of the image and the C terminus is at the bottom. [12]
This structure was predicted by analyzing the amino acid sequence using I-TASSER. The final result can be seen below. [13]
Glycosylation at Asn 45 and Asn 156 along with N-glycosylation seen in the portion of the protein that is found in the cytoplasm. [14]
The k-NN tool suggests the location of TMEM156 in the endoplasmic reticulum of the cell with 44.4% certainty. The following locations were all predicted with 11.1% certainty: vacuolar, vesicles of secretory system, extracellular, plasma membrane, and mitochondrial. [15]
TMEM156 is expressed in several tissues including ascites, bone marrow, salivary glands, and vascular to name a few. This gene is not ubiquitously expressed, but is still evident in many tissues. This gene is predominately expressed in adults but there is a bit of expression in fetuses. [16]
Transmembrane protein 156 has one known paralog. It also has various orthologs within eukaryotes. The table below compares an overarching sample of the known orthologs and one paralog. [18] The specific lineage of TMEM156 is: Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Homindae; Homo . [1]
Organism | Common name | Accession number | Sequence identity | Sequence similarity | Notes |
---|---|---|---|---|---|
Homo sapiens | human | XP_011512056 | 100% | 100% | Mammal |
Phycicoccus sp. Root101 | WP_056919608 | 36% | 56% | Archaea | |
Medicago truncatula | barrel medic | XP_003600988 | 52% | 73% | Plantae |
Acanthisitta chloris | rifleman | KFP78611 | 28% | 49% | Aves |
Struthio camelus australis | KFV80462 | 30% | 48% | Aves | |
Chaetura pelagica | chimney swift | KFU95974 | 31% | 50% | Aves |
Tauraco erythrolophus | red-crested turaco | XP_009989935 | 32% | 48% | Aves |
Chlamydotis macqueenii | MacQueen's bustard | XP_010113687 | 33% | 50% | Aves |
Chelonia mydas | green sea turtle | XP_007059764 | 35% | 54% | Reptile |
Alligator mississippiensis | American alligator | XP_006273714 | 36% | 56% | Reptile |
Dasypus novemcinctus | nine-banded armadillo | XP_004459440 | 73% | 81% | Mammal |
Felis catus | domestic cat | XP_006931228 | 68% | 77% | Mammal |
Panthera tigris altaica | Amur tiger | XP_007098927 | 69% | 78% | Mammal |
Bubalus bubalis | water buffalo | XP_006066188 | 72% | 83% | Mammal |
Capra hircus | goat | XP_005681569 | 72% | 83% | Mammal |
Leptonychotes weddellii | Weddell seal | XP_006732368 | 72% | 82% | Mammal |
Ovis aries | sheep | XP_012035193 | 73% | 84% | Mammal |
Bison bison bison | bison | XP_010831883.1 | 74% | 84% | Mammal |
Camelus bactrianus | Bactrian camel | XP_010960972 | 74% | 85% | Mammal |
Vicugna pacos | alpaca | XP_015099523 | 75% | 86% | Mammal |
Pteropus vampyrus | large flying fox | XP_011364299 | 79% | 89% | Mammal |
Homo sapiens | humans | NP_061028.3 | 47% | 58% | Mammal |
Family with sequence similarity 63, member A is a protein that, is encoded by the FAM63A gene in humans,. It is located on the minus strand of chromosome 1 at locus 1q21.3.
C20orf96 is a protein-coding gene in humans. It codes for an unknown protein known as uncharacterized protein C20orf96, predicted to be a nuclear protein. The function and biological processes of the gene is not well understood by the scientific community yet.
C8orf48 is a protein that in humans is encoded by the C8orf48 gene. C8orf48 is a nuclear protein specifically predicted to be located in the nuclear lamina. C8orf48 has been found to interact with proteins that are involved in the regulation of various cellular responses like gene expression, protein secretion, cell proliferation, and inflammatory responses. This protein has been linked to breast cancer and papillary thyroid carcinoma.
Leucine rich repeat containing 24 is a protein that, in humans, is encoded by the LRRC24 gene. The protein is represented by the official symbol LRRC24, and is alternatively known as LRRC14OS. The function of LRRC24 is currently unknown. It is a member of the leucine-rich repeat (LRR) superfamily of proteins.
PRR29 is a protein encoded by the PRR29 gene located in humans on chromosome 17 at 17q23.
Coiled-coil domain containing protein 180 (CCDC180) is a protein that in humans is encoded by the CCDC180 gene. This protein is known to localize to the nucleus and is thought to be involved in regulation of transcription as are many proteins containing coiled-coil domains. As it is expressed most highly in the testes and is regulated by SRY and SOX transcription factors, it could be involved in sex determination.
Transmembrane Protein 176B, or TMEM176B is a transmembrane protein that in humans is encoded by the TMEM176B gene. It is thought to play a role in the process of maturation of dendritic cells.
Uncharacterized protein Chromosome 16 Open Reading Frame 71 is a protein in humans, encoded by the C16orf71 gene. The gene is expressed in epithelial tissue of the respiratory system, adipose tissue, and the testes. Predicted associated biological processes of the gene include regulation of the cell cycle, cell proliferation, apoptosis, and cell differentiation in those tissue types. 1357 bp of the gene are antisense to spliced genes ZNF500 and ANKS3, indicating the possibility of regulated alternate expression.
Transmembrane and coiled-coil domains 4, TMCO4, is a protein in humans that is encoded by the TMCO4 gene. Currently, its function is not well defined. It is transmembrane protein that is predicted to cross the endoplasmic reticulum membrane three times. TMCO4 interacts with other proteins known to play a role in cancer development, hinting at a possible role in the disease of cancer.
FAM71E1, also known as Family With Sequence Similarity 71 Member E1, is a protein that in humans is encoded by the FAM71E1 gene. It is thought to be ubiquitously expressed at low levels throughout the body, and it is conserved in vertebrates, particularly mammals and some reptiles. The protein is localized to the nucleus and can be exported to the cytoplasm.
C15orf39 is a protein that in humans is encoded by the Chromosome 15 open reading frame 15 (C15orf39) gene.
Transmembrane protein 44 is a protein that in humans is encoded by the TMEM44 gene.
Chromosome 19 open reading frame 44 is a protein that in humans is encoded by the C19orf44 gene. C19orf44 is an uncharacterized protein with an unknown function in humans. C19orf44 is non-limiting implying that the protein exists in other species besides human. The protein contains one domain of unknown function (DUF) that is highly conserved throughout its orthologs. This protein is most highly expressed in the testis and ovary, but also has significant expression in the thyroid and parathyroid. Other names for this protein include: LOC84167.
Uncharacterized protein C17orf78 is a protein encoded by the C17orf78 gene in humans. The name denotes the location of the parent gene, being at the 78th open reading frame, on the 17th human chromosome. The protein is highly expressed in the small intestine, especially the duodenum. The function of C17orf78 is not well defined.
Chromosome 1 Opening Reading Frame 94 or C1orf94 is a protein in human coded by the C1orf94 gene. The function of this protein is still poorly understood.
C12orf24 is a gene in humans that encodes a protein known as FAM216A. This gene is primarily expressed in the testis and brain, but has constitutive expression in 25 other tissues. FAM216A is an intracellular protein that has been predicted to reside within the nucleus of cells. The exact function of C12orf24 is unknown. FAM216A is highly expressed in Sertoli cells of the testis as well as different stage spermatids.
Leucine rich single-pass membrane protein 2 is a single-pass membrane protein rich in leucine, that in humans is encoded by the LSMEM2 gene. The LSMEM2 protein is conserved in mammals, birds, and reptiles. In humans, LSMEM2 is found to be highly expressed in the heart, skeletal muscle and tongue.
TMEM275 is a protein that in humans is encoded by the TMEM275 gene. TMEM275 has two, highly-conserved, helical trans-membrane regions. It is predicted to reside within the plasma membrane or the endoplasmic reticulum's membrane.
The FAM214B, also known as protein family with sequence similarity 214, B (FAM214B) is a protein that, in humans, is encoded by the FAM214B gene located on the human chromosome 9. The protein has 538 amino acids. The gene contain 9 exon. There has been studies that there are low expression of this gene in patients with major depression disorder. In most organisms such as mammals, amphibians, reptiles, and birds, there are high levels of gene expression in the bone marrow and blood. For humans in fetal development, FAM214B is mostly expressed in the brains and bone marrow.
Chromosome 13 Open Reading Frame 46 is a protein which in humans is encoded by the C13orf46 gene. In humans, C13orf46 is ubiquitously expressed at low levels in tissues, including the lungs, stomach, prostate, spleen, and thymus. This gene encodes eight alternatively spliced mRNA transcript, which produce five different protein isoforms.
{{cite web}}
: CS1 maint: multiple names: authors list (link)