Patched Domain-Containing Protein 4 (PTCHD4) is a protein, which in humans is encoded by the Patched Domain-Containing Protein 4 gene. It is otherwise known by the aliases: [1] PTCHD4, C6orf138, dj402H5.2, and by the accession number BC137364. [2]
Sequenced distant orthologs of human PTCHD4 have been found as far back in evolution as mold, which shows a conservation of 16% identity. [3] Strict orthologs include boneless fish, but not echinoderms, insects, or plants.
(ORTHOLOG TABLE IMAGE) Figure 1: Table of example orthologs
(SCATTER PLOT IMAGE) Figure 2: Scatter plot of conservation with human PTCHD4 versus date of divergence in millions of years ago
PTCHD4 has three other genes in its family: PTCHD1, PTCHD2, and PTCHD3, as well as the Niemann-Pick disease type C protein. [4] These paralogs are far less conserved than strict orthologs, which may suggest that they split long before speciation. [4]
Human PTCHD4 is located on the negative strand of chromosome 6, at 6p12.3. [1] From there, it covers 190,350 base pairs, which includes three exons and two introns.
(LOCATION IMAGE) Figure 3, which shows the location of gene PTCHD4 on chromosome 6, is courtesy of GeneCards.org.
Nominal expression of PTCHD4 was found in the brain, connective tissue, embryonic tissue, lungs, placenta, testis, trachea, and uterus, with the greatest expression in the trachea. [5] Nominal expression was also found in the following disease states: chondrosarcoma, germ cell tumors, non-neoplasia, and uterine tumors. Protein localization was found in all tissues examined except the salivary glands, yet RNA expression was scarcely found anywhere. [6] This may suggest that PTCHD4 protein is particularly resilient to degradation, and that it is only produced under key circumstances or at key life stages.
Homo sapiens PTCHD4 variant 1 is 846 amino acids in length, [1] weighs 96.4 kilodaltons, [3] and has an isoelectric point of 8.8. It has two isomers, which are denoted as variant 2 and variant X1. It is phenylalanine rich, [3] and found in the cell membrane. [7]
Phyre2 extrapolated the following 3D secondary structure with 100% certainty after analyzing 745 residues, (an 88% majority) of PTCHD4’s human protein sequence. [8] Results were remarkably similar in such distant orthologs as earthworms, boneless fish, and green algae. There are an estimate 10 transmembrane helices. [7]
(STRUCTURE)
PTCHD4 has been implicated as an integral component of cellular membrane, [9] and as a protein receptor in the hedgehog pathway. The later of these two functions causes an inhibitory effect during the development of embryonic cells in all bilaterians and vertebrates. [10] In mammals, the hedgehog pathway is vital to the proper development of the brain, skeleton, musculature, gastrointestinal tract, and lungs. It also appears to be important to adult animals, as it has been implicated in the regulation of adult stem cells, while its malfunction is associated basal cell carcinoma. [11]
Protein patched homolog 1 is a protein that is the member of the patched family and in humans is encoded by the PTCH1 gene.
GATA3 is a transcription factor that in humans is encoded by the GATA3 gene. Studies in animal models and humans indicate that it controls the expression of a wide range of biologically and clinically important genes.
Transmembrane protein 8B is a protein that in humans is encoded by the TMEM8B gene. It encodes for a transmembrane protein that is 338 amino acids long, and is located on human chromosome 9. Aliases associated with this gene include C9orf127, NAG-5, and NGX61.
Iroquois-class homeodomain protein IRX-1, also known as Iroquois homeobox protein 1, is a protein that in humans is encoded by the IRX1 gene. All members of the Iroquois (IRO) family of proteins share two highly conserved features, encoding both a homeodomain and a characteristic IRO sequence motif. Members of this family are known to play numerous roles in early embryo patterning. IRX1 has also been shown to act as a tumor suppressor gene in several forms of cancer.
KIAA0895 is a protein that in Homo sapiens is encoded by the KIAA0895 gene. The gene encodes a protein commonly known as the KIAA0895 protein. It's aliases include hypothetical protein LOC23366, OTTHUMP00000206979, OTTHUMP00000206980, 9530077C05Rik, and 1110003N12Rik. It is located at 7p14.2.
CZIB is a gene in the human genome that encodes the protein CXXC motif containing zinc binding protein. CZIB was previously referred to as C1orf123.
TMEM143 is a protein that in humans is encoded by TMEM143 gene. TMEM143, a dual-pass protein, is predicted to reside in the mitochondria and high expression has been found in both human skeletal muscle and the heart. Interaction with other proteins indicate that TMEM143 could potentially play a role in tumor suppression/expression and cancer regulation.
Transmembrane protein 251, also known as C14orf109 or UPF0694, is a protein that in humans is encoded by the TMEM251 gene. One notable feature of this protein is the presence of proline residues on one of its predicted transmembrane domains., which is a determinant of the intramitochondrial sorting of inner membrane proteins.
Niban-like protein 2.(NLP2) is a protein that in humans is encoded by the FAM129C gene. Paralogs of this gene include FAM129A, and FAM129B. Its aliases include B-Cell Novel Protein 1 (BCNP1), and Family with Sequence Similarity 129 Member C (FAM129C).
Chromosome 11 open reading frame 86, also known as C11orf86, is a protein-coding gene in humans. It encodes for a protein known as uncharacterized protein C11orf86, which is predicted to be a nuclear protein. The function of this protein is currently unknown.
OCC-1 is a protein, which in humans is encoded by the gene C12orf75. The gene is approximately 40,882 bp long and encodes 63 amino acids. OCC-1 is ubiquitously expressed throughout the human body. OCC-1 has shown to be overexpressed in various colon carcinomas. Novel splice variant of this gene was also detected in various human cancer types; in addition to encoding a novel smaller protein, OCC-1 gene produces a non-protein coding RNA splice variant lncRNA.
Uncharacterized protein Chromosome 16 Open Reading Frame 71 is a protein in humans, encoded by the C16orf71 gene. The gene is expressed in epithelial tissue of the respiratory system, adipose tissue, and the testes. Predicted associated biological processes of the gene include regulation of the cell cycle, cell proliferation, apoptosis, and cell differentiation in those tissue types. 1357 bp of the gene are antisense to spliced genes ZNF500 and ANKS3, indicating the possibility of regulated alternate expression.
C21orf62 is a protein that, in humans, is encoded by the C21orf62 gene. C21orf62 is found on human chromosome 21, and it is thought to be expressed in tissues of the brain and reproductive organs. Additionally, C21orf62 is highly expressed in ovarian surface epithelial cells during normal regulation, but is not expressed in cancerous ovarian surface epithelial cells.
Retroelement silencing factor 1 is a protein that in humans is encoded by the RESF1 gene. RESF1 is broadly expressed in the lymph nodes, ovaries, appendix and spleen. RESF1 shows characteristics of being a minor histocompatibility antigen, as well as tumor suppressor capabilities. The high expression in the lymph nodes and spleen indicate function in the immune system.
Uncharacterized protein Chromosome 1 Open Reading Frame 27 is a protein in humans, encoded by the C1orf27 gene. It is accession number NM_017847. This is a membrane protein that is 3926 base pairs long with the most extensive string of amino acids being 454aa long. C1orf27 exhibits cytoplasmic expression in epidermal tissues. Predicted associated biological processes of the gene include cell fate specification and developmental properties.
C2orf74, also known as LOC339804, is a protein encoding gene located on the short arm of chromosome 2 near position 15 (2p15). Isoform 1 of the gene is 19,713 base pairs long. C2orf74 has orthologs in 135 different species, including primarily placental mammals and some marsupials.
Transmembrane protein 101 (TMEM101) is a protein that in humans is encoded by the TMEM101 gene. The TMEM101 protein has been demonstrated to activate the NF-κB signaling pathway. High levels of expression of TMEM101 have been linked to breast cancer.
C12orf29 is a protein that in humans is encoded by chromosome 12 open reading frame 29. The gene is ubiquitously expressed in various tissues. The protein has 325 amino acids. The biological process of C12orf29 has been annotated as hematopoietic progenitor cell differentiation. The molecular and cellular functions of C12orf29 gene have not yet well understood by the scientific community.
C22orf15 is a protein which, in humans, is encoded by the C22orf15 gene.
Chromosome 20 open reading frame 85, or most commonly known as C20orf85 is a gene that encodes for the C20orf85 Protein. This gene is not yet well understood by the scientific community.