NHL Repeat Containing Protein 2, or NHLRC2, is a protein encoded by the NHLRC2 gene . [1]
NHLRC2 has also been referred to as Novel NHL Repeat Domain Containing Protein and 1200003F01Rik. [1]
NHLRC2 is located on the positive strand of chromosome 10, at position 10q25.3. [2] The full gene spans 62,533 base pairs (bp). [1] Eleven exons are transcribed in the protein-coding mRNA. [3] [ failed verification ] There is a second, less prevalent transcript variant of NHLRC2; this non-coding isoform is 522 bp of retained intron. [4] [ failed verification ]
The translated NHLRC2 protein is 726 amino acids long in humans and has a molecular weight of 79,442.59 g/mol. [1] It has been found in the cytosol, mitochondria, and/or peroxisomes of cells in which it is expressed. [1] [5]
NHLRC2 is a part of the NHL-repeat superfamily of n-bladed beta-propellers. [1] Specifically, NHLRC2 is a 6-bladed beta-propeller. The NHL repeat is found in many eukaryotic and prokaryotic proteins, and is associated with growth factor regulators. [6] [7]
The function of NHLRC2 is not yet fully understood. Using bioinformatic analysis, a "YVAD" motif was found to be conserved in eukaryotes, bacteria, and archaea. [8] "YVAD" shows up three times alone in the human NHLRC2 protein. [8] This motif is potentially involved in inhibiting caspases 1, 2, 3, 4 & 5, thus possibly having anti-apoptotic properties. [9] [10]
According to NCBI GEO microarray expression patterns, as well as UniGene's Expressed Sequence Tag (EST) profiles, NHLRC2 is expressed at low levels throughout most human tissues. [11] [12]
NCBI GEO Profiles detailed several conditions under which NHLRC2 expression is increased in comparison to base expression levels. Many, but not all, conditional cases involved an increase of NHLRC2 expression in various cancer types, including—but not limited to—the following: leukemia, lymphomas, breast & mammary gland cancer, colorectal cancer, Wilms' tumor, and lung cancer. [11]
NHLRC2 has no human paralogs. Extensive orthologs were identified, however, using NCBI's BLAST and BLAT programs. [13] [14] A select group of orthologs are detailed in the table below. This is not an exhaustive list of orthologs; rather, it shows the diversity of NHLRC2 orthologs.
PSORT II predicted one possible cleavage site within NHLRC2 in humans: between amino aids 32Q and 33E. [5] PSORT II also predicted two possible vacuolar signaling motifs—KLPK and TLPK—in humans that were also conserved in close orthologs such as vertebrates and amphibians, but not in distant orthologs like insects or plants. [5]
C11orf49 is a protein coding gene that in humans encodes for the C11orf49 protein. It is heavily expressed in brain tissue and peripheral blood mononuclear cells, with the latter being an important component of the immune system. It is predicted that the C11orf49 protein acts as a kinase, and has been shown to interact with HTT and APOE2.
METTL26, previously designated C16orf13, is a protein-coding gene for Methyltransferase Like 26, also known as JFP2. Though the function of this gene is unknown, various data have revealed that it is expressed at high levels in various cancerous tissues. Underexpression of this gene has also been linked to disease consequences in humans.
Proline-rich 12 (PRR12) is a protein of unknown function encoded by the gene PRR12.
Glutamine-rich protein 2 is a protein that in humans is decoded by the QRICH2 gene on human chromosome 17. The function of QRICH2 protein is mostly unknown, but it has been shown that QRICH2 gene contains a high molecular weight Glutenin domain and an ATPase involved domain. QRICH2 gene is highly expressed in testis, and the subcellular location of QRICH2 protein is in the nucleus.
Fibroblast growth factor receptor oncogene partner 2 (FGFR1OP2) was identified in a study on myeloproliferative syndrome (EMS). The study aimed to identify the partner genes to the fibroblast growth factor receptor 1 (FGFR1) involved in the syndrome. Using the 5'-RACE PCR technique, FGFR1OP2 was identified as a novel gene with no known function.
Family with sequence similarity 63, member A is a protein that, is encoded by the FAM63A gene in humans,. It is located on the minus strand of chromosome 1 at locus 1q21.3.
PRR29 is a protein encoded by the PRR29 gene located in humans on chromosome 17 at 17q23.
FAM71F2 or Family with Sequence Similarity 71 member F2 is a protein that in humans is encoded by the Family with Sequence Similarity 71 member F2 gene. This gene is highly active in the reproductive tissues, specifically the testis, and may serve as a potential biomarker for determining metastatic testicular cancer.
C17orf98 is a protein which in humans is coded by the gene c17orf98. The protein is derived from Homo sapiens chromosome 17. The C17orf98 gene consists of a 6,302 base sequence. Its mRNA has three exons and no alternative splice sites. The protein has 154 amino acids, with no abnormal amino acid levels. C17orf98 has a domain of unknown function (DUF4542) and is 17.6kDa in weight. C17orf98 does not belong to any other families nor does it have any isoforms. The protein has orthologs with high percent similarity in mammals and reptiles. The protein has additional distantly related orthologs across the metazoan kingdom, culminating with the sponge family.
Chromosome 21 Open Reading Frame 58 (C21orf58) is a protein that in humans is encoded by the C21orf58 gene.
Chromosome 18 open reading frame 63 is a protein which in humans is encoded by the C18orf63 gene. This protein is not yet well understood by the scientific community. Research has been conducted suggesting that C18orf63 could be a potential biomarker for early stage pancreatic cancer and breast cancer.
Single-pass membrane and coiled-coil domain-containing protein 3 is a protein that is encoded in humans by the SMCO3 gene.
C22orf23 is a protein which in humans is encoded by the C22orf23 gene. Its predicted secondary structure consists of alpha helices and disordered/coil regions. It is expressed in many tissues and highest in the testes and it is conserved across many orthologs.
WD Repeat and Coiled-coiled containing protein (WDCP) is a protein which in humans is encoded by the WDCP gene. The function of the protein is not completely understood, but WDCP has been identified in a fusion protein with anaplastic lymphoma kinase found in colorectal cancer. WDCP has also been identified in the MRN complex, which processes double-stranded breaks in DNA.
ProteinFAM89A is a protein which in humans is encoded by the FAM89A gene. It is also known as chromosome 1 open reading frame 153 (C1orf153). Highest FAM89A gene expression is observed in the placenta and adipose tissue. Though its function is largely unknown, FAM89A is found to be differentially expressed in response to interleukin exposure, and it is implicated in immune responses pathways and various pathologies such as atherosclerosis and glioma cell expression.
SH3 Domain Binding Kinase Family Member 3 is an enzyme that in humans is encoded by the SBK3 gene. SBK3 is a member of the serine/threonine protein kinase family. The SBK3 protein is known to exhibit transferase activity, especially phosphotransferase activity, and tyrosine kinase activity. It is well-conserved throughout mammalian organisms and has two paralogs: SBK1 and SBK2.
C6orf136 is a protein in humans encoded by the C6orf136 gene. The gene is conserved in mammals, mollusks, as well some porifera. While the function of the gene is currently unknown, C6orf136 has been shown to be hypermethylated in response to FOXM1 expression in Head Neck Squamous Cell Carcinoma (HNSCC) tissue cells. Additionally, elevated expression of C6orf136 has been associated with improved survival rates in patients with bladder cancer. C6orf136 has three known isoforms.
FAM120AOS, or family with sequence similarity 120A opposite strand, codes for uncharacterized protein FAM120AOS, which currently has no known function. The gene ontology describes the gene to be protein binding. Overall, it appears that the thyroid and the placenta are the two tissues with the highest expression levels of FAM120AOS across a majority of datasets.
Family with sequence 98, member C or FAM98C is a gene that encodes for FAM98C has two aliases FLJ44669 and hypothetical protein LOC147965. FAM98C has two paralogs in humans FAM98A and FAM98B. FAM98C can be characterized for being a Leucine-rich protein. The function of FAM98C is still not defined. FAM98C has orthologs in mammals, reptiles, and amphibians and has a distant orhtologs in Rhinatrema bivittatum and Nanorana parkeri.
Zinc Finger Protein 548 (ZNF548) is a human protein encoded by the ZNF548 gene which is located on chromosome 19. It is found in the nucleus and is hypothesized to play a role in the regulation of transcription by RNA Polymerase II. It belongs to the Krüppel C2H2-type zinc-finger protein family as it contains many zinc-finger repeats.