Niban-like protein 2. [1] (NLP2) is a protein that in humans is encoded by the FAM129C [1] gene. Paralogs of this gene include FAM129A, and FAM129B. [2] Its aliases include B-Cell Novel Protein 1 (BCNP1), and Family with Sequence Similarity 129 Member C (FAM129C). [3] [4]
The FAM129C gene is 30,538 base pairs long and is mapped to 19p.13.112 on chromosome 19 (NC_000019.10) from 17523301 to 17553839 in humans. Chromosome 19 has highest gene density of all human chromosomes [5] and large clustered gene families corresponding to high G + C content, CpG islands, and high-density repetitive DNA suggest evolutionary significance for genes located here. [5] Based on location and expression of FAM129C gene, this would suggest it has a role in immune system function.
True orthologs of FAM129C seem to be highly conserved in mammals, reptiles, marsupials, bony and cartilaginous fish. The most distant ortholog of FAM129C were found to be in a cellular slime mould, Polysphondyllumpallidum, and even a species of barley, Hordeumvulgare.
Species | Common Name | NCBI Accession # | Sequence Length | E value |
---|---|---|---|---|
Homo sapiens | Human | AAI67806 | 697 | ––––––– |
Polysphondyllum pallidum | Cellular slime mould | ADBJ01000008.1 | 532 | 1.00E-05 |
Hordeum vulgare | Barley | AK366539.1 | 553 | 2.00E-04 |
In normal tissues, the highest expression was in lymph, bone marrow, and spleen tissue, with low expression in other parts of the human body. [6] [7] FAM129C contains pleckstrin homology domain that may cause the protein to associate with the plasma membrane. [8] It is expressed in early stages of B-cell differentiation, and in high levels in chronic lymphocytic leukemia, and in the activated subtype of diffuse large B-cell lymphoma. [9] FAM129C is mainly expressed in the cytoplasm. [2] The pattern of expression is similar to that of CXCR4, so may be involved in B cell development and B cell maturation during germinal center reaction. [8]
In the human GEO profile, FAM129C appears to be expressed at lower levels in tissues with dilated cardiomyopathy by almost 50% when compared to non-failing septum tissue. [10] This may mean that FAM129C plays a role in non-failing heart tissue. Another condition in which FAM129C is significantly down-regulated is with the wild-type genotype hippocampal tissue of Rubinstein-Taybi compared with the p300 +/- genotype. [11] People with this condition have an increased risk of developing noncancerous and cancerous tumors such as leukemia and lymphoma
The isoelectric point of NLP2 is 8.576000. [12] The molecular weight is 77.4 kdal. [12] The amino acid sequence is 697aa long [2]
The predicted tertiary structure for NLP2 shows the FAM129C PH domain. There are seven predicted β sheets at the N terminus. [8] [13] This will form the tertiary structure of the pleckstrin homology domain. [8]
Transmembrane domains, peptide cleavage sites, or strong glycosylation sites were not predicted for NLP2. [14] [15] [16] [17] [18] [19] A total of 32 likely phosphorylation sites were predicted on Serine (25,) Threonine (5), and Tyrosine (2). [20]
Interferon-inducible GTPase 5 also known as immunity-related GTPase cinema 1 (IRGC1) is an enzyme that in humans is coded by the IRGC gene. It is predicted to behave like other proteins in the p47-GTPase-like and IRG families. It is most expressed in the testis.
C5orf34 is a protein that in humans is encoded by the C5orf34 gene (5p12).
WD repeat-containing protein 90 is a protein that, in humans, is encoded by the WDR90 gene (16p13.3). This human protein is 1750 amino acids, and has a molecular weight of 187.7 kDa. It contains multiple WD40 repeat domains and one domain of unknown function. This protein is conserved all the way back to invertebrates. Proteins containing WD transducin repeating domains have been found to play a role in a variety of functions ranging from signal transduction and transcription regulation to cell cycle control, autophagy and apoptosis.
Chromosome 11 open reading frame 86, also known as C11orf86, is a protein-coding gene in humans. It encodes for a protein known as uncharacterized protein C11orf86, which is predicted to be a nuclear protein. The function of this protein is currently unknown.
CXorf49 is a protein, which in humans is encoded by the gene chromosome X open reading frame 49(CXorf49).
Ankyrin repeat domain-containing protein 24 is a protein in humans that is coded for by the ANKRD24 gene. The gene is also known as KIAA1981. The protein's function in humans is currently unknown. ANKRD24 is in the protein family that contains ankyrin-repeat domains.
Chromosome 16 open reading frame 95 (C16orf95) is a gene which in humans encodes the protein C16orf95. It has orthologs in mammals, and is expressed at a low level in many tissues. C16orf95 evolves quickly compared to other proteins.
PRR29 is a protein encoded by the PRR29 gene located in humans on chromosome 17 at 17q23.
OCC-1 is a protein, which in humans is encoded by the gene C12orf75. The gene is approximately 40,882 bp long and encodes 63 amino acids. OCC-1 is ubiquitously expressed throughout the human body. OCC-1 has shown to be overexpressed in various colon carcinomas. Novel splice variant of this gene was also detected in various human cancer types; in addition to encoding a novel smaller protein, OCC-1 gene produces a non-protein coding RNA splice variant lncRNA.
Coiled-coil domain containing protein 180 (CCDC180) is a protein that in humans is encoded by the CCDC180 gene. This protein is known to localize to the nucleus and is thought to be involved in regulation of transcription as are many proteins containing coiled-coil domains. As it is expressed most highly in the testes and is regulated by SRY and SOX transcription factors, it could be involved in sex determination.
Leukocyte Receptor Cluster Member 9 is an uncharacterized protein encoded by the LENG9 gene. In humans, LENG9 is predicted to play a role in fertility and reproductive disorders associated with female endometrium structures.
C12orf66 is a protein that in humans is encoded by the C12orf66 gene. The C12orf66 protein is one of four proteins in the KICSTOR protein complex which negatively regulates mechanistic target of rapamycin complex 1 (mTORC1) signaling.
Chromosome 16 open reading frame 46 is a protein of yet to be determined function in Homo sapiens. It is encoded by the C16orf46 gene with NCBI accession number of NM_001100873. It is a protein-coding gene with an overlapping locus.
Chromosome 1 open reading frame 141, or C1orf141 is a protein which, in humans, is encoded by gene C1orf141. It is a precursor protein that becomes active after cleavage. The function is not yet well understood, but it is suggested to be active during development
Chromosome 1 open reading frame (C1orf167) is a protein which in humans is encoded by the C1orf167 gene. The NCBI accession number is NP_001010881. The protein is 1468 amino acids in length with a molecular weight of 162.42 kDa. The mRNA sequence was found to be 4689 base pairs in length.
C5orf46 is a protein coding gene located on chromosome 5 in humans. It is also known as sssp1, or skin and saliva secreted protein 1. There are two known isoforms known in humans, with isoform 2 being the longer of the two. The protein encoded is predicted to have one transmembrane domain, and has a predicted molecular weight of 9,692 Da, and a basal isoelectric point of 4.67.
C1orf122 is a gene in the human genome that encodes the cytosolic protein ALAESM.. ALAESM is present in all tissue cells and highly up-regulated in the brain, spinal cord, adrenal gland and kidney. This gene can be expressed up to 2.5 times the average gene in its highly expressed tissues. Although the function of C1orf122 is unknown, it is predicted to be used for mitochondria localization.
C12orf24 is a gene in humans that encodes a protein known as FAM216A. This gene is primarily expressed in the testis and brain, but has constitutive expression in 25 other tissues. FAM216A is an intracellular protein that has been predicted to reside within the nucleus of cells. The exact function of C12orf24 is unknown. FAM216A is highly expressed in Sertoli cells of the testis as well as different stage spermatids.
ProteinFAM89A is a protein which in humans is encoded by the FAM89A gene. It is also known as chromosome 1 open reading frame 153 (C1orf153). Highest FAM89A gene expression is observed in the placenta and adipose tissue. Though its function is largely unknown, FAM89A is found to be differentially expressed in response to interleukin exposure, and it is implicated in immune responses pathways and various pathologies such as atherosclerosis and glioma cell expression.
Transmembrane protein 221 (TMEM221) is a protein that in humans is encoded by the TMEM221 gene. The function of TMEM221 is currently not well understood.