BPIFA4P | |||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Identifiers | |||||||||||||||||||||||||||||||||||||||||||||||||||
Aliases | BPIFA4P , BASE, BPI fold containing family A member 4, pseudogene | ||||||||||||||||||||||||||||||||||||||||||||||||||
External IDs | OMIM: 607627 GeneCards: BPIFA4P | ||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||
Wikidata | |||||||||||||||||||||||||||||||||||||||||||||||||||
|
BPI fold containing family A, member 4 (BPIFA4) is a non-human protein encoded by the Bpifa4 gene in mammals such as monkey, cat, and cow but does not appear in rodents and humans. [3] [4] It is also known as Latherin in horse, encoded by the Lath/Bpifa4 gene but is somewhat divergent from the other species. [5] [6] [7] Latherin/BPIFA4 is a secreted protein found in saliva and sweat.
In humans, no functional protein expressed therefore BPIFB4P is referred to as a pseudogene . [7] [8] [9] However, a non-functional protein does appear; it known as BASE protein, secreted by breast cancer cell lines and salivary gland tissue (see section below). [10] [11]
BPIFA3 is a member of a BPI fold protein superfamily defined by the presence of the bactericidal/permeability-increasing protein fold (BPI fold) which is formed by two similar domains in a "boomerang" shape. [12] This superfamily is also known as the BPI/LBP/PLUNC family or the BPI/LPB/CETP family. [13] The BPI fold creates apolar binding pockets that can interact with hydrophobic and amphipathic molecules, such as the acyl carbon chains of lipopolysaccharide found on Gram-negative bacteria, but members of this family may have many other functions.
Genes for the BPI/LBP/PLUNC superfamily are found in all vertebrate species, including distant homologs in non-vertebrate species such as insects, mollusks, and roundworms. [7] [14] Within that broad grouping is the BPIF gene family whose members encode the BPI fold structural motif and are found clustered on a single chromosome, e.g., Chromosome 20 in humans, Chromosome 2 in mouse, Chromosome 3 in rat, Chromosome 17 in pig, Chromosome 13 in cow. The BPIF gene family is split into two groupings, BPIFA and BPIFB. In humans, BIPFA consists of 3 protein encoding genes BPIFA1 , BPIFA2 , BPIFA3 , and 1 pseudogene BPIFA4P; while BPIFB consists of 5 protein encoding genes BPIFB1 , BPIFB2 , BPIFB3 , BPIFB4 , BPIFB6 and 2 pseudogenes BPIFB5P , BPIFB9P . What appears as pseudogenes in humans may appear as fully functional genes in other species.
Although BPIFA4P is identified as a pseudogene in humans, [7] [9] the RNA sequence for a wikt:putative protein has been detected at moderate levels in several glands (including salivary and mammillary), skin, and breast cancer. [15] [11] That pattern is consistent with the expression of normal BPIFA4/Latherin found in saliva and sweat of other species such as cow, [4] horse, [5] [6] and sheep. [16] The function of BPIFA4 in those species is associated with BPIFA gene family members' properties of being a surfactant and binding to bacterial lipopolysaccharides. Sweat helps animals cool down, and in animals with pelts/fur BPIFA4/Latherin significantly reduces surface tension of sweat, acting as a wetting agent to facilitate evaporative cooling. Further, it is speculated that the presence of a surfactant protein in the saliva of ruminant animals (e.g., cow, horse, sheep) may assist in mastication of large quantities of vegetable matter in their diet. [6] BPIFA4 in saliva also may function as a first line of defense against bacteria, via bactericidal functions similar to other BIPFA and BIPFB family members.
The expression of a pseudogene product in humans has caused unresolved issues about BPIFA4 in humans. This uncommon situation was summarized by Bingle and colleagues at University of Sheffield, who did extensive work on the BPI/LBP/PLUNC family:
"Human BPIFA4 appears to be an example of a pseudogene (and should be properly identified as BPIFA4P) that is perhaps better described as a ‘dying’ gene, since it appears to be both transcribed and translated, but no longer encodes a functional protein product." [17]
That non-functional protein came to be known as BASE (Breast cancer And Salivary gland Expression) protein. [11] Using a screening method for identifying human genes that code for membrane proteins, researchers at the National Cancer Institute discovered in 2003 a previously uncharacterized gene in breast cancer cell lines. With RT-PCR and Northern blot techniques, they detected the expression of BASE RNA in several breast cancer cell lines but not normal breast tissue. Separately BASE RNA expression was found in salivary gland tumors and normal salivary gland tissue. [11] [18] Thus the acronym was devised to reflect that pattern. Investigators at EMBL subsequently confirmed BASE/BPIFA4P expression in ~50% of actual breast cancer tumor samples they tested. [19] In particular, BASE/BPIFA4P expression was present in tumors with high levels of ERα estrogen receptor but not in tumors with low ERα. However, it was shown experimentally that estrogen represses the expression of the BASE/BPIFA4P gene, while the transcription factor FOXA1 activates the expression of BASE/BPIFA4P. The undetermined interplay between ERα and FOXA1 is likely to be important in hormone receptor-positive disease and acquired anti-hormone resistance. [20] Although the BASE protein's potential function was never investigated, the presence of the BASE/BPIFA4P gene was nevertheless considered a potentially useful marker for breast cancer screening.
The BASE gene deposited into US and European databases was eventually recognized as being a member of the BPI/LBP/PLUNC family and subsequently relabeled as BPIFA4P. [7] It was further recognized that unlike other primate genes for BPIFA4, the human BASE/BPIFA4P gene was missing a single nucleotide in exon 6. That deletion causes a frameshift mutation which results in a "premature" stop codon. [17] The resulting human BASE protein is much shorter than the functional BPIFA4 and Latherin proteins of other species. The original analysis predicted the BASE protein to be 19.5 kDa in size, [9] but Western blots show the protein migrates at a size larger than 22 kDa. [21] This truncated human BASE protein lacks key structural elements of a functional BPIFA4 protein, namely a long α-helical segment that creates the BPI fold. Without that, BASE cannot function like any other BPI/LBP/PLUNC family member and is thus considered functionless.
Chromosome 20 is one of the 23 pairs of chromosomes in humans. Chromosome 20 spans around 66 million base pairs and represents between 2 and 2.5 percent of the total DNA in cells. Chromosome 20 was fully sequenced in 2001 and was reported to contain over 59 million base pairs. Since then, due to sequencing improvements and fixes, the length of chromosome 20 has been updated to just over 66 million base pairs.
BPI fold containing family A, member 1 (BPIFA1), also known as Palate, lung, and nasal epithelium clone (PLUNC), is a protein that in humans is encoded by the BPIFA1 gene. It was also formerly known as "Secretory protein in upper respiratory tracts" (SPURT). The BPIFA1 gene sequence predicts 4 transcripts ; 3 mRNA variants have been well characterized. The resulting BPIFA1 is a secreted protein, expressed at very high levels in mucosa of the airways and salivary glands; at high levels in oropharyneal epithelium, including tongue and tonsils; and at moderate levels many other tissue types and glands including pituitary, testis, lung, bladder, blood, prostate, pancreas, levels in the digestive tract and pancreas. The protein can be detected on the apical side of epithelial cells and in airway surface liquid, nasal mucus, and sputum.
GATA3 is a transcription factor that in humans is encoded by the GATA3 gene. Studies in animal models and humans indicate that it controls the expression of a wide range of biologically and clinically important genes.
Cystatin-S is a protein that in humans is encoded by the CST4 gene.
BPI fold-containing family B, member 2, (BPIFB2) also known as bactericidal/permeability-increasing protein-like 1, is a protein that in humans is encoded by the BPIFB2 gene.
In molecular biology, the lipid-binding serum glycoproteins family, also known as the BPI/LBP/Plunc family or LBP/BPI/CETP family represents a family which includes mammalian lipid-binding serum glycoproteins and/or proteins containing a structural motif known as the BPI fold. Members of this family include:
BPI fold containing family A, member 3 (BPIFA3) is a protein that in humans is encoded by the BPIFA3 gene. The gene is also known as SPLUNC3 and C20orf71 in humans and the orthologous gene in mice is 1700058C13Rik. There are multiple variants of the BPIFA3 projected to be a secreted protein. It is very highly expressed in testis with little or no expression in other tissues. The Human Protein Atlas project and Mouse ENCODE Consortium report RNA-Seq expression at RPKM levels of 29.1 for human testis and 69.4 for mouse, but 0 for all other tissues. Similarly, the Bgee consortium, using multiple techniques in addition to RNA-Seq, reports a relative Expression Score of 95.8 out of 100 for testis and 99.0 for sperm in humans; however low levels of BPIFA3 between 20 and 30 were seen for a variety of tissues such as muscle, glands, prostate, nervous system, and skin.
CZIB is a gene in the human genome that encodes the protein CXXC motif containing zinc binding protein. CZIB was previously referred to as C1orf123.
BPI fold containing family B, member 4 (BPIFB4) is a protein that in humans is encoded by the BPIFB4 gene. It was formerly known as "Long palate, lung and nasal epithelium carcinoma-associated protein 4" encoded by the LPLUNC4 gene. The BPIFB4 gene sequence predicts 4 transcripts ; 3 isoforms have been well characterized. In a variety of mammals, BPIFB4 is generally expressed in very high levels in the olfactory epithelium, high levels in the gonads and pituitary, moderate levels in white blood cells (monocytes) It can occur either localized in the cytoplasm of cells or secreted and circulated systemically in blood plasma.
Transmembrane protein 151B is a protein that in humans is encoded by the TMEM151B gene.
BPI fold-containing family B member 1 (BPIFB1) is a protein that in humans is encoded by the BPIFB1 gene. BPIFB1 is a secreted protein, expressed at very high levels in mucosa of the airways and salivary glands, and at moderate levels in the digestive tract and pancreas.
Chromosome 9 open reading frame 43 is a protein that in humans is encoded by the C9orf43 gene. The gene is also known as MGC17358 and LOC257169. C9orf43 contains DUF 4647 and a polyglutamine repeat region although protein function is not well understood.
c7orf26 is a gene in humans that encodes a protein known as c7orf26. Based on properties of c7orf26 and its conservation over a long period of time, its suggested function is targeted for the cytoplasm and it is predicted to play a role in regulating transcription.
FAM227B is a protein that in humans is encoded by FAM227B gene. FAM227B stands for family with sequence similarity 227 member B and encodes protein FAM227B of the same name. Its aliases include C15orf33, MGC57432 and FLJ23800.
Chromosome 17 open reading frame 75 is a protein that in humans is encoded by the C17orf75 gene. C17orf75 is also known as SRI2 and is a human protein encoding gene located at 17q11.2 on the complementary strand. The protein this gene encodes is also known as NJMU-R1. The C17orf75 gene is ubiquitously expressed at medium-low levels throughout the body and at slightly higher levels in the brain and testes. This protein is thought to be part of a complex associated with golgin-mediated vesicle capture.
BPI fold containing family B, member 3 (BPIFB3) is a protein that in humans is encoded by the BPIFB3 gene. Two variants have been detected in humans.
BPI fold containing family B, member 5 is a non-human protein encoded by the Bpifb5 gene, also known as Lplunc5. The BPIFB5 protein and Bpifb5 gene have been characterized in mammals such as rodents and even-toed ungulates but are apparently lacking in primates and other vertebrates such as birds, reptiles, and amphibians. The protein in rodents is expressed at moderately high levels in mucosa of the airways and at moderate levels in salivary glands, esophagus, and gonads ; in even-toed ungulates expression is high in testis, moderate in brain and striated muscle, and low in kidney.
BPI fold containing family B, member 6 (BPIFB6), also known as bactericidal/permeability-increasing protein-like 3 (BPIL3), is a protein that in humans is encoded by the BPIFB6 gene, also known as BPIL3 and LPLUNC6. It is expressed at high levels in hypertrophic tonsils, at relatively moderate levels in oronasal epithelium including nasal mucosa, tongue, and salivary gland, as well as esophageal mucosa at lesser levels. Orthologs are present in many vertebrate species including mammals, birds, reptiles, and amphibians.
Vomeromodulin is a non-human protein also known as BPI fold containing family B, member 9 (BPIFB9) in the rat encoded by the Bpifb9/RYF3 gene, and as BPI fold containing family B, member 9A (BPIFB9A) encoded by the Bpifb9a gene in the mouse. This protein has been characterized in mammals such as rodents, carnivores, even-toed ungulates, insectivores, bats, lagomorphs, and shrews but is apparently absent in primates and other vertebrates such as birds, reptiles, and amphibians. Its function is associated with detection of chemical odorant pheromone molecules.
BPI fold containing family A, member 2 (BPIFA2), also known as Parotid Secretory Protein (PSP), is a protein that in humans is encoded by the BPIFA2 gene. The BPIFA2 gene sequence predicts multiple transcripts ; 2 mRNA variants have been well characterized. The resulting BPIFA2 is a secreted protein, expressed at very high levels in the parotid (salivary) gland; at high levels in oropharyngeal mucosa, including tongue; and at moderate levels many other tissue types and glands including mammary gland, testis, lung, bladder, blood, prostate, adrenal gland, kidney, and pancreas.
Accession NR_026760