NBPF26

Last updated
NBPF26
Identifiers
Aliases NBPF26 , NBPF member 26
External IDs HomoloGene: 41035 GeneCards: NBPF26
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)

NM_001351372
NM_001395637
NM_001405520

n/a

RefSeq (protein)

n/a

n/a

Location (UCSC) Chr 1: 120.72 – 120.84 Mb n/a
PubMed search [2] n/a
Wikidata
View/Edit Human

NBPF26, or Neuroblastoma breakpoint family member 26, is a protein encoded by the NBPF26 gene in Homo sapiens. The alias for NBPF26 is notch 2 N-terminal like R (NOTCH2NLR). [3] NBPF26 encodes 13 Olduvai domains, which are thought to contribute to the rapid expansion of the neocortex in humans. [4]

Contents

Gene

Locus

The NBPF26 gene is located on the plus strand of chromosome 1 (1p11.2) from 120,723,945 to 120,842,229, spanning 118,285 base pairs. NOTCH2NLR is an alias of NBPF26 and overlaps with beginning of NBPF26's coding sequence.

Genomic neighborhood

Human NBPF26 genomic neighborhood and location. Human NBPF26 genomic neighborhood.gif
Human NBPF26 genomic neighborhood and location.

Within the genomic neighborhood of human NBPF26, phosphodiesterase 4D interacting protein-like pseudogene 2 (PDE4DIPP2), NOTCH2NLR, Rosellinia necatrix victorivirus 1-19 (RnVV1-19), and tRNA-Asn (anticodon GTT) 7-1 (TRN-GTT7-1) can be found.

mRNA

NBPF26 encodes three isoform variants shown in the table below. Isoform one is the longest variant and spans 6,891 base pairs. [5]

Human NBPF26 Isoforms
IsoformNucleotide Accession #mRNA Length (bp)Protein Accession #Protein Length (aa)Molecular Weight (kDa)
1NM_001405520.16,891NP_001392449.11,673190
2NM_001351372.26,666NP_001338301.21,598182
3NM_001395637.26,216NP_001382566.11448165

Protein

Composition

NBPF26 Tertiary Structure. Structure created with AlphaFold. NBPF26 Tertiary Structure.png
NBPF26 Tertiary Structure. Structure created with AlphaFold.

NBPF26 isoform one has a predicted molecular weight of 190 kDa and an isoelectric point of 4.6. [7] Relative to other proteins, NBPF26 is rich in glutamic acid, glutamine, and cysteine, and poor in isoleucine. [8] Two positive charge clusters, spanning 22 amino acids, are located at positions 1,241-1,262 and 1,560-1,581.

Motifs

Five EGF-like domains and one calcium-binding EGF-like domain are encoded in NBPF26. [9]

NBPF26 also encodes 13 Olduvai domains. [10] Positions and sequences of Conserved 1-3 (Con1-3) and Human lineage-specific 1-3 (HLS1-3) Olduvai clades are shown in the table below.

NBPF26 Olduvai Domains
CladePosition (aa)Sequence
Con1442-505EKVLESSAPREVQKTEESKVPEDSLEECAITCSNSHGPCDSNQPHKNIKITFEEDEVNSTLVVD
Con1713-775EKVQKSSAPREMQKAEEKEVPEDSLEECAITCSNSHGPYDCNQPHRKTKITFEEDKVDSTLIG
Con2799-862EEKGPVSPRNLQESEEEEVPQESWDEGYSTLSIPPEMLASYKSYSSTFHSLEEQQVCMAVDIG
HLS1871-937KEDHEATGPRLSRELLDEKGPEVLQDSLDRCYSTPSGCLELTDSCQPYRSAFYVLEQQRVGLAVDMD
HLS1946-1,012EEDQDPSCPRLSGELLDEKEPEVLQESLDRCYSTPSGCLELTDSCQPYRSAFYILEQQRVGLAVDMD
HLS11,021-1,087EEDQDPSCPRLSGELLDEKEPEVLQESLDRCYSTPSGCLELTDSCQPYRSAFYILEQQRVGLAVDMD
HLS21,096-1,162EEDQDPSCPRLSRELLDEKEPEVLQDSLGRCYSTPSGYLELPDLGQPYSSAVYSLEEQYLGLALDVD
HLS31,171-1,237EEDQGPPCPRLSRELLEVVEPEVLQDSLDRCYSTPSSCLEQPDSCQPYGSSFYALEEKHVGFSLDV
HLS11,265-1331EEDQNPPCPRLSRELLDEKGPEVLQDSLDRCYSTPSGCLELTDSCQPYRSAFYILEQQRVGLAVDMD
HLS31,340-1,406EEDQDPSCPRLSRELLEVVEPEVLQDSLDRCYSTPSSCLEQPDSCQPYGSSFYALEEKHVGFSLDVG
HLS21,415-1,481EEDQDPSCPRLSRELLDEKEPEVLQDSLGRCYSTPSGYLELPDLGQPYSSAVYSLEEQYLGLALDVD
HLS31,490-1,556EEDQGPPCPRLSRELLEVVEPEVLQDSLDRCYSTPSSCLEQPDSCQPYGSSFYALEEKHVGFSLDVG
Con31,584-1,650EEDQNPPCPRLNSMLMEVEEPEVLQDSLDICYSTPSMYFELPDSFQHYRSVFYSFEEEHISFALYVD

Post-translational modifications

NBPF26 is phosphorylated at amino acids 46, 48, 97, and 228. [11] [12] In addition, NBPF26 is predicted to undergo sulfonation, C-mannosylation, N-glycosylation, and O-glycosylation. [13] [14] [15] [16] Three furin cleavage sites are found at amino acid positions 379, 1252, and 1571. [17]

Protein interactions

NBPF26 is predicted to interact with Estrogen receptor 1 (ESR1), APEX nuclease (multifunctional DNA repair enzyme) 1 (APEX1), lysine (K)-specific demethylase 1A (KDM1A), trans-golgi network protein 2 (TGOLN2), and tripartite motif containing 25 (TRIM25). [18] [19]

Homology

Orthologs

Distant orthologs for human NBPF26 can be found in primates, birds, reptiles, amphibians, fish, and some invertebrates. Select orthologs are shown in the table below.

NBPF26 Orthologs
ClassGenus and SpeciesCommon nameGene nameAccession #Length (aa)Identity (%)Similarity (%)
MammaliaPan troglodytes Chimpanzee NBPF14XP_0549585241,61553.957.8
MammaliaGorilla gorilla gorilla Gorilla NBPF12XP_055212742.12,42854.356.0
MammaliaMus musculus House Mouse NOTCH2NP_035058.22,47316.021.2
MammaliaEnhydra lutris kenyoni Northern Sea Otter NBPF26XP_022351917.1 2,35812.321.6
AvesDryobates pubescens Downy woodpecker NOTCH2XP_054021673.12,45320.129.7
AvesAntrostomus carolinensis Chuck-will's-widow NOTCH2XP_0101741612,40119.128.5
AvesStrigops habroptila Owl Parrot NOTCH2XP_030350503.12,43215.622.4
AvesAnser cygnoides Swan goose NBPF26XP_047936201.1 2,35913.724.1
ReptiliaTrachemys scripta elegans Red-eared slider NOTCH2XP_034634874.12,39720.029.8
ReptiliaAlligator mississippiensis American alligator NOTCH2XP_006260017.12,43516.724.2
ReptiliaPodarcis raffonei Aeolian wall lizard NOTCH1XP_053230981.12,53114.622.8
ReptiliaPseudonaja textilis Eastern brown snake NOTCH3XP_026552021.12,41614.321.3
AmphibiaSpea bombifrons Plains spadefoot toad NOTCH2LXP_053317909.12,41917.025.7
AmphibiaXenopus laevis African clawed frog NOTCH2XP_018114267.12,45015.022.6
AmphibiaGeotrypetes seraphini Gaboon caecilian NOTCH3XP_033779639.12,47214.621.8
AmphibiaHyla sarda Sardinian tree frog MTF1XP_056404323.11,16912.422.3
ChordataAcipenser ruthenus Sterlet NOTCH2XP_058887354.12,45416.424.7
ChordataCallorhinchus milii Australian ghostshark NOTCH2XP_042190840.12,81916.023.8
ChordataPetromyzon marinus Sea lamprey NOTCH1LXP_032815028.12,53614.923.2
ChordataParamormyrops kingsleyae Elephantfish NOTCH2LXP_023699719.12,42014.422.2
InvertebratesAmphimedon queenslandica Sponge FBPP1LXP_0198558556988.114.4

Paralogs

Human NBPF26 has multiple paralogs. The top 10 are shown in the table below.

Human NBPF26 Paralogs
NameAccession #Length (aa)Identity (%)Similarity (%)
NBPF1 NP_001392595.11,70468.871.6
NBPF7NP_001392671.11,70468.271.0
LOC102724250NP_001392459.11,70467.770.4
NBPF1LNP_001393481.11,94966.168.7
NBPF12KAI4082365.11,45965.167.3
NBPF9NP_001264373.11,11164.765.2
NBPF20XP_047301971.11,06561.762.5
KIAA1693BAB21784.190150.751.8
NBPF8XP_047285792.11,35348.650.0
NBPF11NP_001095133.386546.347.9

Expression

NBPF26 RNA expression is ubiquitous throughout human tissue. [20]

Clinical Significance

In lung cancer, mutations in NBPF26 were associated with local disease recurrence. [21] NBPF26 expression in monocytes was increased in pregnant patients with rheumatoid arthritis compared to pregnancies without rheumatoid arthritis. [22]

Conceptual translation

A conceptual translation of human NBPF26 is shown below.

NBPF26 conceptual translation part 1.png
NBPF26 conceptual translation part 2.png
NBPF26 conceptual translation part 3.png
NBPF26 conceptual translation part 4.png
NBPF26 conceptual translation part 5.png
NBPF26 conceptual translation part 6.png

Related Research Articles

<span class="mw-page-title-main">ERICH2</span> Protein-coding gene in the species Homo sapiens

Glutamate Rich Protein 2 is a protein in humans encoded by the gene ERICH2. This protein is expressed heavily in male tissues specifically in the testes, and proteins are specifically found in the nucleoli fibrillar center and the vesicles of these testicular cells. The protein has multiple protein interactions which indicate that it may play a role in histone modification and proper histone functioning.

<span class="mw-page-title-main">C16orf46</span> Human gene

Chromosome 16 open reading frame 46 is a protein of yet to be determined function in Homo sapiens. It is encoded by the C16orf46 gene with NCBI accession number of NM_001100873. It is a protein-coding gene with an overlapping locus.

C11orf42 is an uncharacterized protein in Homo sapiens that is encoded by the C11orf42 gene. It is also known as chromosome 11 open reading frame 42 and uncharacterized protein C11orf42, with no other aliases. The gene is mostly conserved in mammals, but it has also been found in rodents, reptiles, fish and worms.

<span class="mw-page-title-main">C1orf185</span> Protein-coding gene in the species Homo sapiens

Chromosome 1 open reading frame 185, also known as C1orf185, is a protein that in humans is encoded by the C1orf185 gene. In humans, C1orf185 is a lowly expressed protein that has been found to be occasionally expressed in the circulatory system.

<span class="mw-page-title-main">C20orf202</span>

C20orf202 is a protein that in humans is encoded by the C20orf202 gene. In humans, this gene encodes for a nuclear protein that is primarily expressed in the lung and placenta.

<span class="mw-page-title-main">KRBA1</span> Protein-coding gene in the species Homo sapiens

KRBA1 is a protein that in humans is encoded by the KRBA1 gene. It is located on the plus strand of chromosome 7 from 149,411,872 to 149,431,664. It is also commonly known under two other aliases: KIAA1862 and KRAB A Domain Containing 1 gene and encodes the KRBA1 protein in humans. The KRBA family of genes is understood to encode different transcriptional repressor proteins

<span class="mw-page-title-main">LSMEM2</span> Protein-coding gene in the species Homo sapiens

Leucine rich single-pass membrane protein 2 is a single-pass membrane protein rich in leucine, that in humans is encoded by the LSMEM2 gene. The LSMEM2 protein is conserved in mammals, birds, and reptiles. In humans, LSMEM2 is found to be highly expressed in the heart, skeletal muscle and tongue.

<span class="mw-page-title-main">TMEM221</span> Protein

Transmembrane protein 221 (TMEM221) is a protein that in humans is encoded by the TMEM221 gene. The function of TMEM221 is currently not well understood.

<span class="mw-page-title-main">SMIM15</span> Mammalian protein found in Homo sapiens

SMIM15(small integral membrane protein 15) is a protein in humans that is encoded by the SMIM15 gene. It is a transmembrane protein that interacts with PBX4. Deletions where SMIM15 is located have produced mental defects and physical deformities. The gene has been found to have ubiquitous but variable expression in many tissues throughout the body.

<span class="mw-page-title-main">TMEM247</span> Protein-coding gene in the species Homo sapiens

Transmembrane protein 247 is a multi-pass transmembrane protein of unknown function found in Homo sapiens encoded by the TMEM247 gene. Notable in the protein are two transmembrane regions near the c-terminus of the translated polypeptide. Transmembrane protein 247 has been found to be expressed almost entirely in the testes.

TMEM275 is a protein that in humans is encoded by the TMEM275 gene. TMEM275 has two, highly-conserved, helical trans-membrane regions. It is predicted to reside within the plasma membrane or the endoplasmic reticulum's membrane.

<span class="mw-page-title-main">C11orf98</span> Protein-coding gene in the species Homo sapiens

C11orf98 is a protein-encoding gene on chromosome 11 in humans of unknown function. It is otherwise known as c11orf48. The gene spans the chromosomal locus from 62,662,817-62,665,210. There are 4 exons. It spans across 2,394 base pairs of DNA and produces an mRNA that is 646 base pairs long.

<span class="mw-page-title-main">C4orf19</span> Human C4orf19 gene

C4orf19 is a protein which in humans is encoded by the C4orf19 gene.

Chromosome 4 open reading frame 50 is a protein that in humans is encoded by the C4orf50 gene. The protein localizes in the nucleus. C4orf50 has orthologs in vertebrates but not invertebrates

<span class="mw-page-title-main">TMEM144</span> Transmembrane Protein 144

Transmembrane Protein 144 (TMEM144) is a protein in humans encoded by the TMEM144 gene.

<span class="mw-page-title-main">C13orf46</span> C13of46 Gene and Protein

Chromosome 13 Open Reading Frame 46 is a protein which in humans is encoded by the C13orf46 gene. In humans, C13orf46 is ubiquitously expressed at low levels in tissues, including the lungs, stomach, prostate, spleen, and thymus. This gene encodes eight alternatively spliced mRNA transcript, which produce five different protein isoforms.

<span class="mw-page-title-main">Chromosome 5 open reading frame 47</span> Human C5ORF47 Gene

Chromosome 5 Open Reading Frame 47, or C5ORF47, is a protein which, in humans, is encoded by the C5ORF47 gene. It also goes by the alias LOC133491. The human C5ORF47 gene is primarily expressed in the testis.

<span class="mw-page-title-main">ANKMY1</span> Protein in humans

Ankyrin Repeat And MYND Domain Containing 1 (ANKMY1) is a protein that in humans is encoded by the ANKMY1 gene. Known aliases of ANKMY1 include Zinc Finger Myeloid, Nervy and DEAF-1 or ZMYND13.

<span class="mw-page-title-main">PRR23A</span>

Proline-Rich Protein 23A is a protein that is encoded by the Proline-Rich 23A (PRR23A) gene.

<span class="mw-page-title-main">ZFP62</span> Gene in Humans

Zinc Finger Protein 62, also known as "ZNF62," "ZNF755," or "ZET," is a protein that in humans is encoded by the ZFP62 gene. ZFP62 is part of the C2H2 Zinc Finger family of genes.

References

  1. 1 2 3 GRCh38: Ensembl release 89: ENSG00000273136 - Ensembl, May 2017
  2. "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  3. NCBI gene entry for NBPF26
  4. Fiddes, I. T., Pollen, A. A., Davis, J. M., & Sikela, J. M. (2019). Paired involvement of human-specific Olduvai domains and NOTCH2NL genes in human brain evolution. Human genetics, 138, 715-721.
  5. NCBI gene entry for NBPF26
  6. AlphaFold <https://alphafold.ebi.ac.uk/entry/A0A2J8IZQ5>
  7. Expasy compute pI <https://web.expasy.org/cgi-bin/compute_pi/pi_tool>
  8. Statistical Analysis of Protein Sequences (SAPS)<https://www.ebi.ac.uk/Tools/seqstats/saps>
  9. MotifFinder <https://www.genome.jp/tools-bin/search_motif_lib>
  10. MyHits Motif Scan <https://myhits.sib.swiss/cgi-bin/motif_scan>
  11. DTU NetPhos <https://services.healthtech.dtu.dk/services/NetPhos-3.1/>
  12. GPS <https://gps.biocuckoo.cn/online.php>
  13. Sulfinator <https://web.expasy.org/sulfinator/>
  14. NetCGlyc <https://services.healthtech.dtu.dk/services/NetCGlyc-1.0/>
  15. NetNGlyc 1.0 < https://services.healthtech.dtu.dk/services/NetNGlyc-1.0/>
  16. NetOGlyc 4.0 <https://services.healthtech.dtu.dk/services/NetOGlyc-4.0/>
  17. ProP 1.0 <https://services.healthtech.dtu.dk/services/ProP-1.0/>
  18. IntAct database <https://www.ebi.ac.uk/intact/home>
  19. BioGrid < https://thebiogrid.org/>
  20. NBPF26 RNA expression <https://www.ncbi.nlm.nih.gov/gene/101060684>
  21. Valter, A., Luhari, L., Pisarev, H., Truumees, B., Planken, A., Smolander, O. P., & Oselin, K. (2023). Genomic alterations as independent prognostic factors to predict the type of lung cancer recurrence. Gene, 885, 147690. https://doi.org/10.1016/j.gene.2023.147690
  22. Lien, H. J. T., Pedersen, T. T., Jakobsen, B., Flatberg, A., Chawla, K., Sætrom, P., & Fenstad, M. H. (2023). Single-cell resolution of longitudinal blood transcriptome profiles in rheumatoid arthritis, systemic lupus erythematosus and healthy control pregnancies. Annals of the rheumatic diseases, ard-2023-224644. Advance online publication. https://doi.org/10.1136/ard-2023-224644