NBEAL1

Last updated

NBEAL1 is a protein that in humans is encoded by the NBEAL1 gene. [1] It is found on chromosome 2q33.2 of Homo sapiens.

Contents

Neurobeachin-like protein 1
Identifiers
SymbolNBEAL1
Alt. namesALS2CR16 ALS2CR17
NCBI gene 65065
HGNC 20681
OMIM 609816
UniProt Q6ZS30
Other data
Locus Chr. 2 q33.2
Search for
Structures Swiss-model
Domains InterPro

Through the different domains of this protein, the function of NBEAL1 is predicted to be involved in the following cellular mechanisms: vesicle trafficking, membrane dynamics, receptor signaling, pre-mRNA processing, signal transduction and cytoskeleton assembly. [2] [3] [4] NBEAL1 is also known as Amytorophic Lateral Sclerosis 2 Chromosomal Region, ALS2CR16 and ALS2CR17. [1]

This ideogram, using a red oval, depicts the location of NBEAL1 on human chromosome 2. LocationofNBEAL1gene.png
This ideogram, using a red oval, depicts the location of NBEAL1 on human chromosome 2.

Protein Properties

Transcript

The mRNA for this protein consists of 9058 base pairs in a linear sequence with the coding sequence begins at base pair number 334 and extends until base pair number 8418. [5] The translated protein is a total 56 exons that constitute a final length of 2694 amino acids. [6] There are currently 9 known isoforms within humans. [3]

Domains

Neurobeachin-like1 contains five domains: DUF4704, DUF4800, PH_BEACH, Beach, and WD40 repeats. [6]

DUF4704

DUF4704 is a domain of unknown function. While the function of this domain is unknown, it is conserved within neurobeachin proteins in eukaryotes. [4] It begins at amino acid 859 and spans until number 1115. [3]

DUF4800

DUF4800 is a domain of unknown function. It begins at amino acid 1580, spanning until 1833. [3] While it is uncharacterized in function, it is found within eukaryotes. [7]

PH_BEACH

Spanning from amino acid 1886 until amino acid 1983, this domain is referred to as a Pleckstrin Homology domain in the BEACH domain. [8] It has a PH because the fold of this domain is similar to the PH domain, but is not identical in the sequence of the canonical PH domains. The PH_BEACH domain is not able to bind phospholipids. [9]

Beach

The Beige and Chediak-Higashi (BEACH) domain is one of the most significant domains within this protein. This domain is highly conserved roughly 280 amino acid domain, present in nine different human BEACH domains. [10] It located after the PH_BEACH domain in the sequence. While not much is understood on the exact function of BDCP proteins within the BEACH domain, it is known that they serve many purposes within cellular mechanisms: vesicular transport, apoptosis, membrane dynamics and receptor signaling. [10] This protein family is of great clinical importance currently because mutations in this domain have been identified in multiple human disorders. For example, neurobeachin-like1 is upregulated in glioma: as the pathological grade of the glioma increases, the expression of neurobeachin-like1 is decreased. [2] In NBEAL1, this follows the PH_BEACH domain, beginning at amino acid 2005 and ending at amino acid 2284. [3]

WD40

NBEAL1 has one WD40 domain within NBEAL1. From amino acid 2409 to 2682 is the entire WD40 domain. Within the domain, from 2406 to 2439, there is a structural motif WD40 repeat. The WD40 domain is found in a number of eukaryotic proteins that have multiple functions. These include, but are not limited to, adaptor/regulatory modules in signal transduction, pre-mRNA processing, and cytoskeleton assembly. [3]

Properties

Structure

Secondary

The secondary structure of NBEAL1 is predicted to be a combination of alpha helices, beta sheets and random coils. [13]

Tertiary

I-TASSER predicted this 3D structure for amino acids 1-1500 of NBEAL1. NBEAL1 Structure.gif
I-TASSER predicted this 3D structure for amino acids 1-1500 of NBEAL1.
Predicted structure via I-TASSER of NBEAL1, amino acids 1501-2694. NBEAL1 3D.gif
Predicted structure via I-TASSER of NBEAL1, amino acids 1501-2694.

I-TASSER was used to predict a 3D structure of NBEAL1. [14] Since NBEAL1 is longer in amino acid length than allowed for input, it was split in half to predict the structure of the whole protein.

Post-Translational Modifications

The following document illustrates the different post-translational modifications.

The different post-translational modifications are illustrated in this conceptually annotated transcript. Wiki conceptually aligned transcript.pdf
The different post-translational modifications are illustrated in this conceptually annotated transcript.

Expression

Using the EST abundance profile through Unigene, NBEAL1 expression was discovered based on both body sites and health states. [15] NBEAL1 shows expression in the brain, embryonic tissue, eye, intestine, kidney, liver, lung, mammary glands, ovaries, pancreas, pharynx, placenta, prostate, skin, stomach, testis, thyroid, and trachea. Based on transcripts per million, expression is highest in the stomach at 62 transcripts per million, with pancreas and trachea being next with their transcripts per million being 37 and 38, respectively. The lowest transcripts per million in the brain, eye, placenta and testis, all at 4 per million. When looking at the breakdown by different health states, NBEAL1 is highly expressed in multiple tumors. [15] Again, the abundance was highest in gastrointestinal tumors, correlating to the high expression of NBEAL1 within the stomach. However, NBEAL1 expression is not seen in pancreatic tumors, which may signify something about its function within the pancreas. The abundance also differs in developmental stages, the highest being the fetal stage with 21 transcripts per million and the adult at 14 transcripts per million.

Function

The function of NBEAL1 is not yet well understood by the scientific community. However, given the function of the different domains and disease associations, it is predicted that the NBEAL1 protein may be involved in a variety of functions. As of now they include, but are not limited to, protein-protein interactions, vesicle trafficking, membrane dynamics, receptor signaling, apoptosis, adaptor/regulatory modules in signal transduction, pre-mRNA processing, and cytoskeleton assembly. [3] [2]

Clinical Significance

This protein has been associated with NBEAL1 are Amyotrophic Lateral Sclerosis, Juvenile and Adenocarcinoma, [1] although the function in these diseases has not yet been identified.

Homology

Neurobeachin-like1 is a highly conserved protein. It has orthologs found in many life forms, including but not limited to: reptiles, birds, amphibians, mammals, fish, and a few invertebrates. The following table presents some of the orthologs found using searches in BLAST [16] and BLAT. [17]

Scientific NameCommon NameAccession NumberSequence LengthPercent Identity
Homo sapiensHumanNP_001107604.12694-
Pan troglodytesChimpanzeeXP_525997.3269499
Gorilla gorillagorillaWestern Lowland GorillaXP_018878299.1269499
Mus musculusMouseNP_77560268898
Cerocebus taysSooty angabeyXP_011903312267897
Canis lupus familiarisDogXP_545603.3269393
Ailuropoda melanoleucaGiant PandaXP_019655126.1269393
Trichechus manatus latirostrisWest Indian ManateeXP_004378299268293
Tursiops truncatusCommon Bottlenose DolphinXP_019794654.1268292.7
Eptesicus cuscusBig Brown BatXP_008144758.1272292
Zonotrichia albicollisWhite Throated SparrowXP_014120514.1270780
Gallus gallusChickenXP_004942730.1272578.7
Python bivattatusBurmese PythonXP_007422078.1268779
Xenopus tropicalisWestern Clawed FrogXP_012826463.1268774
Callorhinchus miliiAustralian GhostsharkXP_007888887.1274970.2
Danio rerioZebrafishXP_009300392272366.3
Octopus bimaculoidesCalifornia two-spot octopusXP_014777916.1258441.2
Daphnia magnaPlanktonic crustaceanKZS037292734

34.4

Drosophila busckiiFruit flyXP_017842328.1272234
This unrooted phylogenetic tree, produced by Biology Workbench, illustrates the evolution of NBEAL1. 12633.CLUSTALWPROF.dt.svg
This unrooted phylogenetic tree, produced by Biology Workbench, illustrates the evolution of NBEAL1.

Paralogs

According to GeneCards, NBEAL1 has a few paralogs: NBEAL2, WDFY3, NBEA, LRBA, Lysosomal trafficking regulator (LYST), and WDFY3. [19] The table below summarizes the paralogs of NBEAL1.

Gene NameSpeciesAccession NumberSequence LengthPercent Identity
NBEAL2Homo sapiensNP_055990275446
NBEAHomo sapiensNP_056483.3294622.8
LRBAHomo sapiensNP_996717.2286322.5
WDFY3Homo sapiensXP_016863397.1354421.8
LYSTHomo sapiensNP_001288294.1380119.3
Using Biology Workbench, a Multiple Sequence Alignment was done to illustrate the highly conserved aspects of NBEAL1 in close orthologs to the homo sapiens gene. 374.TEXSHADE.op.ps.pdf
Using Biology Workbench, a Multiple Sequence Alignment was done to illustrate the highly conserved aspects of NBEAL1 in close orthologs to the homo sapiens gene.

Related Research Articles

WD repeat-containing protein 90 is a protein that, in humans, is encoded by the WDR90 gene (16p13.3). This human protein is 1750 amino acids, and has a molecular weight of 187.7 kDa. It contains multiple WD40 repeat domains and one domain of unknown function. This protein is conserved all the way back to invertebrates. Proteins containing WD transducin repeating domains have been found to play a role in a variety of functions ranging from signal transduction and transcription regulation to cell cycle control, autophagy and apoptosis.

<span class="mw-page-title-main">ANKRD24</span> Protein-coding gene in the species Homo sapiens

Ankyrin repeat domain-containing protein 24 is a protein in humans that is coded for by the ANKRD24 gene. The gene is also known as KIAA1981. The protein's function in humans is currently unknown. ANKRD24 is in the protein family that contains ankyrin-repeat domains.

<span class="mw-page-title-main">Zinc finger protein 684</span> Protein found in humans

Zinc finger protein 684 is a protein that in humans is encoded by the ZNF684 gene.

The coiled-coil domain containing 142 (CCDC142) is a gene which in humans encodes the CCDC142 protein. The CCDC142 gene is located on chromosome 2, spans 4339 base pairs and contains 9 exons. The gene codes for the coiled-coil domain containing protein 142 (CCDC142), whose function is not yet well understood. There are two known isoforms of CCDC142. CCDC142 proteins produced from these transcripts range in size from 743 to 665 amino acids and contain signals suggesting protein movement between the cytosol and nucleus. Homologous CCDC142 genes are found in many animals including vertebrates and invertebrates but not fungus, plants, protists, archea, or bacteria. Although the function of this protein is not well understood, it contains a coiled-coil domain and a RINT1_TIP1 motif located within the coiled-coil domain.

<span class="mw-page-title-main">PRR29</span> Protein-coding gene in the species Homo sapiens

PRR29 is a protein encoded by the PRR29 gene located in humans on chromosome 17 at 17q23.

OCC-1 is a protein, which in humans is encoded by the gene C12orf75. The gene is approximately 40,882 bp long and encodes 63 amino acids. OCC-1 is ubiquitously expressed throughout the human body. OCC-1 has shown to be overexpressed in various colon carcinomas. Novel splice variant of this gene was also detected in various human cancer types; in addition to encoding a novel smaller protein, OCC-1 gene produces a non-protein coding RNA splice variant lncRNA.

BEND2 is a protein that in humans is encoded by the BEND2 gene. It is also found in other vertebrates, including mammals, birds, and reptiles. The expression of BEND2 in Homo sapiens is regulated and occurs at high levels in the skeletal muscle tissue of the male testis and in the bone marrow. The presence of the BEN domains in the BEND2 protein indicates that this protein may be involved in chromatin modification and regulation.

<span class="mw-page-title-main">C9orf25</span> Protein-coding gene in the species Homo sapiens

Chromosome 9 open reading frame 25 (C9orf25) is a domain that encodes the FAM219A gene. The terms FAM219A and C9orf25 are aliases and can be used interchangeably. The function of this gene is not yet completely understood.

<span class="mw-page-title-main">C1orf122</span> Protein-coding gene in the species Homo sapiens

C1orf122 is a gene in the human genome that encodes the cytosolic protein ALAESM.. ALAESM is present in all tissue cells and highly up-regulated in the brain, spinal cord, adrenal gland and kidney. This gene can be expressed up to 2.5 times the average gene in its highly expressed tissues. Although the function of C1orf122 is unknown, it is predicted to be used for mitochondria localization.

<span class="mw-page-title-main">C7orf50</span> Mammalian protein found in Homo sapiens

C7orf50 is a gene in humans that encodes a protein known as C7orf50. This gene is ubiquitously expressed in the kidneys, brain, fat, prostate, spleen, among 22 other tissues and demonstrates low tissue specificity. C7orf50 is conserved in chimpanzees, Rhesus monkeys, dogs, cows, mice, rats, and chickens, along with 307 other organisms from mammals to fungi. This protein is predicted to be involved with the import of ribosomal proteins into the nucleus to be assembled into ribosomal subunits as a part of rRNA processing. Additionally, this gene is predicted to be a microRNA (miRNA) protein coding host gene, meaning that it may contain miRNA genes in its introns and/or exons.

<span class="mw-page-title-main">Fam89A</span> Human protein and gene

ProteinFAM89A is a protein which in humans is encoded by the FAM89A gene. It is also known as chromosome 1 open reading frame 153 (C1orf153). Highest FAM89A gene expression is observed in the placenta and adipose tissue. Though its function is largely unknown, FAM89A is found to be differentially expressed in response to interleukin exposure, and it is implicated in immune responses pathways and various pathologies such as atherosclerosis and glioma cell expression.

<span class="mw-page-title-main">FAM214B</span> Protein-coding gene in the species Homo sapiens

The FAM214B, also known as protein family with sequence similarity 214, B (FAM214B) is a protein that, in humans, is encoded by the FAM214B gene located on the human chromosome 9. The protein has 538 amino acids. The gene contain 9 exon. There has been studies that there are low expression of this gene in patients with major depression disorder. In most organisms such as mammals, amphibians, reptiles, and birds, there are high levels of gene expression in the bone marrow and blood. For humans in fetal development, FAM214B is mostly expressed in the brains and bone marrow.

<span class="mw-page-title-main">C2orf72</span> Human protein encoding gene

C2orf72 is a gene in humans that encodes a protein currently named after its gene, C2orf72. It is also designated LOC257407 and can be found under GenBank accession code NM_001144994.2. The protein can be found under UniProt accession code A6NCS6.

<span class="mw-page-title-main">FAM166C</span>

Family with Sequence Similarity 166, member C (FAM166C), is a protein encoded by the FAM166C gene. The protein FAM166C is localized in the nucleus. It has a calculated molecular weight of 23.29 kDa. It also contains DUF2475, a protein of unknown function from amino acid 19–85. The FAM166C protein is nominally expressed in the testis, stomach, and thyroid.

<span class="mw-page-title-main">C2orf80</span> Gene

C2orf80 is a protein that in humans is encoded by the c2orf80 gene. The gene c2orf80 also goes by the alias GONDA1. In humans, c2orf80 is exclusively expressed in the brain. While relatively little is known about the function of c2orf80, medical studies have shown a strong association between variations in c2orf80 and IDH-mutant gliomas, 46,XY gonadal dysgenesis, and a possible association with blood pressure.

<span class="mw-page-title-main">C5orf22</span> Protein-coding gene in the species Homo sapiens

Chromosome 5 open reading frame 22 (c5orf22) is a protein-coding gene of poorly characterized function in Homo sapiens. The primary alias is unknown protein family 0489 (UPF0489).

bMERB domain containing 1 is a gene expressed in humans which has broad expression across the brain. This gene codes for bMERB1 domain-containing protein 1 isoform 1. It is predicted that this gene is involved in actin cytoskeleton regulation, microtubule regulation and glial cell migration.

<span class="mw-page-title-main">THAP3</span> Protein in Humans

THAP domain-containing protein 3 (THAP3) is a protein that, in Homo sapiens (humans), is encoded by the THAP3 gene. The THAP3 protein is as known as MGC33488, LOC90326, and THAP domain-containing, apoptosis associated protein 3. This protein contains the Thanatos-associated protein (THAP) domain and a host-cell factor 1C binding motif. These domains allow THAP3 to influence a variety of processes, including transcription and neuronal development. THAP3 is ubiquitously expressed in H. sapiens, though expression is highest in the kidneys.

<span class="mw-page-title-main">SCRN3</span> Protein-coding gene in the species Homo sapiens

Secernin-3 (SCRN3) is a protein that is encoded by the human SCRN3 gene. SCRN3 belongs to the peptidase C69 family and the secernin subfamily. As a part of this family, the protein is predicted to enable cysteine-type exopeptidase activity and dipeptidase activity, as well as be involved in proteolysis. It is ubiquitously expressed in the brain, thyroid, and 25 other tissues. Additionally, SCRN3 is conserved in a variety of species, including mammals, birds, fish, amphibians, and invertebrates. SCRN3 is predicted to be an integral component of the cytoplasm.

References

  1. 1 2 3 Database GH. "NBEAL1 Gene - GeneCards | NBEL1 Protein | NBEL1 Antibody". www.genecards.org. Retrieved 2017-02-03.
  2. 1 2 3 Chen J, Lu Y, Xu J, Huang Y, Cheng H, Hu G, Luo C, Lou M, Cao G, Xie Y, Ying K (June 2004). "Identification and characterization of NBEAL1, a novel human neurobeachin-like 1 protein gene from fetal brain, which is up regulated in glioma". Brain Research. Molecular Brain Research. 125 (1–2): 147–55. doi:10.1016/j.molbrainres.2004.02.022. PMID   15193433.
  3. 1 2 3 4 5 6 7 "homo sapiens NBEAL1 - Protein - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2017-02-24.
  4. 1 2 de Souza N, Vallier LG, Fares H, Greenwald I (February 2007). "SEL-2, the C. elegans neurobeachin/LRBA homolog, is a negative regulator of lin-12/Notch activity and affects endosomal traffic in polarized epithelial cells". Development. 134 (4): 691–702. doi:10.1242/dev.02767. PMID   17215302. S2CID   18838265.
  5. "Homo sapiens neurobeachin like 1 (NBEAL1), mRNA - Nucleotide - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2017-02-24.
  6. 1 2 "neurobeachin-like protein 1 [Homo sapiens] - Protein - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2017-02-24.
  7. "Pfam: Family: DUF4800 (PF16057)". pfam.xfam.org. Retrieved 2017-04-23.
  8. "NCBI CDD Conserved Protein Domain PH_BEACH". www.ncbi.nlm.nih.gov. Retrieved 2017-02-27.
  9. Gebauer D, Li J, Jogl G, Shen Y, Myszka DG, Tong L (November 2004). "Crystal structure of the PH-BEACH domains of human LRBA/BGL". Biochemistry. 43 (47): 14873–80. doi:10.1021/bi049498y. PMID   15554694.
  10. 1 2 Cullinane AR, Schäffer AA, Huizing M (July 2013). "The BEACH is hot: a LYST of emerging roles for BEACH-domain containing proteins in human disease". Traffic. 14 (7): 749–66. doi:10.1111/tra.12069. PMC   3761935 . PMID   23521701.
  11. Kramer J (1990). "Molecular Weight".[ permanent dead link ]
  12. Nakai and Horton (1997). "PSORTII". PSORT.
  13. "ExPASy: SIB Bioinformatics Resource Portal - Home". www.expasy.org. Retrieved 2017-04-25.
  14. "I-TASSER server for protein structure and function prediction". zhanglab.ccmb.med.umich.edu. Retrieved 2017-04-30.
  15. 1 2 Group S. "EST Profile - Hs.648846". www.ncbi.nlm.nih.gov. Retrieved 2017-05-06.
  16. "BLAST: Basic Local Alignment Search Tool". blast.ncbi.nlm.nih.gov. Retrieved 2017-04-22.
  17. "Genome Browser FAQ". genome.ucsc.edu. Retrieved 2017-04-22.
  18. Workbench NB. "SDSC Biology Workbench". seqtool.sdsc.edu. Archived from the original on 2003-08-11. Retrieved 2017-05-07.
  19. Database GH. "NBEAL1 Gene - GeneCards | NBEL1 Protein | NBEL1 Antibody". www.genecards.org. Retrieved 2017-02-20.

Further reading