Alpha-1-B glycoprotein

Last updated
A1BG
Identifiers
Aliases A1BG , A1B, ABG, GAB, HYST2477, alpha-1-B glycoprotein
External IDs OMIM: 138670 MGI: 2152878 HomoloGene: 11167 GeneCards: A1BG
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)

NM_130786

NM_001081067

RefSeq (protein)

NP_570602

NP_001074536

Location (UCSC) Chr 19: 58.35 – 58.35 Mb Chr 15: 60.77 – 60.79 Mb
PubMed search [3] [4]
Wikidata
View/Edit Human View/Edit Mouse

Alpha-1-B glycoprotein is a 54.3 kDa protein in humans that is encoded by the A1BG gene. [5] The protein encoded by this gene is a plasma glycoprotein of unknown function. The protein shows sequence similarity to the variable regions of some immunoglobulin supergene family member proteins. Patients who have pancreatic ductal adenocarcinoma show an overexpression of A1BG in pancreatic juice. [6]

Contents

Gene

Neighborhood

A1BG is located on the negative DNA strand of chromosome 19 from 58,858,172 – 58,864,865. [7] Additionally, A1BG is located directly adjacent to the ZSCAN22 gene (58,838,385-58,853,712)) on the positive DNA strand, as well as the ZNF837 (58,878,990 - 58,892,389, complement) and ZNF497 (58865723 - 58,874,214, complement) genes on the negative strand. [7]

Expression

GEO Profile of A1BG tissue expression.jpg#filelinks

A1BG is expressed at high levels in the adult and fetal liver. [8] Additionally, the mammary gland shows roughly half as much expression as the liver. [8] Trace amounts of A1BG expression can be found in the blood, brain, lung, lymph node, ovary, testis, pancreas, and pancreas. [8] Liver tumors exhibit elevated levels of A1BG transcripts. [8]

mRNA

mRNA structure

The gene contains 20 distinct introns. [9] Transcription produces 15 different mRNAs, 10 alternatively spliced variants and 5 unspliced forms. [9] There are 4 probable alternative promoters, 4 non overlapping alternative last exons and 7 validated alternative polyadenylation sites. [9] The mRNAs appear to differ by truncation of the 5' end, truncation of the 3' end, presence or absence of 4 cassette exons, overlapping exons with different boundaries, splicing versus retention of 3 introns. [9]

Protein

Properties

The San Diego Super Computer's Statistical Analysis of Protein (SAPS) program determined that alpha-1B glycoprotein has 495 amino acids residues, an isoelectric point of 5.47, and a molecular mass of 54.3 kDa. Additionally, it suggested that no transmembrane domains exist in alpha-1B glycoprotein. [10] According to NCBI, the amino acid sequence MLVVFLLLWGVTWGPVTEA is a signal peptide on the N-terminus of the protein that might function as an endoplasmic reticulum import signal. [10]

Post-translational modifications

The NetAcet 1.0 program calculated that the first five amino acid residues serve as an N-acetylation site. [11] The NetGlycate 1.0 program predicted that the lysines located at residue 78, 114, and 227 serve as glycation points. [12] The NetNES 1.1 program predicted the leucine at residue 47 to be a nuclear export signal. [13] The NetNGlyc 1.0 program predicted four N-glycosylation sites - two of which are highly conserved internally repeated sequences. [14] [15] The NetCGlyc1.0 program predicted that none of the tryptophan residues serve as C-mannosylation sites. [16]

Protein interactions

A study by Udby et al. showed that Cysteine-rich secretory protein 3 is a ligand of alpha-1B glycoprotein in human plasma and they suggest that the A1BG-CRISP-3 complex displays a similar function in protecting the circulation from a potentially harmful effect of free CRISP-3. [17]

Sex-specific role in cardiac function

Female mice with cardiac conditional knockout of A1BG exhibit poor cardiac function, but male mice with the same knockout are unaffected. [18]

Homology

Orthologs

In addition to the table below, alpha-1B glycoprotein is also conserved in the white-cheeked crested gibbon, baboon, bolivian squirrel monkey, sheep, dog, wild boar, Chinese tree shrew, Chinese hamster, black flying fox, rabbit, guinea pig, giant panda, cow, rat, and the naked mole-rat. [19] Additionally, it is very likely that A1BG is further conserved throughout the mammalian clade.

Genus species Organism common nameDivergence from humans (MYA) [20] NCBI protein accession numberSequence identityProtein lengthCommon gene name
Homo sapiens [21] Human --NP_570602100%495A1BG
Pan troglodytes [22] Chimpanzee 6.2XP_00114666997.0%501PREDICTED: Alpha-1B-glycoprotein isoform 4
Pan paniscus [23] Bonobo 6.3XP_00381667797.0%499A1BG
Gorilla gorilla gorilla [24] Gorilla 8.8XP_00406165295.0%275PREDICTED: alpha-1B-glycoprotein
Pongo pygmaeus [25] Orangutan 15.7XP_00282995395.0%495alpha-1B-glycoprotein isoform 1
Macaca mulatta [26] Rhesus monkey29.0XM_00110182188.0%351hypothetical protein EGK_11172, partial
Callithrix jacchus [27] Marmoset42.6XP_00276261983.0%500A1BG
Mus musculus [28] Mouse91.0NP_00107453644.0%512alpha-1B-glycoprotein precursor
Felis catus [29] Cat94.2XP_00399739962.0%481PREDICTED: alpha-1B-glycoprotein
Equus caballus [30] Horse97.4XP_00149534458.0%568PREDICTED: alpha-1B-glycoprotein-like
Loxodonta africana [31] African bush elephant104.7XP_00340672261.0%520PREDICTED: alpha-1B-glycoprotein-like

Paralogs

No paralogs have been found for alpha-1B glycoprotein. [32]

Homologous domains

An initial NCBI Blast alignment of alpha-1B glycoprotein illustrates that the protein is mainly composed of three immunoglobulin domains. [33] There is a large segment of amino acids from position 297 to 400 that is not shown to be an immunoglobulin domain. However, a NCBI BLAST alignment of just the amino acids from 297 to 400 does illustrate that the latter sequence is indeed a fourth immunoglobulin domain. [34] Ultimately, alpha-1B glycoprotein seems to be primarily composed of four immunoglobulin domains.

Clinical significance

Steroid-resistant nephrotic syndrome

The alpha-1-glycoprotein is upregulated 11-fold in the urine of patients who have steroid resistant nephrotic syndrome. [35] A1BG was present in 7/19 patients with SRNS and was absent from all patients with steroid sensitive nephrotic syndrome. [35] The 13.8 kDa A1BG fragment had a high discriminatory power for steroid resistance in pediatric nephrotic syndrome, but is only present in a subset of patients. [35]

Related Research Articles

<span class="mw-page-title-main">Immunoglobulin heavy constant alpha 1</span> Gene in the species Homo sapiens

Immunoglobulin heavy constant alpha 1 is a immunoglobulin gene with symbol IGHA1. It encodes a constant (C) segment of Immunoglobulin A heavy chain. Immunoglobulin A is an antibody that plays a critical role in immune function in the mucous membranes. IgA shows the same typical structure of other antibody classes, with two heavy chains and two light chains, and four distinct domains: one variable region, and three variable regions. As a major class of immunoglobulin in body secretions, IgA plays a role in defending against infection, as well as preventing the access of foreign antigens to the immunologic system.

<span class="mw-page-title-main">SOGA2</span>

SOGA2, also known as Suppressor of glucose autophagy associated 2 or CCDC165, is a protein that in humans is encoded by the SOGA2 gene. SOGA2 has two human paralogs, SOGA1 and SOGA3. In humans, the gene coding sequence is 151,349 base pairs long, with an mRNA of 6092 base pairs, and a protein sequence of 1586 amino acids. The SOGA2 gene is conserved in gorilla, baboon, galago, rat, mouse, cat, and more. There is distant conservation seen in organisms such as zebra finches and anoles. SOGA2 is ubiquitously expressed in humans, with especially high expression in brain, colon, pituitary gland, small intestine, spinal cord, testis and fetal brain.

<span class="mw-page-title-main">SUHW4</span> Protein-coding gene in the species Homo sapiens

Zinc finger protein 280D, also known as Suppressor Of Hairy Wing Homolog 4, SUWH4, Zinc Finger Protein 634, ZNF634, or KIAA1584, is a protein that in humans is encoded by the ZNF280D gene located on chromosome 15q21.3.

<span class="mw-page-title-main">Morn repeat containing 1</span>

MORN1 containing repeat 1, also known as Morn1, is a protein that in humans is encoded by the MORN1 gene.

<span class="mw-page-title-main">DEPDC5</span>

DEPDC5 is a human protein of poorly understood function but has been associated with cancer in several studies. It is encoded by a gene of the same name, located on chromosome 22.

<span class="mw-page-title-main">CCDC130</span> Protein-coding gene in the species Homo sapiens

Coiled-coil domain containing 130 is a protein that in humans is encoded by the CCDC130 gene. It is part of the U4/U5/U6 tri-snRNP in the U5 portion. This tri-snRNP comes together with other proteins to form complex B of the mature spliceosome. The mature protein is approximately 45 kilodaltons (kDa) and is extremely hydrophilic due to the abnormally high number of charged and polar amino acids. CCDC130 is a highly conserved protein, it has orthologous genes in some yeasts and plants that were found using nucleotide and protein versions of the basic local alignment search tool (BLAST) from the National Center for Biotechnology Information. GEO profiles for CCDC130 have shown that this protein is ubiquitously expressed, but the highest levels of expression are found in T-lymphocytes.

SNED1 is an extracellular matrix (ECM) protein expressed at low levels in a wide range of tissues. The gene encoding SNED1 is located in the human chromosome 2 at locus q37.3. The corresponding mRNA isolated from the spleen and is 6834bp in length, and the corresponding protein is 1413 amino-acid long. The mouse ortholog of SNED1 was cloned in 2004 from the embryonic kidney by Leimester et al. SNED1 present domains characteristic of ECM proteins, including an amino-terminal NIDO domain, several calcium binding EGF-like domains (EGF_CA), a Sushi domain also known as complement control protein (CCP) domain, and three type III fibronectin (FN3) domains in the carboxy-terminal region.

<span class="mw-page-title-main">FAM71F2</span> Protein-coding gene in the species Homo sapiens

FAM71F2 or Family with Sequence Similarity 71 member F2 is a protein that in humans is encoded by the Family with Sequence Similarity 71 member F2 gene. This gene is highly active in the reproductive tissues, specifically the testis, and may serve as a potential biomarker for determining metastatic testicular cancer.

<span class="mw-page-title-main">RTL6</span>

Retrotransposon Gag Like 6 is a protein encoded by the RTL6 gene in humans. RTL6 is a member of the Mart family of genes, which are related to Sushi-like retrotransposons and were derived from fish and amphibians. The RTL6 protein is localized to the nucleus and has a predicted leucine zipper motif that is known to bind nucleic acids in similar proteins, such as LDOC1.

<span class="mw-page-title-main">Proline-rich protein 30</span>

Proline-rich protein 30 is a protein in humans that is encoded for by the PRR30 gene. PRR30 is a member in the family of Proline-rich proteins characterized by their intrinsic lack of structure. Copy number variations in the PRR30 gene have been associated with an increased risk for neurofibromatosis.

The Family with sequence similarity 149 member B1 is an uncharacterized protein encoded by the human FAM149B1 gene, with one alias KIAA0974. The protein resides in the nucleus of the cell. The predicted secondary structure of the gene contains multiple alpha-helices, with a few beta-sheet structures. The gene is conserved in mammals, birds, reptiles, fish, and some invertebrates. The protein encoded by this gene contains a DUF3719 protein domain, which is conserved across its orthologues. The protein is expressed at slightly below average levels in most human tissue types, with high expression in brain, kidney, and testes tissues, while showing relatively low expression levels in pancreas tissues.

<span class="mw-page-title-main">SHLD1</span>

SHLD1 or shieldin complex subunit 1 is a gene on chromosome 20. The C20orf196 gene encodes an mRNA that is 1,763 base pairs long, and a protein that is 205 amino acids long.

<span class="mw-page-title-main">C2orf16</span> Protein-coding gene in the species Homo sapiens

C2orf16 is a protein that in humans is encoded by the C2orf16 gene. Isoform 2 of this protein is 1,984 amino acids long. The gene contains 1 exon and is located at 2p23.3. Aliases for C2orf16 include Open Reading Frame 16 on Chromosome 2 and P-S-E-R-S-H-H-S Repeats Containing Sequence.

<span class="mw-page-title-main">C22orf23</span>

C22orf23 is a protein which in humans is encoded by the C22orf23 gene. Its predicted secondary structure consists of alpha helices and disordered/coil regions. It is expressed in many tissues and highest in the testes and it is conserved across many orthologs.

<span class="mw-page-title-main">Fam89A</span>

ProteinFAM89A is a protein which in humans is encoded by the FAM89A gene. It is also known as chromosome 1 open reading frame 153 (C1orf153). Highest FAM89A gene expression is observed in the placenta and adipose tissue. Though its function is largely unknown, FAM89A is found to be differentially expressed in response to interleukin exposure, and it is implicated in immune responses pathways and various pathologies such as atherosclerosis and glioma cell expression.

<span class="mw-page-title-main">C14orf119</span> Protein-coding gene in the species Homo sapiens

C14orf119 is a protein that in humans is encoded by the c14orf119 gene. The c14orf119 protein is predicted to be localized in the nucleus. Additionally, c14orf119 expression is decreased in individuals with systemic lupus erythematosus (SLE) when compared with healthy individual and is increased in individuals with various types of lymphomas when compared to healthy individuals.

<span class="mw-page-title-main">ISLR</span> Protein-coding gene in the species Homo sapiens

In humans, the immunoglobulin super family containing leucine-rich repeat (ISLR) protein is encoded by the ISLR gene. Current RNA-seq studies show that the protein is highly expressed in the endometrium and ovary and shows expression among 25 other tissues. The protein is seen localized in the cytoplasm, plasma membrane, extracellular exosome, and platelet alpha granule lumen. Furthermore, the protein is known to play a role in platelet degranulation, cell adhesion, and response to elevated platelet cytosolic Ca2+.

Transmembrane protein 39B (TMEM39B) is a protein that in humans is encoded by the gene TMEM39B. TMEM39B is a multi-pass membrane protein with eight transmembrane domains. The protein localizes to the plasma membrane and vesicles. The precise function of TMEM39B is not yet well-understood by the scientific community, but differential expression is associated with survival of B cell lymphoma, and knockdown of TMEM39B is associated with decreased autophagy in cells infected with the Sindbis virus. Furthermore, the TMEM39B protein been found to interact with the SARS-CoV-2 ORF9C protein. TMEM39B is expressed at moderate levels in most tissues, with higher expression in the testis, placenta, white blood cells, adrenal gland, thymus, and fetal brain.

<span class="mw-page-title-main">FAM214B</span> Protein-coding gene in the species Homo sapiens

The FAM214B, also known as protein family with sequence similarity 214, B (FAM214B) is a protein that, in humans, is encoded by the FAM214B gene located on the human chromosome 9. The protein has 538 amino acids. The gene contain 9 exon. There has been studies that there are low expression of this gene in patients with major depression disorder. In most organisms such as mammals, amphibians, reptiles, and birds, there are high levels of gene expression in the bone marrow and blood. For humans in fetal development, FAM214B is mostly expressed in the brains and bone marrow.

<span class="mw-page-title-main">FAM98C</span> Gene

Family with sequence 98, member C or FAM98C is a gene that encodes for FAM98C has two aliases FLJ44669 and hypothetical protein LOC147965. FAM98C has two paralogs in humans FAM98A and FAM98B. FAM98C can be characterized for being a Leucine-rich protein. The function of FAM98C is still not defined. FAM98C has orthologs in mammals, reptiles, and amphibians and has a distant orhtologs in Rhinatrema bivittatum and Nanorana parkeri.

References

  1. 1 2 3 GRCh38: Ensembl release 89: ENSG00000121410 - Ensembl, May 2017
  2. 1 2 3 GRCm38: Ensembl release 89: ENSMUSG00000022347 - Ensembl, May 2017
  3. "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  4. "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  5. "Entrez Gene: Alpha-1-B glycoprotein" . Retrieved 2012-11-09.
  6. Tian M, Cui YZ, Song GH, Zong MJ, Zhou XY, Chen Y, Han JX (2008). "Proteomic analysis identifies MMP-9, DJ-1 and A1BG as overexpressed proteins in pancreatic juice from pancreatic ductal adenocarcinoma patients". BMC Cancer. 8: 241. doi:10.1186/1471-2407-8-241. PMC   2528014 . PMID   18706098.
  7. 1 2 "A1BG alpha-1-B glycoprotein" . Retrieved May 10, 2013.
  8. 1 2 3 4 "EST Profile - Hs.529161". UniGene. National Center for Biotechnology Information, U.S. National Library of Medicine. Retrieved 2013-05-11.
  9. 1 2 3 4 "AceView: A1BG" . Retrieved May 11, 2013.
  10. 1 2 Brendel V, Bucher P, Nourbakhsh IR, Blaisdell BE, Karlin S (March 1992). "Methods and algorithms for statistical analysis of protein sequences". Proc. Natl. Acad. Sci. U.S.A. 89 (6): 2002–6. Bibcode:1992PNAS...89.2002B. doi: 10.1073/pnas.89.6.2002 . PMC   48584 . PMID   1549558.
  11. Kiemer L, Bendtsen JD, Blom N (April 2005). "NetAcet: prediction of N-terminal acetylation sites". Bioinformatics. 21 (7): 1269–70. doi: 10.1093/bioinformatics/bti130 . PMID   15539450.
  12. Johansen MB, Kiemer L, Brunak S (September 2006). "Analysis and prediction of mammalian protein glycation". Glycobiology. 16 (9): 844–53. CiteSeerX   10.1.1.128.831 . doi:10.1093/glycob/cwl009. PMID   16762979.
  13. la Cour T, Kiemer L, Mølgaard A, Gupta R, Skriver K, Brunak S (June 2004). "Analysis and prediction of leucine-rich nuclear export signals". Protein Eng. Des. Sel. 17 (6): 527–36. doi: 10.1093/protein/gzh062 . PMID   15314210 . Retrieved May 10, 2013.
  14. Gupta, R. "Prediction of N-glycosylation sites in human proteins" . Retrieved May 10, 2013.
  15. Higgins DG, Bleasby AJ, Fuchs R (April 1992). "CLUSTAL V: improved software for multiple sequence alignment". Comput. Appl. Biosci. 8 (2): 189–91. doi:10.1093/bioinformatics/8.2.189. PMID   1591615.
  16. Julenius, Karin (2007). "NetCGlyc1.0: Prediction of mammalian C-mannosylation sites". Glycobiology. 17 (8): 868–876. doi: 10.1093/glycob/cwm050 . PMID   17494086 . Retrieved May 10, 2013.
  17. Udby L, Sørensen OE, Pass J, Johnsen AH, Behrendt N, Borregaard N, Kjeldsen L (October 2004). "Cysteine-rich secretory protein 3 is a ligand of alpha1B-glycoprotein in human plasma". Biochemistry. 43 (40): 12877–86. doi:10.1021/bi048823e. PMID   15461460.
  18. Shi, Wei; Sheng, Xinlei; Dorr, Kerry M.; Hutton, Josiah E.; Emerson, James I.; Davies, Haley A.; Andrade, Tia D.; Wasson, Lauren K.; Greco, Todd M.; Hashimoto, Yutaka; Federspiel, Joel D. (October 2021). "Cardiac proteomics reveals sex chromosome-dependent differences between males and females that arise prior to gonad formation". Developmental Cell. 56 (21): 3019–3034.e7. doi:10.1016/j.devcel.2021.09.022. PMC   9290207 . PMID   34655525.
  19. "NCBI Blast results for A1BG protein sequence" . Retrieved May 11, 2013.
  20. "Time Tree".
  21. "alpha-1-B glycoprotein [Homo sapiens]" . Retrieved May 11, 2013.
  22. "PREDICTED: alpha-1B-glycoprotein isoform 4 [Pan troglodytes]". NCBI. Retrieved May 10, 2013.
  23. "PREDICTED: alpha-1B-glycoprotein [Pan paniscus]" . Retrieved May 11, 2013.
  24. "PREDICTED: alpha-1B-glycoprotein" . Retrieved May 10, 2013.
  25. "Send to: PREDICTED: alpha-1B-glycoprotein isoform 1 [Pongo abelii]" . Retrieved May 11, 2013.
  26. "hypothetical protein EGK_11172, partial [Macaca mulatta]" . Retrieved May 11, 2013.
  27. "PREDICTED: alpha-1B-glycoprotein [Callithrix jacchus]" . Retrieved May 11, 2013.
  28. "alpha-1B-glycoprotein precursor [Mus musculus]" . Retrieved May 11, 2013.
  29. "PREDICTED: alpha-1B-glycoprotein [Felis catus]" . Retrieved May 11, 2013.
  30. "PREDICTED: alpha-1B-glycoprotein-like" . Retrieved May 11, 2013.
  31. "PREDICTED: alpha-1B-glycoprotein-like [Loxodonta africana]".
  32. "A1BG Gene". Weissman Institute of Science. Retrieved May 10, 2013.
  33. "NCBI conserved domain search" . Retrieved May 10, 2013.
  34. "NCBI Blast: Protein Sequence" . Retrieved May 10, 2013.
  35. 1 2 3 Piyaphanee N, Ma Q, Kremen O, Czech K, Greis K, Mitsnefes M, Devarajan P, Bennett MR (June 2011). "Discovery and initial validation of α 1-B glycoprotein fragmentation as a differential urinary biomarker in pediatric steroid-resistant nephrotic syndrome". Proteomics: Clinical Applications. 5 (5–6): 334–42. doi:10.1002/prca.201000110. PMC   7039306 . PMID   21591266.