KRBA1

Last updated
KRBA1
Identifiers
Aliases KRBA1 , KRAB-A domain containing 1
External IDs MGI: 1925077 HomoloGene: 15880 GeneCards: KRBA1
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)

NM_001290187
NM_032534

NM_133922
NM_001347152
NM_001347153
NM_001362144
NM_001362145

Contents

RefSeq (protein)

NP_001277116
NP_115923

NP_001334081
NP_001334082
NP_598683
NP_001349073
NP_001349074

Location (UCSC) Chr 7: 149.71 – 149.73 Mb Chr 6: 48.37 – 48.4 Mb
PubMed search [3] [4]
Wikidata
View/Edit Human View/Edit Mouse

KRBA1 is a protein that in humans is encoded by the KRBA1 gene. It is located on the plus strand of chromosome 7 from 149,411,872 to 149,431,664. [5] It is also commonly known under two other aliases: KIAA1862 and KRAB A Domain Containing 1 gene and encodes the KRBA1 protein in humans. [6] The KRBA family of genes is understood to encode different transcriptional repressor proteins [7]

Gene

Location of KRBA1 Gene on chromosome 7 as designated by the red line KRBA1 gene location.png
Location of KRBA1 Gene on chromosome 7 as designated by the red line

The KRBA1 gene is located on chromosome 7 at 7q36 in humans. It is 3786 base pairs in length with 17 exons total, however the coding sequence is only 3196 base pairs long resulting in a protein with 1064 amino acids. [8]

Protein

The KRBA1 protein has a predicted isoelectric point of 7.8pI and a predicted molecular weight of 112 kD which makes it a relatively large and neutral protein. [9] This protein is primarily made up of four amino acids. The four major ones are: Proline at 15.7%, Glycine at 11.2%, Serine at 10.9%, and Leucine at 10.2%. [9] Proline is present in well above average proportions. However, it was the only one as such and the amino acids M, N, Y, I, and F are all present in just below average proportions.

Isoforms

There are two isoforms for this protein known simply as isoform 1 and 2. Isoform 1 is the smaller one as it has a shorter N-terminus. Additionally, its 5'-UTR is different and its start codon is farther downstream. [8]

Domains and motifs

The domain of function is the KRBA box which is predicted to start at base pair 7 and end at base pair 47. For motifs there was a leucine zipper and two nuclear localization signals (NLS) were found. [10] Additionally, two fairly large proline rich regions are present towards the C-terminus of the protein. [11]

The leucine zipper is present at base pair 837: LHSLGAALAEKLDRLATALAGL

One 4 pattern and one 7 pattern NLS were found

pat4: KRPR @ 763bp

pat7: PSRRKSH @ 217bp

Secondary and tertiary structure

Using the GOR4 program, it can be inferred that KRBA1 has very little specific secondary structure composed of mostly random coil regions. Random coil regions constitute 75.00% of the protein, while alpha helices constitute 16.26% and extended strands 8.74%. [12] The alpha helices are broken up into many sections throughout the protein.

DNA level regulation

Promoter

The promoter region was chosen using ElDorado at Genomatix, which assessed the KRBA1 gene locus for possible promoter regions. Out of nine possible promoter sets, Promoter Set 2 (GXP_660502) was chosen, as its transcripts had much higher numbers of CAGE tags than any other set. Coding transcript GXT_27212195 was used, as it had the highest number of exons (17/17 possible), and matched with the NM_001290187 accession number on NCBI for KRBA1 mRNA. [13]

Transcription factor binding sites

As determined by MatInspector at Genomatix, the following are transcription factors (TFs) predicted to bind to KRBA1 in the promoter region. They were selected based on increased similarity scores and increased occurrences. [13]
Matrix FamilyDetailed Family InformationMatrixDetailed Matrix InformationStarting bpEnding bpAnchor bpStrandSimilaritySequence
V$ZTREZinc transcriptional regulatory elementV$ZTRE.035' half site of ZTRE motif114111571149(+)0.993ccCTCCccctcggccca
V$ZTREZinc transcriptional regulatory elementV$ZTRE.035' half site of ZTRE motif898914906(-)0.989ccCTCCcctcgtccccg
V$ZTREZinc transcriptional regulatory elementV$ZTRE.043' half site of ZTRE motif113311491141(-)0.984gggGGAGgggcgggggt
V$ZTREZinc transcriptional regulatory elementV$ZTRE.043' half site of ZTRE motif906922914(+)0.983aggGGAGggggcgggga
V$PLAGPleomorphic adenoma geneV$PLAG1.01Pleomorphic adenoma gene (PLAG) 1, a developmentally regulated C2H2 zinc finger protein668877(+)0.951cgGAGGccgggacgaggggttgt
V$PLAGPleomorphic adenoma geneV$PLAG1.02Pleomorphic adenoma gene 1307329318(-)1aaGGGGgcctgcggggctaagcc
V$PLAGPleomorphic adenoma geneV$PLAG1.02Pleomorphic adenoma gene 1459481470(+)1caGGGGgaggcagaggagagtgg
V$PLAGPleomorphic adenoma geneV$PLAG1.02Pleomorphic adenoma gene 1305241(+)1gaGGGGgctttgcccgagtgggc
V$SMADVertebrate SMAD family of transcription factorsV$SMAD3.01Smad3 transcription factor involved in TGF-beta signaling199209204(+)0.994gctGTCTgggc
V$SMADVertebrate SMAD family of transcription factorsV$SMAD3.01Smad3 transcription factor involved in TGF-beta signaling213223218(+)0.996aagGTCTggac
V$SMADVertebrate SMAD family of transcription factorsV$SMAD3.01Smad3 transcription factor involved in TGF-beta signaling428438433(-)0.994ctgGTCTgggc
V$SMADVertebrate SMAD family of transcription factorsV$SMAD3.02Smad3 transcription factor involved in TGF-beta signaling factor PU.1363373368(-)0.993cctGTCTggag
V$AHRRAHR-arnt heterodimers and AHR-related factorsV$AHRARNT.03DRE (dioxin response elements), XRE (xenobiotic response elements) bound by AHR/ARNT heterodimers416553(+)0.971gcccgagtggGCGTgcgcctttcct
V$AHRRAHR-arnt heterodimers and AHR-related factorsV$AHRARNT.03DRE (dioxin response elements), XRE (xenobiotic response elements) bound by AHR/ARNT heterodimers246270258(+)0.952agtctcctctGCGTgggaccacagc
V$AHRRAHR-arnt heterodimers and AHR-related factorsV$AHRARNT.01Aryl hydrocarbon receptor / Arnt heterodimers556580568(-)0.97gtcacattttgCGTGcctgtttgct
V$AHRRAHR-arnt heterodimers and AHR-related factorsV$AHRARNT.03DRE (dioxin response elements), XRE (xenobiotic response elements) bound by AHR/ARNT heterodimers658682670(-)0.956gtccggccggGCGTgggtgggacag

Expression pattern

In humans, this gene is ubiquitously expressed at a fairly low level, approximately 0.6 times the expression of the average gene. [8] Keeping that in mind, this gene is expressed the most in the heart. There is some expression in other areas such as the reproductive organs and the brain, but it is much less. When it comes to the common mouse Mus musculus, the expression of the KRBA1 gene occurs in different areas of the body compared to humans. In mice, the highest expression isn't in the heart but in the reproductive organs along with the adrenal gland and thymus. [8]

Protein level regulation

Localization

This protein is localized in the nucleus. [5] The gene is predicted to contain two nuclear localization signals (NLS) both a 4 residue pattern and 7 residue pattern signal. [10]

Post translational modifications

There weren't many different types of post translational modifications that were found. However, the few that were found occur throughout the protein at high levels. The few that were found were: O-GlcNAc, [14] O-Glycosylation, [15] and net phosphorylation, [16] and glycation. [17] All four of them have a multitude of sites throughout the entire protein.

Homology and evolution

The mutation rate of KRBA1 compared to fibrinopeptides, hemoglobin, and cytochrome c. KRBA1 Evolution Graph.png
The mutation rate of KRBA1 compared to fibrinopeptides, hemoglobin, and cytochrome c.

KRBA1 orthologs have a relatively high mutation rate as seen in the graph below comparing KRBA1 with fibrinopeptides, hemoglobin, and cytochrome C. The orthologs in the table below are sorted by sequence identity and similarity. [8]

This table displays 10 relatively distant orthologs with sequence identity ranging from 63%-36% [8] [18]
Genus and SpeciesCommon nameTaxonomic GroupDate of divergence

(MYA)

Accession numberSequence length (aa)Sequence identity %Sequence similarity
Homo sapiens Humans primates n/aNP_001277116.11064100%100%
Equus asinus Asinus Equidae 96XP_014686007.1117963%69%
Leptonychotes weddellii Weddell seal Pinnipedia 96XP_030881320.1114263%69%
Phoca vitulina Harbor sealPinnipedia96XP_032257255.1118463%69%
Nannospalax galili Greater mole rat Rodentia 90XP_008832018.1110362%69%
Felis catus Cat Felidae 96XP_011278956.2115962%67%
Mus musculus MouseRodentia90NP_001334081.1107858%67%
Mus Pahari Garidner's shrewmouseRodentia90XP_029390880.1108657%66%
Gallus gallus Chicken Aves 312XP_025003155.155540%52%
Pogona vitticeps Central bearded dragon Reptilia 312XP_020655832.1169736%51%
This picture depicts a time-calibrated, unrooted, phylogenetic tree of a few organisms that have orthologs of the KRBA1 gene. KRBA1 Tree.png
This picture depicts a time-calibrated, unrooted, phylogenetic tree of a few organisms that have orthologs of the KRBA1 gene.

Function and biochemistry

The domain of function on this gene is known as the KRAB (Kruppel-associated box) domain - A box and it spans from amino acid 7 to 47. It is a transcription repression domain within the zinc finger proteins subgroup of the KRAB-ZFPs family. [7] The KRAB domain interacts with and recruits other proteins and corepressors and then represses transcription. The mechanism appears to be the following: the KRBA proteins via their KRAB domain recruit the repressor KAP1 (KRAB-associated protein-1, also known as transcription intermediary factor 1 beta, KRAB-A interacting protein). The KAP1/ KRAB complex then recruits the heterochromatin protein 1 (HP1), and other chromatin modulating proteins, leading to transcriptional repression through heterochromatin formation. [7]

Interacting proteins

Using the STRING database, two interacting proteins were found to have high confidence interaction scores. High interaction scores are any scores above 0.7. The two proteins are Transcription intermediary factor 1-beta (TRIM28) with a score of 0.903 and Leucine-rich repeat-containing protein 61 (LRRC61) with a score 0.803. [19]

Clinical significance

A research done by Zaibo Li et al. displayed the KRAB-A-Domain in action. The von-Hippel Lindau tumor suppressor (pVHL) is a part of protein complex known as E3 ubiquitin ligase and targets hypoxia-inducible-factor 1a (HIF-1a) for breakdown. It is reported that there is a new protein that was discovered that interacts with the pVHL protein and helps it with its job. [20] pVHL works by bringing in histone deacetylases, and it is helped by a certain domain on this new protein. It is shown in a study by Li et al. that this domain is the KRAB-A-domain and it mediates pVHL's transcriptional repression of HIF-1a. [20] Additionally, the research shows that the KRAB-A-domain is shown to repress transcription on top of its ability to recruit histone deacetylases. [20]

Related Research Articles

<span class="mw-page-title-main">YIF1A</span> Protein-coding gene in the species Homo sapiens

Protein YIF1A is a Yip1 domain family proteins that in humans is encoded by the YIF1A gene.

<span class="mw-page-title-main">WWC2</span> Protein-coding gene in the species Homo sapiens

WW and C2 domain containing 2 (WWC2) is a protein that in humans is encoded by the WWC2 gene (4q35.1). Though function of WWC2 remains unknown, it has been predicted that WWC2 may play a role in cancer.

CXorf49 is a protein, which in humans is encoded by the gene chromosome X open reading frame 49(CXorf49).

<span class="mw-page-title-main">ERICH2</span> Protein-coding gene in the species Homo sapiens

Glutamate Rich Protein 2 is a protein in humans encoded by the gene ERICH2. This protein is expressed heavily in male tissues specifically in the testes, and proteins are specifically found in the nucleoli fibrillar center and the vesicles of these testicular cells. The protein has multiple protein interactions which indicate that it may play a role in histone modification and proper histone functioning.

Leukocyte Receptor Cluster Member 9 is an uncharacterized protein encoded by the LENG9 gene. In humans, LENG9 is predicted to play a role in fertility and reproductive disorders associated with female endometrium structures.

<span class="mw-page-title-main">TMCO4</span> Protein-coding gene in the species Homo sapiens

Transmembrane and coiled-coil domains 4, TMCO4, is a protein in humans that is encoded by the TMCO4 gene. Currently, its function is not well defined. It is transmembrane protein that is predicted to cross the endoplasmic reticulum membrane three times. TMCO4 interacts with other proteins known to play a role in cancer development, hinting at a possible role in the disease of cancer.

<span class="mw-page-title-main">TMEM44</span> Protein-coding gene in the species Homo sapiens

TMEM44 is a protein that in humans is encoded by the TMEM44 gene. DKFZp686O18124 is a synonym of TMEM44.

<span class="mw-page-title-main">SMCO3</span> Protein-coding gene in the species Homo sapiens

Single-pass membrane and coiled-coil domain-containing protein 3 is a protein that is encoded in humans by the SMCO3 gene.

<span class="mw-page-title-main">TMEM155</span> Protein-coding gene in the species Homo sapiens

Transmembrane protein 155 is a protein that in humans is encoded by the TMEM155 gene. It is located on human chromosome 4, spanning 6,497 bases. It is also referred to as FLJ30834 and LOC132332. This protein is known to be expressed mainly in the brain, placenta, and lymph nodes and is conserved throughout most placental mammals. The function and structure of this protein is still not well understood, but its level of expression has been studied pertaining to various pathologies.

<span class="mw-page-title-main">C1orf185</span> Protein-coding gene in the species Homo sapiens

Chromosome 1 open reading frame 185, also known as C1orf185, is a protein that in humans is encoded by the C1orf185 gene. In humans, C1orf185 is a lowly expressed protein that has been found to be occasionally expressed in the circulatory system.

<span class="mw-page-title-main">C5orf46</span> Protein-coding gene in the species Homo sapiens

C5orf46 is a protein coding gene located on chromosome 5 in humans. It is also known as sssp1, or skin and saliva secreted protein 1. There are two known isoforms known in humans, with isoform 2 being the longer of the two. The protein encoded is predicted to have one transmembrane domain, and has a predicted molecular weight of 9,692 Da, and a basal isoelectric point of 4.67.

<span class="mw-page-title-main">C16orf90</span> Protein-coding gene in the species Homo sapiens

C16orf90 or chromosome 16 open reading frame 90 produces uncharacterized protein C16orf90 in homo sapiens. C16orf90's protein has four predicted alpha-helix domains and is mildly expressed in the testes and lowly expressed throughout the body. While the function of C16orf90 is not yet well understood by the scientific community, it has suspected involvement in the biological stress response and apoptosis based on expression data from microarrays and post-translational modification data.

<span class="mw-page-title-main">ZNF337</span> Protein-coding gene in the species Homo sapiens

ZNF337, also known as zinc finger protein 337, is a protein that in humans is encoded by the ZNF337 gene. The ZNF337 gene is located on human chromosome 20 (20p11.21). Its protein contains 751 amino acids, has a 4,237 base pair mRNA and contains 6 exons total. In addition, alternative splicing results in multiple transcript variants. The ZNF337 gene encodes a zinc finger domain containing protein, however, this gene/protein is not yet well understood by the scientific community. The function of this gene has been proposed to participate in a processes such as the regulation of transcription (DNA-dependent), and proteins are expected to have molecular functions such as DNA binding, metal ion binding, zinc ion binding, which would be further localized in various subcellular locations. While there are no commonly associated or known aliases, an important paralog of this gene is ZNF875

<span class="mw-page-title-main">C1orf94</span> Protein-coding gene in the species Homo sapiens

Chromosome 1 Opening Reading Frame 94 or C1orf94 is a protein in human coded by the C1orf94 gene. The function of this protein is still poorly understood.

<span class="mw-page-title-main">LSMEM2</span> Protein-coding gene in the species Homo sapiens

Leucine rich single-pass membrane protein 2 is a single-pass membrane protein rich in leucine, that in humans is encoded by the LSMEM2 gene. The LSMEM2 protein is conserved in mammals, birds, and reptiles. In humans, LSMEM2 is found to be highly expressed in the heart, skeletal muscle and tongue.

<span class="mw-page-title-main">TMEM221</span> Protein

Transmembrane protein 221 (TMEM221) is a protein that in humans is encoded by the TMEM221 gene. The function of TMEM221 is currently not well understood.

<span class="mw-page-title-main">SMIM15</span> Mammalian protein found in Homo sapiens

SMIM15(small integral membrane protein 15) is a protein in humans that is encoded by the SMIM15 gene. It is a transmembrane protein that interacts with PBX4. Deletions where SMIM15 is located have produced mental defects and physical deformities. The gene has been found to have ubiquitous but variable expression in many tissues throughout the body.

TMEM275 is a protein that in humans is encoded by the TMEM275 gene. TMEM275 has two, highly-conserved, helical trans-membrane regions. It is predicted to reside within the plasma membrane or the endoplasmic reticulum's membrane.

<span class="mw-page-title-main">FAM214B</span> Protein-coding gene in the species Homo sapiens

The FAM214B, also known as protein family with sequence similarity 214, B (FAM214B) is a protein that, in humans, is encoded by the FAM214B gene located on the human chromosome 9. The protein has 538 amino acids. The gene contain 9 exon. There has been studies that there are low expression of this gene in patients with major depression disorder. In most organisms such as mammals, amphibians, reptiles, and birds, there are high levels of gene expression in the bone marrow and blood. For humans in fetal development, FAM214B is mostly expressed in the brains and bone marrow.

<span class="mw-page-title-main">ZNF548</span> Protein-coding gene in the species Homo sapiens

Zinc Finger Protein 548 (ZNF548) is a human protein encoded by the ZNF548 gene which is located on chromosome 19. It is found in the nucleus and is hypothesized to play a role in the regulation of transcription by RNA Polymerase II. It belongs to the Krüppel C2H2-type zinc-finger protein family as it contains many zinc-finger repeats.

References

  1. 1 2 3 GRCh38: Ensembl release 89: ENSG00000133619 - Ensembl, May 2017
  2. 1 2 3 GRCm38: Ensembl release 89: ENSMUSG00000042810 - Ensembl, May 2017
  3. "Human PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  4. "Mouse PubMed Reference:". National Center for Biotechnology Information, U.S. National Library of Medicine.
  5. 1 2 "AceView: Gene:KRBA1, a comprehensive annotation of human, mouse and worm genes with mRNAs or ESTsAceView". www.ncbi.nlm.nih.gov. Retrieved 2020-05-02.
  6. "KRBA1 Gene - GeneCards | KRBA1 Protein | KRBA1 Antibody". www.genecards.org. Retrieved 2020-05-02.
  7. 1 2 3 "CDD Conserved Protein Domain Family: KRAB_A-box". www.ncbi.nlm.nih.gov. Retrieved 2020-05-02.
  8. 1 2 3 4 5 6 "KRBA1 KRAB-A domain containing 1 [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2020-05-02.
  9. 1 2 "ExPASy - Compute pI/Mw tool". web.expasy.org. Retrieved 2020-05-02.
  10. 1 2 "PSORT II Prediction". psort.hgc.jp. Retrieved 2020-05-02.
  11. "ELM - Search the ELM resource". elm.eu.org. Retrieved 2020-05-02.
  12. "NPS@ : GOR4 secondary structure prediction". npsa-prabi.ibcp.fr. Retrieved 2020-05-02.
  13. 1 2 "Genomatix - NGS Data Analysis & Personalized Medicine". www.genomatix.de. Retrieved 2020-05-02.
  14. "YinOYang 1.2 Server". www.cbs.dtu.dk. Retrieved 2020-05-02.
  15. "NetOGlyc 4.0 Server". www.cbs.dtu.dk. Retrieved 2020-05-02.
  16. "NetPhos 3.1 Server". www.cbs.dtu.dk. Retrieved 2020-05-02.
  17. "NetGlycate 1.0 Server". www.cbs.dtu.dk. Retrieved 2020-05-02.
  18. "TimeTree :: The Timescale of Life". www.timetree.org. Retrieved 2020-05-03.
  19. "KRBA1 protein (human) - STRING interaction network". version11.string-db.org. Retrieved 2020-05-02.
  20. 1 2 3 Li Z, Wang D, Na X, Schoen SR, Messing EM, Wu G (April 2003). "The VHL protein recruits a novel KRAB-A domain protein to repress HIF-1alpha transcriptional activity". The EMBO Journal. 22 (8): 1857–67. doi:10.1093/emboj/cdg173. PMC   154465 . PMID   12682018.