RNF227

Last updated

RING Finger Protein 227, also known as RNF227 and LINC02581, is a protein which in humans is encoded by the RNF227 gene. [1] According to DNA microarray data, it is found in at least 15 tissues. [1] [ citation needed ]

Contents

Gene

In humans, the RNF227 gene is found on chromosome 17 p13.1. Its mRNA sequence is 2850 base pairs in length and includes 2 exons. The coding sequence is from base pairs 95 to 2835. [2]

Protein

The RNF227 protein is 190 amino acids in length, seen in the table below. [3]

Predicted secondary structure. I-TASSER Secondary Structure.png
Predicted secondary structure.
Predicted tertiary structure. Itasser 3.gif
Predicted tertiary structure.
1
MQLLVRVPSL PERGELDCNI CYRPFNLGCR APRRLPGTAR ARCGHTICTA CLRELAARGD
61
GGGAAARVVR LRRVVTCPFC RAPSQLPRGG LTEMALDSDL WSRLEEKARA KCERDEAGNP
121
AKESSDADGE AEEEGESEKG AGPRSAGWRA LRRLWDRVLG PARRWRRPLP SNVLYCAEIK
181
DIGHLTRCTL

Predicted properties

Using tools at Expasy, the predicted molecular weight of the protein sequence is 20,875 kilodaltons [3] with an isoelectric point of 9.23. [6] The Statistical Analysis of Protein Sequences tool detected two repetitive structures: CRAPRRLP from positions 29 to 36 and CRAPSQLP from positions 80 to 87. [7]

Zinc finger domain

RING Finger Protein 227 has a zinc finger domain from position 18 to 81 , which is highly conserved throughout many eukaryotic organisms. [8]

Secondary structure

The secondary structure was predicted by the I-TASSER server and shows 7 alpha helices, 4 beta strands, and 12 coils. [4]

Tertiary structure

The tertiary structure was predicted by the I-TASSER with a confidence score of -3.42, which is typically in the range from -5 to 2. [4]

Gene level regulation

RNA-seq was performed of tissue samples from 95 human individuals representing 27 different tissues in order to determine tissue-specificity of all protein-coding genes. The highest expression can be seen in the skin, with an expression value of 22 ± 4.5 Reads per Kilobase of transcript, per Million mapped reads (RPKM). Transcription profiling was done by high throughput sequencing of individual and mixtures of 16 human tissues RNA to show the highest expression in the testes. Additionally, the lowest expression is seen in the liver. RNA sequencing was conducted of the total RNA from 20 human tissues which showed high expression in the brain, both in the cerebellum and fetal tissues. 35 human fetal samples from 6 tissues (3 – 7 replicates per tissue) collected between 10- and 20-weeks gestational time were sequence using Illumina TruSeq Stranded Total RNA. This shows very high expression in the intestine after 11 weeks and the kidney after 10 weeks. [1]

Three experiments were found that show what conditions RNF227 rises and falls. A study conducted on T cell-driven IL-22 amplification of Il-1beta-driven inflammation in human adipose tissue shows how there is higher expression of RNF227 in obese non-diabetic patients. [9] An analysis of non-invasive NeuN cells and invasive NeuT cells treated with interstitial fluid flow resulted in higher expression of RNF227 in the NeuN cell line in both the static and flow protocols. This gives insight into the molecular pathways activated by interstitial fluid flow in ERBB2-positive breast cancer cells. [10] The last experiment showed how the effect of Rho kinase inhibition on long-term keratinocyte proliferation is rapid and conditional and resulted in higher expression in the control agent as compared to the Y-27632 agent. [11]

Transcript level regulation

Stem-Loop Diagram of 5' UTR of RNF227. Stem loop.png
Stem-Loop Diagram of 5' UTR of RNF227.
Conceptual translation of RNF227. RNF227 Conceptual Translation.png
Conceptual translation of RNF227.

The diagram to the right depicts the stem-loop formation of the 5' untranslated region of RNF227. [12] The BED4.02, ZFX.01, and ZIC3.03 transcription factors are seen with RNF227, which is notable because they are all associated with zinc finger domains. [13] Translation is initiated at the AUG start codon, as seen in the conceptual translation.

Protein level regulation

The Motif Scan tool at MyHits predicted casein kinase II phosphorylation sites (from positions 9 to 12, 102 to 105, and 125 to 128), N-myristylation sites (from positions 37 to 42 and 61 to 66), and protein kinase c phosphorylation sites (from positions 38 to 40 and 137 to 139). [14]

Additionally, PSORT II predicted a 69.6% chance for the protein sequence to be found in the nucleus of a cell. [15]

Homology

Multiple sequence alignment of mammalian orthologs. Mammal MSA.png
Multiple sequence alignment of mammalian orthologs.
Multiple sequence alignment of amphibian orthologs. Amphibia MSA.png
Multiple sequence alignment of amphibian orthologs.
Multiple sequence alignment of fish orthologs. Fish MSA.png
Multiple sequence alignment of fish orthologs.

RING Finger Protein 227 has no paralogs. It does, however, have numerous orthologs extending throughout eukaryotes. The following table presents a selection of orthologs found using searches in BLAST [19] and BLAT. [20] This is not meant to be a comprehensive list, rather a small sample that shows the diversity of species in which orthologs are found.

Genus and Species Common Name Taxonomic Group Date of Divergence (Million Years Ago) Accession Number Sequence Length (amino acids) Sequence Identity Sequence Similarity
Homo sapiens Human Primates 0NP_001345628.1190100%100%
Neotoma lepida Desert Woodrat Rodentia 90OBS67541.116467.9%73.7%
Microtus ochrogaster Prairie vole Rodentia 90XP_026636787.121467.4%74.4%
Dipodomys ordii Ord's Kangaroo Rat Rodentia 90XP_012868576.115864.8%68.9%
Balaenoptera acutorostrata scammoni Minke Whale Artiodactyla 90XP_028024073.116665.3%71.1%
Vulpes vulpes Red Fox Carnivora 96XP_025861213.116062.8%72.3%
Vicugna pacos Alpaca Artiodactyla 96XP_006218277.115624.8%30.4%
Vombatus ursinus Common Wombat Diprotondontia 159XP_027712916.118062.0%74.5%
Sarcophilus harrisii Tasmanian Devil Dasyuromorphia 159XP_023358488.217258.8%66.2%
Gallus gallus Chicken Galliformes 312XP_001234238.116825.9%36.1%
Geotrypetes seraphini Gaboon Caecilian Gymnophiona 351.8XP_033780950.115037.2%46.9%
Rhinatrema bivittatum Two-lined Caecilian Gymnophiona 351.8XP_029437562.115235.7%48.0%
Microcaecilia unicolor Cayenne Caecilian Gymnophiona 351.8XP_030043188.114834.4%46.9%
Xenopus tropicalis Western Clawed Frog Anura 351.8XP_031750786.117819.1%43.4%
Scleropages formosus Asian Arowana Osteoglossiformes 435XP_029113159.116530.6%44.9%
Astyanax mexicanus Mexican Tetra Characiformes 435XP_007231481.216128.0%37.4%
Paramormyrops kinsleyaeOld Calabar Mormyrid Osteoglossiformes 435XP_023674393.116526.6%32.3%
Lepisosteus oculatus Spotted Gar Lepisosteiformes 435XP_006640609.217224.6%33.0%
Salmo trutta Brown Trout Saloniformes 435XP_029622555.119024.2%29.6%
Danio rerio Zebrafish Cypriniformes 435NP_001121828.118723.6%35.6%

Related Research Articles

<span class="mw-page-title-main">C11orf49</span> Protein-coding gene in the species Homo sapiens

C11orf49 is a protein coding gene that in humans encodes for the C11orf49 protein. It is heavily expressed in brain tissue and peripheral blood mononuclear cells, with the latter being an important component of the immune system. It is predicted that the C11orf49 protein acts as a kinase, and has been shown to interact with HTT and APOE2.

<span class="mw-page-title-main">PRR29</span> Protein-coding gene in the species Homo sapiens

PRR29 is a protein encoded by the PRR29 gene located in humans on chromosome 17 at 17q23.

<span class="mw-page-title-main">C17orf53</span>

C17orf53 is a gene in humans that encodes a protein known as C17orf53, uncharacterized protein C17orf53. It has been shown to target the nucleus, with minor localization in the cytoplasm. Based on current findings C17orf53 is predicted to perform functions of transport, however further research into the protein could provide more specific evidence regarding its function.

<span class="mw-page-title-main">C16orf46</span> Human gene

Chromosome 16 open reading frame 46 is a protein of yet to be determined function in Homo sapiens. It is encoded by the C16orf46 gene with NCBI accession number of NM_001100873. It is a protein-coding gene with an overlapping locus.

<span class="mw-page-title-main">ZCCHC18</span> Protein-coding gene in the species Homo sapiens

Zinc finger CCHC-type containing 18 (ZCCHC18) is a protein that in humans is encoded by ZCCHC18 gene. It is also known as Smad-interacting zinc finger protein 2 (SIZN2), para-neoplastic Ma antigen family member 7b (PNMA7B), and LOC644353. Other names such as zinc finger, CCHC domain containing 12 pseudogene 1, P0CG32, ZCC18_HUMAN had been used to describe this protein.

<span class="mw-page-title-main">C9orf25</span> Protein-coding gene in the species Homo sapiens

Chromosome 9 open reading frame 25 (C9orf25) is a domain that encodes the FAM219A gene. The terms FAM219A and C9orf25 are aliases and can be used interchangeably. The function of this gene is not yet completely understood.

<span class="mw-page-title-main">C19orf44</span> Mammalian protein found in Homo sapiens

Chromosome 19 open reading frame 44 is a protein that in humans is encoded by the C19orf44 gene. C19orf44 is an uncharacterized protein with an unknown function in humans. C19orf44 is non-limiting implying that the protein exists in other species besides human. The protein contains one domain of unknown function (DUF) that is highly conserved throughout its orthologs. This protein is most highly expressed in the testis and ovary, but also has significant expression in the thyroid and parathyroid. Other names for this protein include: LOC84167.

<span class="mw-page-title-main">C4orf51</span> Protein-coding gene in the species Homo sapiens

Chromosome 4 open reading frame 51 (C4orf51) is a protein which in humans is encoded by the C4orf51 gene.

<span class="mw-page-title-main">C16orf86</span> Protein-coding gene in the species Homo sapiens

Uncharacterized protein C16orf86 is a protein in humans that is encoded by the C16orf86 gene. It is mostly made of alpha helices and it is expressed in the testes, but also in other tissues such as the kidney, colon, brain, fat, spleen, and liver. For the function of C16orf86, it is not well understood, however it could be a transcription factor in the nucleus that regulates G0/G1 in the cell cycle for tissues such as the kidney, brain, and skeletal muscles as mentioned in the DNA microarray data below in the gene level regulation section.

Chromosome 1 open reading frame (C1orf167) is a protein which in humans is encoded by the C1orf167 gene. The NCBI accession number is NP_001010881. The protein is 1468 amino acids in length with a molecular weight of 162.42 kDa. The mRNA sequence was found to be 4689 base pairs in length.

<span class="mw-page-title-main">C20orf202</span>

C20orf202 is a protein that in humans is encoded by the C20orf202 gene. In humans, this gene encodes for a nuclear protein that is primarily expressed in the lung and placenta.

<span class="mw-page-title-main">C1orf94</span> Protein-coding gene in the species Homo sapiens

Chromosome 1 Opening Reading Frame 94 or C1orf94 is a protein in human coded by the C1orf94 gene. The function of this protein is still poorly understood.

<span class="mw-page-title-main">LSMEM2</span> Protein-coding gene in the species Homo sapiens

Leucine rich single-pass membrane protein 2 is a single-pass membrane protein rich in leucine, that in humans is encoded by the LSMEM2 gene. The LSMEM2 protein is conserved in mammals, birds, and reptiles. In humans, LSMEM2 is found to be highly expressed in the heart, skeletal muscle and tongue.

<span class="mw-page-title-main">TMEM169</span> Gene

Transmembrane protein 169 (TMEM169) in humans is encoded by TMEM169 gene. The aliases of TMEM169 include FLJ34263, DKFZp781L2456, and LOC92691. TMEM169 has the highest expression in the brain, particularly the fetal brain. TMEM169 has homologs mammals, reptiles, amphibians, birds, fish, chordates and invertebrates. The most distantly related homolog of TMEM169 is Anopheles albimanus.

C3orf56 is a protein encoding gene found on chromosome 3. Although, the structure and function of the protein is not well understood, it is known that the C3orf56 protein is exclusively expressed in metaphase II of oocytes and degrades as the oocyte develops towards the blastocyst stage. Degradation of the C3orf56 protein suggests that this gene plays a role in the progression from maternal to embryonic genome and in embryonic genome activation.

<span class="mw-page-title-main">C9orf85</span> Protein-coding gene in the species Homo sapiens

Chromosome 9 open reading frame 85, commonly known as C9orf85, is a protein in Homo sapiens encoded by the C9orf85 gene. The gene is located at 9q21.13. When spliced, four different isoforms are formed. C9orf85 has a predicted molecular weight of 20.17 kdal. Isoelectric point was found to be 9.54. The function of the gene has not yet been confirmed, however it has been found to show high levels of expression in cells of high differentiation.

<span class="mw-page-title-main">FAM120AOS</span> Protein-coding gene in the species Homo sapiens

FAM120AOS, or family with sequence similarity 120A opposite strand, codes for uncharacterized protein FAM120AOS, which currently has no known function. The gene ontology describes the gene to be protein binding. Overall, it appears that the thyroid and the placenta are the two tissues with the highest expression levels of FAM120AOS across a majority of datasets.

<span class="mw-page-title-main">C12orf29</span> Protein-coding gene in humans

C12orf29 is a protein that in humans is encoded by chromosome 12 open reading frame 29. The gene is ubiquitously expressed in various tissues. The protein has 325 amino acids. The biological process of C12orf29 has been annotated as hematopoietic progenitor cell differentiation. The molecular and cellular functions of C12orf29 gene have not yet well understood by the scientific community.

<span class="mw-page-title-main">THAP3</span> Protein in Humans

THAP domain-containing protein 3 (THAP3) is a protein that, in Homo sapiens (humans), is encoded by the THAP3 gene. The THAP3 protein is as known as MGC33488, LOC90326, and THAP domain-containing, apoptosis associated protein 3. This protein contains the Thanatos-associated protein (THAP) domain and a host-cell factor 1C binding motif. These domains allow THAP3 to influence a variety of processes, including transcription and neuronal development. THAP3 is ubiquitously expressed in H. sapiens, though expression is highest in the kidneys.

<span class="mw-page-title-main">C13orf46</span> C13of46 Gene and Protein

Chromosome 13 Open Reading Frame 46 is a protein which in humans is encoded by the C13orf46 gene. In humans, C13orf46 is ubiquitously expressed at low levels in tissues, including the lungs, stomach, prostate, spleen, and thymus. This gene encodes eight alternatively spliced mRNA transcript, which produce five different protein isoforms.

References

  1. 1 2 3 "RNF227 ring finger protein 227 [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2020-12-17.
  2. "Homo sapiens ring finger protein 227 (RNF227), transcript variant 1, mRNA - Nucleotide - NCBI". www.ncbi.nlm.nih.gov. 9 June 2022.
  3. 1 2 "RING finger protein 227 [Homo sapiens] - Protein - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2020-12-17.
  4. 1 2 3 "I-TASSER results". zhanglab.ccmb.med.umich.edu. Retrieved 2020-12-17.
  5. "RNF227 ring finger protein 227 [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2020-12-17.
  6. "Compute pI/MW". Expasy.[ permanent dead link ]
  7. "SAPS Results". www.ebi.ac.uk. Retrieved 2020-12-19.
  8. "RNF227 - RING finger protein 227 - Homo sapiens (Human) - RNF227 gene & protein". www.uniprot.org. Retrieved 2020-12-17.
  9. "Type 2 diabetic obese patients: visceral adipose tissue CD14+ cells". www.ncbi.nlm.nih.gov. Retrieved 2020-12-19.
  10. "Interstitial fluid flow effect on noninvasive and invasive ERBB2-positive breast cancer cells". www.ncbi.nlm.nih.gov. Retrieved 2020-12-19.
  11. "Rho kinase inhibition effect on epidermal keratinocyte in vitro". www.ncbi.nlm.nih.gov. Retrieved 2020-12-19.
  12. "RNAfold web server". rna.tbi.univie.ac.at. Retrieved 2020-12-19.
  13. "Genomatix: Retrieve and analyze promoters: Query Input". www.genomatix.de. Retrieved 2020-12-19.
  14. "Motif Scan". myhits.sib.swiss. Retrieved 2020-12-19.
  15. "PSORT II Prediction".[ permanent dead link ]
  16. "Clustal Omega < Multiple Sequence Alignment < EMBL-EBI". www.ebi.ac.uk. Retrieved 2020-12-19.
  17. "Clustal Omega < Multiple Sequence Alignment < EMBL-EBI". www.ebi.ac.uk. Retrieved 2020-12-19.
  18. "Clustal Omega < Multiple Sequence Alignment < EMBL-EBI". www.ebi.ac.uk. Retrieved 2020-12-19.
  19. "BLAST: Basic Local Alignment Search Tool". blast.ncbi.nlm.nih.gov. Retrieved 2020-12-19.
  20. "Human BLAT Search". genome.ucsc.edu. Retrieved 2020-12-19.