Human identical sequence

Last updated

The human identical sequence (HIS) is a sequence of RNA elements, 24-27 nucleotides in length, that coronavirus genomes share with the human genome. [1] In pathogenic progression, HIS acts as a NamiRNA (nuclear activating miRNA) through the NamiRNA-enhancer network to activate neighboring host genes. [2] [3] The first HIS elements was identified in the SARS-CoV-2 genome, which has five HIS elements; other human coronaviruses have one to five. [1] It has been suggested that these sequences can be more generally termed "host identical sequences" since similar correlations have been found between the genome of SARS-CoV-2 and multiple potential hosts (bats, pangolins, ferrets, and cats). [1]

Contents

SARS-CoV-2

namelengthsequencelocation in virus genomelocation in human genomeneighboring genesnote
HIS-SARS2-126UGUCUAUGCUAAUGGAGGUAAAGGCU7570–7595 in ORF1a Chr3: 124017420-124017395 KALRN
HIS-SARS2-224UAUAACACAUATAAAAAUACGUGU12494–12517 in ORF1a Chr3: 176597319-176597342
HIS-SARS2-324UUAUAUGCCUUAUUUCUUUACUUU6766–6789 in ORF1a Chr5: 28949255-28949232
HIS-SARS2-427AGGAGAAUGACAAAAAAAAAAAAAAAA29860–29886 in 3' UTR Chr18: 73670168-73670142 FBXO15, TIMM21, CYB5A same as HIS-SARS1-2
HIS-SARS2-524UUGUUGCUGCUAUUUUCUAUUUAA8610–8633 in ORF1a ChrX: 99693480-99693457

SARS-CoV-1

namelengthsequencelocation in virus genomelocation in human genomeneighboring genesnote
HIS-SARS-125UAACAUGCUUAGGAUAAUGGCCUCU15251–15275 in ORF1b Chr4: 172887105–172887129
Chr8: 122356667-122356690
HAS2, ZHX2
HIS-SARS-227AGGAGAAUGACAAAAAAAAAAAAAAAA29717–29743 in 3' UTR Chr18: 73670168-73670142same as HIS-SARS2-4

MERS-CoV

namelengthsequencelocation in virus genomelocation in human genomeneighboring genesnote
HIS-MERS-124UUCCAUUUGCACAGAGUAUCUUUU24364–24387 in S ChrX: 25635779-25635802

HCoV-HKU1

namelengthsequencelocation in virus genomelocation in human genomeneighboring genesnote
HIS-HKU1-124UUAGAAUUGUUCAAAUGUUAUCUG18656-18679 chr1:106816197-106816220
HIS-HKU1-224UUUUCUAAGAAAGAUUGGUAUGAU14044-14067 chr1:226438633-226438656
chr4:151100495-151100518
chr5:79284823-79284846
chr5:111192947-111192970
chr7:94695722-94695745
chr7:98386489-98386512
chr15:59768424-59768447
chr22:30137367-30137390
HIS-HKU1-324AUUUGACUUUAAAUCUUCAUACUA26693-26716 chr4:11718458-11718481
HIS-HKU1-424GAUUGGUUGUAUUUUCAUUUUUAU23527-23550 chr4:33759646-33759669
HIS-HKU1-524UAGAUACUGUUAUUUUUAAAAAUA19844-19867 chrX:81711130-81711153

HCoV-NL63

namelengthsequencelocation in virus genomelocation in human genomeneighboring genesnote
HIS-NL63-124UUAUGAUUUUGGUGAUUUUGUUGU13044-13067 chr1:215311768-215311791
HIS-NL63-224GGUGUUUUUGUUGAUGAUGUUGUU14920-14943 chr4:28254452-28254475
HIS-NL63-324AUAGGCUUAAAUGCUUCUGUUACU20754-20777 chr6:30469931-30469954
HIS-NL63-424AAGUAAUUGUAUUAAGAUGUUAUC12124-12147 chr7:19853545-19853568
HIS-NL63-524AACUUUUAUGAUUUUGGUGAUUUU13039-13062 chr9:1525276-1525299

HCoV-OC43

namelengthsequencelocation in virus genomelocation in human genomeneighboring genesnote
HIS-OC43-124UACAGCUCUUUGUAAAUCUGGUAG22827-22850 chr8:122471006-122471029HAS2, ZHX2
HIS-OC43-224UUGUAUGAGUGAUUUUAUGAGUGA24509-24532 chr13:30510223-30510246

HCoV-229E

namelengthsequencelocation in virus genomelocation in human genomeneighboring genesnote
HIS-229E-124AAUAUUUUAACAGUACCACGUUAU19817-19840 chr8:42865576-42865599
HIS-229E-224ACUUUGUAUUGUGUCCUCCUGGAA13139-13162 chr11:112451251-112451274

Related Research Articles

<span class="mw-page-title-main">SARS-related coronavirus</span> Species of coronavirus causing SARS and COVID-19

Severe-acute-respiratory-syndrome–related coronavirus is a species of virus consisting of many known strains. Two strains of the virus have caused outbreaks of severe respiratory diseases in humans: severe acute respiratory syndrome coronavirus 1, which caused the 2002–2004 outbreak of severe acute respiratory syndrome (SARS), and severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), which is causing the ongoing pandemic of COVID-19. There are hundreds of other strains of SARSr-CoV, which are only known to infect non-human mammal species: bats are a major reservoir of many strains of SARSr-CoV; several strains have been identified in Himalayan palm civets, which were likely ancestors of SARS-CoV-1.

<span class="mw-page-title-main">SARS-CoV-1</span> Virus that causes SARS

Severe acute respiratory syndrome coronavirus 1 (SARS-CoV-1), previously known as severe acute respiratory syndrome coronavirus (SARS-CoV), is a strain of coronavirus that causes severe acute respiratory syndrome (SARS), the respiratory illness responsible for the 2002–2004 SARS outbreak. It is an enveloped, positive-sense, single-stranded RNA virus that infects the epithelial cells within the lungs. The virus enters the host cell by binding to angiotensin-converting enzyme 2. It infects humans, bats, and palm civets. The SARS-CoV-1 outbreak was largely brought under control by simple public health measures. Testing people with symptoms, isolating and quarantining suspected cases, and restricting travel all had an effect. SARS-CoV-1 was most transmissible when patients were sick, so its spread could be effectively suppressed by isolating patients with symptoms.

RNA activation (RNAa) is a small RNA-guided and Argonaute (Ago)-dependent gene regulation phenomenon in which promoter-targeted short double-stranded RNAs (dsRNAs) induce target gene expression at the transcriptional/epigenetic level. RNAa was first reported in a 2006 PNAS paper by Li et al. who also coined the term "RNAa" as a contrast to RNA interference (RNAi) to describe such gene activation phenomenon. dsRNAs that trigger RNAa have been termed small activating RNA (saRNA). Since the initial discovery of RNAa in human cells, many other groups have made similar observations in different mammalian species including human, non-human primates, rat and mice, plant and C. elegans, suggesting that RNAa is an evolutionarily conserved mechanism of gene regulation.

<span class="mw-page-title-main">Coronavirus packaging signal</span> Regulartory element in coronaviruses

The Coronavirus packaging signal is a conserved cis-regulatory element found in Betacoronavirus. It has an important role in regulating the packaging of the viral genome into the capsid. As part of the viral life cycle, within the infected cell, the viral genome becomes associated with viral proteins and assembles into new infective progeny viruses. This process is called packaging and is vital for viral replication.

mir-133 microRNA precursor family

mir-133 is a type of non-coding RNA called a microRNA that was first experimentally characterised in mice. Homologues have since been discovered in several other species including invertebrates such as the fruitfly Drosophila melanogaster. Each species often encodes multiple microRNAs with identical or similar mature sequence. For example, in the human genome there are three known miR-133 genes: miR-133a-1, miR-133a-2 and miR-133b found on chromosomes 18, 20 and 6 respectively. The mature sequence is excised from the 3' arm of the hairpin. miR-133 is expressed in muscle tissue and appears to repress the expression of non-muscle genes.

<span class="mw-page-title-main">ING4</span> Protein-coding gene in the species Homo sapiens

Inhibitor of growth protein 4 is a protein that in humans is encoded by the ING4 gene.

<span class="mw-page-title-main">PAK6</span>

Serine/threonine-protein kinase PAK 6 is an enzyme that in humans is encoded by the PAK6 gene.

<span class="mw-page-title-main">Therapeutic Targets Database</span> Database of protein targets in drug design

Therapeutic Target Database (TTD) is a pharmaceutical and medical repository constructed by the Innovative Drug Research and Bioinformatics Group (IDRB) at Zhejiang University, China and the Bioinformatics and Drug Design Group at the National University of Singapore. It provides information about known and explored therapeutic protein and nucleic acid targets, the targeted disease, pathway information and the corresponding drugs directed at each of these targets. Detailed knowledge about target function, sequence, 3D structure, ligand binding properties, enzyme nomenclature and drug structure, therapeutic class, and clinical development status. TTD is freely accessible without any login requirement at https://idrblab.org/ttd/.

SOAP is a suite of bioinformatics software tools from the BGI Bioinformatics department enabling the assembly, alignment, and analysis of next generation DNA sequencing data. It is particularly suited to short read sequencing data.

Bat SARS-like coronavirus WIV1, also sometimes called SARS-like coronavirus WIV1, is a strain of severe acute respiratory syndrome–related coronavirus (SARSr-CoV) isolated from Chinese rufous horseshoe bats in 2013. Like all coronaviruses, virions consist of single-stranded positive-sense RNA enclosed within an envelope.

NamiRNAs are a type of miRNAs present in the nucleus, which can activate gene expression by binding to the enhancer, and therefore were named nuclear activating miRNAs (NamiRNAs), such as miR-24-1 and miR-26. These miRNAs loci are enriched with epigenetic markers that display enhancer activity like histone H3K27ac, P300/CBP, and DNaseI high-sensitivity loci. These NamiRNAs are able to activate the related enhancers and co-work with them to up-regulate the expression of neighboring genes. NamiRNAs are able to promote global gene transcription by binding their targeted enhancers in whole genome level.

<span class="mw-page-title-main">SARS-CoV-2</span> Virus that causes COVID-19

Severe acute respiratory syndrome coronavirus 2 (SARS‑CoV‑2) is a strain of coronavirus that causes COVID-19, the respiratory illness responsible for the COVID-19 pandemic. The virus previously had the provisional name 2019 novel coronavirus (2019-nCoV), and has also been called human coronavirus 2019. First identified in the city of Wuhan, Hubei, China, the World Health Organization designated the outbreak a public health emergency of international concern from January 30, 2020, to May 5, 2023. SARS‑CoV‑2 is a positive-sense single-stranded RNA virus that is contagious in humans.

SHC014-CoV is a SARS-like coronavirus (SL-COV) which infects horseshoe bats. It was discovered in Kunming in Yunnan Province, China. It was discovered along with SL-CoV Rs3367, which was the first bat SARS-like coronavirus shown to directly infect a human cell line. The line of Rs3367 that infected human cells was named Bat SARS-like coronavirus WIV1.

Bat coronavirus RaTG13 is a SARS-like betacoronavirus identified in the droppings of the horseshoe bat Rhinolophus affinis. It was discovered in 2013 in bat droppings from a mining cave near the town of Tongguan in Mojiang county in Yunnan, China. In February 2020, it was identified as the closest known relative of SARS-CoV-2, the virus that causes COVID-19, sharing 96.1% nucleotide identity. However, in 2022, scientists found three closer matches in bats found 530 km south, in Feuang, Laos, designated as BANAL-52, BANAL-103 and BANAL-236.

Civet SARS-CoV is a coronavirus associated with severe acute respiratory syndrome coronavirus (SARS-CoV), which infected humans and caused SARS events from 2002 to 2003. It infected the masked palm civet. The severe acute respiratory syndrome coronavirus (SARS-CoV) is highly similar, with a genome sequence similarity of about 99.8%. Because several patients infected at the early stage of the epidemic had contact with fruit-eating Japanese raccoon dog in the market, tanuki may be a direct source of human SARS coronavirus. At the end of 2003, four more people in Guangzhou, China, were infected with the disease. Sequence analysis found that the similarity with the tanuki virus reached 99.9%, and the SARS coronavirus was also caused by cases of tanuki transmission.

16BO133 is a SARS-like coronavirus (SL-COV) which was found in the greater horseshoe bat in South Korea. It was published in 2019 and its genome was completely sequenced. The sequenced Korean SARSr-CoV strain belongs to the severe acute respiratory syndrome coronavirus 1, and its genome sequence similarity is 82.8%.

LYRa11 is a SARS-like coronavirus (SL-COV) which was identified in 2011 in samples of intermediate horseshoe bats in Baoshan, Yunnan, China. The genome of this virus strain is 29805nt long, and the similarity to the whole genome sequence of SARS-CoV that caused the SARS outbreak is 91%. It was published in 2014. Like SARS-CoV and SARS-CoV-2, LYRa11 virus uses ACE2 as a receptor for infecting cells.

References

  1. 1 2 3 Li, W; Yang, S; Xu, P; Zhang, D; Tong, Y; Chen, L; Jia, B; Li, A; Lian, C; Ru, D; Zhang, B; Liu, M; Chen, C; Fu, W; Yuan, S; Gu, C; Wang, L; Li, W; Liang, Y; Yang, Z; Ren, X; Wang, S; Zhang, X; Song, Y; Xie, Y; Lu, H; Xu, J; Wang, H; Yu, W (February 2022). "SARS-CoV-2 RNA elements share human sequence identity and upregulate hyaluronan via NamiRNA-enhancer network". EBioMedicine. 76: 103861. doi:10.1016/j.ebiom.2022.103861. PMC   8811534 . PMID   35124429.
  2. Yang, S; Ling, Y; Zhao, F; Li, W; Song, Z; Wang, L; Li, Q; Liu, M; Tong, Y; Chen, L; Ru, D; Zhang, T; Zhou, K; Zhang, B; Xu, P; Yang, Z; Li, W; Song, Y; Xu, J; Zhu, T; Shan, F; Yu, W; Lu, H (18 March 2022). "Hymecromone: a clinical prescription hyaluronan inhibitor for efficiently blocking COVID-19 progression". Signal Transduction and Targeted Therapy. 7 (1): 91. doi:10.1038/s41392-022-00952-w. PMC   8931182 . PMID   35304437.
  3. Xiao M, Li J, Li W, Wang Y, Wu F, Xi Y, Zhang L, Ding C, Luo H, Li Y, Peng L, Zhao L, Peng S, Xiao Y, Dong S, Cao J, Yu W (October 2017). "MicroRNAs activate gene transcription epigenetically as an enhancer trigger". RNA Biology. 14 (10): 1326–1334. doi:10.1080/15476286.2015.1112487. PMC   5711461 . PMID   26853707.