An internal ribosome entry site, abbreviated IRES, is an RNA element that allows for translation initiation in a cap-independent manner, as part of the greater process of protein synthesis. Initiation of eukaryotic translation nearly always occurs at and is dependent on the 5' cap of mRNA molecules, where the translation initiation complex forms and ribosomes engage the mRNA. IRES elements, however allow ribosomes to engage the mRNA and begin translation independently of the 5' cap.
IRES sequences were first discovered in 1988 in the poliovirus (PV) and encephalomyocarditis virus (EMCV) RNA genomes in the laboratories of Nahum Sonenberg [1] and Eckard Wimmer, [2] respectively. They are described as distinct regions of RNA molecules that are able to recruit the eukaryotic ribosome to the mRNA. This process is also known as cap-independent translation. It has been shown that IRES elements have a distinct secondary or even tertiary structure, but similar structural features at the levels of either primary or secondary structure that are common to all IRES segments have not been reported to date.
Use of IRES sequences in molecular biology soon became common as a tool for expressing multiple genes from a single transcriptional unit in a genetic vector. In such vectors, translation of the first cistron is initiated at the 5' cap, and translation of any downstream cistron is enabled by an IRES element appended at its 5' end. [3]
IRES elements are most commonly found in the 5' untranslated region, but may also occur elsewhere in mRNAs. The mRNA of viruses of the Dicistroviridae family possess two open reading frames (ORFs), and translation of each is directed by a distinct IRES. It has also been suggested that some mammalian cellular mRNAs also have IRESs, although this has been a matter of dispute. [4] [5] A number of these cellular IRES elements are located within mRNAs encoding proteins involved in stress survival, and other processes critical to survival. As of September 2009, there are 60 animal and 8 plant viruses reported to contain IRES elements and 115 mRNA sequences containing them as well. [6]
IRESs are often used by viruses as a means to ensure that viral translation is active when host translation is inhibited. These mechanisms of host translation inhibition are varied, and can be initiated by both virus and host, depending on the type of virus. However, in the case of most picornaviruses, such as poliovirus, this is accomplished by viral proteolytic cleavage of eIF4G so that it cannot interact with the 5'cap binding protein eIF4E. Interaction between these two eukaryotic initiation factors (eIFs) of the eIF4F complex is necessary for 40S ribosomal subunit recruitment to the 5' end of mRNAs, which is further thought to occur with mRNA 5'cap to 3' poly(A) tail loop formation. The virus may even use partially-cleaved eIF4G to aid in initiation of IRES-mediated translation.
Cells may also use IRESs to increase translation of certain proteins during mitosis and programmed cell death. In mitosis, the cell dephosphorylates eIF4E so that it has little affinity for the 5'cap. As a result, the 40S ribosomal subunit, and the translational machinery is diverted to IRES within the mRNA. Many proteins involved in mitosis are encoded by IRES mRNA. In programmed cell death, cleavage of eIF-4G, such as performed by viruses, decreases translation. Lack of essential proteins contributes to the death of the cell, as does translation of IRES mRNA sequences coding proteins involved in controlling cell death. [7]
To date, the mechanism of viral IRES function is better characterized than the mechanism of cellular IRES function, [8] which is still a matter of debate. HCV-like IRESs directly bind the 40S ribosomal subunit to position their initiator codons are located in ribosomal P-site without mRNA scanning. These IRESs still use the eukaryotic initiation factors (eIFs) eIF2, eIF3, eIF5, and eIF5B, but do not require the factors eIF1, eIF1A, and the eIF4F complex. In contrast, picornavirus IRESs do not bind the 40S subunit directly, but are recruited instead through the eIF4G-binding site. [9] Many viral IRES (and cellular IRES) require additional proteins to mediate their function, known as IRES trans-acting factors (ITAFs). The role of ITAFs in IRES function is still under investigation.
Testing of sequences for potential IRES function has generally relied on the use of bicistronic reporter assays. In these tests, a candidate IRES segment is introduced into a plasmid between two cistrons encoding two different reporter proteins. A promoter upstream of the first cistron drives transcription of both cistrons in a single mRNA. Cells are transfected with the plasmid and assays are subsequently performed to quantitate expression of the two reporters in the cells. An increase in the ratio of expression of the downstream reporter relative to the upstream reporter is taken as evidence for IRES activity in the test sequence. However, without characterization of the mRNA species produced from such plasmids, other explanations for the increase in this ratio cannot be ruled out. [4] [5] For example, there are multiple known cases of suspected IRES elements that were later reported as having promoter function. Unexpected splicing activity within several reported IRES elements have also been shown to be responsible for the apparent IRES function observed in bicistronic reporter tests. [10] A promoter or splice acceptor within a test sequence can result in the production of monocistronic mRNA from which the downstream cistron is translated by conventional cap-dependent, rather than IRES-mediated, initiation. A later study that documented a variety of unexpected aberrant mRNA species arising from reporter plasmids revealed that splice acceptor sites can mimic both IRES and promoter elements in tests employing such plasmids, further highlighting the need for caution in the interpretation of reporter assay results in the absence of careful RNA analysis. [11]
IRES sequences are often used in molecular biology to co-express multiple genes under the control of the same promoter, thereby mimicking a polycistronic mRNA. Within the past decades, IRES sequences have been used to develop hundreds of genetically modified rodent animal models. [12] The advantage of this technique is that molecular handling is improved. The problem about IRES is that the expression for each subsequent gene is decreased. [13]
Another viral element to establish polycistronic mRNA in eukaryotes are 2A-peptides. Here, the potential decrease in gene expression and the degree of incomplete separation of proteins is context dependent. [14]
Ribosomes are macromolecular machines, found within all cells, that perform biological protein synthesis. Ribosomes link amino acids together in the order specified by the codons of messenger RNA molecules to form polypeptide chains. Ribosomes consist of two major components: the small and large ribosomal subunits. Each subunit consists of one or more ribosomal RNA molecules and many ribosomal proteins. The ribosomes and associated molecules are also known as the translational apparatus.
The 5′ untranslated region is the region of a messenger RNA (mRNA) that is directly upstream from the initiation codon. This region is important for the regulation of translation of a transcript by differing mechanisms in viruses, prokaryotes and eukaryotes. While called untranslated, the 5′ UTR or a portion of it is sometimes translated into a protein product. This product can then regulate the translation of the main coding sequence of the mRNA. In many organisms, however, the 5′ UTR is completely untranslated, instead forming a complex secondary structure to regulate translation.
The Shine–Dalgarno (SD) sequence is a ribosomal binding site in bacterial and archaeal messenger RNA, generally located around 8 bases upstream of the start codon AUG. The RNA sequence helps recruit the ribosome to the messenger RNA (mRNA) to initiate protein synthesis by aligning the ribosome with the start codon. Once recruited, tRNA may add amino acids in sequence as dictated by the codons, moving downstream from the translational start site.
Eukaryotic translation is the biological process by which messenger RNA is translated into proteins in eukaryotes. It consists of four phases: initiation, elongation, termination, and recapping.
Initiation factors are proteins that bind to the small subunit of the ribosome during the initiation of translation, a part of protein biosynthesis.
Eukaryotic initiation factors (eIFs) are proteins or protein complexes involved in the initiation phase of eukaryotic translation. These proteins help stabilize the formation of ribosomal preinitiation complexes around the start codon and are an important input for post-transcription gene regulation. Several initiation factors form a complex with the small 40S ribosomal subunit and Met-tRNAiMet called the 43S preinitiation complex. Additional factors of the eIF4F complex recruit the 43S PIC to the five-prime cap structure of the mRNA, from which the 43S particle scans 5'-->3' along the mRNA to reach an AUG start codon. Recognition of the start codon by the Met-tRNAiMet promotes gated phosphate and eIF1 release to form the 48S preinitiation complex, followed by large 60S ribosomal subunit recruitment to form the 80S ribosome. There exist many more eukaryotic initiation factors than prokaryotic initiation factors, reflecting the greater biological complexity of eukaryotic translation. There are at least twelve eukaryotic initiation factors, composed of many more polypeptides, and these are described below.
4EGI-1 is a synthetic chemical compound which has been found to interfere with the growth of certain types of cancer cells in vitro. Its mechanism of action involves interruption of the binding of cellular initiation factor proteins involved in the translation of transcribed mRNA at the ribosome. The inhibition of these initiation factors prevents the initiation and translation of many proteins whose functions are essential to the rapid growth and proliferation of cancer cells.
This family represents the internal ribosome entry site (IRES) of the hepatitis A virus. HAV IRES is a 450 nucleotide long sequence located in the 735 nt long 5’ UTR of Hepatitis A viral RNA genome. IRES elements allow cap and end-independent translation of mRNA in the host cell. The IRES achieves this by mediating the internal initiation of translation by recruiting a ribosomal 40S pre-initiation complex directly to the initiation codon and eliminates the requirement for eukaryotic initiation factor, eIF4F.
The Hepatitis C virus internal ribosome entry site, or HCV IRES, is an RNA structure within the 5'UTR of the HCV genome that mediates cap-independent translation initiation.
A ribosome binding site, or ribosomal binding site (RBS), is a sequence of nucleotides upstream of the start codon of an mRNA transcript that is responsible for the recruitment of a ribosome during the initiation of translation. Mostly, RBS refers to bacterial sequences, although internal ribosome entry sites (IRES) have been described in mRNAs of eukaryotic cells or viruses that infect eukaryotes. Ribosome recruitment in eukaryotes is generally mediated by the 5' cap present on eukaryotic mRNAs.
Rotavirus translation, the process of translating mRNA into proteins, occurs in a different way in Rotaviruses. Unlike the vast majority of cellular proteins in other organisms, in Rotaviruses the proteins are translated from capped but nonpolyadenylated mRNAs. The viral nonstructural protein NSP3 specifically binds the 3'-end consensus sequence of viral mRNAs and interacts with the eukaryotic translation initiation factor eIF4G. The Rotavirus replication cycle occurs entirely in the cytoplasm. Upon virus entry, the viral transcriptase synthesizes capped but nonpolyadenylated mRNA The viral mRNAs bear 5' and 3' untranslated regions (UTR) of variable length and are flanked by two different sequences common to all genes.
Eukaryotic translation initiation factor 4 gamma 1 is a protein that in humans is encoded by the EIF4G1 gene.
Eukaryotic translation initiation factor 4 gamma 3 is a protein that in humans is encoded by the EIF4G3 gene. The gene encodes a protein that functions in translation by aiding the assembly of the ribosome onto the messenger RNA template. Confusingly, this protein is usually referred to as eIF4GII, as although EIF4G3 is the third gene that is similar to eukaryotic translation initiation factor 4 gamma, the second isoform EIF4G2 is not an active translation initiation factor.
Eukaryotic initiation factor 4A-I is a 46 kDa cytosolic protein that, in humans, is encoded by the EIF4A1 gene, which is located on chromosome 17. It is the most prevalent member of the eIF4A family of ATP-dependant RNA helicases, and plays a critical role in the initiation of cap-dependent eukaryotic protein translation as a component of the eIF4F translation initiation complex. eIF4A1 unwinds the secondary structure of RNA within the 5'-UTR of mRNA, a critical step necessary for the recruitment of the 43S preinitiation complex, and thus the translation of protein in eukaryotes. It was first characterized in 1982 by Grifo, et al., who purified it from rabbit reticulocyte lysate.
40S ribosomal protein S25 (eS25) is a protein that in humans is encoded by the RPS25 gene.
Eukaryotic translation initiation factor 1 (eIF1) is a protein that in humans is encoded by the EIF1 gene. It is related to yeast SUI1.
Eukaryotic translation initiation factor 4 G (eIF4G) is a protein involved in eukaryotic translation initiation and is a component of the eIF4F cap-binding complex. Orthologs of eIF4G have been studied in multiple species, including humans, yeast, and wheat. However, eIF4G is exclusively found in domain Eukarya, and not in domains Bacteria or Archaea, which do not have capped mRNA. As such, eIF4G structure and function may vary between species, although the human EIF4G1 has been the focus of extensive studies.
The eukaryotic initiation factor-4A (eIF4A) family consists of 3 closely related proteins EIF4A1, EIF4A2, and EIF4A3. These factors are required for the binding of mRNA to 40S ribosomal subunits. In addition these proteins are helicases that function to unwind double-stranded RNA.
Translational regulation refers to the control of the levels of protein synthesized from its mRNA. This regulation is vastly important to the cellular response to stressors, growth cues, and differentiation. In comparison to transcriptional regulation, it results in much more immediate cellular adjustment through direct regulation of protein concentration. The corresponding mechanisms are primarily targeted on the control of ribosome recruitment on the initiation codon, but can also involve modulation of peptide elongation, termination of protein synthesis, or ribosome biogenesis. While these general concepts are widely conserved, some of the finer details in this sort of regulation have been proven to differ between prokaryotic and eukaryotic organisms.
Eukaryotic initiation factor 4F (eIF4F) is a heterotrimeric protein complex that binds the 5' cap of messenger RNAs (mRNAs) to promote eukaryotic translation initiation. The eIF4F complex is composed of three non-identical subunits: the DEAD-box RNA helicase eIF4A, the cap-binding protein eIF4E, and the large "scaffold" protein eIF4G. The mammalian eIF4F complex was first described in 1983, and has been a major area of study into the molecular mechanisms of cap-dependent translation initiation ever since.