Membrane protein | |||||||||
---|---|---|---|---|---|---|---|---|---|
Identifiers | |||||||||
Symbol | CoV_M | ||||||||
Pfam | PF01635 | ||||||||
InterPro | IPR002574 | ||||||||
PROSITE | PS51927 | ||||||||
|
The membrane (M) protein (previously called E1, sometimes also matrix protein [2] ) is an integral membrane protein that is the most abundant of the four major structural proteins found in coronaviruses. [3] [4] [5] The M protein organizes the assembly of coronavirus virions through protein-protein interactions with other M protein molecules as well as with the other three structural proteins, the envelope (E), spike (S), and nucleocapsid (N) proteins. [4] [6] [7] [8]
The M protein is a transmembrane protein with three transmembrane domains and is around 230 amino acid residues long. [8] [9] In SARS-CoV-2, the causative agent of COVID-19, the M protein is 222 residues long. [10] Its membrane topology orients the C-terminus toward the cytosolic face of the membrane and thus into the interior of the virion. It has a short N-terminal segment and a larger C-terminal domain. Although the protein sequence is not well conserved across all coronavirus groups, there is a conserved amphipathic region near the C-terminal end of the third transmembrane segment. [8] [9]
M functions as a homodimer. [4] [5] Studies of the M protein in multiple coronaviruses by cryo-electron microscopy have identified two distinct functional protein conformations, thought to have different roles in forming protein-protein interactions with other structural proteins. [5] M protein of SARS-CoV-2 is homologous to the prokaryotic sugar transport protein SemiSWEET. [11]
M is a glycoprotein whose glycosylation varies according to coronavirus subgroup; N-linked glycosylation is typically found in the alpha and gamma groups while O-linked glycosylation is typically found in the beta group. [8] [9] There are some exceptions; for example, in SARS-CoV, a betacoronavirus, the M protein has one N-glycosylation site. [8] [6] Glycosylation state does not appear to have a measurable effect on viral growth. [6] [9] [12] No other post-translational modifications have been described for the M protein. [4]
NCBI genome ID | 86693 |
---|---|
Genome size | 29,903 bases |
Year of completion | 2020 |
Genome browser (UCSC) |
The gene encoding the M protein is located toward the 3' end of the virus's positive-sense RNA genome, along with the genes for the other three structural proteins and various virus-specific accessory proteins. [6] [8] M is translated by membrane-bound polysomes [6] to be inserted into the endoplasmic reticulum (ER) and trafficked to the endoplasmic reticulum-Golgi intermediate compartment (ERGIC), the intracellular compartment that gives rise to the coronavirus viral envelope, or to the Golgi apparatus. [8] [7] [6] The exact localization is dependent on the specific virus protein. [13] Investigations of the subcellular localization of the MERS-CoV M protein found C-terminal sequence signals associated with trafficking to the Golgi. [14]
The M protein is the most abundant protein in coronavirus virions. [8] [5] [4] It is essential for viral replication. [4]
The primary function of the M protein is organizing assembly of new virions. [4] It is involved in establishing viral shape and morphology. Individual M molecules interact with each other to form the viral envelope [7] [9] [8] and may be able to exclude host cell proteins from the viral membrane. [5] Studies of the SARS-CoV M protein suggest that M-M interactions involve both the N- and C-termini. [6] Coronaviruses are moderately pleomorphic and conformational variations of M appear to be associated with virion size. [5]
M forms protein-protein interactions with all three other major structural proteins. [4] [7] M is necessary but not sufficient for viral assembly; M and the E protein expressed together are reportedly sufficient to form virus-like particles, [7] though some reports vary depending on experimental conditions and the specific virus studied. [6] [13] In some reports M appears to be capable of inducing membrane curvature, [5] though others report M alone is insufficient for this and E is required. [7] Although the E protein is not necessarily essential, it appears to be required for normal viral morphology and may be responsible for establishing curvature or initiating viral budding. [7] M also appears to have functional roles in the later stages of viral maturation, secretion, and budding. [4]
Incorporation of the spike protein (S) - which is required for assembly of infectious virions - is reported to occur though M interactions and may depend on specific conformations of M. [5] [13] The conserved amphipathic region C-terminal to the third transmembrane segment is important for spike interactions. [13] Interactions with M appear to be required for correct subcellular localization of S at the viral budding site. [12] M interacts directly with the nucleocapsid (N) protein without requiring the presence of RNA. [6] This interaction appears to occur primarily through both proteins' C-termini. [4]
The M protein in MERS-CoV, SARS-CoV, and SARS-CoV-2 has been described as an antagonist of interferon response. [4] [17]
The M protein is immunogenic and has been reported to be a determinant of humoral immunity. [4] Cytotoxic T cell responses to M have been described. [16] Antibodies to epitopes found in the M protein have been identified in patients recovered from severe acute respiratory syndrome (SARS). [18]
Other recent research has identified that SAS-COV-2 membrane protein when treated on human PBMC's causes a significant increase in pro inflammatory mediators such as TNF and IL-6. [19] . The effects of exogenous SARS-COV-2 membrane protein challenge in mice was also studied. In these studies, exogenous membrane protein treated intra nasally caused a significant increase in pulmonary inflammation in mice leading to histological changes within the lungs. [20] .
It has been reported that human coronavirus NL63 relies on the M protein as well as the S protein to mediate host cell interactions preceding viral entry. M is thought to bind heparan sulfate proteoglycans exposed on the cell surface. [21]
A study of SARS-CoV-2 sequences collected during the COVID-19 pandemic found that missense mutations in the M gene were relatively uncommon and suggested it was under purifying selection. [22] Similar results have been described for broader population genetics analyses over a wider range of related viruses, finding that the sequences of M and several non-structural proteins in the coronavirus genome are most subject to evolutionary constraints. [23]
Coronaviruses are a group of related RNA viruses that cause diseases in mammals and birds. In humans and birds, they cause respiratory tract infections that can range from mild to lethal. Mild illnesses in humans include some cases of the common cold, while more lethal varieties can cause SARS, MERS and COVID-19. In cows and pigs they cause diarrhea, while in mice they cause hepatitis and encephalomyelitis.
Severe-acute-respiratory-syndrome–related coronavirus is a species of virus consisting of many known strains. Two strains of the virus have caused outbreaks of severe respiratory diseases in humans: severe acute respiratory syndrome coronavirus 1, which caused the 2002–2004 outbreak of severe acute respiratory syndrome (SARS), and severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), which is causing the ongoing pandemic of COVID-19. There are hundreds of other strains of SARSr-CoV, which are only known to infect non-human mammal species: bats are a major reservoir of many strains of SARSr-CoV; several strains have been identified in Himalayan palm civets, which were likely ancestors of SARS-CoV-1.
Coronaviridae is a family of enveloped, positive-strand RNA viruses which infect amphibians, birds, and mammals. The group includes the subfamilies Letovirinae and Orthocoronavirinae; the members of the latter are known as coronaviruses.
Human coronavirus NL63 (HCoV-NL63) is a species of coronavirus, specifically a Setracovirus from among the Alphacoronavirus genus. It was identified in late 2004 in patients in the Netherlands by Lia van der Hoek and Krzysztof Pyrc using a novel virus discovery method VIDISCA. Later on the discovery was confirmed by the researchers from the Rotterdam, the Netherlands The virus is an enveloped, positive-sense, single-stranded RNA virus which enters its host cell by binding to ACE2. Infection with the virus has been confirmed worldwide, and has an association with many common symptoms and diseases. Associated diseases include mild to moderate upper respiratory tract infections, severe lower respiratory tract infection, croup and bronchiolitis.
In virology, a spike protein or peplomer protein is a protein that forms a large structure known as a spike or peplomer projecting from the surface of an enveloped virus. The proteins are usually glycoproteins that form dimers or trimers.
A viral structural protein is a viral protein that is a structural component of the mature virus.
Feline coronavirus (FCoV) is a positive-stranded RNA virus that infects cats worldwide. It is a coronavirus of the species Alphacoronavirus 1, which includes canine coronavirus (CCoV) and porcine transmissible gastroenteritis coronavirus (TGEV). FCoV has two different forms: feline enteric coronavirus (FECV), which infects the intestines, and feline infectious peritonitis virus (FIPV), which causes the disease feline infectious peritonitis (FIP).
ORF7a is a gene found in coronaviruses of the Betacoronavirus genus. It expresses the Betacoronavirus NS7A protein, a type I transmembrane protein with an immunoglobulin-like protein domain. It was first discovered in SARS-CoV, the virus that causes severe acute respiratory syndrome (SARS). The homolog in SARS-CoV-2, the virus that causes COVID-19, has about 85% sequence identity to the SARS-CoV protein.
Human coronavirus HKU1 (HCoV-HKU1) is a species of coronavirus in humans and animals. It causes an upper respiratory disease with symptoms of the common cold, but can advance to pneumonia and bronchiolitis. It was first discovered in January 2004 from one man in Hong Kong. Subsequent research revealed it has global distribution and earlier genesis.
Betacoronavirus is one of four genera of coronaviruses. Member viruses are enveloped, positive-strand RNA viruses that infect mammals, including humans. The natural reservoir for betacoronaviruses are bats and rodents. Rodents are the reservoir for the subgenus Embecovirus, while bats are the reservoir for the other subgenera.
Human coronavirus 229E (HCoV-229E) is a species of coronavirus which infects humans and bats. It is an enveloped, positive-sense, single-stranded RNA virus which enters its host cell by binding to the APN receptor. Along with Human coronavirus OC43, it is one of the viruses responsible for the common cold. HCoV-229E is a member of the genus Alphacoronavirus and subgenus Duvinacovirus.
Bat SARS-like coronavirus WIV1, also sometimes called SARS-like coronavirus WIV1, is a strain of severe acute respiratory syndrome–related coronavirus (SARSr-CoV) isolated from Chinese rufous horseshoe bats in 2013. Like all coronaviruses, virions consist of single-stranded positive-sense RNA enclosed within an envelope.
The envelope (E) protein is the smallest and least well-characterized of the four major structural proteins found in coronavirus virions. It is an integral membrane protein less than 110 amino acid residues long; in SARS-CoV-2, the causative agent of Covid-19, the E protein is 75 residues long. Although it is not necessarily essential for viral replication, absence of the E protein may produce abnormally assembled viral capsids or reduced replication. E is a multifunctional protein and, in addition to its role as a structural protein in the viral capsid, it is thought to be involved in viral assembly, likely functions as a viroporin, and is involved in viral pathogenesis.
The nucleocapsid (N) protein is a protein that packages the positive-sense RNA genome of coronaviruses to form ribonucleoprotein structures enclosed within the viral capsid. The N protein is the most highly expressed of the four major coronavirus structural proteins. In addition to its interactions with RNA, N forms protein-protein interactions with the coronavirus membrane protein (M) during the process of viral assembly. N also has additional functions in manipulating the cell cycle of the host cell. The N protein is highly immunogenic and antibodies to N are found in patients recovered from SARS and COVID-19.
Spike (S) glycoprotein is the largest of the four major structural proteins found in coronaviruses. The spike protein assembles into trimers that form large structures, called spikes or peplomers, that project from the surface of the virion. The distinctive appearance of these spikes when visualized using negative stain transmission electron microscopy, "recalling the solar corona", gives the virus family its main name.
ORF3a is a gene found in coronaviruses of the subgenus Sarbecovirus, including SARS-CoV and SARS-CoV-2. It encodes an accessory protein about 275 amino acid residues long, which is thought to function as a viroporin. It is the largest accessory protein and was the first of the SARS-CoV accessory proteins to be described.
ORF7b is a gene found in coronaviruses of the genus Betacoronavirus, which expresses the accessory protein Betacoronavirus NS7b protein. It is a short, highly hydrophobic transmembrane protein of unknown function.
ORF8 is a gene that encodes a viral accessory protein, Betacoronavirus NS8 protein, in coronaviruses of the subgenus Sarbecovirus. It is one of the least well conserved and most variable parts of the genome. In some viruses, a deletion splits the region into two smaller open reading frames, called ORF8a and ORF8b - a feature present in many SARS-CoV viral isolates from later in the SARS epidemic, as well as in some bat coronaviruses. For this reason the full-length gene and its protein are sometimes called ORF8ab. The full-length gene, exemplified in SARS-CoV-2, encodes a protein with an immunoglobulin domain of unknown function, possibly involving interactions with the host immune system. It is similar in structure to the ORF7a protein, suggesting it may have originated through gene duplication.
ORF6 is a gene that encodes a viral accessory protein in coronaviruses of the subgenus Sarbecovirus, including SARS-CoV and SARS-CoV-2. It is not present in MERS-CoV. It is thought to reduce the immune system response to viral infection through interferon antagonism.
ORF9b is a gene that encodes a viral accessory protein in coronaviruses of the subgenus Sarbecovirus, including SARS-CoV and SARS-CoV-2. It is an overlapping gene whose open reading frame is entirely contained within the N gene, which encodes coronavirus nucleocapsid protein. The encoded protein is 97 amino acid residues long in SARS-CoV and 98 in SARS-CoV-2, in both cases forming a protein dimer.