Male gamete fusion factor | |||||||||
---|---|---|---|---|---|---|---|---|---|
Identifiers | |||||||||
Symbol | HAP2-GCS1 | ||||||||
Pfam | PF10699 | ||||||||
InterPro | IPR018928 | ||||||||
|
HAP2 (hapless 2), also known as GCS1 (generative cell-specific protein 1), is a family of membrane fusion proteins found in the sperm cell of diverse eukaryotes including Toxoplasma , thale cress, and fruit flies. This protein is essential for gamete fusion, and therefore fertilization, in these organisms. [1]
By eukaryotic taxonomy, it is found in:
For a full list of organisms in which it occurs, consult InterPro #IPR018928 - taxonomy.
HAP2/GCS1 belongs to the broader group of fusexins, which includes such proteins as C. elegans developmental fusogens EFF-1 and AFF-1, haloarchaeal Fsx1, and viral "class II" fusion proteins. [5] In an unrooted phylogenetic tree from 2021, HAP2/GCS1 and EFF-1/AFF-1 occupy two ends of the tree, the middle being occupied by viral sequences; this suggests that they may have been acquired separately. [6] The latest structure-based unrooted phylogenetic tree of Brukman et al. (2022), which takes into account the newly-discovered archaeal sequences, shows that Fsx1 groups with HAP2/GCS1, and that they are separated from EFF-1 by a number of viral sequences. Based on where the root is placed, a number of different hypotheses regarding the history of these families – their horizontal transfer and vertical inheritance – can be generated. [5] Older comparisons excluding archaeal sequences would strongly favor an interpretation where HAP2/GCS1 is acquired from a virus, [1] but the grouping of Fsx1 with HAP2/GCS1 has allowed the possibility of a much more ancient source. [5]
A DNA virus is a virus that has a genome made of deoxyribonucleic acid (DNA) that is replicated by a DNA polymerase. They can be divided between those that have two strands of DNA in their genome, called double-stranded DNA (dsDNA) viruses, and those that have one strand of DNA in their genome, called single-stranded DNA (ssDNA) viruses. dsDNA viruses primarily belong to two realms: Duplodnaviria and Varidnaviria, and ssDNA viruses are almost exclusively assigned to the realm Monodnaviria, which also includes some dsDNA viruses. Additionally, many DNA viruses are unassigned to higher taxa. Reverse transcribing viruses, which have a DNA genome that is replicated through an RNA intermediate by a reverse transcriptase, are classified into the kingdom Pararnavirae in the realm Riboviria.
A genome is all the genetic information of an organism. It consists of nucleotide sequences of DNA. The nuclear genome includes protein-coding genes and non-coding genes, other functional regions of the genome such as regulatory sequences, and often a substantial fraction of junk DNA with no evident function. Almost all eukaryotes have mitochondria and a small mitochondrial genome. Algae and plants also contain chloroplasts with a chloroplast genome.
A flagellum is a hair-like appendage that protrudes from certain plant and animal sperm cells, from fungal spores (zoospores), and from a wide range of microorganisms to provide motility. Many protists with flagella are known as flagellates.
DNA primase is an enzyme involved in the replication of DNA and is a type of RNA polymerase. Primase catalyzes the synthesis of a short RNA segment called a primer complementary to a ssDNA template. After this elongation, the RNA piece is removed by a 5' to 3' exonuclease and refilled with DNA.
Karyogamy is the final step in the process of fusing together two haploid eukaryotic cells, and refers specifically to the fusion of the two nuclei. Before karyogamy, each haploid cell has one complete copy of the organism's genome. In order for karyogamy to occur, the cell membrane and cytoplasm of each cell must fuse with the other in a process known as plasmogamy. Once within the joined cell membrane, the nuclei are referred to as pronuclei. Once the cell membranes, cytoplasm, and pronuclei fuse, the resulting single cell is diploid, containing two copies of the genome. This diploid cell, called a zygote or zygospore can then enter meiosis, or continue to divide by mitosis. Mammalian fertilization uses a comparable process to combine haploid sperm and egg cells (gametes) to create a diploid fertilized egg.
The origin of replication is a particular sequence in a genome at which replication is initiated. Propagation of the genetic material between generations requires timely and accurate duplication of DNA by semiconservative replication prior to cell division to ensure each daughter cell receives the full complement of chromosomes. This can either involve the replication of DNA in living organisms such as prokaryotes and eukaryotes, or that of DNA or RNA in viruses, such as double-stranded RNA viruses. Synthesis of daughter strands starts at discrete sites, termed replication origins, and proceeds in a bidirectional manner until all genomic DNA is replicated. Despite the fundamental nature of these events, organisms have evolved surprisingly divergent strategies that control replication onset. Although the specific replication origin organization structure and recognition varies from species to species, some common characteristics are shared.
Evolution of sexual reproduction describes how sexually reproducing animals, plants, fungi and protists could have evolved from a common ancestor that was a single-celled eukaryotic species. Sexual reproduction is widespread in eukaryotes, though a few eukaryotic species have secondarily lost the ability to reproduce sexually, such as Bdelloidea, and some plants and animals routinely reproduce asexually without entirely having lost sex. The evolution of sexual reproduction contains two related yet distinct themes: its origin and its maintenance. Bacteria and Archaea (prokaryotes) have processes that can transfer DNA from one cell to another, but it is unclear if these processes are evolutionarily related to sexual reproduction in Eukaryotes. In eukaryotes, true sexual reproduction by meiosis and cell fusion is thought to have arisen in the last eukaryotic common ancestor, possibly via several processes of varying success, and then to have persisted.
Sec61, termed SecYEG in prokaryotes, is a membrane protein complex found in all domains of life. As the core component of the translocon, it transports proteins to the endoplasmic reticulum in eukaryotes and out of the cell in prokaryotes. It is a doughnut-shaped pore through the membrane with 3 different subunits (heterotrimeric), SecY (α), SecE (γ), and SecG (β). It has a region called the plug that blocks transport into or out of the ER. This plug is displaced when the hydrophobic region of a nascent polypeptide interacts with another region of Sec61 called the seam, allowing translocation of the polypeptide into the ER lumen.
Viral eukaryogenesis is the hypothesis that the cell nucleus of eukaryotic life forms evolved from a large DNA virus in a form of endosymbiosis within a methanogenic archaeon or a bacterium. The virus later evolved into the eukaryotic nucleus by acquiring genes from the host genome and eventually usurping its role. The hypothesis was first proposed by Philip Bell in 2001 and was further popularized with the discovery of large, complex DNA viruses that are capable of protein biosynthesis.
Ribonuclease P is a type of ribonuclease which cleaves RNA. RNase P is unique from other RNases in that it is a ribozyme – a ribonucleic acid that acts as a catalyst in the same way that a protein-based enzyme would. Its function is to cleave off an extra, or precursor, sequence of RNA on tRNA molecules. Further, RNase P is one of two known multiple turnover ribozymes in nature, the discovery of which earned Sidney Altman and Thomas Cech the Nobel Prize in Chemistry in 1989: in the 1970s, Altman discovered the existence of precursor tRNA with flanking sequences and was the first to characterize RNase P and its activity in processing of the 5' leader sequence of precursor tRNA. Recent findings also reveal that RNase P has a new function. It has been shown that human nuclear RNase P is required for the normal and efficient transcription of various small noncoding RNAs, such as tRNA, 5S rRNA, SRP RNA and U6 snRNA genes, which are transcribed by RNA polymerase III, one of three major nuclear RNA polymerases in human cells.
Membrane fusion proteins are proteins that cause fusion of biological membranes. Membrane fusion is critical for many biological processes, especially in eukaryotic development and viral entry. Fusion proteins can originate from genes encoded by infectious enveloped viruses, ancient retroviruses integrated into the host genome, or solely by the host genome. Post-transcriptional modifications made to the fusion proteins by the host, namely addition and modification of glycans and acetyl groups, can drastically affect fusogenicity.
Cell fusion is an important cellular process in which several uninucleate cells combine to form a multinucleate cell, known as a syncytium. Cell fusion occurs during differentiation of myoblasts, osteoclasts and trophoblasts, during embryogenesis, and morphogenesis. Cell fusion is a necessary event in the maturation of cells so that they maintain their specific functions throughout growth.
The eukaryotes constitute the domain of Eukaryota or Eukarya, organisms whose cells have a membrane-bound nucleus. All animals, plants, fungi, seaweeds, and many unicellular organisms are eukaryotes. They constitute a major group of life forms alongside the two groups of prokaryotes: the Bacteria and the Archaea. Eukaryotes represent a small minority of the number of organisms, but given their generally much larger size, their collective global biomass is much larger than that of prokaryotes.
Cell–cell fusogens are glycoproteins that facilitate the fusion of cell to cell membranes. Cell–cell fusion is critical for the merging of gamete genomes and the development of organs in multicellular organisms. Cell-cell fusion occurs when both actin cytoskeleton and fusogenic proteins properly rearrange across the cell membrane. This process is led by actin-propelled membrane protrusions.
The eocyte hypothesis in evolutionary biology proposes that the eukaryotes originated from a group of prokaryotes called eocytes. After his team at the University of California, Los Angeles discovered eocytes in 1984, James A. Lake formulated the hypothesis as "eocyte tree" that proposed eukaryotes as part of archaea. Lake hypothesised the tree of life as having only two primary branches: prokaryotes, which include Bacteria and Archaea, and karyotes, that comprise Eukaryotes and eocytes. Parts of this early hypothesis were revived in a newer two-domain system of biological classification which named the primary domains as Archaea and Bacteria.
Tristromaviridae is a family of viruses. Archaea of the genera Thermoproteus and Pyrobaculum serve as natural hosts. Tristromaviridae is the sole family in the order Primavirales. There are two genera and three species in the family.
Ubiquitin-like proteins (UBLs) are a family of small proteins involved in post-translational modification of other proteins in a cell, usually with a regulatory function. The UBL protein family derives its name from the first member of the class to be discovered, ubiquitin (Ub), best known for its role in regulating protein degradation through covalent modification of other proteins. Following the discovery of ubiquitin, many additional evolutionarily related members of the group were described, involving parallel regulatory processes and similar chemistry. UBLs are involved in a widely varying array of cellular functions including autophagy, protein trafficking, inflammation and immune responses, transcription, DNA repair, RNA splicing, and cellular differentiation.
In virology, realm is the highest taxonomic rank established for viruses by the International Committee on Taxonomy of Viruses (ICTV), which oversees virus taxonomy. Six virus realms are recognized and united by specific highly conserved traits:
Monodnaviria is a realm of viruses that includes all single-stranded DNA viruses that encode an endonuclease of the HUH superfamily that initiates rolling circle replication of the circular viral genome. Viruses descended from such viruses are also included in the realm, including certain linear single-stranded DNA (ssDNA) viruses and circular double-stranded DNA (dsDNA) viruses. These atypical members typically replicate through means other than rolling circle replication.
An archaeal virus is a virus that infects and replicates in archaea, a domain of unicellular, prokaryotic organisms. Archaeal viruses, like their hosts, are found worldwide, including in extreme environments inhospitable to most life such as acidic hot springs, highly saline bodies of water, and at the bottom of the ocean. They have been also found in the human body. The first known archaeal virus was described in 1974 and since then, a large diversity of archaeal viruses have been discovered, many possessing unique characteristics not found in other viruses. Little is known about their biological processes, such as how they replicate, but they are believed to have many independent origins, some of which likely predate the last archaeal common ancestor (LACA).