Biological dark matter is an informal term for unclassified or poorly understood genetic material. This genetic material may refer to genetic material produced by unclassified microorganisms. By extension, biological dark matter may also refer to the un-isolated microorganisms whose existence can only be inferred from the genetic material that they produce. Some of the genetic material may not fall under the three existing domains of life: Bacteria, Archaea and Eukaryota; thus, it has been suggested that a possible fourth domain of life may yet be discovered, [1] [2] although other explanations are also probable. Alternatively, the genetic material may refer to non-coding DNA (so-called "junk DNA") [3] [4] [5] and non-coding RNA produced by known organisms. [6] [7] [8]
Much of the genomic dark matter is thought to originate from ancient transposable elements and from other low-complexity repetitive elements. [9] [10] Uncategorized genetic material is found in humans and many other species. [1] [11] Their phylogenetic novelty could indicate the cellular organisms or viruses from which they evolved. [12]
Up to 99% of all living microorganisms cannot be cultured, [13] [14] [15] [16] [17] so few functional insights exist about the metabolic potential of these organisms.
Sequences that are believed to be derived from unknown microbes are referred to as the microbial dark matter, [18] the dark virome, [19] or dark matter fungi. [20] Such sequences are not rare. It has been estimated that in material from humans, between 40 and 90% of viral sequences are from dark matter. [21] [22] [23] Human blood contains over three thousand different DNA sequences which cannot yet be identified. [24] A mycological study from 2023 found that dark matter fungi seem to dominate the fungal kingdom. [25]
Algorithms have been developed that examine sequences for similarities to bacterial 16S RNA sequences, [26] K-mer similarities to known viruses, [27] specific features of codon usage, [28] or for inferring the existence of proteins. [29] These approaches have suggested, for example, the existence of a novel bacteriophage of the microviridae family, [29] and a novel bacterioidales-like phage. [30] Other studies have suggested the existence of 264 new viral genera, discovered in publicly available databases, [31] and a study of human blood suggested that 42% of people have at least one previously unknown virus each, adding up to 19 different new genera. [32] A comprehensive study of DNA sequences from multiple human samples inferred the existence of 4,930 species of microbes of which 77% were previously unreported. [33] Health-related findings include a prophage that might be associated with cirrhosis of the liver, [27] and seven novel sequences from children with type-1 diabetes that have characteristics of viruses. [34] Although they might exist, no organisms that clearly cause human disease have been discovered in the dark matter.
In February 2023, scientists reported the findings of unusual DNA strands from the microorganisms in the "dark microbiome" in the driest non-polar desert on Earth. [35] [36]
The human microbiome is the aggregate of all microbiota that reside on or within human tissues and biofluids along with the corresponding anatomical sites in which they reside, including the gastrointestinal tract, skin, mammary glands, seminal fluid, uterus, ovarian follicles, lung, saliva, oral mucosa, conjunctiva, and the biliary tract. Types of human microbiota include bacteria, archaea, fungi, protists, and viruses. Though micro-animals can also live on the human body, they are typically excluded from this definition. In the context of genomics, the term human microbiome is sometimes used to refer to the collective genomes of resident microorganisms; however, the term human metagenome has the same meaning.
Metagenomics is the study of genetic material recovered directly from environmental or clinical samples by a method called sequencing. The broad field may also be referred to as environmental genomics, ecogenomics, community genomics or microbiomics.
Microbial genetics is a subject area within microbiology and genetic engineering. Microbial genetics studies microorganisms for different purposes. The microorganisms that are observed are bacteria and archaea. Some fungi and protozoa are also subjects used to study in this field. The studies of microorganisms involve studies of genotype and expression system. Genotypes are the inherited compositions of an organism. Genetic Engineering is a field of work and study within microbial genetics. The usage of recombinant DNA technology is a process of this work. The process involves creating recombinant DNA molecules through manipulating a DNA sequence. That DNA created is then in contact with a host organism. Cloning is also an example of genetic engineering.
The Human Microbiome Project (HMP) was a United States National Institutes of Health (NIH) research initiative to improve understanding of the microbiota involved in human health and disease. Launched in 2007, the first phase (HMP1) focused on identifying and characterizing human microbiota. The second phase, known as the Integrative Human Microbiome Project (iHMP) launched in 2014 with the aim of generating resources to characterize the microbiome and elucidating the roles of microbes in health and disease states. The program received $170 million in funding by the NIH Common Fund from 2007 to 2016.
Virophages are small, double-stranded DNA viral phages that require the co-infection of another virus. The co-infecting viruses are typically giant viruses. Virophages rely on the viral replication factory of the co-infecting giant virus for their own replication. One of the characteristics of virophages is that they have a parasitic relationship with the co-infecting virus. Their dependence upon the giant virus for replication often results in the deactivation of the giant viruses. The virophage may improve the recovery and survival of the host organism.
Microbiota are the range of microorganisms that may be commensal, mutualistic, or pathogenic found in and on all multicellular organisms, including plants. Microbiota include bacteria, archaea, protists, fungi, and viruses, and have been found to be crucial for immunologic, hormonal, and metabolic homeostasis of their host.
Marine microorganisms are defined by their habitat as microorganisms living in a marine environment, that is, in the saltwater of a sea or ocean or the brackish water of a coastal estuary. A microorganism is any microscopic living organism or virus, which is invisibly small to the unaided human eye without magnification. Microorganisms are very diverse. They can be single-celled or multicellular and include bacteria, archaea, viruses, and most protozoa, as well as some fungi, algae, and animals, such as rotifers and copepods. Many macroscopic animals and plants have microscopic juvenile stages. Some microbiologists also classify viruses as microorganisms, but others consider these as non-living.
The Earth Microbiome Project (EMP) was an initiative founded by Janet Jansson, Jack Gilbert and Rob Knight in 2010 to collect natural samples and analyze microbial life around the globe.
In metagenomics, binning is the process of grouping reads or contigs and assigning them to individual genome. Binning methods can be based on either compositional features or alignment (similarity), or both.
Microbial phylogenetics is the study of the manner in which various groups of microorganisms are genetically related. This helps to trace their evolution. To study these relationships biologists rely on comparative genomics, as physiology and comparative anatomy are not possible methods.
The human virome is the total collection of viruses in and on the human body. Viruses in the human body may infect both human cells and other microbes such as bacteria. Some viruses cause disease, while others may be asymptomatic. Certain viruses are also integrated into the human genome as proviruses or endogenous viral elements.
Viral metagenomics uses metagenomic technologies to detect viral genomic material from diverse environmental and clinical samples. Viruses are the most abundant biological entity and are extremely diverse; however, only a small fraction of viruses have been sequenced and only an even smaller fraction have been isolated and cultured. Sequencing viruses can be challenging because viruses lack a universally conserved marker gene so gene-based approaches are limited. Metagenomics can be used to study and analyze unculturable viruses and has been an important tool in understanding viral diversity and abundance and in the discovery of novel viruses. For example, metagenomics methods have been used to describe viruses associated with cancerous tumors and in terrestrial ecosystems.
A microbiome is the community of microorganisms that can usually be found living together in any given habitat. It was defined more precisely in 1988 by Whipps et al. as "a characteristic microbial community occupying a reasonably well-defined habitat which has distinct physio-chemical properties. The term thus not only refers to the microorganisms involved but also encompasses their theatre of activity". In 2020, an international panel of experts published the outcome of their discussions on the definition of the microbiome. They proposed a definition of the microbiome based on a revival of the "compact, clear, and comprehensive description of the term" as originally provided by Whipps et al., but supplemented with two explanatory paragraphs. The first explanatory paragraph pronounces the dynamic character of the microbiome, and the second explanatory paragraph clearly separates the term microbiota from the term microbiome.
Microbial dark matter (MDM) comprises the vast majority of microbial organisms that microbiologists are unable to culture in the laboratory, due to lack of knowledge or ability to supply the required growth conditions. Microbial dark matter is analogous to the dark matter of physics and cosmology due to its elusiveness in research and importance to our understanding of biological diversity. Microbial dark matter can be found ubiquitously and abundantly across multiple ecosystems, but remains difficult to study due to difficulties in detecting and culturing these species, posing challenges to research efforts. It is difficult to estimate its relative magnitude, but the accepted gross estimate is that as little as one percent of microbial species in a given ecological niche are culturable. In recent years, more effort has been directed towards deciphering microbial dark matter by means of recovering genome DNA sequences from environmental samples via culture independent methods such as single cell genomics and metagenomics. These studies have enabled insights into the evolutionary history and the metabolism of the sequenced genomes, providing valuable knowledge required for the cultivation of microbial dark matter lineages. However, microbial dark matter research remains comparatively undeveloped and is hypothesized to provide insight into processes radically different from known biology, new understandings of microbial communities, and increasing understanding of how life survives in extreme environments.
Virome refers to the assemblage of viruses that is often investigated and described by metagenomic sequencing of viral nucleic acids that are found associated with a particular ecosystem, organism or holobiont. The word is frequently used to describe environmental viral shotgun metagenomes. Viruses, including bacteriophages, are found in all environments, and studies of the virome have provided insights into nutrient cycling, development of immunity, and a major source of genes through lysogenic conversion. Also, the human virome has been characterized in nine organs of 31 Finnish individuals using qPCR and NGS methodologies.
Pharmacomicrobiomics, proposed by Prof. Marco Candela for the ERC-2009-StG project call, and publicly coined for the first time in 2010 by Rizkallah et al., is defined as the effect of microbiome variations on drug disposition, action, and toxicity. Pharmacomicrobiomics is concerned with the interaction between xenobiotics, or foreign compounds, and the gut microbiome. It is estimated that over 100 trillion prokaryotes representing more than 1000 species reside in the gut. Within the gut, microbes help modulate developmental, immunological and nutrition host functions. The aggregate genome of microbes extends the metabolic capabilities of humans, allowing them to capture nutrients from diverse sources. Namely, through the secretion of enzymes that assist in the metabolism of chemicals foreign to the body, modification of liver and intestinal enzymes, and modulation of the expression of human metabolic genes, microbes can significantly impact the ingestion of xenobiotics.
Auxiliary metabolic genes (AMGs) are found in many bacteriophages but originated in bacterial cells. AMGs modulate host cell metabolism during infection so that the phage can replicate more efficiently. For instance, bacteriophages that infect the abundant marine cyanobacteria Synechococcus and Prochlorococcus (cyanophages) carry AMGs that have been acquired from their immediate host as well as more distantly-related bacteria. Cyanophage AMGs support a variety of functions including photosynthesis, carbon metabolism, nucleic acid synthesis and metabolism. AMGs also have broader ecological impacts beyond their host including their influence on biogeochemical cycling.
Nikos Kyrpides is a Greek-American bioscientist who has worked on the origins of life, information processing, bioinformatics, microbiology, metagenomics and microbiome data science. He is a senior staff scientist at the Berkeley National Laboratory, head of the Prokaryote Super Program and leads the Microbiome Data Science program at the US Department of Energy Joint Genome Institute.
Marine viruses are defined by their habitat as viruses that are found in marine environments, that is, in the saltwater of seas or oceans or the brackish water of coastal estuaries. Viruses are small infectious agents that can only replicate inside the living cells of a host organism, because they need the replication machinery of the host to do so. They can infect all types of life forms, from animals and plants to microorganisms, including bacteria and archaea.
Virosphere was coined to refer to all those places in which viruses are found or which are affected by viruses. However, more recently virosphere has also been used to refer to the pool of viruses that occurs in all hosts and all environments, as well as viruses associated with specific types of hosts, type of genome or ecological niche.