Microbiome-wide association study

Last updated

A microbiome-wide association study (MWAS), otherwise known as a metagenome-wide association study (MGWAS), is a statistical methodology used to examine the full metagenome of a defined microbiome in various organisms to determine if some feature (as example, gene or species) of the microbiome is associated with a host trait. MWAS has been adopted by the field of metagenomics from the widely used genome-wide association study (GWAS).

Results of microbiome wide association analyses using single-OTU regression method between operational taxonomic units and residual feed intake (RFI) and feed conversion ratio (FCR). In the plots, the solid and dashed lines represent significance and suggestive significance at 5 and 10% family-wise type I error rates, respectively Manhattan Plot of Microbiome Wide Association Plot.webp
Results of microbiome wide association analyses using single-OTU regression method between operational taxonomic units and residual feed intake (RFI) and feed conversion ratio (FCR). In the plots, the solid and dashed lines represent significance and suggestive significance at 5 and 10% family-wise type I error rates, respectively

While MWAS is phonetically and conceptually tied to GWAS there are several key differentiations:

There are several ways to classify which feature of the microbiome will be used in a MWAS. MWAS can be assessed using a specific taxonomic level (species, genus, [8] phyla, etc.), operational taxonomic unit (OTU) [1] or amplicon sequence variant (ASV), transcriptome, [9] proteome, [10] and more. The approach used depends upon the research hypothesis as each method will often give differing results.

Often, a taxonomic level or OTU/ASV based approach is used to determine the correlations between the specific microbiome feature and the desired phenotype. Several methods can be employed, such as machine learning approaches like random forests, [11] and deep learning. [12] Feature association can also be established with programs like DESeq2 and ANCOM. However, correlations established by the wide array of tools available may not always translate into causality. Researchers determine causality through sequential testing. [13] Newer methods have explored inference of digital twins of microbial ecosystem to address some modeling challenges arising from the diversity of microbes in such environments, inter-host variability, and compositionality of measurements. [14]

Related Research Articles

<span class="mw-page-title-main">Human microbiome</span> Microorganisms in or on human skin and biofluids

The human microbiome is the aggregate of all microbiota that reside on or within human tissues and biofluids along with the corresponding anatomical sites in which they reside, including the gastrointestinal tract, skin, mammary glands, seminal fluid, uterus, ovarian follicles, lung, saliva, oral mucosa, conjunctiva, and the biliary tract. Types of human microbiota include bacteria, archaea, fungi, protists, and viruses. Though micro-animals can also live on the human body, they are typically excluded from this definition. In the context of genomics, the term human microbiome is sometimes used to refer to the collective genomes of resident microorganisms; however, the term human metagenome has the same meaning.

<span class="mw-page-title-main">Metagenomics</span> Study of genes found in the environment

Metagenomics is the study of genetic material recovered directly from environmental or clinical samples by a method called sequencing. The broad field may also be referred to as environmental genomics, ecogenomics, community genomics or microbiomics.

<span class="mw-page-title-main">Gut microbiota</span> Community of microorganisms in the gut

Gut microbiota, gut microbiome, or gut flora are the microorganisms, including bacteria, archaea, fungi, and viruses, that live in the digestive tracts of animals. The gastrointestinal metagenome is the aggregate of all the genomes of the gut microbiota. The gut is the main location of the human microbiome. The gut microbiota has broad impacts, including effects on colonization, resistance to pathogens, maintaining the intestinal epithelium, metabolizing dietary and pharmaceutical compounds, controlling immune function, and even behavior through the gut–brain axis.

<span class="mw-page-title-main">Integrated Microbial Genomes System</span> Genome browsing and annotation platform

The Integrated Microbial Genomes system is a genome browsing and annotation platform developed by the U.S. Department of Energy (DOE)-Joint Genome Institute. IMG contains all the draft and complete microbial genomes sequenced by the DOE-JGI integrated with other publicly available genomes. IMG provides users a set of tools for comparative analysis of microbial genomes along three dimensions: genes, genomes and functions. Users can select and transfer them in the comparative analysis carts based upon a variety of criteria. IMG also includes a genome annotation pipeline that integrates information from several tools, including KEGG, Pfam, InterPro, and the Gene Ontology, among others. Users can also type or upload their own gene annotations and the IMG system will allow them to generate Genbank or EMBL format files containing these annotations.

Jeffrey Ivan Gordon is a biologist and the Dr. Robert J. Glaser Distinguished University Professor and Director of the Center for Genome Sciences and Systems Biology at Washington University in St. Louis. He is internationally known for his research on gastrointestinal development and how gut microbial communities affect normal intestinal function, shape various aspects of human physiology including our nutritional status, and affect predisposition to diseases. He is a member of the National Academy of Sciences, the American Academy of Arts and Sciences, the Institute of Medicine of the National Academies, and the American Philosophical Society.

<span class="mw-page-title-main">Human Microbiome Project</span> Former research initiative

The Human Microbiome Project (HMP) was a United States National Institutes of Health (NIH) research initiative to improve understanding of the microbiota involved in human health and disease. Launched in 2007, the first phase (HMP1) focused on identifying and characterizing human microbiota. The second phase, known as the Integrative Human Microbiome Project (iHMP) launched in 2014 with the aim of generating resources to characterize the microbiome and elucidating the roles of microbes in health and disease states. The program received $170 million in funding by the NIH Common Fund from 2007 to 2016.

<span class="mw-page-title-main">Microbiota</span> Community of microorganisms

Microbiota are the range of microorganisms that may be commensal, mutualistic, or pathogenic found in and on all multicellular organisms, including plants. Microbiota include bacteria, archaea, protists, fungi, and viruses, and have been found to be crucial for immunologic, hormonal, and metabolic homeostasis of their host.

Prevotella is a genus of Gram-negative bacteria.

Metaproteomics is an umbrella term for experimental approaches to study all proteins in microbial communities and microbiomes from environmental sources. Metaproteomics is used to classify experiments that deal with all proteins identified and quantified from complex microbial communities. Metaproteomics approaches are comparable to gene-centric environmental genomics, or metagenomics.

Biological dark matter is an informal term for unclassified or poorly understood genetic material. This genetic material may refer to genetic material produced by unclassified microorganisms. By extension, biological dark matter may also refer to the un-isolated microorganism whose existence can only be inferred from the genetic material that they produce. Some of the genetic material may not fall under the three existing domains of life: Bacteria, Archaea and Eukaryota; thus, it has been suggested that a possible fourth domain of life may yet be discovered, although other explanations are also probable. Alternatively, the genetic material may refer to non-coding DNA and non-coding RNA produced by known organisms.

Microbial phylogenetics is the study of the manner in which various groups of microorganisms are genetically related. This helps to trace their evolution. To study these relationships biologists rely on comparative genomics, as physiology and comparative anatomy are not possible methods.

<span class="mw-page-title-main">Human virome</span> Total collection of viruses in and on the human body

The human virome is the total collection of viruses in and on the human body. Viruses in the human body may infect both human cells and other microbes such as bacteria. Some viruses cause disease, while others may be asymptomatic. Certain viruses are also integrated into the human genome as proviruses or endogenous viral elements.

<span class="mw-page-title-main">CrAssphage</span>

CrAss-like phage are a bacteriophage family that was discovered in 2014 by cross assembling reads in human fecal metagenomes. In silico comparative genomics and taxonomic analysis have found that crAss-like phages represent a highly abundant and diverse family of viruses. CrAss-like phage were predicted to infect bacteria of the Bacteroidota phylum and the prediction was later confirmed when the first crAss-like phage (crAss001) was isolated on a Bacteroidota host in 2018. The presence of crAss-like phage in the human gut microbiota is not yet associated with any health condition.

<span class="mw-page-title-main">Microbiome</span> Microbial community assemblage and activity

A microbiome is the community of microorganisms that can usually be found living together in any given habitat. It was defined more precisely in 1988 by Whipps et al. as "a characteristic microbial community occupying a reasonably well-defined habitat which has distinct physio-chemical properties. The term thus not only refers to the microorganisms involved but also encompasses their theatre of activity". In 2020, an international panel of experts published the outcome of their discussions on the definition of the microbiome. They proposed a definition of the microbiome based on a revival of the "compact, clear, and comprehensive description of the term" as originally provided by Whipps et al., but supplemented with two explanatory paragraphs. The first explanatory paragraph pronounces the dynamic character of the microbiome, and the second explanatory paragraph clearly separates the term microbiota from the term microbiome.

Metatranscriptomics is the set of techniques used to study gene expression of microbes within natural environments, i.e., the metatranscriptome.

<span class="mw-page-title-main">Wang Jun (scientist)</span> Chinese scientist

Wang Jun is a Chinese scientist, founder and CEO of iCarbonX, and former CEO of the Beijing Genomics Institute.

Hadesarchaea, formerly called the South-African Gold Mine Miscellaneous Euryarchaeal Group, are a class of thermophile microorganisms that have been found in deep mines, hot springs, marine sediments, and other subterranean environments.

<span class="mw-page-title-main">Pharmacomicrobiomics</span>

Pharmacomicrobiomics, proposed by Prof. Marco Candela for the ERC-2009-StG project call, and publicly coined for the first time in 2010 by Rizkallah et al., is defined as the effect of microbiome variations on drug disposition, action, and toxicity. Pharmacomicrobiomics is concerned with the interaction between xenobiotics, or foreign compounds, and the gut microbiome. It is estimated that over 100 trillion prokaryotes representing more than 1000 species reside in the gut. Within the gut, microbes help modulate developmental, immunological and nutrition host functions. The aggregate genome of microbes extends the metabolic capabilities of humans, allowing them to capture nutrients from diverse sources. Namely, through the secretion of enzymes that assist in the metabolism of chemicals foreign to the body, modification of liver and intestinal enzymes, and modulation of the expression of human metabolic genes, microbes can significantly impact the ingestion of xenobiotics.

Katherine Snowden Pollard is the Director of the Gladstone Institute of Data Science and Biotechnology and a professor at the University of California, San Francisco (UCSF). She is a Chan Zuckerberg Biohub Investigator. She was awarded Fellowship of the International Society for Computational Biology in 2020 and the American Institute for Medical and Biological Engineering in 2021 for outstanding contributions to computational biology and bioinformatics.

Trevor Lawley FMedSci is a Faculty member and Group Leader in the Host-Microbiota Interactions Lab at the Wellcome Sanger Institute (WSI). He is also co-founder and Chief Scientific Officer of the biotech company Microbiotica.

References

  1. 1 2 Aliakbari, Amir; Zemb, Olivier; Cauquil, Laurent; Barilly, Céline; Billon, Yvon; Gilbert, Hélène (2022-04-25). "Microbiability and microbiome-wide association analyses of feed efficiency and performance traits in pigs". Genetics Selection Evolution. 54 (1): 29. doi: 10.1186/s12711-022-00717-7 . ISSN   1297-9686. PMC   9036775 . PMID   35468740.
  2. MetaHIT Consortium; Qin, Junjie; Li, Ruiqiang; Raes, Jeroen; Arumugam, Manimozhiyan; Burgdorf, Kristoffer Solvsten; Manichanh, Chaysavanh; Nielsen, Trine; Pons, Nicolas; Levenez, Florence; Yamada, Takuji; Mende, Daniel R.; Li, Junhua; Xu, Junming; Li, Shaochuan (2010-03-04). "A human gut microbial gene catalogue established by metagenomic sequencing". Nature. 464 (7285): 59–65. Bibcode:2010Natur.464...59.. doi:10.1038/nature08821. ISSN   0028-0836. PMC   3779803 . PMID   20203603.
  3. Gilbert, Jack A.; Quinn, Robert A.; Debelius, Justine; Xu, Zhenjiang Z.; Morton, James; Garg, Neha; Jansson, Janet K.; Dorrestein, Pieter C.; Knight, Rob (2016-07-07). "Microbiome-wide association studies link dynamic microbial consortia to disease". Nature. 535 (7610): 94–103. Bibcode:2016Natur.535...94G. doi:10.1038/nature18850. ISSN   0028-0836. PMID   27383984. S2CID   4455429.
  4. Koenig, Jeremy E.; Spor, Aymé; Scalfone, Nicholas; Fricker, Ashwana D.; Stombaugh, Jesse; Knight, Rob; Angenent, Largus T.; Ley, Ruth E. (2011-03-15). "Succession of microbial consortia in the developing infant gut microbiome". Proceedings of the National Academy of Sciences. 108 (supplement_1): 4578–4585. Bibcode:2011PNAS..108.4578K. doi: 10.1073/pnas.1000081107 . ISSN   0027-8424. PMC   3063592 . PMID   20668239.
  5. Vandeputte, Doris; De Commer, Lindsey; Tito, Raul Y.; Kathagen, Gunter; Sabino, João; Vermeire, Séverine; Faust, Karoline; Raes, Jeroen (2021-11-18). "Temporal variability in quantitative human gut microbiome profiles and implications for clinical research". Nature Communications. 12 (1): 6740. Bibcode:2021NatCo..12.6740V. doi:10.1038/s41467-021-27098-7. ISSN   2041-1723. PMC   8602282 . PMID   34795283.
  6. Gloor, Gregory B.; Macklaim, Jean M.; Pawlowsky-Glahn, Vera; Egozcue, Juan J. (2017-11-15). "Microbiome Datasets Are Compositional: And This Is Not Optional". Frontiers in Microbiology. 8: 2224. doi: 10.3389/fmicb.2017.02224 . ISSN   1664-302X. PMC   5695134 . PMID   29187837.
  7. Aitchison, John; Greenacre, Michael (2002-10-08). "Biplots of compositional data". Journal of the Royal Statistical Society Series C: Applied Statistics. 51 (4): 375–392. doi:10.1111/1467-9876.00275. hdl: 10230/1046 . ISSN   0035-9254.
  8. Davenport, Emily R.; Cusanovich, Darren A.; Michelini, Katelyn; Barreiro, Luis B.; Ober, Carole; Gilad, Yoav (2015-11-03). White, Bryan A (ed.). "Genome-Wide Association Studies of the Human Gut Microbiota". PLOS ONE. 10 (11): e0140301. Bibcode:2015PLoSO..1040301D. doi: 10.1371/journal.pone.0140301 . ISSN   1932-6203. PMC   4631601 . PMID   26528553.
  9. Huang, Hechen; Ren, Zhigang; Gao, Xingxing; Hu, Xiaoyi; Zhou, Yuan; Jiang, Jianwen; Lu, Haifeng; Yin, Shengyong; Ji, Junfang; Zhou, Lin; Zheng, Shusen (2020-11-23). "Integrated analysis of microbiome and host transcriptome reveals correlations between gut microbiota and clinical outcomes in HBV-related hepatocellular carcinoma". Genome Medicine. 12 (1): 102. doi: 10.1186/s13073-020-00796-5 . ISSN   1756-994X. PMC   7682083 . PMID   33225985.
  10. Riddell, Nina; Crewther, Sheila G. (2017-01-30). "Integrated Comparison of GWAS, Transcriptome, and Proteomics Studies Highlights Similarities in the Biological Basis of Animal and Human Myopia". Investigative Ophthalmology & Visual Science. 58 (1): 660–669. doi: 10.1167/iovs.16-20618 . ISSN   1552-5783. PMID   28135361.
  11. Zajac, Diana J.; Green, Stefan J.; Johnson, Lance A.; Estus, Steven (2021-08-03). "APOE Genetics Influence Murine Gut Microbiome". doi: 10.21203/rs.3.rs-746611/v1 .{{cite journal}}: Cite journal requires |journal= (help)
  12. Zhu, Qiang; Li, Bojing; He, Tingting; Li, Guangrong; Jiang, Xingpeng (2020-02-15). "Robust biomarker discovery for microbiome-wide association studies". Methods. 173: 44–51. doi:10.1016/j.ymeth.2019.06.012. PMID   31238097. S2CID   195660885.
  13. Surana, Neeraj K.; Kasper, Dennis L. (2017-12-14). "Moving beyond microbiome-wide associations to causal microbe identification". Nature. 552 (7684): 244–247. Bibcode:2017Natur.552..244S. doi:10.1038/nature25019. ISSN   0028-0836. PMC   5730484 . PMID   29211710.
  14. Sizemore, Nicholas; Oliphant, Kaitlyn; Zheng, Ruolin; Martin, Camilia R.; Claud, Erika C.; Chattopadhyay, Ishanu (2024-04-12). "A digital twin of the infant microbiome to predict neurodevelopmental deficits". Science Advances. 10 (15): eadj0400. Bibcode:2024SciA...10J.400S. doi:10.1126/sciadv.adj0400. ISSN   2375-2548. PMC   11006218 . PMID   38598636.