Jason H. Moore

Last updated
Jason H. Moore
NationalityAmerican
Alma mater Florida State University
University of Michigan
Known for Multifactor dimensionality reduction
AwardsFellow of AAAS, ACMI, ASA
Scientific career
Fields Translational Bioinformatics
Biomedical Informatics
Human Genetics
Institutions Vanderbilt University
Dartmouth College
University of Pennsylvania
Academic advisorsCharles F. Sing, Ph.D.
Notable students Marylyn D. Ritchie

Jason H. Moore is a translational bioinformatics scientist, biomedical informatician, and human geneticist, the Edward Rose Professor of Informatics and Director of the Institute for Biomedical Informatics at the Perelman School of Medicine at the University of Pennsylvania, where he is also Senior Associate Dean for Informatics and Director of the Division of Informatics in the Department of Biostatistics, Epidemiology, and Informatics.

Contents

Biography

He was a founding Director of the Advanced Computing Center for Research and Education at Vanderbilt University from 2000 until 2004 and founding Director of the Institute for Quantitative Biomedical Sciences at Geisel School of Medicine of Dartmouth College from 2010 until 2015.

He's the editor-in-chief of the BioData Mining journal since 2008.

Research

Moore's research focuses on the development and application of artificial intelligence and machine learning methods for modeling complex patterns in biomedical big data. A central focus is using informatics methods for identifying combinations of DNA sequence variations and environmental factors that are predictive of human health and complex disease. For example, he developed the multifactor dimensionality reduction (MDR) [1] [2] machine learning method for detecting and characterizing combinations of attributes or independent variables that interact to influence a dependent or class variable. He then applied MDR for improved understanding of the interplay of multiple genetic polymorphisms of complex traits in genome-wide association studies. More recent work focuses on computational methods such as the tree-based pipeline optimization tool (TPOT) [3] [4] for automated machine learning and data science. Current work also focuses on methods and software for accessible artificial intelligence. [5] [6]

He is a former member of the National Library of Medicine grant review committee (BLIRC). He is the founding Editor-in-Chief of the journal BioData Mining. He has published more than 500 peer reviewed articles, book chapters and editorials. His translational bioinformatics research program has been continuously funded by multiple grants from the National Institutes of Health for nearly 20 years.

Honors

In 2011 he was elected as a Fellow of the American Association for the Advancement of Science (AAAS) [7] and was selected as a Kavli Fellow of the National Academy of Sciences (NAS) [8] in 2013. In 2015 he was elected a Fellow of the American College of Medical Informatics (ACMI). [9] In 2017 he was elected a Fellow of the American Statistical Association (ASA). [10] In 2021 he was elected a Fellow of the International Academy of Health Sciences Informatics (IAHSI). [11] In 2021 he was elected a Member of the International Statistics Institute (ISI). [12]

Related Research Articles

<span class="mw-page-title-main">Bioinformatics</span> Computational analysis of large, complex sets of biological data

Bioinformatics is an interdisciplinary field of science that develops methods and software tools for understanding biological data, especially when the data sets are large and complex. Bioinformatics uses biology, chemistry, physics, computer science, computer programming, information engineering, mathematics and statistics to analyze and interpret biological data. The subsequent process of analyzing and interpreting data is referred to as computational biology.

Biomedical text mining refers to the methods and study of how text mining may be applied to texts and literature of the biomedical domain. As a field of research, biomedical text mining incorporates ideas from natural language processing, bioinformatics, medical informatics and computational linguistics. The strategies in this field have been applied to the biomedical literature available through services such as PubMed.

Multifactor dimensionality reduction (MDR) is a statistical approach, also used in machine learning automatic approaches, for detecting and characterizing combinations of attributes or independent variables that interact to influence a dependent or class variable. MDR was designed specifically to identify nonadditive interactions among discrete variables that influence a binary outcome and is considered a nonparametric and model-free alternative to traditional statistical methods such as logistic regression.

GenePattern is a freely available computational biology open-source software package originally created and developed at the Broad Institute for the analysis of genomic data. Designed to enable researchers to develop, capture, and reproduce genomic analysis methodologies, GenePattern was first released in 2004. GenePattern is currently developed at the University of California, San Diego.

<span class="mw-page-title-main">David Haussler</span> American bioinformatician

David Haussler is an American bioinformatician known for his work leading the team that assembled the first human genome sequence in the race to complete the Human Genome Project and subsequently for comparative genome analysis that deepens understanding the molecular function and evolution of the genome.

<span class="mw-page-title-main">Søren Brunak</span> Danish bioinformatics professor, scientist

Søren Brunak is a Danish biological and physical scientist working in bioinformatics, systems biology, and medical informatics. He is a professor of Disease Systems Biology at the University of Copenhagen and professor of bioinformatics at the Technical University of Denmark. As Research Director at the Novo Nordisk Foundation Center for Protein Research at the University of Copenhagen Medical School, he leads a research effort where molecular-level systems biology data are combined with phenotypic data from the healthcare sector, such as electronic patient records, registry information, and biobank questionnaires. A major aim is to understand the network biology basis for time-ordered comorbidities and discriminate between treatment-related disease correlations and other comorbidities in disease trajectories. Søren Brunak also holds a position as a Medical Informatics Officer at Rigshospitalet, the Capital Region of Denmark.

Bioimage informatics is a subfield of bioinformatics and computational biology. It focuses on the use of computational techniques to analyze bioimages, especially cellular and molecular images, at large scale and high throughput. The goal is to obtain useful knowledge out of complicated and heterogeneous image and related metadata.

<span class="mw-page-title-main">Philip Bourne</span>

Philip Eric Bourne is an Australian bioinformatician, non-fiction writer, and businessman. He is currently Stephenson Chair of Data Science and Director of the School of Data Science and Professor of Biomedical Engineering and was the first associate director for Data Science at the National Institutes of Health, where his projects include managing the Big Data to Knowledge initiative, and formerly Associate Vice Chancellor at UCSD. He has contributed to textbooks and is a strong supporter of open-access literature and software. His diverse interests have spanned structural biology, medical informatics, information technology, structural bioinformatics, scholarly communication and pharmaceutical sciences. His papers are highly cited, and he has an h-index above 50.

<span class="mw-page-title-main">Olga Troyanskaya</span> American academic

Olga G. Troyanskaya is a Professor in the Department of Computer Science and the Lewis-Sigler Institute for Integrative Genomics at Princeton University and the Deputy Director for Genomics at the Flatiron Institute's Center for Computational Biology in NYC. She studies protein function and interactions in biological pathways by analyzing genomic data using computational tools.

<span class="mw-page-title-main">Robert F. Murphy (computational biologist)</span>

Robert F. Murphy is Ray and Stephanie Lane Professor of Computational Biology Emeritus and Director of the M.S. Program in Automated Science at Carnegie Mellon University. Prior to his retirement in May 2021, he was the Ray and Stephanie Lane Professor of Computational Biology as well as Professor of Biological Sciences, Biomedical Engineering, and Machine Learning. He was founding Director of the Center for Bioimage Informatics at Carnegie Mellon and founded the Joint CMU-Pitt Ph.D. Program in Computational Biology. He also founded the Computational Biology Department at Carnegie Mellon University and served as its head from 2009 to 2020.

Suzanna (Suzi) E. Lewis was a scientist and Principal investigator at the Berkeley Bioinformatics Open-source Project based at Lawrence Berkeley National Laboratory until her retirement in 2019. Lewis led the development of open standards and software for genome annotation and ontologies.

Translational bioinformatics (TBI) is a field that emerged in the 2010s to study health informatics, focused on the convergence of molecular bioinformatics, biostatistics, statistical genetics and clinical informatics. Its focus is on applying informatics methodology to the increasing amount of biomedical and genomic data to formulate knowledge and medical tools, which can be utilized by scientists, clinicians, and patients. Furthermore, it involves applying biomedical research to improve human health through the use of computer-based information system. TBI employs data mining and analyzing biomedical informatics in order to generate clinical knowledge for application. Clinical knowledge includes finding similarities in patient populations, interpreting biological information to suggest therapy treatments and predict health outcomes.

<span class="mw-page-title-main">Christopher G. Chute</span>

Christopher G. Chute is a Bloomberg Distinguished Professor at Johns Hopkins University, physician-scientist and biomedical informatician known for biomedical terminologies and health information technology (IT) standards. He chairs the World Health Organization Revision Steering Group for the revision of the International Classification of Diseases (ICD-11).

<span class="mw-page-title-main">Relief (feature selection)</span> Feature selection algorithm used in binary classification

Relief is an algorithm developed by Kira and Rendell in 1992 that takes a filter-method approach to feature selection that is notably sensitive to feature interactions. It was originally designed for application to binary classification problems with discrete or numerical features. Relief calculates a feature score for each feature which can then be applied to rank and select top scoring features for feature selection. Alternatively, these scores may be applied as feature weights to guide downstream modeling. Relief feature scoring is based on the identification of feature value differences between nearest neighbor instance pairs. If a feature value difference is observed in a neighboring instance pair with the same class, the feature score decreases. Alternatively, if a feature value difference is observed in a neighboring instance pair with different class values, the feature score increases. The original Relief algorithm has since inspired a family of Relief-based feature selection algorithms (RBAs), including the ReliefF algorithm. Beyond the original Relief algorithm, RBAs have been adapted to (1) perform more reliably in noisy problems, (2) generalize to multi-class problems (3) generalize to numerical outcome problems, and (4) to make them robust to incomplete data.

<span class="mw-page-title-main">Núria López Bigas</span> Researcher on computational cancer genomics

Núria López Bigas is a Spanish biologist and research professor with expertise in medical genetics, computational biology, and bioinformatics. She is an ICREA professor at Pompeu Fabra University and she also leads the Biomedical Genomics Research Group at the Institute for Research in Biomedicine in Barcelona, Spain. Her research is focused on developing computational approaches to investigate cancer genomes.

<span class="mw-page-title-main">Umit Catalyurek</span>

Ümit V. Çatalyürek is a professor of computer science at the Georgia Institute of Technology, and Adjunct Professor in department of Biomedical Informatics at the Ohio State University. He is known for his work on graph analytics, parallel algorithms for scientific applications, data-intensive computing, and large scale genomic and biomedical applications. He was the director of the High Performance Computing Lab at the Ohio State University. He was named Fellow of the Institute of Electrical and Electronics Engineers (IEEE) in 2016 for contributions to combinatorial scientific computing and parallel computing.

Michel Justin Dumontier is a Distinguished Professor of Data Science at Maastricht University. His research focuses on methods to represent knowledge on the web with applications for drug discovery and personalized medicine. He was previously an Associate Professor of Medicine at the Stanford University School of Medicine and an Associate Professor of Bioinformatics at Carleton University. He is best known for his work in biomedical ontologies, linked data and biomedical knowledge discovery. He has taught courses on biochemistry, bioinformatics, computational systems biology, and translational medicine. His research has been funded by Natural Sciences and Engineering Research Council, Canada Foundation for Innovation, Mitacs Canada, the Ontario Ministry of Research, Innovation and Science, CANARIE, and the US National Institutes of Health. Dumontier has an h-index of over 30 and has authored over 125 scientific publications in journals and conferences. He lives in Maastricht with his wife Tifany Irene Leung and their lionhead rabbit Storm.

Keith A. Crandall is an American computational biologist, bioinformaticist, and population geneticist, at George Washington University, where he is the founding director of the Computational Biology Institute, and professor in the Department of Biostatistics and Bioinformatics.

Aidong Zhang is a computer scientist whose research topics include machine learning and bioinformatics. She is William Wulf Faculty Fellow and Professor of Computer Science at the University of Virginia, where she also holds affiliations with the Department of Biomedical Engineering and School of Data Science.

Biomedical data science is a multidisciplinary field which leverages large volumes of data to promote biomedical innovation and discovery. Biomedical data science draws from various fields including Biostatistics, Biomedical informatics, and machine learning, with the goal of understanding biological and medical data. It can be viewed as the study and application of data science to solve biomedical problems. Modern biomedical datasets often have specific features which make their analyses difficult, including:

References

  1. Ritchie, Marylyn D.; Hahn, Lance W.; Roodi, Nady; Bailey, L. Renee; Dupont, William D.; Parl, Fritz F.; Moore, Jason H. (2001-07-01). "Multifactor-Dimensionality Reduction Reveals High-Order Interactions among Estrogen-Metabolism Genes in Sporadic Breast Cancer". The American Journal of Human Genetics. 69 (1): 138–147. doi:10.1086/321276. PMC   1226028 . PMID   11404819.
  2. Hahn, L. W.; Ritchie, M. D.; Moore, J. H. (2003-02-12). "Multifactor dimensionality reduction software for detecting gene-gene and gene-environment interactions". Bioinformatics. 19 (3): 376–382. doi: 10.1093/bioinformatics/btf869 . ISSN   1367-4803. PMID   12584123.
  3. Olson, Randal S.; Urbanowicz, Ryan J.; Andrews, Peter C.; Lavender, Nicole A.; Kidd, La Creis; Moore, Jason H. (2016-03-30). Automating Biomedical Data Science Through Tree-Based Pipeline Optimization. Applications of Evolutionary Computation. Lecture Notes in Computer Science. Vol. 9597. Springer, Cham. pp. 123–137. arXiv: 1601.07925 . doi:10.1007/978-3-319-31204-0_9. ISBN   978-3-319-31203-3. S2CID   9709316.
  4. Olson, Randal S.; Bartley, Nathan; Urbanowicz, Ryan J.; Moore, Jason H. (2016-01-01). "Evaluation of a Tree-based Pipeline Optimization Tool for Automating Data Science". Proceedings of the Genetic and Evolutionary Computation Conference 2016. GECCO '16. New York, NY, USA: ACM. pp. 485–492. arXiv: 1603.06212 . doi:10.1145/2908812.2908918. ISBN   9781450342063. S2CID   7142590.
  5. Olson, Randal S.; Sipper, Moshe; La Cava, William; Tartarone, Sharon; Vitale, Steven; Fu, Weixuan; Holmes, John H.; Moore, Jason H. (2017-05-01). "A System for Accessible Artificial Intelligence". arXiv: 1705.00594 [cs.AI].
  6. "These Researchers Want the People to Seize the Means of AI Production". Motherboard. 2017-05-03. Retrieved 2017-05-06.
  7. "AAAS Members Elected as Fellows". AAAS - The World's Largest General Scientific Society. 2011-12-06. Retrieved 2017-05-06.
  8. "Geisel School of Medicine - Dartmouth's Jason Moore selected as a Kavli Fellow of the National Academy of Sciences". geiselmed.dartmouth.edu. Retrieved 2017-05-06.
  9. "13 Fellows Inducted into American College of Medical Informatics". PRWeb. Retrieved 2017-05-06.
  10. "Director of the Penn Institute for Biomedical Informatics Jason Moore Elected as a Fellow of the American Statistical Association – PR News". www.pennmedicine.org. Retrieved 2017-05-06.
  11. "The International Academy of Health Sciences Informatics announces the 2021 Class of Academy Fellows". 28 May 2021.
  12. "Elected Members of the International Statistics Institute (ISI)".