Sudipto Banerjee

Last updated

Sudipto Banerjee
Born (1972-10-23) October 23, 1972 (age 51)
Alma mater Presidency College, Kolkata
Indian Statistical Institute
University of Connecticut, Storrs
Known for Bayesian hierarchical modeling, Gaussian process, spatial data analysis, Wombling
Awards Mortimer Spiegelman Award, ASA Outstanding Statistical Application Award, George W. Snedecor Award
Scientific career
Fields Statistics
Institutions University of Minnesota, Twin Cities
University of California, Los Angeles
Thesis Multivariate Spatial Modelling in a Bayesian Setting  (2000)
Doctoral advisor Alan E. Gelfand
Doctoral students "Mathematics Genealogy".

Sudipto Banerjee (born October 23, 1972) is an Indian-American statistician best known for his work on Bayesian hierarchical modeling and inference for spatial data analysis. He is Professor of Biostatistics and Senior Associate Dean in the School of Public Health at the University of California, Los Angeles. He served as the Chair of the Department of Biostatistics at UCLA from 2014 through 2023. He served as the elected President of the International Society for Bayesian Analysis in 2022.

Contents

Early life and education

Banerjee was born in Kolkata, India. He attended Presidency College, Kolkata for his undergraduate studies, and the Indian Statistical Institute, graduating with an M.STAT in 1996. Subsequently, he moved to the United States and obtained an MS and PhD in statistics from the University of Connecticut in 2000, [1] where he was introduced to Bayesian statistics and hierarchical modeling by Alan Enoch Gelfand [1] who had been a pioneer in the development of the Gibbs sampler and Markov chain Monte Carlo algorithms in Bayesian statistics.

Career

Banerjee joined the University of Minnesota, Twin Cities in 2000 as an assistant professor of Biostatistics and was associated with the School of Public Health for 14 years. There he worked on a number of problems and wrote numerous articles on spatial statistics, developing theory and methods related to Bayesian modeling and inference for geographic data with wide-ranging applications in public and environmental health sciences, ecology, forestry, real estate economics and agronomy. In 2014, Banerjee joined the Department of Biostatistics in the School of Public Health at UCLA as Professor and Chair of Biostatistics. [2]

Research

Banerjee is widely recognized as a leading expert in spatial statistics and its diverse applications in environmental, social and health sciences. He has made fundamental and pioneering statistical contributions in the broad area of Bayesian statistics and hierarchical models for analyzing spatial-temporal data and, more specifically, in the following areas within space-time modeling: (i) statistical inference for spatial gradients and zones of rapid change (also called wombling); (ii) scaling up Gaussian process models for massive spatial data analysis; (iii) graphical models for high-dimensional spatial data analysis; (iii) spatial frailties and space-time survival analysis; and (iv) computational algorithms and software for spatial data analysis. His notable statistical innovations include Gaussian predictive process and Nearest-Neighbor Gaussian process models for massive spatial-temporal data, and multivariate Markov random fields for regionally aggregated spatial data.

Banerjee's interdisciplinary research contributions include his leadership in statistical science and oversee activities surrounding exposure data science in the GuLF Study (Gulf Long-term Follow-up Study) Program examining the human-health consequences of the Deepwater Horizon oil spill in April 2010. [3] The spill followed an explosion on a drilling rig leased by BP, the British oil company, and led to the release of over four million barrels of oil into the Gulf of Mexico, 48 miles off the coast of Louisiana in the United States. Banerjee has been actively involved in collaborative frameworks involving public health researchers with expertise in epidemiology, environmental and occupational health, and biostatistics that would be responsible for sound statistical practices including innovative methods for comprehensively analyzing the exposure of workers to potentially harmful chemicals. In another high-profile study, Banerjee was invited to serve on a committee formed by the National Research Council and the National Academy of Sciences in 2015-16 for his expertise in the use of spatial data science in analyzing and synthesizing geographically referenced flood insurance data in devising an affordability framework for Federal Emergency Management Agency (FEMA). Professor Banerjee contributed with his expertise in spatial data science and GIS technologies within a comprehensive policy framework to ascertain when and where premium increases from the Biggert–Waters Flood Insurance Reform Act of 2012 lose cost effectiveness. [4] [5]

Awards and honors

Selected works

Related Research Articles

Geostatistics is a branch of statistics focusing on spatial or spatiotemporal datasets. Developed originally to predict probability distributions of ore grades for mining operations, it is currently applied in diverse disciplines including petroleum geology, hydrogeology, hydrology, meteorology, oceanography, geochemistry, geometallurgy, geography, forestry, environmental control, landscape ecology, soil science, and agriculture. Geostatistics is applied in varied branches of geography, particularly those involving the spread of diseases (epidemiology), the practice of commerce and military planning (logistics), and the development of efficient spatial networks. Geostatistical algorithms are incorporated in many places, including geographic information systems (GIS).

In statistics, Markov chain Monte Carlo (MCMC) is a class of algorithms used to draw samples from a probability distribution. Given a probability distribution, one can construct a Markov chain whose elements' distribution approximates it – that is, the Markov chain's equilibrium distribution matches the target distribution. The more steps that are included, the more closely the distribution of the sample matches the actual desired distribution.

<span class="mw-page-title-main">David Spiegelhalter</span> English statistician (born 1953)

Sir David John Spiegelhalter is a British statistician and a Fellow of Churchill College, Cambridge. From 2007 to 2018 he was Winton Professor of the Public Understanding of Risk in the Statistical Laboratory at the University of Cambridge. Spiegelhalter is an ISI highly cited researcher.

<span class="mw-page-title-main">Spatial analysis</span> Formal techniques which study entities using their topological, geometric, or geographic properties

Spatial analysis is any of the formal techniques which studies entities using their topological, geometric, or geographic properties. Spatial analysis includes a variety of techniques using different analytic approaches, especially spatial statistics. It may be applied in fields as diverse as astronomy, with its studies of the placement of galaxies in the cosmos, or to chip fabrication engineering, with its use of "place and route" algorithms to build complex wiring structures. In a more restricted sense, spatial analysis is geospatial analysis, the technique applied to structures at the human scale, most notably in the analysis of geographic data. It may also be applied to genomics, as in transcriptomics data.

Spatial epidemiology is a subfield of epidemiology focused on the study of the spatial distribution of health outcomes; it is closely related to health geography.

In statistics, Wombling is any of a number of techniques used for identifying zones of rapid change, typically in some quantity as it varies across some geographical or Euclidean space. It is named for statistician William H. Womble.

In medical research, a dynamic treatment regime (DTR), adaptive intervention, or adaptive treatment strategy is a set of rules for choosing effective treatments for individual patients. Historically, medical research and the practice of medicine tended to rely on an acute care model for the treatment of all medical problems, including chronic illness. Treatment choices made for a particular patient under a dynamic regime are based on that individual's characteristics and history, with the goal of optimizing his or her long-term clinical outcome. A dynamic treatment regime is analogous to a policy in the field of reinforcement learning, and analogous to a controller in control theory. While most work on dynamic treatment regimes has been done in the context of medicine, the same ideas apply to time-varying policies in other fields, such as education, marketing, and economics.

Bani K. Mallick is a Distinguished Professor and Susan M. Arseven `75 Chair in Data Science and Computational Statistics in the Department of Statistics at Texas A&M University in College Station. He is the Director of the Center for Statistical Bioinformatics. Mallick is well known for his contribution to the theory and practice of Bayesian semiparametric methods and uncertainty quantification. Mallick is an elected fellow of American Association for the Advancement of Science, American Statistical Association, Institute of Mathematical Statistics, International Statistical Institute and the Royal Statistical Society. He received the Distinguished research award from Texas A&M University and the Young Researcher award from the International Indian Statistical Association.

David Brian Dunson is an American statistician who is Arts and Sciences Distinguished Professor of Statistical Science, Mathematics and Electrical & Computer Engineering at Duke University. His research focuses on developing statistical methods for complex and high-dimensional data. Particular themes of his work include the use of Bayesian hierarchical models, methods for learning latent structure in complex data, and the development of computationally efficient algorithms for uncertainty quantification. He is currently serving as joint Editor of the Journal of the Royal Statistical Society, Series B.

<span class="mw-page-title-main">JASP</span> Free and open-source statistical program

JASP is a free and open-source program for statistical analysis supported by the University of Amsterdam. It is designed to be easy to use, and familiar to users of SPSS. It offers standard analysis procedures in both their classical and Bayesian form. JASP generally produces APA style results tables and plots to ease publication. It promotes open science via integration with the Open Science Framework and reproducibility by integrating the analysis settings into the results. The development of JASP is financially supported by several universities and research funds. As the JASP GUI is developed in C++ using Qt framework, some of the team left to make a notable fork which is Jamovi which has its GUI developed in JavaScript and HTML5.

Vanja Dukic is an expert in computational statistics and mathematical epidemiology who works as a professor of applied mathematics at the University of Colorado Boulder. Her research includes work on using internet search engine access patterns to track diseases, and on the effects of climate change on the spread of diseases.

Alan Enoch Gelfand is an American statistician, and is currently the James B. Duke Professor of Statistics and Decision Sciences at Duke University. Gelfand’s research includes substantial contributions to the fields of Bayesian statistics, spatial statistics and hierarchical modeling.

Elizabeth Anne (Lianne) Sheppard is an American statistician. She specializes in biostatistics and environmental statistics, and in particular in the effects of air quality on health. She is a Professor of Environmental and Occupational Health Sciences and a Professor of Biostatistics in the University of Washington School of Public Health. In 2021, Dr. Sheppard was named to the Rohm & Haas Endowed Professorship of Public Health Sciences.

Dalene Kay Stangl is an American statistician known for development and promotion of Bayesian statistical methods in health-related research.

Robert E. Kass is the Maurice Falk Professor of Statistics and Computational Neuroscience in the Department of Statistics and Data Science, the Machine Learning Department, and the Neuroscience Institute at Carnegie Mellon University.

This is a comparison of statistical analysis software that allows doing inference with Gaussian processes often using approximations.

Vecchia approximation is a Gaussian processes approximation technique originally developed by Aldo Vecchia, a statistician at United States Geological Survey. It is one of the earliest attempts to use Gaussian processes in high-dimensional settings. It has since been extensively generalized giving rise to many contemporary approximations.

<span class="mw-page-title-main">Roderick J. A. Little</span> Ph.D. University of London 1974

Roderick Joseph Alexander Little is an academic statistician, whose main research contributions lie in the statistical analysis of data with missing values and the analysis of complex sample survey data. Little is Richard D. Remington Distinguished University Professor of Biostatistics in the Department of Biostatistics at the University of Michigan, where he also holds academic appointments in the Department of Statistics and the Institute for Social Research.

Barbara Elizabeth Engelhardt is an American computer scientist and specialist in bioinformatics. Working as a Professor at Stanford University, her work has focused on latent variable models, exploratory data analysis for genomic data, and QTLs. In 2021, she was awarded the Overton Prize by the International Society for Computational Biology.

Guosheng Yin is a statistician, data scientist, educator and researcher in Biostatistics, Statistics, machine learning, and AI. Presently, Guosheng Yin is Chair in Statistics in Department of Mathematics at Imperial College London. Previously, he served as the Head of Department and the Patrick S C Poon Endowed Chair in Statistics and Actuarial Science, at the University of Hong Kong. Before he joined the University of Hong Kong, Yin worked at the University of Texas M.D. Anderson Cancer Center till 2009 as a tenured Associate Professor of Biostatistics.

References

  1. 1 2 Sudipto Banerjee at the Mathematics Genealogy Project
  2. "Sudipto Banerjee". UCLA.
  3. "GuLF Study" (PDF). National Institute of Environmental Health Sciences. September 9, 2012.
  4. Ferraro, Thomas (January 30, 2014). "U.S. Senate passes bill to delay hikes in flood insurance rates". Reuters. Retrieved January 31, 2014.
  5. "S. 1926 – Summary". United States Congress. January 31, 2014. Retrieved January 31, 2014.