Paul R. Rosenbaum | |
---|---|
Born | Paul R. Rosenbaum |
Education | Hampshire College Harvard University |
Known for | Observational Studies Propensity Score Causal Inference |
Awards | COPSS R. A. Fisher Award, 2019 COPSS George W. Snedecor Award, 2003 IMS Medallion Lecture, 2020 Nathan Mantel Award, 2017 Long-Term Excellence Award from the Health Policy Statistics Section of the American Statistical Association, 2018 |
Scientific career | |
Fields | Statistics, Observational Studies, Causal Inference |
Institutions | Wharton School University of Pennsylvania |
Paul R. Rosenbaum is the Robert G. Putzel Professor Emeritus in the Department of Statistics and Data Science at Wharton School of the University of Pennsylvania, where he worked from 1986 through 2021. He has written extensively about causal inference in observational studies, [1] [2] [3] [4] [5] including sensitivity analysis, [6] [7] [8] optimal matching, [9] [10] design sensitivity, [11] [12] [13] evidence factors, [14] [15] quasi-experimental devices, [16] [17] [18] [19] and (with Donald B. Rubin) the propensity score. [20] [21] [22] [23] [24] With various coauthors, he has also written about health outcomes, [25] [26] [27] [28] racial disparities in health outcomes, [29] [30] [31] instrumental variables, [32] [33] psychometrics [34] [35] [36] and experimental design. [37] [38] [39]
Rosenbaum is the author of several books: (i) Observational Studies, first edition 1995, second edition 2002, in the Springer Series in Statistics, New York: Springer, (ii) Design of Observational Studies, first edition 2010, second edition 2020, in the Springer Series in Statistics, New York: Springer, (iii) Observation and Experiment: An Introduction to Causal Inference, 2017, Cambridge, MA: Harvard University Press, (iv) Replication and Evidence Factors in Observational Studies, 2021, in the Chapman & Hall/CRC Monographs on Statistics and Applied Probability, 167, New York: CRC Press/Taylor & Francis Group, (v) Causal Inference, 2023, in the MIT Press Essential Knowledge Series.
For work in causal inference, the Committee of Presidents of Statistical Societies gave Rosenbaum the R. A. Fisher Award and Lectureship in 2019 and the George W. Snedecor Award in 2003. His R. A. Fisher Lecture is available on YouTube beginning at minute 32. He received Nathan Mantel Award from the Section on Statistics in Epidemiology of the American Statistical Association in 2017, and the Long-Term Excellence Award from the Health Policy Statistics Section of the American Statistical Association in 2018. He delivered an Institute of Mathematical Statistics Medallion Lecture about evidence factors in 2020, and a complete and a short version of the lecture are available on YouTube. He is a Fellow of the American Statistical Association. [40]
In statistics, the Lilliefors test is a normality test based on the Kolmogorov–Smirnov test. It is used to test the null hypothesis that data come from a normally distributed population, when the null hypothesis does not specify which normal distribution; i.e., it does not specify the expected value and variance of the distribution. It is named after Hubert Lilliefors, professor of statistics at George Washington University.
Analytic and enumerative statistical studies are two types of scientific studies:
In statistics, ignorability is a feature of an experiment design whereby the method of data collection does not depend on the missing data. A missing data mechanism such as a treatment assignment or survey sampling strategy is "ignorable" if the missing data matrix, which indicates which variables are observed or missing, is independent of the missing data conditional on the observed data.
In fields such as epidemiology, social sciences, psychology and statistics, an observational study draws inferences from a sample to a population where the independent variable is not under the control of the researcher because of ethical concerns or logistical constraints. One common observational study is about the possible effect of a treatment on subjects, where the assignment of subjects into a treated group versus a control group is outside the control of the investigator. This is in contrast with experiments, such as randomized controlled trials, where each subject is randomly assigned to a treated group or a control group. Observational studies, for lacking an assignment mechanism, naturally present difficulties for inferential analysis.
Donald Bruce Rubin is an Emeritus Professor of Statistics at Harvard University, where he chaired the department of Statistics for 13 years. He also works at Tsinghua University in China and at Temple University in Philadelphia.
Computational statistics, or statistical computing, is the study which is the intersection of statistics and computer science, and refers to the statistical methods that are enabled by using computational methods. It is the area of computational science specific to the mathematical science of statistics. This area is fast developing. The view that the broader concept of computing must be taught as part of general statistical education is gaining momentum.
In the statistical analysis of observational data, propensity score matching (PSM) is a statistical matching technique that attempts to estimate the effect of a treatment, policy, or other intervention by accounting for the covariates that predict receiving the treatment. PSM attempts to reduce the bias due to confounding variables that could be found in an estimate of the treatment effect obtained from simply comparing outcomes among units that received the treatment versus those that did not. Paul R. Rosenbaum and Donald Rubin introduced the technique in 1983.
Matching is a statistical technique that evaluates the effect of a treatment by comparing the treated and the non-treated units in an observational study or quasi-experiment. The goal of matching is to reduce bias for the estimated treatment effect in an observational-data study, by finding, for every treated unit, one non-treated unit(s) with similar observable characteristics against which the covariates are balanced out. By matching treated units to similar non-treated units, matching enables a comparison of outcomes among treated and non-treated units to estimate the effect of the treatment reducing bias due to confounding. Propensity score matching, an early matching technique, was developed as part of the Rubin causal model, but has been shown to increase model dependence, bias, inefficiency, and power and is no longer recommended compared to other matching methods. A simple, easy-to-understand, and statistically powerful method of matching known as Coarsened Exact Matching or CEM.
Causal analysis is the field of experimental design and statistics pertaining to establishing cause and effect. Typically it involves establishing four elements: correlation, sequence in time, a plausible physical or information-theoretical mechanism for an observed effect to follow from a possible cause, and eliminating the possibility of common and alternative ("special") causes. Such analysis usually involves one or more artificial or natural experiments.
Causal inference is the process of determining the independent, actual effect of a particular phenomenon that is a component of a larger system. The main difference between causal inference and inference of association is that causal inference analyzes the response of an effect variable when a cause of the effect variable is changed. The study of why things occur is called etiology, and can be described using the language of scientific causal notation. Causal inference is said to provide the evidence of causality theorized by causal reasoning.
Jurimetrics is the application of quantitative methods, and often especially probability and statistics, to law. In the United States, the journal Jurimetrics is published by the American Bar Association and Arizona State University. The Journal of Empirical Legal Studies is another publication that emphasizes the statistical analysis of law.
Sudipto Banerjee is an Indian-American statistician best known for his work on Bayesian hierarchical modeling and inference for spatial data analysis. He is Professor of Biostatistics and Senior Associate Dean in the School of Public Health at the University of California, Los Angeles. He served as the Chair of the Department of Biostatistics at UCLA from 2014 through 2023. He served as the elected President of the International Society for Bayesian Analysis in 2022.
Alan Enoch Gelfand is an American statistician, and is currently the James B. Duke Professor of Statistics and Decision Sciences at Duke University. Gelfand’s research includes substantial contributions to the fields of Bayesian statistics, spatial statistics and hierarchical modeling.
Causal analysis is the field of experimental design and statistical analysis pertaining to establishing cause and effect. Exploratory causal analysis (ECA), also known as data causality or causal discovery is the use of statistical algorithms to infer associations in observed data sets that are potentially causal under strict assumptions. ECA is a type of causal inference distinct from causal modeling and treatment effects in randomized controlled trials. It is exploratory research usually preceding more formal causal research in the same way exploratory data analysis often precedes statistical hypothesis testing in data analysis
Continuous Individualized Risk Index (CIRI) (initialism pronounced /ˈsɪri/) is to a set of probabilistic risk models utilizing Bayesian statistics for integrating diverse cancer biomarkers over time to produce a unified prediction of outcome risk, as originally described by Kurtz, Esfahani, et al. (2019) from Ash Alizadeh's laboratory at Stanford. Inspired by in game win probability models for predicting winners in sports and political elections, CIRI incorporates serial information obtained throughout a given patient's course to estimate a personalized estimate of various cancer-related risks over time. CIRI models have been developed available for various cancer types, including breast cancer (BRCA), diffuse large B-cell lymphoma (DLBCL), and chronic lymphocytic leukemia (CLL).The serial information integrated can be diverse, including choice of therapy and the associated responses observed, whether using liquid biopsies or radiological studies, pathological and other dynamic measurements.
Robert E. Kass is the Maurice Falk Professor of Statistics and Computational Neuroscience in the Department of Statistics and Data Science, the Machine Learning Department, and the Neuroscience Institute at Carnegie Mellon University.
Babette Anne Brumback is an American biostatistician known for her work on causal inference. She is a professor of biostatistics at the University of Florida.
A causal map can be defined as a network consisting of links or arcs between nodes or factors, such that a link between C and E means, in some sense, that someone believes or claims C has or had some causal influence on E.
Michael David Escobar is an American biostatistician who is known for Bayesian nonparametrics, mixture models.
John Anthony Hartigan is an Australian-American statistician, the Eugene Higgins Professor of Statistics emeritus at Yale University. He made fundamental contributions to clustering algorithms, including the famous Hartigan-Wong method and biclustering, and Bayesian statistics.