Petrus (Piet) Groeneboom | |
---|---|
Born | |
Awards | Rollo Davidson Prize (1985) |
Scientific career | |
Fields | Statistics Probability theory |
Institutions | |
Thesis | Large Deviations and Asymptotic Efficiencies [1] (1979) |
Doctoral advisor | Jacobus (Kobus) Oosterhoff |
Doctoral students | Marloes Maathuis |
Petrus (Piet) Groeneboom (born 24 September 1941 [2] in Scheveningen [3] ) is a Dutch statistician who made major advances in the field of shape-constrained statistical inference such as isotonic regression, and also worked in probability theory.
At the beginning of his tertiary studies in 1959, Groeneboom enrolled in medicine at the University of Amsterdam but quickly switched to psychology at the same university, obtaining a candidate degree in 1963. [3] During his studies he attended a course on logic by analytic philosopher Else M. Barth, whose influence, along with that by Lambert Meertens after his (Groeneboom's) candidate degree, he later stated as having made him decide to study mathematics. He was an assistant of Johannes de Groot. He obtained a master's degree in mathematics in 1971, also at the University of Amsterdam, and studied at the Vrije Universiteit Amsterdam from 1975 under Kobus Oosterhoff, obtaining his Ph.D. degree in 1979. [3]
Before and immediately after obtaining his master's degree, Groeneboom worked at the psychological laboratory of the University of Amsterdam. After his second stint there ended in 1973, he moved to the Centrum Wiskunde & Informatica , which at the time was called Mathematisch Centrum (Mathematical Centre), in the same city. [3]
From 1979 to 1981, Groeneboom was a visiting assistant professor at the University of Washington, [2] to where he would return from 1999 to 2013 as affiliate professor in the department of statistics. From 1981 on, he was again based at the Mathematical Centre before being appointed full professor of statistics at the University of Amsterdam in 1984. In 1988, he moved to Delft University of Technology, where he stayed until his retirement in 2006. From 2000 to 2006 he was additionally a part-time professor at the Vrije Universiteit Amsterdam. [3]
Groeneboom has been a professor emeritus of statistics at Delft University of Technology [4] since his retirement in 2006. He has also held positions at the Vrije Universiteit Amsterdam and the University of Washington. Following his retirement he came to public attention for his statistical work in the retrial of Lucia de Berk, a Dutch nurse, who had been convicted of murder.
In 1979, Groeneboom, together with Kobus Oosterhoff and Frits H. Ruymgaart, formulated and proved Sanov's theorem in a finer topology than had been known at the time. [5] A paper he published in 1983 on properties of Brownian motion gave rise to a large body of literature on minorants of more general stochastic processes. [6] One of the main areas of work of Groeneboom has been shape constrained statistical inference, which includes isotonic regression, an area with links to the aforementioned minorant problems, [6] as a special case. His interest in shape constrained inference began in the second half of his two-year stay at the University of Washington. [3] [7] In a 1985 article on an estimator of a monotone density named after Ulf Grenander, he introduced the switching (or switch) relation, which came to be used widely in the area. [8] [9] He found the analytic form of Chernoff's distribution, which later was understood to be omnipresent in monotone problems, [10] in the 1980s, [3] independently of others who worked on the problem at the same time. His paper on the problem came to be regarded as a benchmark in the field of shape constrained inference. In the 2010s he returned to the problem, giving new proofs in collaboration with Steve Lalley and Nico Temme. [11] [12]
Since the late 1980s, Groeneboom has also worked on censored regression models. [3] He established the asymptotic distribution of the nonparametric maximum likelihood estimator of the survival function in the case of "case 1 censoring". The iterative convex minorant algorithm which he introduced in 1991 found use in statistical estimation for proportional hazards models. [13]
Together with Eric Cator, Groeneboom contributed to the probabilistic analysis of the Hammersley process, a continuous interacting particle system (IPS). Methods similar to theirs were subsequently applied to other IPSs. [14] He is known to influence academic thought amongst some American statisticians such as Jon A. Wellner. [12]
In the late 2000s, Groeneboom joined Richard D. Gill in the attempt to overturn the conviction of Lucia de Berk, a Dutch nurse, who had been found guilty of murdering four of her patients, and attempting to kill three others. The matter was a high-profile case in the Netherlands, notable because it depended on the probabilities of certain events. [15] They argued that statistical considerations that had led to the initial suspicions of murder, and those which had remained at the center stage of the case afterwards, were flawed. The effort was ultimately successful and de Berk was finally acquitted of all accusations in 2010. [3] [4]
For his paper on Chernoff's distribution, written in 1984 but appearing much later in 1989, Groeneboom was awarded the Rollo Davidson Prize 1985. [11] [16]
Groeneboom is a fellow of the Institute of Mathematical Statistics, and an elected member of the International Statistical Institute. In 2013, he delivered the Wald lectures at the Joint Statistical Meetings in Montreal. [17] [18]
The median of a set of numbers is the value separating the higher half from the lower half of a data sample, a population, or a probability distribution. For a data set, it may be thought of as the “middle" value. The basic feature of the median in describing data compared to the mean is that it is not skewed by a small proportion of extremely large or small values, and therefore provides a better representation of the center. Median income, for example, may be a better way to describe the center of the income distribution because increases in the largest incomes alone have no effect on the median. For this reason, the median is of central importance in robust statistics.
Statistical inference is the process of using data analysis to infer properties of an underlying probability distribution. Inferential statistical analysis infers properties of a population, for example by testing hypotheses and deriving estimates. It is assumed that the observed data set is sampled from a larger population.
The following outline is provided as an overview of and topical guide to statistics:
Model selection is the task of selecting a model from among various candidates on the basis of performance criterion to choose the best one. In the context of machine learning and more generally statistical analysis, this may be the selection of a statistical model from a set of candidate models, given data. In the simplest cases, a pre-existing set of data is considered. However, the task can also involve the design of experiments such that the data collected is well-suited to the problem of model selection. Given candidate models of similar predictive or explanatory power, the simplest model is most likely to be the best choice.
In probability theory, an empirical process is a stochastic process that characterizes the deviation of the empirical distribution function from its expectation. In mean field theory, limit theorems are considered and generalise the central limit theorem for empirical measures. Applications of the theory of empirical processes arise in non-parametric statistics.
Bootstrapping is a procedure for estimating the distribution of an estimator by resampling one's data or a model estimated from the data. Bootstrapping assigns measures of accuracy to sample estimates. This technique allows estimation of the sampling distribution of almost any statistic using random sampling methods.
Richard David Gill is a British-Dutch mathematician. He has held academic positions in the Netherlands. As a probability theorist and statistician, Gill has researched counting processes. He is also known for his consulting and advocacy on behalf of alleged victims of statistical misrepresentation, including the reversal of the murder conviction of a Dutch nurse who had been jailed for six years.
In probability theory, Chernoff's distribution, named after Herman Chernoff, is the probability distribution of the random variable
Erich Leo Lehmann was a German-born American statistician, who made a major contribution to nonparametric hypothesis testing. He is one of the eponyms of the Lehmann–Scheffé theorem and of the Hodges–Lehmann estimator of the median of a population.
Ulf Grenander was a Swedish statistician and professor of applied mathematics at Brown University.
Gábor J. Székely is a Hungarian-American statistician/mathematician best known for introducing energy statistics (E-statistics). Examples include: the distance correlation, which is a bona fide dependence measure, equals zero exactly when the variables are independent; the distance skewness, which equals zero exactly when the probability distribution is diagonally symmetric; the E-statistic for normality test; and the E-statistic for clustering.
In statistical inference, the concept of a confidence distribution (CD) has often been loosely referred to as a distribution function on the parameter space that can represent confidence intervals of all levels for a parameter of interest. Historically, it has typically been constructed by inverting the upper limits of lower sided confidence intervals of all levels, and it was also commonly associated with a fiducial interpretation, although it is a purely frequentist concept. A confidence distribution is NOT a probability distribution function of the parameter of interest, but may still be a function useful for making inferences.
Pranab Kumar Sen was an Indian-American statistician who was a professor of statistics and the Cary C. Boshamer Professor of Biostatistics at the University of North Carolina at Chapel Hill.
Michael Barrett Woodroofe was an American probabilist and statistician. He was a professor of statistics and of mathematics at the University of Michigan, where he was the Leonard J. Savage Professor until his retirement. He was noted for his work in sequential analysis and nonlinear renewal theory, in central limit theory, and in nonparametric inference with shape constraints.
Dorota Maria Dabrowska is a Polish statistician known for applying nonparametric statistics and semiparametric models to counting processes and survival analysis. Dabrowska's estimator, from her paper "Kaplan–Meier estimate on the plane" is a widely used tool for bivariate survival under random censoring.
Charles "Chuck" Joel Stone was an American statistician and mathematician.
Marloes Henriette Maathuis is a Dutch statistician known for her work on causal inference using graphical models, particularly in high-dimensional data from applications in biology and epidemiology. She is a professor of statistics at ETH Zurich in Switzerland.
Hannu Frans Vilhelm Oja is a Finnish mathematical statistician and biostatistician known for his contribution to nonparametric inference, robust statistics, and multivariate statistical methods. He introduced the Oja median for multivariate distributions.
Jon August Wellner is an American statistician known for his contributions to the fields of statistical inference, empirical process theory, and survival analysis.
A class of functions is considered a Donsker class if it satisfies Donsker's theorem, a functional generalization of the central limit theorem.
The formulation and proof of Sanov's theorem in the τ-topology is due to Groeneboom, Oosterhoff, and Ruymgart
This paper was awarded the Rollo Davidson prize 1985 (Cambridge, UK)