Piet Groeneboom

Last updated
Petrus (Piet) Groeneboom
Piet Groeneboom, 2016.jpg
Piet Groeneboom in 2016
Born (1941-09-24) 24 September 1941 (age 82)
Awards Rollo Davidson Prize (1985)
Scientific career
Fields Statistics
Probability theory
Institutions
Thesis Large Deviations and Asymptotic Efficiencies [1]  (1979)
Doctoral advisor Jacobus (Kobus) Oosterhoff
Doctoral students Marloes Maathuis

Petrus (Piet) Groeneboom (born 24 September 1941 [2] in Scheveningen [3] ) is a Dutch statistician who made major advances in the field of shape-constrained statistical inference such as isotonic regression, and also worked in probability theory.

Contents

Education and career

At the beginning of his tertiary studies in 1959, Groeneboom enrolled in medicine at the University of Amsterdam but quickly switched to psychology at the same university, obtaining a candidate degree in 1963. [3] During his studies he attended a course on logic by analytic philosopher Else M. Barth, whose influence, along with that by Lambert Meertens after his (Groeneboom's) candidate degree, he later stated as having made him decide to study mathematics. He was an assistant of Johannes de Groot. He obtained a master's degree in mathematics in 1971, also at the University of Amsterdam, and studied at the Vrije Universiteit Amsterdam from 1975 under Kobus Oosterhoff, obtaining his Ph.D. degree in 1979. [3]

Before and immediately after obtaining his master's degree, Groeneboom worked at the psychological laboratory of the University of Amsterdam. After his second stint there ended in 1973, he moved to the Centrum Wiskunde & Informatica , which at the time was called Mathematisch Centrum (Mathematical Centre), in the same city. [3]

From 1979 to 1981, Groeneboom was a visiting assistant professor at the University of Washington, [2] to where he would return from 1999 to 2013 as affiliate professor in the department of statistics. From 1981 on, he was again based at the Mathematical Centre before being appointed full professor of statistics at the University of Amsterdam in 1984. In 1988, he moved to Delft University of Technology, where he stayed until his retirement in 2006. From 2000 to 2006 he was additionally a part-time professor at the Vrije Universiteit Amsterdam. [3]

Groeneboom has been a professor emeritus of statistics at Delft University of Technology [4] since his retirement in 2006. He has also held positions at the Vrije Universiteit Amsterdam and the University of Washington. Following his retirement he came to public attention for his statistical work in the retrial of Lucia de Berk, a Dutch nurse, who had been convicted of murder.

Research

In 1979, Groeneboom, together with Kobus Oosterhoff and Frits H. Ruymgaart, formulated and proved Sanov's theorem in a finer topology than had been known at the time. [5] A paper he published in 1983 on properties of Brownian motion gave rise to a large body of literature on minorants of more general stochastic processes. [6] One of the main areas of work of Groeneboom has been shape constrained statistical inference, which includes isotonic regression, an area with links to the aforementioned minorant problems, [6] as a special case. His interest in shape constrained inference began in the second half of his two-year stay at the University of Washington. [3] [7] In a 1985 article on an estimator of a monotone density named after Ulf Grenander, he introduced the switching (or switch) relation, which came to be used widely in the area. [8] [9] He found the analytic form of Chernoff's distribution, which later was understood to be omnipresent in monotone problems, [10] in the 1980s, [3] independently of others who worked on the problem at the same time. His paper on the problem came to be regarded as a benchmark in the field of shape constrained inference. In the 2010s he returned to the problem, giving new proofs in collaboration with Steve Lalley and Nico Temme. [11] [12]

Since the late 1980s, Groeneboom has also worked on censored regression models. [3] He established the asymptotic distribution of the nonparametric maximum likelihood estimator of the survival function in the case of "case 1 censoring". The iterative convex minorant algorithm which he introduced in 1991 found use in statistical estimation for proportional hazards models. [13]

Together with Eric Cator, Groeneboom contributed to the probabilistic analysis of the Hammersley process, a continuous interacting particle system (IPS). Methods similar to theirs were subsequently applied to other IPSs. [14] He is known to influence academic thought amongst some American statisticians such as Jon A. Wellner. [12]

Statistical advocacy in Lucia de Berk case

In the late 2000s, Groeneboom joined Richard D. Gill in the attempt to overturn the conviction of Lucia de Berk, a Dutch nurse, who had been found guilty of murdering four of her patients, and attempting to kill three others. The matter was a high-profile case in the Netherlands, notable because it depended on the probabilities of certain events. [15] They argued that statistical considerations that had led to the initial suspicions of murder, and those which had remained at the center stage of the case afterwards, were flawed. The effort was ultimately successful and de Berk was finally acquitted of all accusations in 2010. [3] [4]

Honors and awards

For his paper on Chernoff's distribution, written in 1984 but appearing much later in 1989, Groeneboom was awarded the Rollo Davidson Prize 1985. [11] [16]

Groeneboom is a fellow of the Institute of Mathematical Statistics, and an elected member of the International Statistical Institute. In 2013, he delivered the Wald lectures at the Joint Statistical Meetings in Montreal. [17] [18]

Books authored

Related Research Articles

<span class="mw-page-title-main">Median</span> Middle quantile of a data set or probability distribution

In statistics and probability theory, the median is the value separating the higher half from the lower half of a data sample, a population, or a probability distribution. For a data set, it may be thought of as "the middle" value. The basic feature of the median in describing data compared to the mean is that it is not skewed by a small proportion of extremely large or small values, and therefore provides a better representation of the center. Median income, for example, may be a better way to describe the center of the income distribution because increases in the largest incomes alone have no effect on the median. For this reason, the median is of central importance in robust statistics.

<span class="mw-page-title-main">Statistical inference</span> Process of using data analysis

Statistical inference is the process of using data analysis to infer properties of an underlying distribution of probability. Inferential statistical analysis infers properties of a population, for example by testing hypotheses and deriving estimates. It is assumed that the observed data set is sampled from a larger population.

The following outline is provided as an overview of and topical guide to statistics:

In statistics, the Behrens–Fisher problem, named after Walter-Ulrich Behrens and Ronald Fisher, is the problem of interval estimation and hypothesis testing concerning the difference between the means of two normally distributed populations when the variances of the two populations are not assumed to be equal, based on two independent samples.

Model selection is the task of selecting a model from among various candidates on the basis of performance criterion to choose the best one. In the context of machine learning and more generally statistical analysis, this may be the selection of a statistical model from a set of candidate models, given data. In the simplest cases, a pre-existing set of data is considered. However, the task can also involve the design of experiments such that the data collected is well-suited to the problem of model selection. Given candidate models of similar predictive or explanatory power, the simplest model is most likely to be the best choice.

In probability theory, an empirical process is a stochastic process that characterizes the deviation of the empirical distribution function its expectation. In mean field theory, limit theorems are considered and generalise the central limit theorem for empirical measures. Applications of the theory of empirical processes arise in non-parametric statistics.

<span class="mw-page-title-main">Richard D. Gill</span> British mathematician

Richard David Gill is a British-Dutch mathematician. He has held academic positions in the Netherlands. As a probability theorist and statistician, Gill has researched counting processes. He is also known for his consulting and advocacy on behalf of alleged victims of statistical misrepresentation, including the reversal of the murder conviction of a Dutch nurse who had been jailed for six years.

In probability theory, Chernoff's distribution, named after Herman Chernoff, is the probability distribution of the random variable

<span class="mw-page-title-main">Erich Leo Lehmann</span> American statistician (1917–2009)

Erich Leo Lehmann was a German-born American statistician, who made a major contribution to nonparametric hypothesis testing. He is one of the eponyms of the Lehmann–Scheffé theorem and of the Hodges–Lehmann estimator of the median of a population.

<span class="mw-page-title-main">Brownian excursion</span> Stochastic process

In probability theory a Brownian excursion process is a stochastic process that is closely related to a Wiener process. Realisations of Brownian excursion processes are essentially just realizations of a Wiener process selected to satisfy certain conditions. In particular, a Brownian excursion process is a Wiener process conditioned to be positive and to take the value 0 at time 1. Alternatively, it is a Brownian bridge process conditioned to be positive. BEPs are important because, among other reasons, they naturally arise as the limit process of a number of conditional functional central limit theorems.

<span class="mw-page-title-main">Ulf Grenander</span> Swedish American mathematician

Ulf Grenander was a Swedish statistician and professor of applied mathematics at Brown University.

<span class="mw-page-title-main">Gábor J. Székely</span>

Gábor J. Székely is a Hungarian-American statistician/mathematician best known for introducing energy statistics (E-statistics). Examples include: the distance correlation, which is a bona fide dependence measure, equals zero exactly when the variables are independent; the distance skewness, which equals zero exactly when the probability distribution is diagonally symmetric; the E-statistic for normality test; and the E-statistic for clustering.

Pranab Kumar Sen was an Indian-American statistician who was a professor of statistics and the Cary C. Boshamer Professor of Biostatistics at the University of North Carolina at Chapel Hill.

In Bayesian inference, the Bernstein–von Mises theorem provides the basis for using Bayesian credible sets for confidence statements in parametric models. It states that under some conditions, a posterior distribution converges in the limit of infinite data to a multivariate normal distribution centered at the maximum likelihood estimator with covariance matrix given by , where is the true population parameter and is the Fisher information matrix at the true population parameter value:

<span class="mw-page-title-main">Aad van der Vaart</span>

Adrianus Willem "Aad" van der Vaart is a Dutch professor of Stochastics at the Delft Institute of Applied Mathematics at Delft University of Technology.

<span class="mw-page-title-main">Michael Woodroofe</span> American probabilist and statistician (1940–2022)

Michael Barrett Woodroofe was an American probabilist and statistician. He was a professor of statistics and of mathematics at the University of Michigan, where he was the Leonard J. Savage Professor until his retirement. He was noted for his work in sequential analysis and nonlinear renewal theory, in central limit theory, and in nonparametric inference with shape constraints.

Dorota Maria Dabrowska is a Polish statistician known for applying nonparametric statistics and semiparametric models to counting processes and survival analysis. Dabrowska's estimator, from her paper "Kaplan–Meier estimate on the plane" is a widely used tool for bivariate survival under random censoring.

Charles "Chuck" Joel Stone was an American statistician and mathematician.

Marloes Henriette Maathuis is a Dutch statistician known for her work on causal inference using graphical models, particularly in high-dimensional data from applications in biology and epidemiology. She is a professor of statistics at ETH Zurich in Switzerland.

Jon August Wellner is an American statistician known for his contributions to the fields of statistical inference, empirical process theory, and survival analysis.

References

  1. Piet Groeneboom at the Mathematics Genealogy Project
  2. 1 2 Cator, Eric A.; Jongbloed, Geurt; Kraaikamp, Cor; Lopuhaä, Hendrik P.; Wellner, Jon A., eds. (2007). Asymptotics: Particles, Processes and Inverse Problems: A Festschrift for Piet Groeneboom. Lecture Notes–Monograph Series. Vol. 55. Beachwood, Ohio: Institute of Mathematical Statistics. doi:10.1214/lnms/1196797058. ISBN   978-0-940600-71-3.
  3. 1 2 3 4 5 6 7 8 9 Jongbloed, Geurt (2019). "A Conversation with Piet Groeneboom". Statistical Science . 34 (1): 156–168. doi: 10.1214/18-STS663 . S2CID   145849794.
  4. 1 2 Gill, Richard D.; Groeneboom, Piet; Jong, Peter de (2019). "Elementary Statistics on Trial—The Case of Lucia de Berk" . Chance . 31 (4): 9–15. arXiv: 1009.0802v3 . doi:10.1080/09332480.2018.1549809. S2CID   5245768.
  5. Dembo, Amir; Zeitouni, Ofer (1998). Large Deviations Techniques and Applications. Applications of Mathematics. Vol. 38 (2nd ed.). New York: Springer. p. 307. ISBN   978-3-642-03310-0. The formulation and proof of Sanov's theorem in the τ-topology is due to Groeneboom, Oosterhoff, and Ruymgart
  6. 1 2 Ouaki, Medi; Pitman, Jim (2022). "Markovian structure in the concave minorant of Brownian motion". Electronic Journal of Probability . 27 57: 1–21. arXiv: 2105.11042 . doi:10.1214/22-EJP769. S2CID   235166920.
  7. Groeneboom, Piet; Pyke, Ronald (1983). "Asymptotic normality of statistics based on the convex minorants of empirical distribution functions". The Annals of Probability . 11 (2): 328–345. doi: 10.1214/aop/1176993599 .
  8. Dümbgen, Lutz; Wellner, Jon A.; Wolff, Malcolm (2018). "A law of the iterated logarithm for Grenander's estimator". Stochastic Processes and Their Applications . 126 (12): 3854–3864. doi:10.1016/j.spa.2016.04.012. PMC   5193173 . PMID   28042197.
  9. Westling, Ted; Carone, Marco (2020). "A unified study of nonparametric inference for monotone functions". The Annals of Statistics . 48 (2): 1001–1024. arXiv: 1806.01928 . doi:10.1214/19-AOS1835. PMC   7377427 . PMID   32704192.
  10. Groeneboom, Piet; Jongbloed, Geurt (2018). "Some Developments in the Theory of Shape Constrained Inference". Statistical Science . 33 (4): 473–492. doi: 10.1214/18-STS657 . S2CID   13672538.
  11. 1 2 Groeneboom, Piet; Lalley, Steven; Temme, Nico (2015). "Chernoff's distribution and differential equations of parabolic and Airy type". Journal of Mathematical Analysis and Applications . 423 (2): 1804–1824. arXiv: 1305.6053 . doi:10.1016/j.jmaa.2014.10.051. S2CID   119173815.
  12. 1 2 Banerjee, Moulinath; Samworth, Richard J. (2018). "A Conversation with Jon Wellner" (PDF). Statistical Science . 33 (4): 633–651. doi:10.1214/18-STS670. S2CID   88523234.
  13. Huang, Jian; Wellner, Jon A. (1997). "Interval censored survival data: a review of recent progress" . In Lin, Danyu; Fleming, Thomas R. (eds.). Proceedings of the first Seattle symposium in biostatistics. Lecture Notes in Statistics. Vol. 123. New York: Springer. pp. 123–169. doi:10.1007/978-1-4684-6316-3_8.
  14. Kriecherbauer, Thomas; Krug, Joachim (2010). "A pedestrian's view on interacting particle systems, KPZ universality and random matrices" . Journal of Physics A . 43 (40): 403001. arXiv: 0803.2796 . doi:10.1088/1751-8113/43/40/403001. S2CID   55894237.
  15. "Elementary Statistics on Trial—The Case of Lucia de Berk | CHANCE". chance.amstat.org. Retrieved 2023-01-23.
  16. Groeneboom, Piet (1989). "Brownian motion with a parabolic drift and Airy functions". Probability Theory and Related Fields . 81 (1): 79–109. doi: 10.1007/BF00343738 . S2CID   119980629. This paper was awarded the Rollo Davidson prize 1985 (Cambridge, UK)
  17. "Wald Lectures: Piet Groeneboom". Institute of Mathematical Statistics . 16 July 2013. Retrieved 24 October 2022.
  18. "Individual members". International Statistical Institute . n.d. Retrieved 15 January 2023.