Benjamin Baumer | |
---|---|
Born | |
Citizenship | American |
Alma mater | Wesleyan University (BA) University of California, San Diego (MA) City University of New York (PhD) |
Spouse | Cory Mescon |
Scientific career | |
Fields | |
Institutions | Smith College |
Thesis | (2012) |
Benjamin Strong Baumer is a statistician and sabermetrician. He is a professor of statistical and data sciences at Smith College, and was formerly the statistical analyst for the New York Mets.
Baumer grew up in Northampton, Massachusetts. [1] His parents are Polly Baumer and Don Baumer, a former magazine owner and professor of government at Smith College. [2] [3] [4]
Baumer received a Bachelor of Arts in economics from Wesleyan University, and his masters in applied mathematics from the University of California, San Diego. [5] [6] He completed a PhD at the City University of New York. [6]
Baumer is married to Cory Mescon, a public defender. [2] [7]
Baumer is known for his work in sabermetrics, including the book The Sabermetric Revolution: Assessing the Growth of Analytics in Baseball with Andrew Zimbalist. [8] [5] He was the statistical analyst for the New York Mets for eight years, between 2004-2012. [9] [10] This was shortly after the publication of Moneyball, so the use of statistical analysis in baseball was still a new field. [9]
Since leaving the Mets, Baumer has been a professor at Smith College. Upon arrival at Smith, he taught in the mathematics department. [10] He was instrumental in the development of Smith's program in statistical and data sciences, and is now appointed in that program. [11] The program is one of the first undergraduate majors in data science in the United States, and the first at a women's college. [12] [13] Baumer is also a member of the advisory board for the MassMutual data science initiative, a joint effort with Smith College, Mount Holyoke College, and MassMutual. [14] [15]
Baumer has written a textbook for use in data science courses, Modern Data Science with R. [16] [17] He has several highly cited papers on pedagogical techniques for undergraduate data science education. [18] [19] He has taught online data science courses for DataCamp. [20] He is a member of the national organizing committee for DataFest, a weekend-long data hackathon for undergraduate students. Baumer has also organized the FiveCollege Data Fest since 2014. [21] [22] [23]
He is the author of several R packages, including openWAR, a package for analyzing baseball data, and etl, a package for Extract, Transform, Load operations on medium data. [24] [25] [26]
Baumer received the 2016 Contemporary Baseball Analysis Award. [27] His project, The Great Analytics Rankings, was nominated for a 2015 EPPY award. [28]
In sports analytics, sabermetrics is the empirical analysis of baseball, especially baseball statistics that measure in-game activity. Sabermetricians collect and summarize the relevant data from this in-game activity to answer specific questions. The term is derived from the acronym SABR, which stands for the Society for American Baseball Research, founded in 1971. The term "sabermetrics" was coined by Bill James, who is one of its pioneers and is often considered its most prominent advocate and public face.
The posterior probability is a type of conditional probability that results from updating the prior probability with information summarized by the likelihood via an application of Bayes' rule. From an epistemological perspective, the posterior probability contains everything there is to know about an uncertain proposition, given prior knowledge and a mathematical model describing the observations available at a particular time. After the arrival of new information, the current posterior probability may serve as the prior in another round of Bayesian updating.
R is a programming language for statistical computing and data visualization. It has been adopted in the fields of data mining, bioinformatics, and data analysis.
Moneyball: The Art of Winning an Unfair Game is a book by Michael Lewis, published in 2003, about the Oakland Athletics baseball team and its general manager Billy Beane. It describes the team's sabermetric approach to assembling a competitive baseball team on a small budget. It led to the 2011 film Moneyball, starring Brad Pitt and Jonah Hill.
Chapman University is a private research university in Orange, California. Encompassing eleven colleges, the university is classified among "R2: Doctoral Universities – High research activity". The school maintains its founding affiliations with the Christian Church and the United Church of Christ, but is a secular university.
Sir David Roxbee Cox was a British statistician and educator. His wide-ranging contributions to the field of statistics included introducing logistic regression, the proportional hazards model and the Cox process, a point process named after him.
Andrew S. Zimbalist is an American economist and author of twenty-four books. He is the Robert A. Woods Professor of Economics at Smith College.
A mixed model, mixed-effects model or mixed error-component model is a statistical model containing both fixed effects and random effects. These models are useful in a wide variety of disciplines in the physical, biological and social sciences. They are particularly useful in settings where repeated measurements are made on the same statistical units, or where measurements are made on clusters of related statistical units. Mixed models are often preferred over traditional analysis of variance regression models because they don't rely on the independent observations assumption. Further, they have their flexibility in dealing with missing values and uneven spacing of repeated measurements. The Mixed model analysis allows measurements to be explicitly modeled in a wider variety of correlation and variance-covariance avoiding biased estimations. structures.
Norman Saul Matloff is an American professor of computer science at the University of California, Davis.
In statistics, a generalized estimating equation (GEE) is used to estimate the parameters of a generalized linear model with a possible unmeasured correlation between observations from different timepoints. Although some believe that GEEs are robust in everything, even with the wrong choice of working correlation matrix, generalized estimating equations are robust only to loss of consistency with the wrong choice.
Hadley Alexander Wickham is a New Zealand statistician known for his work on open-source software for the R statistical programming environment. He is the chief scientist at Posit PBC and an adjunct professor of statistics at the University of Auckland, Stanford University, and Rice University. His work includes the data visualisation system ggplot2 and the tidyverse, a collection of R packages for data science based on the concept of tidy data.
Yihui Xie is a Chinese statistician, data scientist and software engineer who formerly worked for Posit PBC. He is the principal author of the open-source software package Knitr for data analysis in the R programming language, and has also written the book Dynamic Documents with R and knitr.
Sports analytics are collections of relevant historical statistics that can provide a competitive advantage to a team or individual by helping to inform players, coaches and other staff and help facilitate decision-making both during and prior to sporting events. The term "sports analytics" was popularized in mainstream sports culture following the release of the 2011 film Moneyball. In this film, Oakland Athletics general manager Billy Beane relies heavily on the use of baseball analytics to build a competitive team on a minimal budget, building upon and extending the established practice of Sabermetrics.
Mine Çetinkaya-Rundel is a Turkish-American statistician and professor of the practice at Duke University, and a professional educator at RStudio. She is the author of several open source statistics textbooks and is an instructor for Coursera. She is the chair-elect of the Statistical Education Section of the American Statistical Association. Previously, she was a senior lecturer at University of Edinburgh.
Jeanette Grasselli Brown is an American analytical chemist and spectroscopist who is known for her work with Standard Oil of Ohio as an industrial researcher in the field of spectroscopy.
Jennifer "Jenny" Bryan is a data scientist and an associate professor of statistics at the University of British Columbia where she developed the Master in Data Science Program. She is a statistician and software engineer at RStudio from Vancouver, Canada and is known for creating open source tools which connect R to Google Sheets and Google Drive.
Galit Shmueli is a data scientist who works in Taiwan as Tsing Hua Distinguished Professor at the Institute of Service Science, National Tsing Hua University. She is the author of many textbooks in business statistics and is known for her work on information quality, and on clarifying the difference between explanations and predictions in statistical analyses.
Sherri Nichols is an American software engineer, data scientist, and baseball statistician most known for her contribution to baseball's Sabermetrics movement. Growing up loving baseball and math, Nichols fused the two passions together to start analyzing baseball in a stats-driven manner. Her influence on the infant stages of the Sabermetrics movement in the 1980s-1990s can be depicted from various works such as Nichols' Law of Catcher Defense, her work collecting play-by-play data, and most notably her cocreation of Defensive Average. Nichols' assertiveness and knowledge has greatly influenced other notable baseball statisticians and paved the way for other women to enter the male dominated industry.
Nicholas (Nick) Horton is an American statistics professor and author. He is the Beitzel Professor in Technology and Society at Amherst College. In 2022, he began a 3-year term as the vice president of the American Statistical Association.