Ben Baumer

Last updated
Benjamin Baumer
Born
CitizenshipAmerican
Alma mater Wesleyan University (BA)
University of California, San Diego (MA)
City University of New York (PhD)
SpouseCory Mescon
Scientific career
Fields
Institutions Smith College
Thesis  (2012)

Benjamin Strong Baumer is a statistician and sabermetrician. He is a professor of statistical and data sciences at Smith College, and was formerly the statistical analyst for the New York Mets.

Contents

Life

Baumer grew up in Northampton, Massachusetts. [1] His parents are Polly Baumer and Don Baumer, a former magazine owner and professor of government at Smith College. [2] [3] [4]

Baumer received a Bachelor of Arts in economics from Wesleyan University, and his masters in applied mathematics from the University of California, San Diego. [5] [6] He completed a PhD at the City University of New York. [6]

Baumer is married to Cory Mescon, a public defender. [2] [7]

Work

Baumer is known for his work in sabermetrics, including the book The Sabermetric Revolution: Assessing the Growth of Analytics in Baseball with Andrew Zimbalist. [8] [5] He was the statistical analyst for the New York Mets for eight years, between 2004-2012. [9] [10] This was shortly after the publication of Moneyball, so the use of statistical analysis in baseball was still a new field. [9]

Since leaving the Mets, Baumer has been a professor at Smith College. Upon arrival at Smith, he taught in the mathematics department. [10] He was instrumental in the development of Smith's program in statistical and data sciences, and is now appointed in that program. [11] The program is one of the first undergraduate majors in data science in the United States, and the first at a women's college. [12] [13] Baumer is also a member of the advisory board for the MassMutual data science initiative, a joint effort with Smith College, Mount Holyoke College, and MassMutual. [14] [15]

Baumer has written a textbook for use in data science courses, Modern Data Science with R. [16] [17] He has several highly cited papers on pedagogical techniques for undergraduate data science education. [18] [19] He has taught online data science courses for DataCamp. [20] He is a member of the national organizing committee for DataFest, a weekend-long data hackathon for undergraduate students. Baumer has also organized the FiveCollege Data Fest since 2014. [21] [22] [23]

He is the author of several R packages, including openWAR, a package for analyzing baseball data, and etl, a package for Extract, Transform, Load operations on medium data. [24] [25] [26]

Awards

Baumer received the 2016 Contemporary Baseball Analysis Award. [27] His project, The Great Analytics Rankings, was nominated for a 2015 EPPY award. [28]

Bibliography

Related Research Articles

<span class="mw-page-title-main">Sabermetrics</span> Analysis of baseball statistics

In sports analytics, sabermetrics is the empirical analysis of baseball, especially baseball statistics that measure in-game activity. Sabermetricians collect and summarize the relevant data from this in-game activity to answer specific questions. The term is derived from the acronym SABR, which stands for the Society for American Baseball Research, founded in 1971. The term "sabermetrics" was coined by Bill James, who is one of its pioneers and is often considered its most prominent advocate and public face.

The posterior probability is a type of conditional probability that results from updating the prior probability with information summarized by the likelihood via an application of Bayes' rule. From an epistemological perspective, the posterior probability contains everything there is to know about an uncertain proposition, given prior knowledge and a mathematical model describing the observations available at a particular time. After the arrival of new information, the current posterior probability may serve as the prior in another round of Bayesian updating.

<span class="mw-page-title-main">R (programming language)</span> Programming language for statistics

R is a programming language for statistical computing and graphics supported by the R Core Team and the R Foundation for Statistical Computing. Created by statisticians Ross Ihaka and Robert Gentleman, R is used among data miners, bioinformaticians and statisticians for data analysis and developing statistical software. The core R language is augmented by a large number of extension packages containing reusable code and documentation.

<i>Moneyball</i> 2003 book by Michael Lewis

Moneyball: The Art of Winning an Unfair Game is a book by Michael Lewis, published in 2003, about the Oakland Athletics baseball team and its general manager Billy Beane. Its focus is the team's analytical, evidence-based, sabermetric approach to assembling a competitive baseball team despite Oakland's small budget. A film based on Lewis' book, starring Brad Pitt and Jonah Hill, was released in 2011.

<span class="mw-page-title-main">Chapman University</span> American private university in Orange, California

Chapman University is a private research university in Orange, California. It encompasses eleven schools and colleges and is classified among "R2: Doctoral Universities – High research activity". While the school maintains affiliations with the Christian Church and the United Church of Christ, it is not a Christian college.

<span class="mw-page-title-main">David Cox (statistician)</span> British statistician and educator (1924–2022)

Sir David Roxbee Cox was a British statistician and educator. His wide-ranging contributions to the field of statistics included introducing logistic regression, the proportional hazards model and the Cox process, a point process named after him.

Statistica is an advanced analytics software package originally developed by StatSoft and currently maintained by TIBCO Software Inc. Statistica provides data analysis, data management, statistics, data mining, machine learning, text analytics and data visualization procedures.

A mixed model, mixed-effects model or mixed error-component model is a statistical model containing both fixed effects and random effects. These models are useful in a wide variety of disciplines in the physical, biological and social sciences. They are particularly useful in settings where repeated measurements are made on the same statistical units, or where measurements are made on clusters of related statistical units. Because of their advantage in dealing with missing values, mixed effects models are often preferred over more traditional approaches such as repeated measures analysis of variance.

Norman Saul Matloff is an American professor of computer science at the University of California, Davis.

In statistics, a generalized estimating equation (GEE) is used to estimate the parameters of a generalized linear model with a possible unmeasured correlation between observations from different timepoints. Although some believe that Generalized estimating equations are robust in everything even with the wrong choice of working-correlation matrix, Generalized estimating equations are only robust to loss of consistency with the wrong choice.

<span class="mw-page-title-main">Hadley Wickham</span> New Zealand statistician

Hadley Alexander Wickham is a New Zealand statistician known for his work on open-source software for the R statistical programming environment. He is the chief scientist at Posit, PBC and an adjunct professor of statistics at the University of Auckland, Stanford University, and Rice University. His work includes the data visualisation system ggplot2 and the tidyverse, a collection of R packages for data science based on the concept of tidy data.

Yihui Xie is a Chinese statistician, data scientist and software engineer for RStudio. He is the principal author of the open-source software package Knitr for data analysis in the R programming language, and has also written the book Dynamic Documents with R and knitr.

Sports analytics are a collection of relevant, historical, statistics that can provide a competitive advantage to a team or individual. Through the collection and analysis of these data, sports analytics inform players, coaches and other staff in order to facilitate decision making both during and prior to sporting events. The term "sports analytics" was popularized in mainstream sports culture following the release of the 2011 film, Moneyball, in which Oakland Athletics General Manager Billy Beane relies heavily on the use of baseball analytics, building upon and extending the established practice of Sabermetrics, to build a competitive team on a minimal budget.

Mine Çetinkaya-Rundel is a Turkish-American statistician and professor of the practice at Duke University, and a professional educator at RStudio. She is the author of several open source statistics textbooks and is an instructor for Coursera. She is the chair-elect of the Statistical Education Section of the American Statistical Association. Previously, she was a senior lecturer at University of Edinburgh.

Jeanette Grasselli Brown is an American analytical chemist and spectroscopist who is known for her work with Standard Oil of Ohio as an industrial researcher in the field of spectroscopy.

Jennifer "Jenny" Bryan is a data scientist and an associate professor of statistics at the University of British Columbia where she developed the Master in Data Science Program. She is a statistician and software engineer at RStudio from Vancouver, Canada and is known for creating open source tools which connect R to Google Sheets and Google Drive.

Galit Shmueli is a data scientist who works in Taiwan as Tsing Hua Distinguished Professor at the Institute of Service Science, National Tsing Hua University. She is the author of many textbooks in business statistics and is known for her work on information quality, and on clarifying the difference between explanations and predictions in statistical analyses.

Sherri Nichols is an American software engineer, data scientist, and baseball statistician most known for her contribution to baseball's Sabermetrics movement. Growing up loving baseball and math, Nichols fused the two passions together to start analyzing baseball in a stats-driven manner. Her influence on the infant stages of the Sabermetrics movement in the 1980s-1990s can be depicted from various works such as Nichols' Law of Catcher Defense, her work collecting play-by-play data, and most notably her cocreation of Defensive Average. Nichols' assertiveness and knowledge has greatly influenced other notable baseball statisticians and paved the way for other women to enter the male dominated industry.

Nicholas (Nick) Horton is an American statistics professor and author. He is the Beitzel Professor in Technology and Society at Amherst College. As of 2022, he will serve as the vice president of the American Statistical Association.

References

  1. "Ben Baumer | Smith College". www.smith.edu. Retrieved 2018-02-25.
  2. 1 2 "Cory Mescon, Benjamin Baumer". The New York Times. 2010-06-18. ISSN   0362-4331 . Retrieved 2018-02-25.
  3. "Donald C. Baumer | Smith College". www.smith.edu. Retrieved 2018-02-25.
  4. Amanda Drane (2015-12-28). "When an accident took Maggie Baumer of Northampton's arm she rebuilt her life to help others". Daily Hampshire Gazette.
  5. 1 2 David Low (2014-03-14). "Books by Gilbert '98, Baumer '00, Zimbalist P'02 Take Swings at Baseball History, Analytics". News @ Wesleyan.
  6. 1 2 "Ben Baumer". Statistics.com. Archived from the original on 2018-02-26.
  7. "Northampton (Dist PD) | Directories". www.publiccounsel.net. Retrieved 2018-02-25.
  8. "The Sabermetric Revolution". University of Pennsylvania Press.
  9. 1 2 Matthew Yaspan (2014-03-14). "An interview with former Mets stat guru Ben Baumer, Part 1". Amazin' Avenue.
  10. 1 2 Adam Rubin (2012-05-28). "Stat guru Baumer leaving Mets to teach". ESPN.
  11. Cas Sweeney (2017-05-20). "A look into Statistical and Data Sciences- One of Smith's newest and fastest growing majors". Archived from the original on 2018-02-26. Retrieved 2018-02-25.
  12. Emily Cutts (2018-02-20). "Smith College provost named president of the College of William & Mary". Daily Hampshire Gazette.
  13. Steve Pierson (2014-12-08). "Universities and Colleges Creating New Undergraduate Statistics (and Related) Programs". American Statistical Association.
  14. "Jim Kinney". MassLive. 2015-02-13.
  15. "The Center for Data Science and MassMutual Host Local Data Scientists & Business Leaders". UMass Amherst Center for Data Science.
  16. Baumer, Benjamin; Kaplan, Daniel; Horton, Nicholas (2017). Modern Data Science with R (1 ed.). Chapman and Hall/CRC.
  17. Baumer, Benjamin; Kaplan, Daniel; Horton, Nicholas (2021). Modern Data Science with R (2 ed.). Chapman and Hall/CRC.
  18. Baumer, Ben; Cetinkaya-Rundel, Mine; Bray, Andrew; Loi, Linda; Horton, Nicholas (2014-01-01). "R Markdown: Integrating A Reproducible Analysis Tool into Introductory Statistics". Technology Innovations in Statistics Education. 8 (1). arXiv: 1402.1894 . doi:10.5070/T581020118.
  19. Hardin, Johanna; Hoerl, Roger; Horton, Nicholas; Nolan, Deborah; Baumer, Ben; Hall-Holt, Olaf; Murrell, Paul; Peng, Roger; Roback, Paul; Temple Lang, Duncan; Ward, Mark (2015-10-02). "Data science in statistics curricula: Preparing students to "think with data"". The American Statistician. 69 (4): 343–353. arXiv: 1410.3127 . Bibcode:2014arXiv1410.3127H. doi:10.1080/00031305.2015.1077729. S2CID   88520302.
  20. Gabriel de Selding (2017-02-01). "DataChats: An Interview with Ben Baumer". DataCamp.
  21. Gould, Robert; Baumer, Ben; Cetinkaya-Rundel, Mine; Bray, Andrew (2014-06-01). "Big Data Goes to College". Amstat News.
  22. "ASA DataFest Contact".
  23. Carl Bialik (2014-05-02). "The Students Most Likely to Take Our Jobs". FiveThirtyEight.
  24. Ben Baumer (2014-03-17). "Introduction to openWAR". Exploring Baseball Data with R.
  25. Ben Baumer. "An R package enabling the computation of openWAR using MLBAM data". GitHub.
  26. "etl: Extract-Transform-Load Framework for Medium Data". Comprehensive R Archive Network. 17 May 2021.
  27. "Baumer, Brudnicki, McMurray win 2016 SABR Analytics Conference Research Awards". Society for American Baseball Research.
  28. "Editor & Publisher Announces the 2015 EPPY Award Finalists". Editor & Publisher.
  29. "The Sabermetric Revolution | Benjamin Baumer, Andrew Zimbalist". www.upenn.edu. Retrieved 2018-03-02.
  30. "Modern Data Science with R". CRC Press. Retrieved 2018-03-02.
  31. "Analyzing Baseball Data with R, Second Edition". CRC Press. Retrieved 2022-07-11.