Easystats

Last updated
Easystats
Initial release2019 (2019)
Written in R
Operating system All OS supported by R
Available inEnglish
Type Statistical software
License GPL-3.0
Website github.com/easystats/easystats

The easystats collection of open source R packages was created in 2019 and primarily includes tools dedicated to the post-processing of statistical models. [1] [2] As of May 2022, the 10 packages composing the easystats ecosystem have been downloaded more than 8 million times, and have been used in more than 1000 scientific publications. [3] [4] [5] The ecosystem is the topic of several statistical courses, video tutorials and books. [6] [7] [8] [9] [10] [11]

Contents

The aim of easystats is to provide a unifying and consistent framework to understand and report statistical results. It is also compatible with other collections of packages, such as the tidyverse. Notable design characteristics include its API, with a particular attention given to the names of functions and arguments (e.g., avoiding acronyms and abbreviations), and its low number of dependencies. [2] [ better source needed ]

History

In 2019, Dominique Makowski contacted software developer Daniel Lüdecke with the idea to collaborate around a collection of R packages aiming at facilitating data science for users without a statistical or computer science background. The first package of easystats, insight was created in 2019, and was envisioned as the foundation of the ecosystem. [1] The second package that emerged, bayestestR, benefitted from the joining of Bayesian expert Mattan S. Ben-Shachar. Other maintainers include Indrajeet Patil, Brenton M. Wiernik, Etienne Bacher, and Rémi Thériault. [12]

The easystats collection of packages as a whole received the 2023 Award from the Society for the Improvement of Psychological Science (SIPS). [13]

Packages

The easystats ecosystem contains ten semi-independent packages.

See also

Related Research Articles

Bayesian statistics is a theory in the field of statistics based on the Bayesian interpretation of probability, where probability expresses a degree of belief in an event. The degree of belief may be based on prior knowledge about the event, such as the results of previous experiments, or on personal beliefs about the event. This differs from a number of other interpretations of probability, such as the frequentist interpretation, which views probability as the limit of the relative frequency of an event after many trials. More concretely, analysis in Bayesian methods codifies prior knowledge in the form of a prior distribution.

<span class="mw-page-title-main">Helical wheel</span> Protein structure visual representation

A helical wheel is a type of plot or visual representation used to illustrate the properties of alpha helices in proteins.

The Longley–Rice model (LR) is a radio propagation model: a method for predicting the attenuation of radio signals for a telecommunication link in the frequency range of 40 MHz to 100 GHz.

Psychometric software refers to specialized programs used for the psychometric analysis of data obtained from tests, questionnaires, polls or inventories that measure latent psychoeducational variables. Although some psychometric analyses can be performed using general statistical software such as SPSS, most require specialized tools designed specifically for psychometric purposes.

GeneNetwork is a combined database and open-source bioinformatics data analysis software resource for systems genetics. This resource is used to study gene regulatory networks that link DNA sequence differences to corresponding differences in gene and protein expression and to variation in traits such as health and disease risk. Data sets in GeneNetwork are typically made up of large collections of genotypes and phenotypes from groups of individuals, including humans, strains of mice and rats, and organisms as diverse as Drosophila melanogaster, Arabidopsis thaliana, and barley. The inclusion of genotypes makes it practical to carry out web-based gene mapping to discover those regions of genomes that contribute to differences among individuals in mRNA, protein, and metabolite levels, as well as differences in cell function, anatomy, physiology, and behavior.

<span class="mw-page-title-main">Icosahedral twins</span> Structure found in atomic clusters and nanoparticles

An icosahedral twin is a nanostructure found in atomic clusters and also nanoparticles with some thousands of atoms. These clusters are twenty-faced, with twenty interlinked tetrahedral crystals joined along triangular faces having three-fold symmetry. A related, more common structure has five units similarly arranged with twinning, which were known as "fivelings" in the 19th century, more recently as "decahedral multiply twinned particles", "pentagonal particles" or "star particles". A variety of different methods lead to the icosahedral form at size scales where surface energies are more important than those from the bulk.

<span class="mw-page-title-main">Stan (software)</span> Probabilistic programming language for Bayesian inference

Stan is a probabilistic programming language for statistical inference written in C++. The Stan language is used to specify a (Bayesian) statistical model with an imperative program calculating the log probability density function.

<span class="mw-page-title-main">Aperture Photometry Tool</span>

Aperture Photometry Tool (APT) is software with a graphical user interface for computing aperture photometry on astronomical imagery. Image overlays, graphical representations, statistics, models, options and controls for aperture-photometry calculations are brought together into a single package. The software handles only images in FITS file format. The software also can be used as a FITS-image viewer. APT is executed on desktop and laptop computers, and is free of charge under a license that limits its use to astronomical research and education. The software may be downloaded from its official website, and requires the Java Virtual Machine to be installed on the user's computer.

<i>Journal of Open Source Software</i> Academic journal

The Journal of Open Source Software is a peer-reviewed open-access scientific journal covering open-source software from any research discipline. The journal was founded in 2016 by editors Arfon Smith, Kyle Niemeyer, Dan Katz, Kevin Moerman, and Karthik Ram. The editor-in-chief is Arfon Smith. The journal is a sponsored project of NumFOCUS and an affiliate of the Open Source Initiative. The journal uses GitHub as publishing platform.

<span class="mw-page-title-main">Tidyverse</span> Collection of R packages

The tidyverse is a collection of open source packages for the R programming language introduced by Hadley Wickham and his team that "share an underlying design philosophy, grammar, and data structures" of tidy data. Characteristic features of tidyverse packages include extensive use of non-standard evaluation and encouraging piping.

<span class="mw-page-title-main">Flux (machine-learning framework)</span> Open-source machine-learning software library

Flux is an open-source machine-learning software library and ecosystem written in Julia. Its current stable release is v0.14.5 . It has a layer-stacking-based interface for simpler models, and has a strong support on interoperability with other Julia packages instead of a monolithic design. For example, GPU support is implemented transparently by CuArrays.jl. This is in contrast to some other machine learning frameworks which are implemented in other languages with Julia bindings, such as TensorFlow.jl, and thus are more limited by the functionality present in the underlying implementation, which is often in C or C++. Flux joined NumFOCUS as an affiliated project in December of 2021.

ArviZ is a Python package for exploratory analysis of Bayesian models. It is specifically designed to work with the output of probabilistic programming libraries like PyMC, Stan, and others by providing a set of tools for summarizing and visualizing the results of Bayesian inference in a convenient and informative way. ArviZ also provides a common data structure for manipulating and storing data commonly arising in Bayesian analysis, like posterior samples or observed data.

This is a comparison of statistical analysis software that allows doing inference with Gaussian processes often using approximations.

<span class="mw-page-title-main">Vega and Vega-Lite visualisation grammars</span> Graphics software tools

Vega and Vega-Lite are visualization tools implementing a grammar of graphics, similar to ggplot2. The Vega and Vega-Lite grammars extend Leland Wilkinson's Grammar of Graphics by adding a novel grammar of interactivity to assist in the exploration of complex datasets.

pvlib python Software for simulating solar power

pvlib python is open source software for simulating solar power of photovoltaic energy systems.

In Bayesian statistics, the probability of direction (pd) is a measure of effect existence representing the certainty with which an effect is positive or negative. This index is numerically similar to the frequentist p-value.

<span class="mw-page-title-main">Jamovi</span> Graphical user interface for R programming language

jamovi is a free and open-source computer program for data analysis and performing statistical tests. The core developers of jamovi are Jonathon Love, Damian Dropmann, and Ravi Selker, who were developers for the JASP project.

NeuroKit ("nk") is an open source toolbox for physiological signal processing. The most recent version, NeuroKit2, is written in Python and is available from the PyPI package repository. As of June 2022, the software was used in 94 scientific publications. NeuroKit2 is presented as one of the most popular and contributor-friendly open-source software for neurophysiology based on the number of downloads, the number of contributors, and other GitHub metricsa.

PreliZ is a Python package for exploring and eliciting probability distributions. While it is primarily focused on prior elicitation—the process of converting domain-specific knowledge into well-defined probability distributions—it can also be used to analyze distributions outside the context of Bayesian statistics.

References

  1. 1 2 "easystats: one year already. What's next?". r-bloggers. 23 January 2020. Retrieved 14 January 2022.
  2. 1 2 "easystats". GitHub. 14 January 2022. Retrieved 14 January 2022.
  3. "easystats Downloads". GitHub. 14 January 2022. Retrieved 14 January 2022.
  4. "Project "easystats"". ResearchGate. Retrieved 16 January 2022.
  5. "Dominique Makowski's Google Scholar Profile". scholar.google.fr.
  6. "easystats: Quickly investigate model performance". Business Science. 13 July 2021. Retrieved 17 January 2022.
  7. "Automate Textual Reports of Statistical Models in R! report / easystats". YouTube. 29 November 2021. Retrieved 17 January 2022.
  8. Field, Andy P. (2012). Discovering statistics using R. Thousand Oaks, California. ISBN   978-1446200469.{{cite book}}: CS1 maint: location missing publisher (link)
  9. "Analyse des corrélations avec easystats". rzine.fr. Retrieved 17 January 2022.
  10. Kennedy, Ryan (2021). Introduction to R for social scientists a Tidy programming approach. Boca Raton. ISBN   9781000353877.{{cite book}}: CS1 maint: location missing publisher (link)
  11. Monkman, Martin. Data Science with R: A Resource Compendium . Retrieved 18 May 2022.
  12. "easystats Authors". GitHub. 11 November 2024. Retrieved 11 November 2024.
  13. "SIPS 2023 Awards Announced!". improvingpsych. 22 August 2023. Retrieved 29 September 2023.
  14. Lüdecke, Daniel; Waggoner, Philip D.; Makowski, Dominique (25 June 2019). "insight: A Unified Interface to Access Information from Model Objects in R". Journal of Open Source Software. 4 (38): 1412. Bibcode:2019JOSS....4.1412L. doi: 10.21105/joss.01412 . S2CID   198640623.
  15. Patil, Indrajeet; Makowski, Dominique; Ben-Shachar, Mattan S.; Wiernik, Brenton M.; Bacher, Etienne; Lüdecke, Daniel (9 October 2022). "datawizard: An R Package for Easy Data Preparation and Statistical Transformations" (PDF). Journal of Open Source Software. 7 (78): 4684. doi:10.21105/joss.04684 . Retrieved 29 September 2023.
  16. Makowski, Dominique; Ben-Shachar, Mattan; Lüdecke, Daniel (13 August 2019). "bayestestR: Describing Effects and their Uncertainty, Existence and Significance within the Bayesian Framework". Journal of Open Source Software. 4 (40): 1541. Bibcode:2019JOSS....4.1541M. doi: 10.21105/joss.01541 . S2CID   201882316.
  17. "SIPS Awards". 24 July 2018. Retrieved 21 August 2022.
  18. Makowski, Dominique; Ben-Shachar, Mattan; Patil, Indrajeet; Lüdecke, Daniel (16 July 2020). "Methods and Algorithms for Correlation Analysis in R". Journal of Open Source Software. 5 (51): 2306. Bibcode:2020JOSS....5.2306M. doi: 10.21105/joss.02306 . S2CID   225530918.
  19. Lüdecke, Daniel; Ben-Shachar, Mattan; Patil, Indrajeet; Waggoner, Philip; Makowski, Dominique (21 April 2021). "performance: An R Package for Assessment, Comparison and Testing of Statistical Models". Journal of Open Source Software. 6 (60): 3139. Bibcode:2021JOSS....6.3139L. doi: 10.21105/joss.03139 . S2CID   233378359.
  20. Ben-Shachar, Mattan; Lüdecke, Daniel; Makowski, Dominique (23 December 2020). "effectsize: Estimation of Effect Size Indices and Standardized Parameters". Journal of Open Source Software. 5 (56): 2815. Bibcode:2020JOSS....5.2815B. doi: 10.21105/joss.02815 . S2CID   229576898.
  21. Lüdecke, Daniel; Ben-Shachar, Mattan; Patil, Indrajeet; Makowski, Dominique (9 September 2020). "Extracting, Computing and Exploring the Parameters of Statistical Models using R". Journal of Open Source Software. 5 (53): 2445. Bibcode:2020JOSS....5.2445L. doi: 10.21105/joss.02445 . S2CID   225319884.
  22. Lüdecke, Daniel; Patil, Indrajeet; Ben-Shachar, Mattan S.; Wiernik, Brenton M.; Waggoner, Philip; Makowski, Dominique (6 August 2021). "see: An R Package for Visualizing Statistical Models". Journal of Open Source Software. 6 (64): 3393. Bibcode:2021JOSS....6.3393L. doi: 10.21105/joss.03393 . S2CID   238778250.