Easystats

Last updated
Easystats
Initial release2019 (2019)
Written in R
Operating system All OS supported by R
Available inEnglish
Type Statistical software
License GPL-3.0
Website github.com/easystats/easystats

The easystats collection of open source R packages was created in 2019 and primarily includes tools dedicated to the post-processing of statistical models. [1] [2] As of May 2022, the 10 packages composing the easystats ecosystem have been downloaded more than 8 million times, and have been used in more than 1000 scientific publications. [3] [4] [5] The ecosystem is the topic of several statistical courses, video tutorials and books. [6] [7] [8] [9] [10] [11] [12]

Contents

The aim of easystats is to provide a unifying and consistent framework to understand and report statistical results. It is also compatible with other collections of packages, such as the tidyverse. Notable design characteristics include its API, with a particular attention given to the names of functions and arguments (e.g., avoiding acronyms and abbreviations), and its low number of dependencies. [2] [ better source needed ]

History

In 2019, Dominique Makowski contacted software developer Daniel Lüdecke with the idea to collaborate around a collection of R packages aiming at facilitating data science for users without a statistical or computer science background. The first package of easystats, insight was created in 2019, and was envisioned as the foundation of the ecosystem. [1] The second package that emerged, bayestestR, benefitted from the joining of Bayesian expert Mattan S. Ben-Shachar. Other maintainers include Indrajeet Patil and Brenton M. Wiernik. [2]

Packages

The easystats ecosystem contains ten semi-independent packages.

See also

Related Research Articles

<span class="mw-page-title-main">Psychological statistics</span>

Psychological statistics is application of formulas, theorems, numbers and laws to psychology. Statistical methods for psychology include development and application statistical theory and methods for modeling psychological data. These methods include psychometrics, factor analysis, experimental designs, and Bayesian statistics. The article also discusses journals in the same field.

Bayesian statistics is a theory in the field of statistics based on the Bayesian interpretation of probability where probability expresses a degree of belief in an event. The degree of belief may be based on prior knowledge about the event, such as the results of previous experiments, or on personal beliefs about the event. This differs from a number of other interpretations of probability, such as the frequentist interpretation that views probability as the limit of the relative frequency of an event after many trials.

The nested sampling algorithm is a computational approach to the Bayesian statistics problems of comparing models and generating samples from posterior distributions. It was developed in 2004 by physicist John Skilling.

<span class="mw-page-title-main">Helical wheel</span>

A helical wheel is a type of plot or visual representation used to illustrate the properties of alpha helices in proteins.

The Longley–Rice model (LR) is a radio propagation model: a method for predicting the attenuation of radio signals for a telecommunication link in the frequency range of 40 MHz to 100 GHz.

Psychometric software is software that is used for psychometric analysis of data from tests, questionnaires, or inventories reflecting latent psychoeducational variables. While some psychometric analyses can be performed with standard statistical software like SPSS, most analyses require specialized tools.

GeneNetwork is a combined database and open-source bioinformatics data analysis software resource for systems genetics. This resource is used to study gene regulatory networks that link DNA sequence differences to corresponding differences in gene and protein expression and to variation in traits such as health and disease risk. Data sets in GeneNetwork are typically made up of large collections of genotypes and phenotypes from groups of individuals, including humans, strains of mice and rats, and organisms as diverse as Drosophila melanogaster, Arabidopsis thaliana, and barley. The inclusion of genotypes makes it practical to carry out web-based gene mapping to discover those regions of genomes that contribute to differences among individuals in mRNA, protein, and metabolite levels, as well as differences in cell function, anatomy, physiology, and behavior.

<span class="mw-page-title-main">Stan (software)</span> Probabilistic programming language for Bayesian inference

Stan is a probabilistic programming language for statistical inference written in C++. The Stan language is used to specify a (Bayesian) statistical model with an imperative program calculating the log probability density function.

In stochastic processes, Kramers–Moyal expansion refers to a Taylor series expansion of the master equation, named after Hans Kramers and José Enrique Moyal. This expansion transforms the integro-differential master equation

<i>Journal of Open Source Software</i> Academic journal

The Journal of Open Source Software is a peer-reviewed open-access scientific journal covering open-source software from any research discipline. The journal was founded in 2016 by editors Arfon Smith, Kyle Niemeyer, Dan Katz, Kevin Moerman, and Karthik Ram. The editor-in-chief is Arfon Smith, and associate editors-in-chief are Dan Katz, Kevin Moerman, Kyle Niemeyer, and Krysten Thyng. The journal is a sponsored project of NumFOCUS and an affiliate of the Open Source Initiative. The journal uses GitHub as publishing platform.

Jennifer "Jenny" Bryan is a data scientist and an associate professor of statistics at the University of British Columbia where she developed the Master in Data Science Program. She is a statistician and software engineer at RStudio from Vancouver, Canada and is known for creating open source tools which connect R to Google Sheets and Google Drive.

The tidyverse is a collection of open source packages for the R programming language introduced by Hadley Wickham and his team that "share an underlying design philosophy, grammar, and data structures" of tidy data. Characteristic features of tidyverse packages include extensive use of non-standard evaluation and encouraging piping.

<span class="mw-page-title-main">Flux (machine-learning framework)</span>

Flux is an open-source machine-learning software library and ecosystem written in Julia. Its current stable release is v0.12.8. It has a layer-stacking-based interface for simpler models, and has a strong support on interoperability with other Julia packages instead of a monolithic design. For example, GPU support is implemented transparently by CuArrays.jl This is in contrast to some other machine learning frameworks which are implemented in other languages with Julia bindings, such as TensorFlow.jl, and thus are more limited by the functionality present in the underlying implementation, which is often in C or C++. Flux joined NumFOCUS as an affiliated project in December of 2021.


ArviZ is a Python package for exploratory analysis of Bayesian models.

<span class="mw-page-title-main">Vega and Vega-Lite visualisation grammars</span> Graphics software tools

Vega and Vega-Lite are visualization tools implementing a grammar of graphics, similar to ggplot2. The Vega and Vega-Lite grammars extend Leland Wilkinson's Grammar of Graphics. by adding a novel grammar of interactivity to assist in the exploration of complex datasets.

pvlib python Software for simulating solar power

pvlib python is open source software for simulating solar power of photovoltaic energy systems.

In Bayesian statistics, the Probability of Direction (pd) is a measure of effect existence representing the certainty with which an effect is positive or negative. This index is numerically similar to the frequentist p-value.

<span class="mw-page-title-main">Jamovi</span> Graphical user interface for R programming language

Jamovi is a free and open-source computer program for data analysis and performing statistical tests. The core developers of Jamovi are Jonathon Love, Damian Dropmann, and Ravi Selker who are developers for the JASP project.

NeuroKit ("nk") is an open source toolbox for physiological signal processing. The most recent version, NeuroKit2, is written in Python and is available from the PyPI package repository. As of June 2022, the software was used in 94 scientific publications. NeuroKit2 is presented as one of the most popular and contributor-friendly open-source software for neurophysiology based on the number of downloads, the number of contributors, and other GitHub metrics.

References

  1. 1 2 "easystats: one year already. What's next?". r-bloggers. 23 January 2020. Retrieved 14 January 2022.
  2. 1 2 3 "easystats". GitHub. 14 January 2022. Retrieved 14 January 2022.
  3. "easystats Downloads". GitHub. 14 January 2022. Retrieved 14 January 2022.
  4. "Project "easystats"". ResearchGate. Retrieved 16 January 2022.
  5. "Dominique Makowski's Google Scholar Profile". scholar.google.fr.
  6. "easystats: Quickly investigate model performance". Business Science. 13 July 2021. Retrieved 17 January 2022.
  7. "Automate Textual Reports of Statistical Models in R! report / easystats". YouTube. Retrieved 17 January 2022.
  8. Field, Andy P. (2012). Discovering statistics using R. Thousand Oaks, California. ISBN   978-1446200469.
  9. "Analyse des corrélations avec easystats". rzine.fr. Retrieved 17 January 2022.
  10. Su, Gang (2 September 2020). "A Comprehensive List of Handy R Packages". towardsdatascience.com. Retrieved 17 January 2022.
  11. Kennedy, Ryan (2021). Introduction to R for social scientists a Tidy programming approach. Boca Raton. ISBN   9781000353877.
  12. Monkman, Martin. "Data Science with R: A Resource Compendium" . Retrieved 18 May 2022.
  13. Lüdecke, Daniel; Waggoner, Philip D.; Makowski, Dominique (25 June 2019). "insight: A Unified Interface to Access Information from Model Objects in R". Journal of Open Source Software. 4 (38): 1412. Bibcode:2019JOSS....4.1412L. doi: 10.21105/joss.01412 . S2CID   198640623.
  14. Makowski, Dominique; Ben-Shachar, Mattan; Lüdecke, Daniel (13 August 2019). "bayestestR: Describing Effects and their Uncertainty, Existence and Significance within the Bayesian Framework". Journal of Open Source Software. 4 (40): 1541. Bibcode:2019JOSS....4.1541M. doi: 10.21105/joss.01541 . S2CID   201882316.
  15. "SIPS Awards" . Retrieved 21 August 2022.
  16. Makowski, Dominique; Ben-Shachar, Mattan; Patil, Indrajeet; Lüdecke, Daniel (16 July 2020). "Methods and Algorithms for Correlation Analysis in R". Journal of Open Source Software. 5 (51): 2306. Bibcode:2020JOSS....5.2306M. doi: 10.21105/joss.02306 . S2CID   225530918.
  17. Lüdecke, Daniel; Ben-Shachar, Mattan; Patil, Indrajeet; Waggoner, Philip; Makowski, Dominique (21 April 2021). "performance: An R Package for Assessment, Comparison and Testing of Statistical Models". Journal of Open Source Software. 6 (60): 3139. Bibcode:2021JOSS....6.3139L. doi: 10.21105/joss.03139 . S2CID   233378359.
  18. Ben-Shachar, Mattan; Lüdecke, Daniel; Makowski, Dominique (23 December 2020). "effectsize: Estimation of Effect Size Indices and Standardized Parameters". Journal of Open Source Software. 5 (56): 2815. Bibcode:2020JOSS....5.2815B. doi: 10.21105/joss.02815 . S2CID   229576898.
  19. Lüdecke, Daniel; Ben-Shachar, Mattan; Patil, Indrajeet; Makowski, Dominique (9 September 2020). "Extracting, Computing and Exploring the Parameters of Statistical Models using R". Journal of Open Source Software. 5 (53): 2445. Bibcode:2020JOSS....5.2445L. doi: 10.21105/joss.02445 . S2CID   225319884.
  20. Lüdecke, Daniel; Patil, Indrajeet; Ben-Shachar, Mattan S.; Wiernik, Brenton M.; Waggoner, Philip; Makowski, Dominique (6 August 2021). "see: An R Package for Visualizing Statistical Models". Journal of Open Source Software. 6 (64): 3393. Bibcode:2021JOSS....6.3393L. doi: 10.21105/joss.03393 . S2CID   238778250.