Data Science and Predictive Analytics

*Data Science and Predictive Analytics: Biomedical and Health Applications using R*
Author	Ivo D. Dinov
Language	English
Series	The Springer Series in Applied Machine Learning
Subject	Computer science, data science, artificial intelligence
Publisher	Springer
Publication date	2018 (1st ed.), 2023 (2nd edition)
Publication place	Switzerland
Media type	Print (hardcover and softcover), electronic (PDF and EPub)
ISBN	978-3-031-17483-4 978-3-319-72346-4, 978-3-031-17485-8, 978-3-031-17482-7

Last updated October 09, 2025

The first edition of the textbook Data Science and Predictive Analytics: Biomedical and Health Applications using R, authored by Ivo D. Dinov, was published in August 2018 by Springer.^[1] The second edition of the book was printed in 2023.^[2]

By using the statistical computing platform R and a broad range of biomedical case-studies, the 23 chapters of the book first edition provide explicit examples of importing, exporting, processing, modeling, visualizing, and interpreting large, multivariate, incomplete, heterogeneous, longitudinal, and incomplete datasets (big data).^[4]

Structure

First edition table of contents

The first edition of the Data Science and Predictive Analytics (DSPA) textbook^[1] is divided into the following 23 chapters, each progressively building on the previous content.

Motivation
Foundations of R
Managing Data in R
Data Visualization
Linear Algebra & Matrix Computing
Dimensionality Reduction
Lazy Learning: Classification Using Nearest Neighbors
Probabilistic Learning: Classification Using Naive Bayes
Decision Tree Divide and Conquer Classification
Forecasting Numeric Data Using Regression Models
Black Box Machine-Learning Methods: Neural Networks and Support Vector Machines
Apriori Association Rules Learning
k-Means Clustering
Model Performance Assessment
Improving Model Performance
Specialized Machine Learning Topics
Variable/Feature Selection
Regularized Linear Modeling and Controlled Variable Selection
Big Longitudinal Data Analysis
Natural Language Processing/Text Mining
Prediction and Internal Statistical Cross Validation
Function Optimization
Deep Learning, Neural Networks

Second edition table of contents

The significantly reorganized revised edition of the book (2023)^[2] expands and modernizes the presented mathematical principles, computational methods, data science techniques, model-based machine learning and model-free artificial intelligence algorithms. The 14 chapters of the new edition start with an introduction and progressively build foundational skills to naturally reach biomedical applications of deep learning.

Introduction
Basic Visualization and Exploratory Data Analytics
Linear Algebra, Matrix Computing, and Regression Modeling
Linear and Nonlinear Dimensionality Reduction
Supervised Classification
Black Box Machine Learning Methods
Qualitative Learning Methods—Text Mining, Natural Language Processing, and Apriori Association Rules Learning
Unsupervised Clustering
Model Performance Assessment, Validation, and Improvement
Specialized Machine Learning Topics
Variable Importance and Feature Selection
Big Longitudinal Data Analysis
Function Optimization
Deep Learning, Neural Networks

Reception

The materials in the Data Science and Predictive Analytics (DSPA) textbook have been peer-reviewed in the Journal of the American Statistical Association,^[5] International Statistical Institute’s ISI Review Journal,^[3] and the Journal of the American Library Association.^[4] Many scholarly publications reference the DSPA textbook.^[6]^[7]

As of January 17, 2021, the electronic version of the book first edition ( ISBN 978-3-319-72347-1) is freely available on SpringerLink^[8] and has been downloaded over 6 million times. The textbook is globally available in print (hardcover and softcover) and electronic formats (PDF and EPub) in many college and university libraries^[9] and has been used for data science, computational statistics, and analytics classes at various institutions.^[10]

References

1 2 Dinov, Ivo (2018). Data Science and Predictive Analytics: Biomedical and Health Applications Using R. Springer.
1 2 Dinov, Ivo (2023). Data Science and Predictive Analytics: Biomedical and Health Applications Using R. The Springer Series in Applied Machine Learning. Springer. doi:10.1007/978-3-031-17483-4. ISBN 978-3-031-17482-7. S2CID 256875731.
1 2 Capaldi, Mindy (April 2019) [5 April 2019]. "(Review) Data Science and Predictive Analytics: Biomedical and Health Applications Using R" . International Statistical Review. 87 (1): 181–182. doi:10.1111/insr.12317. S2CID 132379032.
1 2 Saracco, Benjamin (2020) [April 2020]. "Review of Data Science and Predictive Analytics: Biomedical and Health Applications Using R". Journal of the Medical Library Association. 108 (2): 344. doi: 10.5195/jmla.2020.901 . PMC 7069824 . S2CID 214729817.
↑ Qiu, Xing (2024). "Book Review: Data Science and Predictive Analytics, 2nd ed". Journal of the American Statistical Association. 119 (546): 1692–1693. doi:10.1080/01621459.2024.2303323.
↑ "Altmetric – Data Science and Predictive Analytics".
↑ "Google Scholar".
↑ Dinov, Ivo D. (2018). Data Science and Predictive Analytics. doi:10.1007/978-3-319-72347-1. ISBN 978-3-319-72346-4. S2CID 52098523.
↑ Textbook library availability
↑ Courses using the DSPA textbook

External links

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[DSPA_2018-1] 1 2 Dinov, Ivo (2018). Data Science and Predictive Analytics: Biomedical and Health Applications Using R. Springer.

[DSPA_2023-2] 1 2 Dinov, Ivo (2023). Data Science and Predictive Analytics: Biomedical and Health Applications Using R. The Springer Series in Applied Machine Learning. Springer. doi:10.1007/978-3-031-17483-4. ISBN 978-3-031-17482-7. S2CID 256875731.

[Capaldi_2019-3] 1 2 Capaldi, Mindy (April 2019) [5 April 2019]. "(Review) Data Science and Predictive Analytics: Biomedical and Health Applications Using R" . International Statistical Review. 87 (1): 181–182. doi:10.1111/insr.12317. S2CID 132379032.

[Saracco_2020-4] 1 2 Saracco, Benjamin (2020) [April 2020]. "Review of Data Science and Predictive Analytics: Biomedical and Health Applications Using R". Journal of the Medical Library Association. 108 (2): 344. doi: 10.5195/jmla.2020.901 . PMC 7069824 . S2CID 214729817.

[Qiu_DSPA_2024_Review-5] Qiu, Xing (2024). "Book Review: Data Science and Predictive Analytics, 2nd ed". Journal of the American Statistical Association. 119 (546): 1692–1693. doi:10.1080/01621459.2024.2303323.

[6] "Altmetric – Data Science and Predictive Analytics".

[7] "Google Scholar".

[8] Dinov, Ivo D. (2018). Data Science and Predictive Analytics. doi:10.1007/978-3-319-72347-1. ISBN 978-3-319-72346-4. S2CID 52098523.

[Libraries-9] Textbook library availability

[DSPA_Courses-10] Courses using the DSPA textbook

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]