Introduction to Data Science
Statistics and Prediction Algorithms Through Case Studies
Rafael A. Irizarry(Author)
Chapman & Hall/CRC (Publisher)
2nd Edition
Will be published approx. on 30. October 2026
Book
Paperback/Softback
479 pages
978-1-032-41987-9 (ISBN)
Description
Introduction to Data Science: Statistics and Prediction Algorithms Through Case Studies teaches data science as a way of thinking statistically, not just as a collection of computational tools. Building on the topics covered in Introduction to Data Science: Data Wrangling and Visualization with R, this book is designed for students with some programming experience and basic mathematical maturity, this book builds the foundations of probability, statistical inference, regression, high-dimensional data analysis, and machine learning through real data examples and reproducible R code. It is suitable for one-semester course in advanced data science.
The book shows how to reason about variability, uncertainty, prediction error, model assumptions, and validation. Through case studies involving polling, genetics, baseball, recommendation systems, image classification, and other modern datasets, readers learn how to connect probability models to data, summarize complex information, quantify uncertainty, fit and interpret models, evaluate prediction algorithms, and understand the statistical ideas behind machine learning. Each chapter is designed to support classroom teaching, self-study, and hands-on analysis, with exercises and companion web materials available through the book website.
Key Features:
Includes base R, data.table, and tidyverse code.
Focuses on the statistical and probabilistic foundations of machine learning.
Contains real-world case studies.
Rafael A. Irizarry is Professor and Chair of the Department of Data Science at Dana-Farber Cancer Institute and Professor of Applied Statistics at Harvard. His research focuses on Genomics and he has taught several Data Science courses.
The book shows how to reason about variability, uncertainty, prediction error, model assumptions, and validation. Through case studies involving polling, genetics, baseball, recommendation systems, image classification, and other modern datasets, readers learn how to connect probability models to data, summarize complex information, quantify uncertainty, fit and interpret models, evaluate prediction algorithms, and understand the statistical ideas behind machine learning. Each chapter is designed to support classroom teaching, self-study, and hands-on analysis, with exercises and companion web materials available through the book website.
Key Features:
Includes base R, data.table, and tidyverse code.
Focuses on the statistical and probabilistic foundations of machine learning.
Contains real-world case studies.
Rafael A. Irizarry is Professor and Chair of the Department of Data Science at Dana-Farber Cancer Institute and Professor of Applied Statistics at Harvard. His research focuses on Genomics and he has taught several Data Science courses.
More details
Series
Edition
2nd edition
Language
English
Place of publication
United Kingdom
Publishing group
Taylor & Francis Ltd
Target group
College/higher education
Postgraduate, Undergraduate Advanced, and Undergraduate Core
Illustrations
12 s/w Tabellen, 113 farbige Zeichnungen, 102 s/w Zeichnungen, 4 Farbfotos bzw. farbige Rasterbilder, 117 farbige Abbildungen, 102 s/w Abbildungen
12 Tables, black and white; 113 Line drawings, color; 102 Line drawings, black and white; 4 Halftones, color; 117 Illustrations, color; 102 Illustrations, black and white
Dimensions
Height: 254 mm
Width: 178 mm
ISBN-13
978-1-032-41987-9 (9781032419879)
Copyright in bibliographic data is held by Nielsen Book Services Limited or its licensors: all rights reserved.
Schweitzer Classification
Other editions
Additional editions
Rafael A. Irizarry
Introduction to Data Science
Statistics and Prediction Algorithms Through Case Studies
E-Book
approx. 10/2026
2nd Edition
Chapman and Hall
€68.49
Not yet available
Rafael A. Irizarry
Introduction to Data Science
Statistics and Prediction Algorithms Through Case Studies
Book
approx. 10/2026
2nd Edition
Chapman & Hall/CRC
€207.98
Not yet published
Rafael A. Irizarry
Introduction to Data Science
Statistics and Prediction Algorithms Through Case Studies
E-Book
approx. 10/2026
2nd Edition
Chapman and Hall
€68.49
Not yet available
Person
Rafael A. Irizarry is Professor and Chair of the Department of Data Science at Dana-Farber Cancer Institute and Professor of Applied Statistics at Harvard. His research focuses on Genomics and he has taught several Data Science courses.
Author
Dept. of Biostatistics, Harvard School of Public Health, Boston, Massachusetts, USA
Content
Distributions Numerical Summaries Comparing Groups Connecting Data and Probability Discrete Probability Continuous Probability Random Variables Sampling Models and the Central Limit Theorem Estimates and Confidence Intervals Data-Driven Models Bayesian Statistics Hierarchical Models Hypothesis Testing Bootstrap Introduction to Regression The Linear Model Framework Treatment Effect Models Generalized Linear Models Association Is Not Causation Multivariable Regression Working with Matrices in R Applied Linear Algebra Dimension Reduction Regularization Latent Factor Models Notation and Terminology Performance Metrics Conditional Expectations and Smoothing Resampling and Model Assessment Supervised Learning Methods Building Machine Learning Models Unsupervised Learning: Clustering