Introduction to Data Science
Statistics and Prediction Algorithms Through Case Studies
Rafael A. Irizarry(Author)
Chapman and Hall (Publisher)
2nd Edition
Will be published approx. on 30. October 2026
479 pages
E-Book
978-1-040-69616-3 (ISBN)
System requirements
for ePUB without DRM
E-Book Single Licence
You are acquiring a single user licence for this eBook, which you might not transfer. [L]
Not yet available
Description
Alles über E-Books | Antworten auf Fragen rund um E-Books, Kopierschutz und Dateiformate finden Sie in unserem Info- & Hilfebereich.
Introduction to Data Science: Statistics and Prediction Algorithms Through Case Studies teaches data science as a way of thinking statistically, not just as a collection of computational tools. Building on the topics covered in Introduction to Data Science: Data Wrangling and Visualization with R, this book is designed for students with some programming experience and basic mathematical maturity, this book builds the foundations of probability, statistical inference, regression, high-dimensional data analysis, and machine learning through real data examples and reproducible R code. It is suitable for one-semester course in advanced data science.
The book shows how to reason about variability, uncertainty, prediction error, model assumptions, and validation. Through case studies involving polling, genetics, baseball, recommendation systems, image classification, and other modern datasets, readers learn how to connect probability models to data, summarize complex information, quantify uncertainty, fit and interpret models, evaluate prediction algorithms, and understand the statistical ideas behind machine learning. Each chapter is designed to support classroom teaching, self-study, and hands-on analysis, with exercises and companion web materials available through the book website.
Key Features:
Includes base R, data.table, and tidyverse code.
Focuses on the statistical and probabilistic foundations of machine learning.
Contains real-world case studies.
Rafael A. Irizarry is Professor and Chair of the Department of Data Science at Dana-Farber Cancer Institute and Professor of Applied Statistics at Harvard. His research focuses on Genomics and he has taught several Data Science courses.
The book shows how to reason about variability, uncertainty, prediction error, model assumptions, and validation. Through case studies involving polling, genetics, baseball, recommendation systems, image classification, and other modern datasets, readers learn how to connect probability models to data, summarize complex information, quantify uncertainty, fit and interpret models, evaluate prediction algorithms, and understand the statistical ideas behind machine learning. Each chapter is designed to support classroom teaching, self-study, and hands-on analysis, with exercises and companion web materials available through the book website.
Key Features:
Includes base R, data.table, and tidyverse code.
Focuses on the statistical and probabilistic foundations of machine learning.
Contains real-world case studies.
Rafael A. Irizarry is Professor and Chair of the Department of Data Science at Dana-Farber Cancer Institute and Professor of Applied Statistics at Harvard. His research focuses on Genomics and he has taught several Data Science courses.
More details
Series
Edition
2nd edition
Language
English
Place of publication
London
United Kingdom
Publishing group
Taylor & Francis Ltd
Target group
College/higher education
Product notice
Reflowable
Illustrations
12 Tables, black and white; 113 Line drawings, color; 102 Line drawings, black and white; 4 Halftones, color; 117 Illustrations, color; 102 Illustrations, black and white
ISBN-13
978-1-040-69616-3 (9781040696163)
Copyright in bibliographic data is held by Nielsen Book Services Limited or its licensors: all rights reserved.
Schweitzer Classification
Other editions
Additional editions
Rafael A. Irizarry
Introduction to Data Science
Statistics and Prediction Algorithms Through Case Studies
Book
approx. 10/2026
2nd Edition
Chapman & Hall/CRC
€84.36
Not yet published
Rafael A. Irizarry
Introduction to Data Science
Statistics and Prediction Algorithms Through Case Studies
Book
approx. 10/2026
2nd Edition
Chapman & Hall/CRC
€207.98
Not yet published
Person
Rafael A. Irizarry is Professor and Chair of the Department of Data Science at Dana-Farber Cancer Institute and Professor of Applied Statistics at Harvard. His research focuses on Genomics and he has taught several Data Science courses.
Author
Dept. of Biostatistics, Harvard School of Public Health, Boston, Massachusetts, USA
Content
Distributions Numerical Summaries Comparing Groups Connecting Data and Probability Discrete Probability Continuous Probability Random Variables Sampling Models and the Central Limit Theorem Estimates and Confidence Intervals Data-Driven Models Bayesian Statistics Hierarchical Models Hypothesis Testing Bootstrap Introduction to Regression The Linear Model Framework Treatment Effect Models Generalized Linear Models Association Is Not Causation Multivariable Regression Working with Matrices in R Applied Linear Algebra Dimension Reduction Regularization Latent Factor Models Notation and Terminology Performance Metrics Conditional Expectations and Smoothing Resampling and Model Assessment Supervised Learning Methods Building Machine Learning Models Unsupervised Learning: Clustering
System requirements
File format: ePUB
Copy protection: without DRM (Digital Rights Management)
System requirements:
- Computer (Windows; MacOS X; Linux): Use a reader that can handle the file format ePUB, such as Adobe Digital Editions or FBReader – both free (see eBook Help).
- Tablet/Smartphone (Android; iOS): Install the free app Adobe Digital Editions or the app PocketBook (see eBook Help).
- E-reader: Bookeen, Kobo, Pocketbook, Sony, Tolino and many more (not Kindle).
The file format ePUB works well for novels and non-fiction books – i.e., 'flowing' text without complex layout. On an e-reader or smartphone, line and page breaks automatically adjust to fit the small displays.
This eBook does not use copy protection or Digital Rights Management
For more information, see our eBook Help page.