
Test-Driven Data Analysis
Nicholas J. Radcliffe(Author)
Chapman & Hall/CRC (Publisher)
1st Edition
Published on 18. May 2026
Book
Paperback/Softback
424 pages
978-1-032-89670-0 (ISBN)
Description
Test-driven data analysis is the synthesis of ideas from test-driven development of software to data-intensive work including data science, data analysis, and data engineering. It is a methodology for improving the quality of data and of analytical pipelines and processes. It can be thought of as data analysis as if the answers actually matter.
Test-driven data analysis can be thought of as a sibling to reproducible research, with similar concerns, but greater emphasis on automated testing, and less requirement for a human to reproduce results. Extensive checklists are provided that can be used to improve quality before,during, and after analysis.
Key Features:
Prevents costly errors in analytical processes before they reach production through automated data validation and reference testing of data pipelines.
* Provides actionable checklists for issues beyond the reach of automated testing.
* Equips readers with open-source Python tools and language-agnostic command-line interfaces.
* Addresses testing challenges for modern LLM-based systems including chat-bots and coding assistants.
* Instills in analysts an inner voice that is always asking: "How is this misleading data misleading me?"
Test-driven data analysis can be thought of as a sibling to reproducible research, with similar concerns, but greater emphasis on automated testing, and less requirement for a human to reproduce results. Extensive checklists are provided that can be used to improve quality before,during, and after analysis.
Key Features:
Prevents costly errors in analytical processes before they reach production through automated data validation and reference testing of data pipelines.
* Provides actionable checklists for issues beyond the reach of automated testing.
* Equips readers with open-source Python tools and language-agnostic command-line interfaces.
* Addresses testing challenges for modern LLM-based systems including chat-bots and coding assistants.
* Instills in analysts an inner voice that is always asking: "How is this misleading data misleading me?"
More details
Series
Language
English
Place of publication
United Kingdom
Publishing group
Taylor & Francis Ltd
Target group
Professional and scholarly
Professional Practice & Development
Product notice
Paperback (trade)
Illustrations
54 s/w Abbildungen, 8 s/w Photographien bzw. Rasterbilder, 46 s/w Zeichnungen, 14 s/w Tabellen
14 Tables, black and white; 46 Line drawings, black and white; 8 Halftones, black and white; 54 Illustrations, black and white
Dimensions
Height: 234 mm
Width: 156 mm
Thickness: 23 mm
Weight
622 gr
ISBN-13
978-1-032-89670-0 (9781032896700)
Copyright in bibliographic data and cover images is held by Nielsen Book Services Limited or by the publishers or by their respective licensors: all rights reserved.
Schweitzer Classification
Other editions
Additional editions

Nicholas J. Radcliffe
Test-Driven Data Analysis
E-Book
05/2026
1st Edition
Chapman and Hall
€78.99
Available for download

Nicholas J. Radcliffe
Test-Driven Data Analysis
E-Book
05/2026
1st Edition
Chapman and Hall
€78.99
Available for download

Nicholas J. Radcliffe
Test-Driven Data Analysis
Book
05/2026
1st Edition
Chapman & Hall/CRC
€74.50
Shipment within 10-20 days
Person
Nicholas Radcliffe is the Founder and Director of Stochastic Solutions Limited, a Scottish company specializing in consulting in data science, data analysis, and data engineering. He has also, since 1995, been a Visiting Professor in the Operations Research Group in the School of Mathematics at the University of Edinburgh. He is known for developing forma analysis (sic) of genetic algorithms and uplift modeling, before more recent work on test-driven data analysis.
Content
Foreword Preface Acknowledgements Author 1 Orientation I Data Validation with Constraints 2 Data Validation 3 Textual Data 4 Profiling and Auditing Data 5 Constraint Discovery and Validation 6 Custom Constraints 7 Practical Considerations 8 Serial Data II Reference Testing 9 Introduction to Reference Tests 10 Modern Software Testing 11 Reference Tests for Analytical Pipelines 12 Testing Models and Modeling III Errors of Interpretation, of Process, & of Applicability 13 Errors of Interpretation I: Formulation 14 Errors of Interpretation II: Communication 15 Errors of Interpretation III: Graphing Sins 16 Errors of Process 17 Errors of Applicability and Errors of Judgement IV Appendices A The TDDA Library, Resources, & Tools B Glossary Bibliography