Schweitzer Fachinformationen
Wenn es um professionelles Wissen geht, ist Schweitzer Fachinformationen wegweisend. Kunden aus Recht und Beratung sowie Unternehmen, öffentliche Verwaltungen und Bibliotheken erhalten komplette Lösungen zum Beschaffen, Verwalten und Nutzen von digitalen und gedruckten Medien.
Discover how to achieve business goals by relying on high-quality, robust data
In Data Quality: Empowering Businesses with Analytics and AI, veteran data and analytics professional delivers a practical and hands-on discussion on how to accelerate business results using high-quality data. In the book, you'll learn techniques to define and assess data quality, discover how to ensure that your firm's data collection practices avoid common pitfalls and deficiencies, improve the level of data quality in the business, and guarantee that the resulting data is useful for powering high-level analytics and AI applications.
The author shows you how to:
An essential resource for data scientists, data analysts, business intelligence professionals, chief technology and data officers, and anyone else with a stake in collecting and using high-quality data, Data Quality: Empowering Businesses with Analytics and AI will also earn a place on the bookshelves of business leaders interested in learning more about what sets robust data apart from the rest.
PRASHANTH SOUTHEKAL, PHD, is a data, analytics, and AI consultant, author, and professor. He has worked and consulted for over 80 organizations including P&G, GE, Shell, Apple, FedEx, and SAP. Dr. Southekal is the author of Data for Business Performance and Analytics Best Practices (ranked #1 analytics books of all time by BookAuthority) and writes regularly on data, analytics, and AI in Forbes and CFO.University. He serves on the Editorial Board of MIT CDOIQ Symposium and is an advisory board member at BGV (Benhamou Global Ventures) a Silicon Valley-based venture capital firm. Apart from his consulting and advisory pursuits, he has trained over 3,000 professionals worldwide in data and analytics. Dr. Southekal is also an adjunct professor of data and analytics at IE Business School (Madrid, Spain). CDO Magazine included him in the top 75 global academic data leaders of 2022. He holds a PhD from ESC Lille (FR), an MBA from the Kellogg School of Management (US), and holds the ICD.D designation from the Institute of Corporate Directors (Canada).
Foreword xvii
Preface xix
About the Book xix
Quality Principles Applied in This Book xx
Organization of the Book xxi
Who Should Read This Book? xxiii
References xxiii
Acknowledgments xxv
PART I: DEFINE PHASE 1
Chapter 1: Introduction 3
Chapter 2: Business Data 17
Chapter 3: Data Quality in Business 37
PART II: ANALYZE PHASE 63
Chapter 4: Causes for Poor Data Quality 65
Chapter 5: Data Lifecycle and Lineage 81
Chapter 6: Profiling for Data Quality 93
PART III: REALIZE PHASE 113
Chapter 7: Reference Architecture for Data Quality 115
Chapter 8: Best Practices to Realize Data Quality 133
Chapter 9: Best Practices to Realize Data Quality 161
PART IV: SUSTAIN PHASE 191
Chapter 10: Data Governance 193
Chapter 11: Protecting Data 211
Appendix 1: Abbreviations and Acronyms 237
Appendix 2: Glossary 241
Appendix 3: Data Literacy Competencies 245
About the Author 249
Index 251
"The promise of the 'data economy' (i.e., data is the new oil), combined with the naive belief that AI can turn a company's data into gold, is leading many enterprises to experiment with adopting AI. But early adopters are learning that one of the primary causes of AI failure is poor-quality data being fed into AI models (garbage in, garbage out). This highlights that raw data is insufficient and that it must be refined to create value. Consequently, implementing data-centric AI is rapidly becoming a best practice at both start-ups and established technology companies. This book provides a pragmatic approach for enterprises to acquire and manage good-quality data based on proven best practices. If you are a C-level executive or an AI practitioner seeking to deploy AI at scale to drive value creation, I highly recommend this book."
-Anik Bose
BGV managing general partner and founder, Ethical AI Governance Group (EAIGG) (United States)
"Prashanth Southekal has nailed the 'why' and the 'how' about data quality's relation to analytics in this highly readable book. To be an economically viable company in today's transparent, global, and competitive world, business leaders must champion the data quality and analytics journey and embed them in decision support systems as an operational core competency. The companies that advocate data quality and analytics integrate them in their DNA to outsmart their competitors in strategic and tactical decision making that yields sustainable success."
-Gary Cokins
President, Analytics-Based Performance Management LLC;
co-author, Predictive Business Analytics (United States)
"Good data is a source of myriad opportunities, while bad data is a tremendous burden. Data is now exposed at a much more strategic level (e.g., through business intelligence systems), increasing manifold the stakes involved for individuals and corporations, as well as government agencies. There, the lack of knowledge about data accuracy, currency, or completeness can have erroneous and even catastrophic results. In this book, Dr. Southekal provides very detailed and thorough coverage of all aspects of data quality management and best practices to improve data quality that would suit all ranges of expertise from beginner to advanced practitioner."
-Michael Taylor
AI chief data scientist, Siemens Mobility (Singapore)
"Dr. Southekal has a way of distilling complexity into practical applications for global leaders who span multiple industries and market types. He leverages his relationships to gain more perspective on data management. His book, Data Quality: Empowering Businesses with Analytics and AI, builds on his prior successes and hits the mark once again. 'Improving data quality should be a top priority for all business leaders.'"
-Victor Ojeleye
Planning & Reporting Manager, FP&A, Cargill Protein North America (United States)
"Data quality is a fundamental building block of success in today's digitally agile world. It creates disruption in long-established industries, but also allows traditional companies to innovate and drive more efficient and effective decision-making practices. Data quality utilization will grow revenues and reduce risk through better-connected intelligence. Dr. Southekal has created a book that explains the why, what, and how of data quality. Written in a structured, logical approach that allows all industries and leaders to fully understand the importance of getting their data quality correct for true value generation, Data Quality: Empowering Businesses with Analytics and AI is a must-read."
-Matthew Small
Managing director, Data Value Creation Ltd (United Kingdom)
"Prashanth provides an in-depth and scientific perspective on a very critical topic for businesses and organizations today. He addresses the complete lifecycle related to data quality, provides detailed explanation in each chapter, and starts from what data quality is to how to capture DQ issues proactively, govern the data, comply with regulations, and secure and sustain DQ practices. There are key callouts depicted in the insight boxes within each chapter. With the future trends indicating the shift toward right data from big data, data quality is a concept that needs to be ingrained in a company's business fabric. The book is a very practical guide to data quality that will be part of my toolkit."
-Ramdas Narayanan
Vice president, Bank of America (United States)
"Data-driven organizations understand that useful data is not simply found and organized by itself. In this book, his third, Dr. Prashanth Southekal shows business leaders the foundations needed to create a company that wants data- and analytics-led decisions to be part of their strategy. For data leaders and practitioners, this book will not only guide you but will also trigger new thoughts and ideas."
-Mark Stern
Vice president of Analytics and BI, BetMGM (United States)
"Data is growing and almost every company is a data company. The majority of organizations want to be in the data-driven space, utilizing and monetizing data through advanced analytics and AI. Although the thought process is great, when it comes to practical implementations most companies are struggling to get value out of their investment. In the consulting space we are seeing repeatedly the need for getting the basics and foundation right. Dr. Southekal's book Data Quality: Empowering Businesses with Analytics and AI is empowering business and data leaders and giving practical guidance on how to build good-quality data to get the most value from analytics and AI projects."
-Rathi Subbaraj
Senior manager, Dufrain (United Kingdom)
"The most thorough and comprehensive book I've seen on data quality. It covers the entire lifecycle of data management in the current enterprise, AI, and analytics landscape. The book contains a wealth of valuable strategic and tactical elements, as well as best practices for getting the most value from data for the business. A must-read for anyone looking to leverage the value of enterprise data."
-Tobias Zwingmann
Managing partner, RAPYD.AI (Germany)
"In today's world, where almost every company is dealing with petabyte-scale data, data quality is something that should be ingrained in all phases of the data lifecycle. In this book Dr. Southekal takes you on a journey of data quality and its lifecycle. It provides an in-depth perspective and the right approach to manage DQ. This book provides a detail explanation of the DARS approach, the DQ lifecycle and its difference phases, multiple dimensions of DQ, data decay, best practices, and a lot more. Dr. Southekal hits the mark again, and this book should be part of the toolkit for all levels of DQ and data practitioners."
-Ujjwal Goel
Director, Data Architecture & Data Engineering, Loblaw (Canada)
"The economy of data has been a trending subject for some time now. But the poor quality of the data affects the decision-making ability and the performance results. Most of the publications in this space refer to the physical flaws in data quality, like data downtime, whereas the author extends the definition to the logical flaws in data, which are much harder to spot and resolve. Dr. Southekal created a playbook for delivering business value from data, with prescriptive recommendations based on best practices for data governance and management practices, all based on the proprietary evaluation framework for data quality."
-Inna Tokarev Sela
CEO, Illumex AI (Israel)
"Dr. Southekal has done it again: given the data science community a gem of a framework (DARS: Design-Assess-Realize-Sustain) that they can apply to maximize ROI from their data and analytics initiatives. Data Quality: Empowering Businesses with Analytics and AI does a phenomenal job explaining nuanced concepts in a language that can be very easily understood by both technical and business audiences."
-Swapnil Srivastava
VP and Global Head of Analytics, Evalueserve (United States)
"Like his previous two books, Data Quality: Empowering Businesses with Analytics and AI is yet another great read for enterprise data leaders. In this book, Dr. Southekal first sets up a framework to understand and measure the quality of business data (the Define and Assess phases); he then provides a guidebook to implement data quality programs (the Realize and Sustain phases). In today's AI-driven world, this book will help business leaders build a solid data foundation."
-Li Kang
Head of Strategy, CelerData (United States)
"With accelerating change, decision-cycle times are narrowing, placing increased pressure on organizations to make faster and effective decisions to drive the biggest impact. In this environment, data quality issues can amplify the impact and costs of incorrect decisions. In this book, Dr. Southekal provides a comprehensive approach, practical frameworks and best practices to defining and addressing data...
Dateiformat: ePUBKopierschutz: Adobe-DRM (Digital Rights Management)
Systemvoraussetzungen:
Das Dateiformat ePUB ist sehr gut für Romane und Sachbücher geeignet – also für „fließenden” Text ohne komplexes Layout. Bei E-Readern oder Smartphones passt sich der Zeilen- und Seitenumbruch automatisch den kleinen Displays an. Mit Adobe-DRM wird hier ein „harter” Kopierschutz verwendet. Wenn die notwendigen Voraussetzungen nicht vorliegen, können Sie das E-Book leider nicht öffnen. Daher müssen Sie bereits vor dem Download Ihre Lese-Hardware vorbereiten.Bitte beachten Sie: Wir empfehlen Ihnen unbedingt nach Installation der Lese-Software diese mit Ihrer persönlichen Adobe-ID zu autorisieren!
Weitere Informationen finden Sie in unserer E-Book Hilfe.