
Building an Anonymization Pipeline
Description
Alles über E-Books | Antworten auf Fragen rund um E-Books, Kopierschutz und Dateiformate finden Sie in unserem Info- & Hilfebereich.
How can you use data in a way that protects individual privacy but still provides useful and meaningful analytics? With this practical book, data architects and engineers will learn how to establish and integrate secure, repeatable anonymization processes into their data flows and analytics in a sustainable manner.
Luk Arbuckle and Khaled El Emam from Privacy Analytics explore end-to-end solutions for anonymizing device and IoT data, based on collection models and use cases that address real business needs. These examples come from some of the most demanding data environments, such as healthcare, using approaches that have withstood the test of time.
- Create anonymization solutions diverse enough to cover a spectrum of use cases
- Match your solutions to the data you use, the people you share it with, and your analysis goals
- Build anonymization pipelines around various data collection models to cover different business needs
- Generate an anonymized version of original data or use an analytics platform to generate anonymized outputs
- Examine the ethical issues around the use of anonymized data
More details
Other editions
Additional editions

Content
- Intro
- Copyright
- Table of Contents
- Preface
- Why We Wrote This Book
- Who This Book Was Written For
- How This Book Is Organized
- Conventions Used in This Book
- O'Reilly Online Learning
- How to Contact Us
- Acknowledgments
- Chapter 1. Introduction
- Identifiability
- Getting to Terms
- Laws and Regulations
- States of Data
- Anonymization as Data Protection
- Approval or Consent
- Purpose Specification
- Re-identification Attacks
- Anonymization in Practice
- Final Thoughts
- Chapter 2. Identifiability Spectrum
- Legal Landscape
- Disclosure Risk
- Types of Disclosure
- Dimensions of Data Privacy
- Re-identification Science
- Defined Population
- Direction of Matching
- Structure of Data
- Overall Identifiability
- Final Thoughts
- Chapter 3. A Practical Risk-Management Framework
- Five Safes of Anonymization
- Safe Projects
- Safe People
- Safe Settings
- Safe Data
- Safe Outputs
- Five Safes in Practice
- Final Thoughts
- Chapter 4. Identified Data
- Requirements Gathering
- Use Cases
- Data Flows
- Data and Data Subjects
- From Primary to Secondary Use
- Dealing with Direct Identifiers
- Dealing with Indirect Identifiers
- From Identified to Anonymized
- Mixing Identified with Anonymized
- Applying Anonymized to Identified
- Final Thoughts
- Chapter 5. Pseudonymized Data
- Data Protection and Legal Authority
- Pseudonymized Services
- Legal Authority
- Legitimate Interests
- A First Step to Anonymization
- Revisiting Primary to Secondary Use
- Analytics Platforms
- Synthetic Data
- Biometric Identifiers
- Final Thoughts
- Chapter 6. Anonymized Data
- Identifiability Spectrum Revisited
- Making the Connection
- Anonymized at Source
- Additional Sources of Data
- Pooling Anonymized Data
- Pros/Cons of Collecting at Source
- Methods of Collecting at Source
- Safe Pooling
- Access to the Stored Data
- Feeding Source Anonymization
- Final Thoughts
- Chapter 7. Safe Use
- Foundations of Trust
- Trust in Algorithms
- Techniques of AIML
- Technical Challenges
- Algorithms Failing on Trust
- Principles of Responsible AIML
- Governance and Oversight
- Privacy Ethics
- Data Monitoring
- Final Thoughts
- Index
- About the Authors
- Colophon
System requirements
File format: PDF
Copy-Protection: Adobe-DRM (Digital Rights Management)
System requirements:
- Computer (Windows; MacOS X; Linux): Install the free reader Adobe Digital Editions prior to download (see eBook Help).
- Tablet/smartphone (Android; iOS): Install the free app Adobe Digital Editions or the app PocketBook before downloading (see eBook Help).
- E-reader: Bookeen, Kobo, Pocketbook, Sony, Tolino and many more (only limited: Kindle).
The file format PDF always displays a book page identically on any hardware. This makes PDF suitable for complex layouts such as those used in textbooks and reference books (images, tables, columns, footnotes). Unfortunately, on the small screens of e-readers or smartphones, PDFs are rather annoying, requiring too much scrolling.
This eBook uses Adobe-DRM, a „hard” copy protection. If the necessary requirements are not met, unfortunately you will not be able to open the eBook. You will therefore need to prepare your reading hardware before downloading.
Please note: We strongly recommend that you authorise using your personal Adobe ID after installation of any reading software.
For more information, see our eBook Help page.