Book description
Updated as of August 2014, this practical book will demonstrate proven methods for anonymizing health data to help your organization share meaningful datasets, without exposing patient identity. Leading experts Khaled El Emam and Luk Arbuckle walk you through a risk-based methodology, using case studies from their efforts to de-identify hundreds of datasets.
Clinical data is valuable for research and other types of analytics, but making it anonymous without compromising data quality is tricky. This book demonstrates techniques for handling different data types, based on the authors’ experiences with a maternal-child registry, inpatient discharge abstracts, health insurance claims, electronic medical record databases, and the World Trade Center disaster registry, among others.
- Understand different methods for working with cross-sectional and longitudinal datasets
- Assess the risk of adversaries who attempt to re-identify patients in anonymized datasets
- Reduce the size and complexity of massive datasets without losing key information or jeopardizing privacy
- Use methods to anonymize unstructured free-form text data
- Minimize the risks inherent in geospatial data, without omitting critical location-based health information
- Look at ways to anonymize coding information in health data
- Learn the challenge of anonymously linking related datasets
Publisher resources
Table of contents
- Preface
- 1. Introduction
- 2. A Risk-Based De-Identification Methodology
- 3. Cross-Sectional Data: Research Registries
- 4. Longitudinal Discharge Abstract Data: State Inpatient Databases
- 5. Dates, Long Tails, and Correlation: Insurance Claims Data
- 6. Longitudinal Events Data: A Disaster Registry
- 7. Data Reduction: Research Registry Revisited
- 8. Free-Form Text: Electronic Medical Records
- 9. Geospatial Aggregation: Dissemination Areas and ZIP Codes
- 10. Medical Codes: A Hackathon
- 11. Masking: Oncology Databases
- 12. Secure Linking
- 13. De-Identification and Data Quality: A Clinical Data Warehouse
- Index
- Colophon
- Copyright
Product information
- Title: Anonymizing Health Data
- Author(s):
- Release date: December 2013
- Publisher(s): O'Reilly Media, Inc.
- ISBN: 9781449363079
You might also like
book
Real World Health Care Data Analysis
Discover best practices for real world data research with SAS code and examples Real world health …
book
Longitudinal Data Analysis
This book provides accessible treatment to state-of-the-art approaches to analyzing longitudinal studies. Comprehensive coverage of the …
book
Common Statistical Methods for Clinical Research with SAS Examples, Third Edition, 3rd Edition
Glenn Walker and Jack Shostak's Common Statistical Methods for Clinical Research with SAS Examples, Third Edition, …
book
Big Data Analytics with R
Utilize R to uncover hidden patterns in your Big Data About This Book Perform computational analyses …