Book description
The world around us is full of data that holds unique insights and valuable stories, and this book will help you uncover them. Whether you already work with data or want to learn more about its possibilities, the examples and techniques in this practical book will help you more easily clean, evaluate, and analyze data so that you can generate meaningful insights and compelling visualizations.
Complementing foundational concepts with expert advice, author Susan E. McGregor provides the resources you need to extract, evaluate, and analyze a wide variety of data sources and formats, along with the tools to communicate your findings effectively. This book delivers a methodical, jargon-free way for data practitioners at any level, from true novices to seasoned professionals, to harness the power of data.
- Use Python 3.8+ to read, write, and transform data from a variety of sources
- Understand and use programming basics in Python to wrangle data at scale
- Organize, document, and structure your code using best practices
- Collect data from structured data files, web pages, and APIs
- Perform basic statistical analyses to make meaning from datasets
- Visualize and present data in clear and compelling ways
Publisher resources
Table of contents
- Preface
- 1. Introduction to Data Wrangling and Data Quality
- 2. Introduction to Python
- 3. Understanding Data Quality
- 4. Working with File-Based and Feed-Based Data in Python
-
5. Accessing Web-Based Data
- Accessing Online XML and JSON
- Introducing APIs
- Basic APIs: A Search Engine Example
- Specialized APIs: Adding Basic Authentication
- Reading API Documentation
- Protecting Your API Key When Using Python
- Specialized APIs: Working With OAuth
- API Ethics
- Web Scraping: The Data Source of Last Resort
- Conclusion
- 6. Assessing Data Quality
- 7. Cleaning, Transforming, and Augmenting Data
-
8. Structuring and Refactoring Your Code
- Revisiting Custom Functions
- Understanding Scope
- Defining the Parameters for Function “Ingredients”
- Return Values
- Climbing the “Stack”
- Refactoring for Fun and Profit
- Documenting Your Custom Scripts and Functions with pydoc
- The Case for Command-Line Arguments
- Where Scripts and Notebooks Diverge
- Conclusion
- 9. Introduction to Data Analysis
- 10. Presenting Your Data
- 11. Beyond Python
- A. More Python Programming Resources
- B. A Bit More About Git
- C. Finding Data
- D. Resources for Visualization and Information Design
- Index
- About the Author
Product information
- Title: Practical Python Data Wrangling and Data Quality
- Author(s):
- Release date: December 2021
- Publisher(s): O'Reilly Media, Inc.
- ISBN: 9781492091509
You might also like
book
Data Wrangling with Python
How do you take your data analysis skills beyond Excel to the next level? By learning …
book
Python Data Analysis - Third Edition
Understand data analysis pipelines using machine learning algorithms and techniques with this practical guide Key Features …
book
Python for Data Science
Python is an ideal choice for accessing, manipulating, and gaining insights from data of all kinds. …
book
Practical Data Science with Python
Learn to effectively manage data and execute data science projects from start to finish using Python …