Book description
Get to grips with pandas - a versatile and high-performance Python library for data manipulation, analysis, and discovery
Key Features
- Perform efficient data analysis and manipulation tasks using pandas
- Apply pandas to different real-world domains with the help of step-by-step demonstrations
- Get accustomed to using pandas as an effective data exploration tool
Book Description
Data analysis has become a necessary skill in a variety of domains where knowing how to work with data and extract insights can generate significant value.
Hands-On Data Analysis with Pandas will show you how to analyze your data, get started with machine learning, and work effectively with Python libraries often used for data science, such as pandas, NumPy, matplotlib, seaborn, and scikit-learn. Using real-world datasets, you will learn how to use the powerful pandas library to perform data wrangling to reshape, clean, and aggregate your data. Then, you will be able to conduct exploratory data analysis by calculating summary statistics and visualizing the data to find patterns. In the concluding chapters, you will explore some applications of anomaly detection, regression, clustering, and classification using scikit-learn to make predictions based on past data.
By the end of this book, you will be equipped with the skills you need to use pandas to ensure the veracity of your data, visualize it for effective decision-making, and reliably reproduce analyses across multiple datasets.
What you will learn
- Understand how data analysts and scientists gather and analyze data
- Perform data analysis and data wrangling using Python
- Combine, group, and aggregate data from multiple sources
- Create data visualizations with pandas, matplotlib, and seaborn
- Apply machine learning (ML) algorithms to identify patterns and make predictions
- Use Python data science libraries to analyze real-world datasets
- Use pandas to solve common data representation and analysis problems
- Build Python scripts, modules, and packages for reusable analysis code
Who this book is for
This book is for data analysts, data science beginners, and Python developers who want to explore each stage of data analysis and scientific computing using a wide range of datasets. You will also find this book useful if you are a data scientist looking to implement pandas in machine learning. Working knowledge of Python programming language will be beneficial.
Table of contents
- Title Page
- Copyright and Credits
- Dedication
- About Packt
- Foreword
- Contributors
- Preface
- Section 1: Getting Started with Pandas
-
Introduction to Data Analysis
- Chapter materials
- Fundamentals of data analysis
- Statistical foundations
- Setting up a virtual environment
- Summary
- Exercises
- Further reading
- Working with Pandas DataFrames
- Section 2: Using Pandas for Data Analysis
- Data Wrangling with Pandas
- Aggregating Pandas DataFrames
- Visualizing Data with Pandas and Matplotlib
- Plotting with Seaborn and Customization Techniques
- Section 3: Applications - Real-World Analyses Using Pandas
- Financial Analysis - Bitcoin and the Stock Market
- Rule-Based Anomaly Detection
- Section 4: Introduction to Machine Learning with Scikit-Learn
- Getting Started with Machine Learning in Python
- Making Better Predictions - Optimizing Models
- Machine Learning Anomaly Detection
- Section 5: Additional Resources
- The Road Ahead
- Solutions
- Appendix
- Other Books You May Enjoy
Product information
- Title: Hands-On Data Analysis with Pandas
- Author(s):
- Release date: July 2019
- Publisher(s): Packt Publishing
- ISBN: 9781789615326
You might also like
book
Hands-On Data Analysis with Pandas - Second Edition
Get to grips with pandas by working with real datasets and master data discovery, data manipulation, …
book
Python: End-to-end Data Analysis
Leverage the power of Python to clean, scrape, analyze, and visualize your data About This Book …
book
Hands-On Predictive Analytics with Python
Step-by-step guide to build high performing predictive applications Key Features Use the Python data analytics ecosystem …
book
Hands-On Exploratory Data Analysis with Python
Discover techniques to summarize the characteristics of your data using PyPlot, NumPy, SciPy, and pandas Key …