Book description
Want to speed up your data analysis and work with larger-than-memory datasets? Python Polars offers a blazingly fast, multithreaded, and elegant API for data loading, manipulation, and processing. With this hands-on guide, you'll walk through every aspect of Polars and learn how to tackle practical use cases using real-world datasets.
Jeroen Janssens and Thijs Nieuwdorp from Xomnia in Amsterdam show you how this superfast DataFrame library is perfect for efficient data wrangling, ETL pipelines, and so much more. This book helps you quickly learn the syntax and understand Polars' underlying concepts. You don't need to have experience with pandas or Spark, but if you do, this book will help you make a smooth transition.
With this definitive guide at your side, you'll be able to:
- Process larger-than-memory datasets at record speed
- Apply the eager, lazy, and streaming APIs of Polars and decide when to use them
- Transition smoothly from pandas or Spark to Polars
- Integrate Polars into your existing code base
- Work with Arrow and Parquet to efficiently read and write data
- Translate complex ETL tasks into efficient and elegant queries
Publisher resources
Table of contents
- Foreword
- Preface
- I. Begin
- 1. Introducing Polars
-
2. Getting Started
- Setting Up Your Environment
- Crash Course JupyterLab
-
Installing Polars on Other Projects
- All Optional Dependencies
- Optional Dependencies for Interoperability
- Optional Dependencies for Working with Spreadsheets
- Optional Dependencies for Working with Databases
- Optional Dependencies for Working with Remote File Systems
- Optional Dependencies for Other I/O Formats
- Optional Dependencies for Extra Functionality
- Installing Optional Dependencies
- Configuring Polars
- Compiling Polars from Scratch
- Takeaways
- 3. Moving from Pandas to Polars
- II. Form
- 4. Data Structures and Data Types
- 5. Eager and Lazy APIs
- 6. Reading and Writing Data
- III. Express
- 7. Beginning Expressions
- 8. Continuing Expressions
- 9. Combining Expressions
- IV. Transform
- 10. Selecting and Creating Columns
- 11. Filtering and Sorting Rows
- 12. Working with Textual, Temporal, and Nested Data Types
- 13. Summarizing and Aggregating
- 14. Joining and Concatenating
- 15. Reshaping
- V. Advance
- 16. Visualizing Data
- 17. Extending Polars
- 18. Polars Internals
- A. Accelerating Polars with the GPU
- Index
- About the Authors
Product information
- Title: Python Polars: The Definitive Guide
- Author(s):
- Release date: February 2025
- Publisher(s): O'Reilly Media, Inc.
- ISBN: 9781098156084
You might also like
book
Python Distilled
Expert Insight for Modern Python (3.6+) Development from the Author of Python Essential Reference The richness …
book
Using Asyncio in Python
If you’re among the Python developers put off by asyncio’s complexity, it’s time to take another …
book
Python Cookbook, 3rd Edition
If you need help writing programs in Python 3, or want to update older Python 2 …
book
Data Structures & Algorithms in Python
LEARN HOW TO USE DATA STRUCTURES IN WRITING HIGH PERFORMANCE PYTHON PROGRAMS AND ALGORITHMS This practical …