Mastering Large Datasets with Python

Brief Table of Contents

Copyright

Brief Table of Contents

Table of Contents

Preface

Acknowledgments

About this book

About the author

About the cover illustration

1.

Chapter 1. Introduction

Chapter 2. Accelerating large dataset work: Map and parallel computing

Chapter 3. Function pipelines for mapping complex transformations

Chapter 4. Processing large datasets with lazy workflows

Chapter 5. Accumulation operations with reduce

Chapter 6. Speeding up map and reduce with advanced parallelization

2.

Chapter 7. Processing truly big datasets with Hadoop and Spark

Chapter 8. Best practices for large data with Apache Streaming and mrjob

Chapter 9. PageRank with map and reduce in PySpark

Chapter 10. Faster decision-making with machine learning ...

Get Mastering Large Datasets with Python now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.

Start your free trial

Mastering Large Datasets with Python by John Wolohan

Brief Table of Contents

Don’t leave empty-handed

It’s yours, free.

Check it out now on O’Reilly