What is a resilient distributed dataset?
Alex Robbins guides you through an in-depth look at the Python API for Apache Spark. In this segment, he explores RDDs--the central abstraction in Spark and essential knowledge for anyone working in the system.
In “Introduction to PySpark,” Alex Robbins guides you through an in-depth look at the Python API for Apache Spark. Check out the full training video here.