Running a word count application using Spark
How to use Apache Spark’s Resilient Distributed Dataset (RDD) API.
This is a highlight from Ted Malaska’s Introduction to Apache Spark for Java and Scala developers.
Visit Safari to view the full session from the 2016 O’Reilly OSCON Conference in Austin.