Errata

Errata for Building Pipelines for Natural Language Understanding with Spark

Submit your own errata for this product.

The errata list is a list of errors and their corrections that were found after the product was released.

The following errata were submitted by our customers and have not yet been approved or disproved by the author or editor. They solely represent the opinion of the customer.

Color Key: Serious technical mistake Minor technical mistake Language or formatting error Typo Question Note Update

Version	Location	Description	Submitted by	Date submitted
Other Digital Version	1 Cell "docRdd.first"	In Notebook number 1, the cell "docRdd.first" does not run, it causes an exception. In the definition of docRdd, I tried putting the line "docName" after the "Document" line at the end of the function, and it runs (but returns the document name rather than the Document object). So the Document gets created, it just cannot seem to return an Array of them in the map. I can't get past this error. Using Spark version 2.1.0, Scala 2.11.8, and Apache Toree as the kernel for Jupyter.	Andreas Kaempf	Mar 02, 2017