Errata

Data Science on AWS

Errata for Data Science on AWS

Submit your own errata for this product.

The errata list is a list of errors and their corrections that were found after the product was released. If the error was corrected in a later version or reprint the date of the correction will be displayed in the column titled "Date Corrected".

The following errata were submitted by our customers and approved as valid errors by the author or editor.

Color key: Serious technical mistake Minor technical mistake Language or formatting error Typo Question Note Update

Version Location Description Submitted By Date submitted Date corrected
Printed
Page 1
chapter 1 -> Create a Parquet-based Table in Athena

SELECT *
FROM dsoaws.amazon_reviews_tsv
WHERE product_category = 'Digital_Video_Download' LIMIT 10

should be
SELECT *
FROM dsoaws.amazon_reviews_parquet
WHERE product_category = 'Digital_Video_Download' LIMIT 10

Note from the Author or Editor:
Good catch. Needs to be updated to reflect the correct table `FROM dsoaws.amazon_reviews_parquet`. Thanks.

Yangyang Cai  May 10, 2020  Mar 11, 2021
Printed
Page 162
data_bias_config

When I set group_name = "product_categeory" for the clarify.BiasConfig as set in the book and accompaying notebook, I get a ClientError on this:

"ClientError: facet name_or_index:'product_category' cannot be the same as group_variable. Please specify a facet other than group_variable. group_variable: product_category
"

What is the correct solution? Thanks in advance.

Note from the Author or Editor:
Thanks for the submitting the erratum.

Here you can find the corrected code sample:
https://github.com/data-science-on-aws/workshop/blob/ad542c15233fa87ab39701585b22332897aa01ee/05_explore/04_Run_Data_Bias_Analysis_ProcessingJob.ipynb

We will update the book in the next release.

Anonymous  Jun 07, 2021