Errata
The errata list is a list of errors and their corrections that were found after the product was released. If the error was corrected in a later version or reprint the date of the correction will be displayed in the column titled "Date Corrected".
The following errata were submitted by our customers and approved as valid errors by the author or editor.
Color key: Serious technical mistake Minor technical mistake Language or formatting error Typo Question Note Update
Version | Location | Description | Submitted By | Date submitted | Date corrected |
---|---|---|---|---|---|
Page 2 1st paragraph 1st line |
typo: availablility --> availability Note from the Author or Editor: |
Joon-Yong Lee | Jul 29, 2017 | May 11, 2018 | |
Page 6-8 examples, code etc. |
Less of an erratum; more of a suggestion. Note from the Author or Editor: |
Kevin Casey | Mar 04, 2018 | May 11, 2018 | |
Printed | Page 15, 18 15 formula, 18 near bottom of page |
On each of these two pages, MAD has been written as Mean Absolution Deviation. In other places, the A is referenced as 'absolute'. The word absolution is not really a possibility here is it?, having checked its definition. Note from the Author or Editor: |
Tom Robey | Jul 24, 2017 | May 11, 2018 |
Printed, PDF, ePub | Page 16 2nd paragraph |
The last sentence of the second paragraph reads "However, if you divide by n - 1 instead of n, the standard deviation becomes an unbiased estimate." Dividing by n - 1 instead of n produces an unbiased estimate of the variance, but the estimate of the standard deviation is still biased. See https://en.wikipedia.org/wiki/Unbiased_estimation_of_standard_deviation. Note from the Author or Editor: |
David W. Body | Mar 09, 2018 | May 11, 2018 |
Page 27 Top of page |
Table 1-3 is a repetition of Table 1-2 Note from the Author or Editor: |
Anonymous | Apr 21, 2017 | Jun 23, 2017 | |
Printed | Page 40 last paragraph |
The sentence "Now the picture is much clearer: tax-assessed value is much higher in some zip codes (98112, 98105) than in others (98108, 98057)." References two zip codes that aren't in Figure 1-12 (98112 and 98057.) Either it should read "Now the picture is much clearer: tax-assessed value is much higher in some zip codes (98105, 98126) than in others (98108, 98188)." or the plot titles in the figure are incorrect. Note from the Author or Editor: |
Anonymous | Apr 02, 2018 | May 11, 2018 |
Page 41 last paragraph |
This idea has propogated to various modern graphics systems --> This idea has propagated to various modern graphics systems Note from the Author or Editor: |
JOON-YONG LEE | Jan 02, 2018 | May 11, 2018 | |
Page 45 3rd paragraph |
The name was wrong. Note from the Author or Editor: |
JOON-YONG LEE | Jan 12, 2018 | May 11, 2018 | |
Page 62 1st para |
A mistake on this equation: [(1 – [x/100]) / 2]% --> [(100 – x) / 2]% Note from the Author or Editor: |
JOON-YONG LEE | Jan 19, 2018 | May 11, 2018 | |
Page 65 1st para |
typo: prodigous --> prodigious Note from the Author or Editor: |
JOON-YONG LEE | Jan 17, 2018 | May 11, 2018 | |
Page 67 last paragraph |
typo: anamolous --> anomalous Note from the Author or Editor: |
JOON-YONG LEE | Jan 17, 2018 | May 11, 2018 | |
Page 71 first equation |
the last term in the formula should be s/sqrt(n) and not s/n Note from the Author or Editor: |
Anonymous | Jun 29, 2017 | May 11, 2018 | |
Page 76 2nd para |
where the mean number of events per time period is 2 --> where the mean number of events per time period is 0.2. Note from the Author or Editor: |
JOON-YONG LEE | Jan 21, 2018 | May 11, 2018 | |
Printed | Page 87 Bottom of first major paragraph |
Text says, "This means that extreme chance results in only one direction direction count toward the p-value." Note from the Author or Editor: |
Tom Robey | Jul 24, 2017 | May 11, 2018 |
Printed | Page 93 For Further Reading section |
Bruce's Introductory Statistics and Analytics book is listed with a 2015 date. Pages 88 and 101, the book is listed as 2014. Note from the Author or Editor: |
Tom Robey | Jul 24, 2017 | May 11, 2018 |
Printed | Page 98 Bottom of Data Science and P-Values paragraph |
Sentence reads, " - a feature night be included or ... ". I am thinking the word should have been *might*. Note from the Author or Editor: |
Tom Robey | Jul 26, 2017 | May 11, 2018 |
Page 104 Bottom line |
The alternative hypothesis uses B > A instead of B < A (or the null hypothesis needs to be changed). Note from the Author or Editor: |
Anonymous | Jul 25, 2017 | May 11, 2018 | |
Printed | Page 111 Last sentence. |
First release (2017-05-09) of first print edition (May 2017) has Greek letter xi (Unicode 03BE) where Greek letter chi (Unicode 03C7) is meant. Same goes for second formula on page 113. Note from the Author or Editor: |
Stephen Frost | Jul 11, 2017 | May 11, 2018 |
Printed | Page 111-114 Throughout |
The text is inconsistent in its use of "chi-square" vs. "chi-squared". The main section is titled "Chi-Square Test", however page 113 references "the chi-squared statistic" twice, page 114 contains a section titled "Chi-Squared Test: Statistical Theory" (but mentions "chi-square distribution"), and the output given by R states "Pearson's Chi-squared test". Note from the Author or Editor: |
Matt Galisa | Aug 14, 2017 | May 11, 2018 |
Page 112 2nd paragraph |
Instead of "same result by random chance" - shouldn't it say same result or more extreme - or something like that? Note from the Author or Editor: |
Anonymous | May 02, 2017 | Jun 23, 2017 | |
Page 124 4th and 5th paragraph |
(30% instead of 10%) --> (50% instead of 10%): because 50% is used in the following paragraph as an example. Note from the Author or Editor: |
JOON YONG LEE | Feb 07, 2018 | May 11, 2018 | |
Page 129 1st paragraph |
interchangable --> interchangeable Note from the Author or Editor: |
JOON YONG LEE | Feb 09, 2018 | May 11, 2018 | |
Page 136 First equation |
Equation for RMSE shows estimate of y_i on LHS Note from the Author or Editor: |
Anonymous | May 29, 2017 | Jun 23, 2017 | |
Page 139 last paragraph |
"where p is the number of..." Here, p should be a capital P to keep consistency with P in the above AIC equation. Note from the Author or Editor: |
JOON YONG LEE | Feb 11, 2018 | May 11, 2018 | |
Printed | Page 153 1st paragraph |
The paragraph notes “adding a bathroom increases the sale price by $7,500” however in the previous code output, Bathrooms is shown as 5.537e+03 or about $5,500. Note from the Author or Editor: |
Peter Edstrom | Feb 04, 2018 | May 11, 2018 |
Printed | Page 154 Last paragraph |
The slope of the main effect SqFtTotLiving shows as 1.176e+02 ($117) in the R output but the paragraph says $177. Thus for a home in the highest ZipGroup the slope is the sum of the main effect plus the interaction SqFtTotLiving:ZipGroup5 ($117 + $230 = $347) - the text shows 177 + 230 = 447 which not only does not match the R output but is also arithmetically incorrect (177 + 230 actually equals 407). Note from the Author or Editor: |
Matt Galisa | Aug 14, 2017 | May 11, 2018 |
Page 157 last paragraph |
statuatory deed --> statutory deed Note from the Author or Editor: |
JOON-YONG LEE | Feb 15, 2018 | May 11, 2018 | |
Printed | Page 170 Middle of main paragraph, under Generalized Additive Models |
"Polynomial terms may not flexible enough ... " looks like the word 'be' is missing. Note from the Author or Editor: |
Tom Robey | Aug 09, 2017 | May 11, 2018 |
Printed | Page 170 Figure 4-12 |
Figure 4-12, described as representing spline regression, appears identical to Figure 4-10, representing polynomial regression on page 168 Note from the Author or Editor: |
Marshall Ehlinger | Feb 10, 2018 | May 11, 2018 |
Printed | Page 196 Figure 5-5 |
Both rows of the figure are labeled y = 1; the lower row should be labeled y = 0. Note from the Author or Editor: |
Matt Galisa | Aug 14, 2017 | May 11, 2018 |
Page 196 Figure 5-5 |
Shorthand for Specificity labeled as FP/(y=0). It should be Specificity TN/(y=0). Note from the Author or Editor: |
John Masiello | Sep 14, 2017 | May 11, 2018 | |
Printed | Page 197 Bottom of page |
The denominator in the equation for specificity is incorrect. ∑FalseNegative should be replaced with ∑FalsePositive. Note from the Author or Editor: |
Phil Terwilliger | Jan 11, 2018 | May 11, 2018 |
Page 201 last paragraph |
indiscriminantly --> indiscriminately Note from the Author or Editor: |
JOON-YONG LEE | Feb 27, 2018 | May 11, 2018 | |
Page 205 1st paragraph in Data Generation |
(see “Undersampling” on page 204) --> (see “Oversampling and Up/Down Weighting” on page 204) Note from the Author or Editor: |
JOON-YONG LEE | Feb 28, 2018 | May 11, 2018 | |
Page 208 in further reading |
Analytics Vidya --> Analytics Vidhya Note from the Author or Editor: |
JOON-YONG LEE | Feb 28, 2018 | May 11, 2018 | |
Printed | Page 212 3rd paragraph |
describes the paid off symbol as triangle but is actually a cross. states the qty of default (circle) as 14 and paid of (cross) as 6, but in the figure 6.2 it is 9 default and 11 paid off Note from the Author or Editor: |
David Pugh | Mar 01, 2018 | May 11, 2018 |
Printed | Page 221 diagram |
The decision tree diagram could do with an explanation of which branch to follow if the node question is true or false. Its not immediately obvious that you go to the left if true and to the right if false Note from the Author or Editor: |
David Pugh | Mar 01, 2018 | May 11, 2018 |
Page 222 last paragraph |
righthand region --> lefthand region Note from the Author or Editor: |
JOON-YONG LEE | Mar 06, 2018 | May 11, 2018 | |
Page 223 Figure 6-4 |
A caption for Figure 6-4 is same to the caption for Figure 6-3. It must be fixed. Note from the Author or Editor: |
JOON-YONG LEE | Mar 06, 2018 | May 11, 2018 | |
Page 230 top of page |
Says "refered to as random forest" instead of "referred" Note from the Author or Editor: |
Anonymous | Jul 18, 2017 | May 11, 2018 | |
Page 230 1st paragraph in Bagging |
n records. --> N records: in Step 1 of the bagging algorithm, n means the size of bootstrap resample. Note from the Author or Editor: |
JOON-YONG LEE | Apr 02, 2018 | May 11, 2018 | |
Printed, PDF | Page 244 last paragraph |
acting in a similar mannger --> manner Note from the Author or Editor: |
JOON-YONG LEE | Apr 09, 2018 | May 11, 2018 |
Page 267 last paragraph |
The oil stocks (XOM, CVS, SLB, COP) --> The oil stocks (XOM, CVX, SLB, COP) Note from the Author or Editor: |
JOON-YONG LEE | Apr 14, 2018 | May 11, 2018 | |
Page 268 main steps of the agglomerative algorithm |
for "D(Ck,Cl))" in step 2 and 3, right parentheses are duplicated. Note from the Author or Editor: |
JOON-YONG LEE | Apr 14, 2018 | May 11, 2018 | |
Page 272 1st paragraph in Mixtures of Normals |
N1(μ1),Σ1), N1(μ2),Σ2), ..., N1(μK),ΣK) has wrong a dimension and parentheses. Note from the Author or Editor: |
JOON-YONG LEE | Apr 14, 2018 | May 11, 2018 | |
Page 281 last code block |
i cannot find the definition of "dnd_cut". Note from the Author or Editor: |
JOON-YONG LEE | Apr 18, 2018 | May 11, 2018 | |
Other Digital Version | 567 Table 1-5 |
I am using the Kindle edition this is LOCATION 567 not page number Note from the Author or Editor: |
Duncan Williamson | Sep 15, 2017 | May 11, 2018 |