Errata

Python Data Science Handbook

Errata for Python Data Science Handbook

Submit your own errata for this product.

The errata list is a list of errors and their corrections that were found after the product was released. If the error was corrected in a later version or reprint the date of the correction will be displayed in the column titled "Date Corrected".

The following errata were submitted by our customers and approved as valid errors by the author or editor.

Color key: Serious technical mistake Minor technical mistake Language or formatting error Typo Question Note Update

Version Location Description Submitted By Date submitted Date corrected
PDF
Page lots

s/occurrance/occurrence/

Page 80: If you want to find occurrances
Page 80: in that they search for the first occurrance of a
Page 80: except they search for the first
occurrance from the end rather than the beginning of the string
Page 81: The replace() function returns a new string, and will replace all occurrances of the
input:
Page 193: which scans the list looking for the first occurrance of a value,
and returning its index
Page 194: rather
than finding the first occurrance of the value and stopping
Page 194: it finds all occurrances of
the value simultaneously
Page 194: If np.where is a bottleneck and your code requires quickly
finding the first occurrance of a value in an unsorted list
Page 195: For example, to count the occurrances of a value or other condition in an
array
Page 195: For counting occurrances of all values at once

Hugh Brown  Aug 20, 2015  Nov 17, 2016
Other Digital Version
Further Resources
"Further Resources" heading

Typo in "Pandas Online Documentation: this is the go-to source for complete docuemntation of the package."

"docuemntation" should be "documentation"

Anonymous  Sep 17, 2015  Nov 17, 2016
PDF
Page xviii
1st paragraph

" ...readers will be poised to use these Python
tools process, describe, model, and draw inferences... "

Missing word "to" between "tools" and "process".

Under "Other Miscellany" on p. xix:

"For readers using Windows, the
vast majority of this book still applicable;"

Missing word "is" between "book" and "still".

"...before going into detail on our recommended methor."

Should be spelled "method".

Michael Casebolt  Jan 04, 2016  Nov 17, 2016
PDF
Page xiv
1st paragraph

"third" is misspelled in the sentence "This is primarily because it took some time for many of the essential thrid-party packages and toolkits to be made compatible with the new language internals."

mringel  Sep 13, 2016  Nov 17, 2016
PDF
Page iii-x
entire table of contents

Table of Contents in 10/18/16 pre-print is not cross-linked to locations - they're no longer 'clickable' to navigate.

Daron Chabot  Oct 25, 2016  Nov 17, 2016
PDF
Page 28

s/Assinging variables in Python /Assigning variables in Python /

hughdbrown  Aug 19, 2015  Nov 17, 2016
PDF
Page 28
Second heading

Section title "Semicolon can Optionally Termnate a Statement" contains a typo. The word is correctly spelled "Terminate". This typo is duplicated in the table of contents for the same section.

Michael Casebolt  Jan 04, 2016  Nov 17, 2016
PDF
Page 47

s/As see here/As shown here/ or
s/As see here/As you can see here/ ??

hughdbrown  Aug 19, 2015  Nov 17, 2016
PDF
Page 109
Table of text entry keyboard shortcuts

This is in the .pdf version, downloaded Dec 13, 2015, from O'Reilly web site.

In the table of keyboard shortcuts for text entry on page 109 (according to page foot numeration), you say

Ctrl-u Cut all text in line

Ctrl-u in ipython cuts from the cursor to the beginning of the line, at least in my versions,

$ ipython
Python 3.5.1 |Anaconda 2.4.1 (x86_64)| (default, Dec 7 2015, 11:24:55)

and

$ ipython
Enthought Python Distribution -- www.enthought.com

Python 2.7.3 | 64-bit | (default, Aug 8 2013, 05:37:06)

JustBennet  Dec 13, 2015  Nov 17, 2016
PDF
Page 123

s/Despite this apparent heterogenaity/Despite this apparent heterogeneity/

hughdbrown  Aug 19, 2015  Nov 17, 2016
PDF
Page 171

s/We are then free do operate on these values as we wish. /We are then free to operate on these values as we wish. /

Hugh Brown  Aug 20, 2015  Nov 17, 2016
PDF
Page 171

gauge

s/The difference is this: "and" and "or" guage/The difference is this: "and" and "or" gauge/

Hugh Brown  Aug 20, 2015  Nov 17, 2016
PDF
Page 185
Last code block

in the example:

# construct a mask of all summer days (June 21st is the 172nd day)
summer = (np.arange(365) - 172 < 90)

Will select the first 261 days of the year, not just the summer.
Should become:
summer = (np.arange(365) - 172 < 90) & (np.arange(365) - 172 > 0)

Anonymous  May 29, 2016  Nov 17, 2016
PDF
Page 207 & 211

s/dimesnional/dimensional/

Page 207: Notice the potential point of confusion here: in a two-dimesnional NumPy array,
data[0] will return the first row.

Page 211: We’ll start with the simple case of the one-dimensional Series object, and then move
on to the more complicated two-dimesnional DataFrame object.

Hugh Brown  Aug 20, 2015  Nov 17, 2016
PDF
Page 212
first line

safter -> safer

Dale Visser  Oct 05, 2015  Nov 17, 2016
PDF
Page 218

s/Lets/Let's/

Lets start by defining a simple Series and Data‐
Frame on which to demonstrate this

Hugh Brown  Aug 20, 2015  Nov 17, 2016
PDF
Page 233
1st paragraph

"DataFrame objects can even be modified with ..." should be "Series objects can even be modified with ...", considering the context.

Toshiyuki Tega  May 15, 2016  Nov 17, 2016
PDF
Page 238

s/medial/medical/

Consider our toy medial
dataframe from above

Hugh Brown  Aug 20, 2015  Nov 17, 2016
PDF
Page 289
First paragraph of "Example: Recipe Database"

s/cleanin-up messy real-world data/cleaning-up messy real-world data/

Kristof  Sep 05, 2015  Nov 17, 2016
PDF
Page 291
third code block

The code recipes.description.str.contains('breakfast').sum() misses the recipes that have a capital 'B' for example.
Thus, to be correct, this should be:
recipes.description.str.contains('breakfast', case=False).sum()

Kristof  Sep 05, 2015  Nov 17, 2016
PDF
Page 305
First paragraph of Digging Into the Data

s/general trend inthe data/general trend in the data/

Kristof  Sep 06, 2015  Nov 17, 2016
PDF
Page 314

documentation

s/Pandas Online Documentation: this is the go-to source for complete docuemnta‐
tion of the package. /Pandas Online Documentation: this is the go-to source for complete documenta‐
tion of the package. /

Hugh Brown  Aug 20, 2015  Nov 17, 2016
PDF
Page 516
First paragraph of "What is Machine Learning?"

s/The study of machine learning cetainly/The study of machine learning certainly

Anonymous  Feb 16, 2016  Nov 17, 2016
Mobi
Page 2236
First paragraph of "Comutation on NumPy Arrays: Universal Functions" section

Typo in sentence. It says, "This section motivates the need for NumPy's ufunts...". Should refer to "ufuncs".

Dale Visser  Oct 05, 2015  Nov 17, 2016