Errata
The errata list is a list of errors and their corrections that were found after the product was released.
The following errata were submitted by our customers and have not yet been approved or disproved by the author or editor. They solely represent the opinion of the customer.
Color Key: Serious technical mistake Minor technical mistake Language or formatting error Typo Question Note Update
Version | Location | Description | Submitted by | Date submitted |
---|---|---|---|---|
Other Digital Version | Preface Using Code Examples |
Words wrong way around on wesmckinney.com |
Steven Mooney | Feb 16, 2024 |
Printed, ePub | Page Section 3.1, page 59 1st paragraph |
The example: |
Patrick Salkeld | Feb 16, 2024 |
Other Digital Version | Section 5.2; Indexing, Selection, and Filtering Selection on DataFrame with loc and iloc |
The word rows is misspelled as "roles". |
Andrei | Feb 17, 2024 |
Other Digital Version | Generator expressions 3rd code listing |
syntax typo for the statement `dict((i, i **2) for i inrange(5))` |
Ben To | Feb 19, 2024 |
Other Digital Version | Set hashable set elements part |
just missing a space before the **first parenthesis** in the sentence "set elements generally must be immutable, and they must be hashable(which means that calling hash on a value does not raise an exception)." |
Ben To | Feb 19, 2024 |
Printed, ePub | Page Page 98, Section 4.1 First example, first 3 paragraphs |
When tried to duplicate this example: |
Patrick Salkeld | Feb 19, 2024 |
Other Digital Version | Chapter 4 - Data Types for ndarrays Second note box |
Where the online text says "A signed integer can represent both positive and negative integers, while an unsigned integer can only represent nonzero integers", the phrase "nonzero integers" should be "non-negative integers". |
Ben To | Mar 04, 2024 |
O'Reilly learning platform | Page Chapter 10.x Throughout the chapter |
Chapter 10 uses DataFrame.groupby(...,axis="columns") on several occasions, which is deprecated. |
Jochen Schüttler | Apr 09, 2024 |
Other Digital Version | Chapter 4, Section "Data Types for ndarrays" The second Note (after Table 4.2) |
Text: |
Alessandro Botelho Bovo | Jun 06, 2024 |
Other Digital Version | Chapter 2, section "Numeric types" 3rd paragraph |
It says: |
Alessandro Botelho Bovo | Jun 06, 2024 |
Other Digital Version | Chapter 4, Section "Unique and Other Set Logic" 1st paragraph |
It says: "NumPy has some basic set operations for one-dimensional ndarrays. A commonly used one is numpy.unique, which returns the sorted unique values in an array:" |
Alessandro Botelho Bovo | Jun 11, 2024 |
ePub | Page Chapter 3, List Discussion regarding "Extend" |
Document at learning.oreilly.com. |
Steven O. Ellis | Jul 07, 2024 |
O'Reilly learning platform | Page Chapter 2 Tab Completion |
"Also, you can also complete methods and attributes on any object after typing a period:" double use of 'also' |
Anonymous | Sep 05, 2024 |
ePub | Page https://wesmckinney.com/book/data-analysis-examples#whetting_movielens In [98]: movies["genre"] = movies.pop("genres").str.split("|") |
In [98]: movies["genre"] = movies.pop("genres").str.split("|") |
Anonymous | Sep 11, 2024 |
Other Digital Version | Creating ndarrays Quinto parrafo. |
In [31]: np.empty((2, 3, 2)) |
Gerald Juárez | Sep 14, 2024 |
Other Digital Version | Section 11.1 Table 11.2 |
In the “Open Access” HTML version, Table 11.2: datetime format specification: |
Jihang Tang | Sep 26, 2024 |
ePub | Page Using Code Examples First sentence. |
The first sentence begins with "You can data find files", I assume it should be "You can find data files". |
Adel Siddiquei | Oct 15, 2024 |
Other Digital Version | Chapter 3, Section 3.1 6th Subtopic |
In the provided example, the description states that strings with a length of 2 or less should be filtered out. However, the code filters out strings where the length is greater than 2 (if len(x) > 2). This is inconsistent with the intended explanation. |
Syed Mohammad Hasan | Oct 22, 2024 |
ePub | Page 1 Preliminaries Installing Necessary Packages |
Sorry, I don't have a massive tech background. Is there something different about python 3.12.2? Or are there permission issues that I need to get around? |
Anonymous | Aug 21, 2024 |
Other Digital Version | 1.4 Installation and Setup Installing Necessary Packages |
On Windows, substitute a carat ^ for the line continuation \ used on Linux and macOS. |
Anonymous | May 15, 2024 |
ePub | Page 3.1, List Discussion of "Extend" |
Please disregard the errata I just submitted. I missed that the example was a list of lists. The text makes perfect sense. |
Steven O. Ellis | Jul 07, 2024 |
O'Reilly learning platform | Page 4 NumPy Basics: Arrays and Vectorized Computation Data Types for ndarrays |
In [45]: numeric_strings = np.array(["1.25", "-9.6", "42"], dtype=np.string_) |
Dmitry | Aug 27, 2024 |
Other Digital Version | 4.2 Pseudorandom Number Generation Table 4.3: NumPy random number generator methods |
duplicate `uniform` function listed in the table |
Ben To | Mar 09, 2024 |
O'Reilly learning platform | Page 4.2 Pseudorandom Number Generation Table 4.3: NumPy random number generator methods |
In Table 4.3, “uniform” distribution is repeated in the third and last row. |
Gao Lu | Oct 01, 2024 |
Other Digital Version | 4.4 Array-Oriented Programming with Arrays first code listing |
In [169]: points = np.arange(-5, 5, 0.01) # 100 equally spaced points |
Ben To | Mar 11, 2024 |
Other Digital Version | 4.6 Linear Algebra 4th code example |
The qr method in the import statement, is never used. |
Doug Richardson | Aug 15, 2024 |
ePub | Page 5 Indexing, Selection and Filtering Using Code Examples |
In the following sentence should 'columns' be changed to 'rows'. When I test this, it prints 2 rows and all the columns. |
Steven Mooney | Feb 21, 2024 |
ePub | Page 7.1.1 Filtering Out Missing Data 6th Paragragh and [38] |
"Suppose you want to keep only rows containing at most a certain number of missing observations. You can indicate this with the thresh argument:" |
Anonymous | May 07, 2024 |
Other Digital Version | 9 Plotting and Visualization Figure 9.27: Tipping percentage by day split by time/smoker |
The code to generate figure 9.27 does not match the generated figure, as the generated figure has a hue to the bars (indicating the day) which is missing from: |
Doug Richardson | Aug 19, 2024 |
Other Digital Version | 9 Plotting and Visualization Figure 9.28: Box plot of tipping percentage by day |
Figure 9.28 box plots have hues in the image, but the code to generate them does not match. |
Doug Richardson | Aug 19, 2024 |
O'Reilly learning platform | Page 10.2 6th code box, In [72] |
The code example is "grouped_pct.agg([("average", "mean"), ("stdev", np.std)])". There is a FutureWarning to use "grouped_pct.agg([("average", "mean"), ("stdev", "std")]) instead. |
Jochen Schüttler | Apr 09, 2024 |
Other Digital Version | 13.3 US Baby Names In[116] China edition page415 |
According to the up code block: def~~ |
Zhang yingtan | Mar 19, 2024 |
Page 135 4 & 6 |
"If a DataFrame’s index and columns have their name attributes set, these will also be displayed:" |
Emile Jacques Bosman | May 01, 2024 | |
Printed, ePub | Page 147 3rd paragraph |
The second sentence in the following text has the word "role" rather than "row: |
Anonymous | Jul 31, 2024 |
Printed, ePub | Page 159 1st paragraph |
The paragraph starts with "Here the function f, which…". Since the example function is named "f1", the paragraph should start with "Here the function f1, which…" |
Anonymous | Jul 31, 2024 |
Printed, ePub | Page 166 3rd paragraph |
"When an entire row or column contains all NA values, the sum is 0, whereas if any value is not NA, then the result is NA. " |
Anonymous | Jul 31, 2024 |
Printed | Page 169 In[285] |
In[283] and In[285] look exactly the same even though line above says that you could include more concise syntax. |
Jude Cancellieri | Mar 09, 2024 |
Printed, ePub | Page 210, Section 7.2 2nd paragraph |
The sentence is: |
Anonymous | Aug 26, 2024 |
Printed, ePub | Page 273 last paragraph, following subtitle 'Pivoting "long" to "Wide" Format' |
In the sentence "In this format, individual values are represented by a single row in a table rather than multiple values per row.", the text starting with "by" should be: "by a single column in a table rather than multiple values (i.e. columns) per row." |
Anonymous | Aug 29, 2024 |