Errata

Hands-On Large Language Models

Errata for Hands-On Large Language Models

Submit your own errata for this product.

The errata list is a list of errors and their corrections that were found after the product was released. If the error was corrected in a later version or reprint the date of the correction will be displayed in the column titled "Date Corrected".

The following errata were submitted by our customers and approved as valid errors by the author or editor.

Color key: Serious technical mistake Minor technical mistake Language or formatting error Typo Question Note Update

Version Location Description Submitted By Date submitted Date corrected
Printed
Page 300
The tip box

On page 300 of the printed book, you will find a tip box mentioning the batch size in MNR loss. This is the same tip box as on page 307 and the former can be safely ignored.

Do note though that the larger batch sizes in general also help speedup training, especially if you have enough VRAM. Likewise, the tip does not relate only to MNR losses but to more losses that share similar mechanics. In other words, upping the batch size is seldom a bad idea if your device can handle it.

Maarten Grootendorst
 
Nov 13, 2024 
Printed
Page 338
bottom

On the bottom of the page, it mentions there are 20*32=680 samples. This should be 640 samples instead.

The same applies to 680*2= 1280. Here it should also be 640 instead of 680.

Maarten Grootendorst
 
Dec 10, 2024