3 The Mechanics of Training LLMs

Here, we will guide you through the intricate process of training LLMs, starting with the crucial task of data preparation and management. This process is fundamental to getting LLMs to perform in a desired way. We will further explore the establishment of a robust training environment, delving into the science of hyperparameter tuning and elaborating on how to address overfitting, underfitting, and other common training challenges, giving you a thorough grounding in creating effective LLMs.

In this chapter, we’re going to cover the following main topics:

Data – preparing the fuel for LLMs
Setting up your training environment
Hyperparameter tuning – finding the sweet spot
Challenges in training LLMs – overfitting, ...

Get Decoding Large Language Models now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.

Start your free trial

Decoding Large Language Models by Irena Cronin

3

The Mechanics of Training LLMs

Don’t leave empty-handed

It’s yours, free.

Check it out now on O’Reilly