Chapter 2. Overview of the Machine Learning Process

Cross-Industry Standard Process for Data Mining (CRISP-DM) is a process for doing data mining. It has several steps that can be followed for continuous improvement. They are:

  • Business understanding

  • Data understanding

  • Data preparation

  • Modeling

  • Evaluation

  • Deployment

Figure 2-1 shows my workflow for creating a predictive model that expands on the CRISP-DM methodology. The walkthrough in the next chapter will cover these basic steps.

Common workflow for machine learning.
Figure 2-1. Common workflow for machine learning.

Get Machine Learning Pocket Reference now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.