© Karthik Ramasubramanian and Abhishek Singh 2017
Karthik Ramasubramanian and Abhishek SinghMachine Learning Using R10.1007/978-1-4842-2334-5_2

2. Data Preparation and Exploration

Karthik Ramasubramanian and Abhishek Singh1
(1)
New Delhi, Delhi, India
 
As we emphasized in our introductory chapter on applying machine learning (ML) algorithms with a simplified process flow, in this chapter, we go deeper into the first block of machine learning process flow—data exploration and preparation.
The subject of data exploration was very formally introduced by John W. Tukey almost four decades ago with his book on Exploratory Data Analysis (EDA) . The methods discussed in the book were profound and there aren’t many software programs that include all of it. ...

Get Machine Learning Using R now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.