Chapter 2Data Collection and Preprocessing

WHAT'S IN THIS CHAPTER

  • Sources to obtain training data
  • Techniques to explore data
  • Techniques to impute missing values
  • Feature engineering techniques

In the previous chapter, you were given a general overview of machine learning, and learned about the different types of machine learning systems. In this chapter you will learn to use NumPy, Pandas, and Scikit-learn to perform common feature engineering tasks.

Get Machine Learning in the AWS Cloud now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.