2

Data Profiling and Preparation before Data Wrangling

Now that we have a good idea about how data is stored and maintained in a database and how normalization and de-normalization are used to store data, in this chapter we will discuss the next stage in the process, which is cleaning and transforming data.

In this chapter, we will cover the following main topics:

  • Data wrangling and its importance
  • Structured and unstructured data
  • Data wrangling tools that are used in the industry
  • What is data profiling?

What is data wrangling?

Data wrangling is the process of cleaning, transforming, and organizing dirty data into clean data that can be used to generate powerful insights to enable stakeholders to make the right decisions. It is basically the ...

Get Data Wrangling with SQL now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.