CHAPTER 9Data Cleaning and Filtering

We shall briefly cover some practices and rules around data filtering and ways to extract the data we want from a table and a historical partitioned database.

9.1 PREDICATE FILTERING

9.1.1 The Where Clause

The where function is the q king of filters; we were first introduced to it in Chapter 1. It acts as a natural filter on a vector:

image
image

It is also our constraint filter in a select statement:

image
image

We recall from Chapter 2 that we separate multiple constraints using a ',' and these are applied from left to right, an exception to the usual q parsing. Note that for multiple where clauses, the filtering is cascading, i.e. the second filter is applied after the initial filter has already reduced our data set. This is a built-in optimisation in q:

image
image
image

Now in ...

Get Machine Learning and Big Data with kdb+/q now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.