Chapter 6. Hadoop Ecosystem – Apache Hive
In this chapter, we will cover the following recipes:
- Getting started with Apache Hive
- Creating databases and tables using Hive CLI
- Simple SQL-style data querying using Apache Hive
- Creating and populating Hive tables and views using Hive query results
- Utilizing different storage formats in Hive – storing table data using ORC files
- Using Hive built-in functions
- Hive batch mode – using a query file
- Performing a join with Hive
- Creating partitioned Hive tables
- Writing Hive User-defined Functions (UDF)
- HCatalog – performing Java MapReduce computations on data mapped to Hive tables
- HCatalog – Writing data to Hive tables from Java MapReduce computations
Introduction
Hadoop has a family of projects that are either built on ...
Get Hadoop MapReduce v2 Cookbook - Second Edition now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.