Video description
End-to-End Hive: HQL, Partitioning, Bucketing, UDFs, Windowing, Optimization, Map Joins, Indexes
About This Video
- Analytical Processing: Joins, Subqueries, Views, Table Generating Functions, Explode, Lateral View, Windowing and more
- Tuning Hive for better functionality: Partitioning, Bucketing, Join Optimizations, Map Side Joins, Indexes, Writing custom User Defined functions in Java. UDF, UDAF, GenericUDF, GenericUDTF, Custom functions in Python, Implementation of MapReduce for Select, Group by and Join
In Detail
Hive is like a new friend with an old face (SQL). This course is an end-to-end, practical guide to using Hive for Big Data processing. Let's parse that A new friend with an old face: Hive helps you leverage the power of Distributed computing and Hadoop for Analytical processing. Its interface is like an old friend: the very SQL like HiveQL. This course will fill in all the gaps between SQL and what you need to use Hive. End-to-End: The course is an end-to-end guide for using Hive: whether you are analyst who wants to process data or an Engineer who needs to build custom functionality or optimize performance - everything you'll need is right here. New to SQL? No need to look elsewhere. The course has a primer on all the basic SQL constructs, Practical: Everything is taught using real-life examples, working queries and code.
Table of contents
- Chapter 1 : You, Us This Course
- Chapter 2 : Introducing Hive
- Chapter 3 : Hadoop and Hive Install
- Chapter 4 : Hadoop and HDFS Overview
- Chapter 5 : Hive Basics
- Chapter 6 : Built-in Functions
- Chapter 7 : Sub-Queries
- Chapter 8 : Partitioning
- Chapter 9 : Bucketing
- Chapter 10 : Windowing
- Chapter 11 : Understanding MapReduce
- Chapter 12 : MapReduce logic for queries: Behind the scenes
- Chapter 13 : Join Optimizations in Hive
- Chapter 14 : Custom Functions in Python
-
Chapter 15 : Custom functions in Java
- Introducing UDFs - you're not limited by what Hive offers
- The Simple UDF: The standard function for primitive types
- The Simple UDF: Java implementation for replacetext()
- Generic UDFs, the Object Inspector and DeferredObjects
- The Generic UDF: Java implementation for containsstring()
- The UDAF: Custom aggregate functions can get pretty complex
- The UDAF: Java implementation for max()
- The UDAF: Java implementation for Standard Deviation
- The Generic UDTF: Custom table generating functions
- The Generic UDTF: Java implementation for namesplit()
- Chapter 16 : SQL Primer - Select Statements
- Chapter 17 : SQL Primer - Group By, Order by and Having
- Chapter 18 : SQL Primer – Joins
- Chapter 19 : Appendix
Product information
- Title: From 0 to 1: Hive for Processing Big Data
- Author(s):
- Release date: December 2017
- Publisher(s): Packt Publishing
- ISBN: 9781788995054
You might also like
video
Data Engineering Foundations LiveLessons Part 1: Using Spark, Hive, and Hadoop Scalable Tools
6+ Hours of Video Instruction One Line Sell The perfect way to get started with scalable …
video
Apache Spark with Python - Big Data with PySpark and Spark
This course covers all the fundamentals of Apache Spark with Python and teaches you everything you …
video
50 Hours of Big Data, PySpark, AWS, Scala, and Scraping
Part 1 is designed to reflect the most in-demand Scala skills. It provides an in-depth understanding …
video
Data Engineering with Python and AWS Lambda LiveLessons
7 Hours of Video Instruction Data Engineering with Python and AWS Lambda LiveLessons shows users how …