Book description
Apache Hadoop is the technology at the heart of the Big Data revolution, and Hadoop skills are in enormous demand. Now, in just 24 lessons of one hour or less, you can learn all the skills and techniques you'll need to deploy each key component of a Hadoop platform in your local environment or in the cloud, building a fully functional Hadoop cluster and using it with real programs and datasets. Each short, easy lesson builds on all that's come before, helping you master all of Hadoop's essentials, and extend it to meet your unique challenges. Apache Hadoop in 24 Hours, Sams Teach Yourself covers all this, and much more:
Understanding Hadoop and the Hadoop Distributed File System (HDFS)
Importing data into Hadoop, and process it there
Mastering basic MapReduce Java programming, and using advanced MapReduce API concepts
Making the most of Apache Pig and Apache Hive
Implementing and administering YARN
Taking advantage of the full Hadoop ecosystem
Managing Hadoop clusters with Apache Ambari
Working with the Hadoop User Environment (HUE)
Scaling, securing, and troubleshooting Hadoop environments
Integrating Hadoop into the enterprise
Deploying Hadoop in the cloud
Getting started with Apache Spark
Step-by-step instructions walk you through common questions, issues, and tasks; Q-and-As, Quizzes, and Exercises build and test your knowledge; "Did You Know?" tips offer insider advice and shortcuts; and "Watch Out!" alerts help you avoid pitfalls. By the time you're finished, you'll be comfortable using Apache Hadoop to solve a wide spectrum of Big Data problems.
Table of contents
- About This E-Book
- Title Page
- Copyright Page
- Contents at a glance
- Table of Contents
- Preface
- About the Author
- Acknowledgments
- Part I: Getting Started with Hadoop
-
Part II: Using Hadoop
- Hour 7: Programming MapReduce Applications
- Hour 8: Analyzing Data in HDFS Using Apache Pig
- Hour 9: Using Advanced Pig
- Hour 10: Analyzing Data Using Apache Hive
- Hour 11: Using Advanced Hive
- Hour 12: Using SQL-on-Hadoop Solutions
- Hour 13: Introducing Apache Spark
- Hour 14: Using the Hadoop User Environment (HUE)
- Hour 15: Introducing NoSQL
-
Part III: Managing Hadoop
- Hour 16: Managing YARN
- Hour 17: Working with the Hadoop Ecosystem
-
Hour 18: Using Cluster Management Utilities
- Cluster Management Overview
- Deploying Clusters and Services Using Management Tools
- Configuration and Service Management Using Management Tools
- Monitoring, Troubleshooting, and Securing Hadoop Clusters Using Cluster Management Utilities
- Getting Started with the Cluster Management Utilities
- Summary
- Q&A
- Workshop
- Hour 19: Scaling Hadoop
- Hour 20: Understanding Cluster Configuration
- Hour 21: Understanding Advanced HDFS
- Hour 22: Securing Hadoop
- Hour 23: Administering, Monitoring and Troubleshooting Hadoop
- Hour 24: Integrating Hadoop into the Enterprise
- Index
- Code Snippets
Product information
- Title: Sams Teach Yourself Hadoop in 24 Hours
- Author(s):
- Release date: April 2017
- Publisher(s): Sams
- ISBN: 9780134456737
You might also like
book
Sams Teach Yourself Apache Spark™ in 24 Hours
Apache Spark is a fast, scalable, and flexible open source distributed processing engine for big data …
book
Apache Hadoop 3 Quick Start Guide
A fast paced guide that will help you learn about Apache Hadoop 3 and its ecosystem …
video
Learning Apache Hadoop
In this Introduction to Hadoop training course, expert author Rich Morrow will teach you the tools …
book
Hadoop For Dummies
Let Hadoop For Dummies help harness the power of your data and rein in the information …