Book description
If you’ve been asked to maintain large and complex Hadoop clusters, this book is a must. Demand for operations-specific material has skyrocketed now that Hadoop is becoming the de facto standard for truly large-scale data processing in the data center. Eric Sammer, Principal Solution Architect at Cloudera, shows you the particulars of running Hadoop in production, from planning, installing, and configuring the system to providing ongoing maintenance.
Rather than run through all possible scenarios, this pragmatic operations guide calls out what works, as demonstrated in critical deployments.
- Get a high-level overview of HDFS and MapReduce: why they exist and how they work
- Plan a Hadoop deployment, from hardware and OS selection to network requirements
- Learn setup and configuration details with a list of critical properties
- Manage resources by sharing a cluster across multiple groups
- Get a runbook of the most common cluster maintenance tasks
- Monitor Hadoop clusters—and learn troubleshooting with the help of real-world war stories
- Use basic tools and techniques to handle backup and catastrophic failure
Publisher resources
Table of contents
- Hadoop Operations
- Dedication
- Preface
- 1. Introduction
- 2. HDFS
- 3. MapReduce
- 4. Planning a Hadoop Cluster
- 5. Installation and Configuration
- 6. Identity, Authentication, and Authorization
- 7. Resource Management
- 8. Cluster Maintenance
- 9. Troubleshooting
- 10. Monitoring
- 11. Backup and Recovery
- A. Deprecated Configuration Properties
- Index
- About the Author
- Colophon
- Copyright
Product information
- Title: Hadoop Operations
- Author(s):
- Release date: October 2012
- Publisher(s): O'Reilly Media, Inc.
- ISBN: 9781449327057
You might also like
book
Hadoop in Action
Hadoop in Action introduces the subject and teaches you how to write programs in the MapReduce …
book
Hadoop Security
As more corporations turn to Hadoop to store and process their most valuable data, the risk …
book
Hadoop in Practice, Second Edition
Hadoop in Practice, Second Edition provides over 100 tested, instantly useful techniques that will help you …
video
Learning Apache Hadoop
In this Introduction to Hadoop training course, expert author Rich Morrow will teach you the tools …