Book description
HBase is a remarkable tool for indexing mass volumes of data, but getting started with this distributed database and its ecosystem can be daunting. With this hands-on guide, you’ll learn how to architect, design, and deploy your own HBase applications by examining real-world solutions. Along with HBase principles and cluster deployment guidelines, this book includes in-depth case studies that demonstrate how large companies solved specific use cases with HBase.
Authors Jean-Marc Spaggiari and Kevin O’Dell also provide draft solutions and code examples to help you implement your own versions of those use cases, from master data management (MDM) and document storage to near real-time event processing. You’ll also learn troubleshooting techniques to help you avoid common deployment mistakes.
- Learn exactly what HBase does, what its ecosystem includes, and how to set up your environment
- Explore how real-world HBase instances were deployed and put into production
- Examine documented use cases for tracking healthcare claims, digital advertising, data management, and product quality
- Understand how HBase works with tools and techniques such as Spark, Kafka, MapReduce, and the Java API
- Learn how to identify the causes and understand the consequences of the most common HBase issues
Publisher resources
Table of contents
- Foreword
- Preface
- I. Introduction to HBase
- 1. What Is HBase?
- 2. HBase Principles
- 3. HBase Ecosystem
- 4. HBase Sizing and Tuning Overview
- 5. Environment Setup
- II. Use Cases
- 6. Use Case: HBase as a System of Record
- 7. Implementation of an Underlying Storage Engine
- 8. Use Case: Near Real-Time Event Processing
- 9. Implementation of Near Real-Time Event Processing
- 10. Use Case: HBase as a Master Data Management Tool
- 11. Implementation of HBase as a Master Data Management Tool
- 12. Use Case: Document Store
- 13. Implementation of Document Store
- III. Troubleshooting
- 14. Too Many Regions
- 15. Too Many Column Families
- 16. Hotspotting
- 17. Timeouts and Garbage Collection
- 18. HBCK and Inconsistencies
- Index
Product information
- Title: Architecting HBase Applications
- Author(s):
- Release date: July 2016
- Publisher(s): O'Reilly Media, Inc.
- ISBN: 9781491916100
You might also like
book
HBase in Action
HBase in Action has all the knowledge you need to design, build, and run applications using …
video
Building Apache HBase Applications
In this Building Apache HBase Applications training course, expert author Jonathan Hsieh will teach you how …
book
HBase: The Definitive Guide
If you're looking for a scalable storage solution to accommodate a virtually endless amount of data, …
book
HBase Administration Cookbook
Master HBase configuration and administration for optimum database performance with this book and ebook. Move large …