Horizontal scaling with automatic sharding of HBase tables

Automatics sharding is a nice feature in HBase. Auto sharding is the capability where the HBase tables are dynamically divided into smaller parts and distributed across the region servers when they become too large.

This capability to share the data and distribute parts of it to different regions helps HBase to scale Horizontally. Regions contain a subset of the table's data. This data is a contiguous sorted set of rows that are stored together.

As you can imagine, when you start with HBase and you start putting data in HBase tables, there would only be a single region. At some point, a region becomes too large and is split into two tables containing partial sets of rows. This is ...

Get Architecting Data-Intensive Applications now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.