Sunday, October 21, 2018

Apache HBase on Amazon S3


By using Amazon S3 as a data store for Apache HBase, you can separate your cluster’s storage and compute nodes.

Many customers have taken advantage of the benefits of running Apache HBase on Amazon S3 for data storage. These benefits include lower costs, data durability, and more efficient scalability.

Customers, such as the Financial Industry Regulatory Agency (FINRA), have lowered their costs by 60% by moving to an Apache HBase on Amazon S3 architecture. They have also experienced operational benefits that come with decoupling storage from compute and using Amazon S3 as the storage layer.

AWS's whitepaper provides an overview of Apache HBase on Amazon S3 and guides data engineers and software developers in the migration of an onpremises or HDFS backed Apache HBase cluster to Apache HBase on Amazon S3.

The whitepaper offers a migration plan that includes detailed steps for each stage of the migration, including data migration, performance tuning, and operational guidance.

Whitepaper Ref: https://d1.awsstatic.com/whitepapers/Migrating_to_Apache_Hbase_on_Amazon_S3_on_Amazon_EMR.pdf

3 comments: