spark-on-hbase

spark-on-hbase (homepage)

Generic solution for scanning, joining and mutating HBase tables to and from the Spark RDDs.

This is a spark-centric implementation with HBaseRDD and HBaseTable main classes. HBaseRDD is used for scanning and optimised single-stage joins while HBaseTable is for mutating underlying HBase tables using RDDs as input. The master setup is for HBase API 1.1.0.1, Scala 2.10 and Spark 1.4.1. Demos and tutorials for different use cases are under examples package.

Tags (No tags yet, login to add one. )

How to

This package doesn't have any releases published in the Spark Packages repo, or with maven coordinates supplied. You may have to build this package from source, or it may simply be a script. To use this Spark Package, please follow the instructions in the README.

Releases

No releases yet.