spark-hadoopoffice-ds (homepage)

A Spark datasource for the HadoopOffice library

@ZuInnoTe / (1)

A Spark datasource for the open source HadoopOffice library. This library enables you to read/write office documents, such as MS Excel, on Big Data platforms.


Tags

  • 1|data source
  • 1|office
  • 1|excel
  • 1|hadoopoffice

How to

Include this package in your Spark Applications using:

spark-shell, pyspark, or spark-submit

> $SPARK_HOME/bin/spark-shell --packages com.github.zuinnote:spark-hadoopoffice-ds_2.10:1.0.1

sbt

In your sbt build file, add:

libraryDependencies += "com.github.zuinnote" % "spark-hadoopoffice-ds_2.10" % "1.0.1"

Maven

In your pom.xml, add:
<dependencies>
  <!-- list of dependencies -->
  <dependency>
    <groupId>com.github.zuinnote</groupId>
    <artifactId>spark-hadoopoffice-ds_2.10</artifactId>
    <version>1.0.1</version>
  </dependency>
</dependencies>

Releases

Version: 1.0.1-s_2.10 ( 96cb93 | zip | jar ) / Date: 2017-01-07 / License: Apache-2.0 / Scala version: 2.10

Version: 1.0.1-s_2.11 ( 96cb93 | zip | jar ) / Date: 2017-01-07 / License: Apache-2.0 / Scala version: 2.11