seq-datasource-v2 (homepage)

Sequence Data Source for Apache Spark

@garawalid / (0)

The SeqDataSourceV2 package allows reading Hadoop Sequence File from Spark SQL.


Tags

  • 1|sql
  • 1|input
  • 1|data source

How to

Include this package in your Spark Applications using:

spark-shell, pyspark, or spark-submit

> $SPARK_HOME/bin/spark-shell --packages garawalid:seq-datasource-v2:0.2.0

sbt

If you use the sbt-spark-package plugin, in your sbt build file, add:

spDependencies += "garawalid/seq-datasource-v2:0.2.0"

Otherwise,

resolvers += "Spark Packages Repo" at "http://dl.bintray.com/spark-packages/maven"

libraryDependencies += "garawalid" % "seq-datasource-v2" % "0.2.0"

Maven

In your pom.xml, add:
<dependencies>
  <!-- list of dependencies -->
  <dependency>
    <groupId>garawalid</groupId>
    <artifactId>seq-datasource-v2</artifactId>
    <version>0.2.0</version>
  </dependency>
</dependencies>
<repositories>
  <!-- list of other repositories -->
  <repository>
    <id>SparkPackagesRepo</id>
    <url>http://dl.bintray.com/spark-packages/maven</url>
  </repository>
</repositories>

Releases

Version: 0.2.0 ( 4e5929 | zip | jar ) / Date: 2021-03-20 / License: Apache-2.0 / Scala version: 2.11