spark-sas7bdat (homepage)

Splittable SAS (.sas7bdat) Input Format for Hadoop and Spark SQL

@saurfang / (1)

This packages allow reading SAS binary file (.sas7bdat) in parallel as data frame in Spark SQL. It provides utility to export it as CSV (using spark-csv) or parquet file.


Tags

  • 1|sql
  • 1|tools
  • 1|sas
  • 1|data source

How to

Include this package in your Spark Applications using:

spark-shell, pyspark, or spark-submit

> $SPARK_HOME/bin/spark-shell --packages saurfang:spark-sas7bdat:3.0.0-s_2.12

sbt

If you use the sbt-spark-package plugin, in your sbt build file, add:

spDependencies += "saurfang/spark-sas7bdat:3.0.0-s_2.12"

Otherwise,

resolvers += "Spark Packages Repo" at "https://repos.spark-packages.org/"

libraryDependencies += "saurfang" % "spark-sas7bdat" % "3.0.0-s_2.12"

Maven

In your pom.xml, add:
<dependencies>
  <!-- list of dependencies -->
  <dependency>
    <groupId>saurfang</groupId>
    <artifactId>spark-sas7bdat</artifactId>
    <version>3.0.0-s_2.12</version>
  </dependency>
</dependencies>
<repositories>
  <!-- list of other repositories -->
  <repository>
    <id>SparkPackagesRepo</id>
    <url>https://repos.spark-packages.org/</url>
  </repository>
</repositories>

Releases

Version: 3.0.0-s_2.12 ( 465b58 | zip | jar ) / Date: 2020-09-13 / License: Apache-2.0 / Scala version: 2.12

Version: 3.0.0-s_2.11 ( 465b58 | zip | jar ) / Date: 2020-09-13 / License: Apache-2.0 / Scala version: 2.11

Version: 2.1.0-s_2.11 ( de5307 | zip | jar ) / Date: 2018-12-10 / License: Apache-2.0 / Scala version: 2.11

Version: 2.1.0-s_2.10 ( de5307 | zip | jar ) / Date: 2018-12-10 / License: Apache-2.0 / Scala version: 2.10

Version: 2.0.0-s_2.10 ( 69ffdd | zip | jar ) / Date: 2018-02-03 / License: Apache-2.0 / Scala version: 2.10

Version: 2.0.0-s_2.11 ( 2b9bd7 | zip | jar ) / Date: 2018-02-03 / License: Apache-2.0 / Scala version: 2.11

Version: 1.1.5-s_2.11 ( ded9b1 | zip | jar ) / Date: 2016-11-20 / License: GPL-3.0 / Scala version: 2.11

Version: 1.1.4-s_2.10 ( 197e07 | zip | jar ) / Date: 2015-10-07 / License: GPL-3.0 / Scala version: 2.10

Spark Scala/Java API compatibility: - 90% , - 68% , - 100% , - 21% , - 76% , - 62%

Version: 1.1.3-s_2.10 ( 90c11b | zip | jar ) / Date: 2015-06-28 / License: GPL-3.0 / Scala version: 2.10

Spark Scala/Java API compatibility: - 22% , - 82% , - 65% , - 71%

Version: 1.1.2-s_2.10 ( 83cff4 | zip | jar ) / Date: 2015-06-12 / License: GPL-3.0 / Scala version: 2.10

Spark Scala/Java API compatibility: - 69% , - 21% , - 100% , - 63%

Version: 1.1.1-s_2.10 ( 49b096 | zip | jar ) / Date: 2015-06-12 / License: GPL-3.0 / Scala version: 2.10

Spark Scala/Java API compatibility: - 21% , - 100% , - 63% , - 69%

Version: 1.1.0-s_2.10 ( 5cdd25 | zip | jar ) / Date: 2015-06-11 / License: GPL-3.0 / Scala version: 2.10

Spark Scala/Java API compatibility: - 69% , - 21% , - 100% , - 63%

Version: 1.0.0-s_2.10 ( 3e3214 | zip | jar ) / Date: 2015-05-14 / License: GPL-3.0 / Scala version: 2.10

Spark Scala/Java API compatibility: - 21% , - 100% , - 63% , - 69%