spark-select (homepage)

Spark select enables retrieving only required data from an object

@minio / (0)

S3 Select is supported with CSV and JSON files using s3selectCSV and s3selectJSON values to specify the data format.


Tags

  • 1|library
  • 1|sql
  • 1|input
  • 1|tutorial
  • 1|scala
  • 1|data source
  • 1|s3select

How to

Include this package in your Spark Applications using:

spark-shell, pyspark, or spark-submit

> $SPARK_HOME/bin/spark-shell --packages io.minio:spark-select_2.11:1.0.0

sbt

If you use the sbt-spark-package plugin, in your sbt build file, add:

spDependencies += "minio/spark-select:1.0.0"

Otherwise,

libraryDependencies += "io.minio" % "spark-select_2.11" % "1.0.0"

Maven

In your pom.xml, add:
<dependencies>
  <!-- list of dependencies -->
  <dependency>
    <groupId>io.minio</groupId>
    <artifactId>spark-select_2.11</artifactId>
    <version>1.0.0</version>
  </dependency>
</dependencies>

Releases

Version: 1.0.0-s_2.11 ( 356d74 | zip | jar ) / Date: 2018-12-05 / License: Apache-2.0 / Scala version: 2.11

Version: 0.0.1 ( 73695c | zip ) / Date: 2018-12-04 / License: Apache-2.0