spark-tools (homepage)

Executable Apache Spark Tools: Format Converter & SQL Processor

@tupol / (0)

This project contains some basic runnable tools that can help with various tasks around a Spark based project.

The main tools available:

FormatConverter Converts any acceptable file format into a different file format, providing also partitioning support.
SimpleSqlProcessor Applies a given SQL to the input files which are being mapped into tables.


Tags

  • 1|streaming
  • 1|sql
  • 1|kafka
  • 1|application
  • 1|scala
  • 1|tools
  • 1|examples
  • 1|utils
  • 1|converter
  • 1|sql-processor
  • 1|format
  • 1|delta

How to

Include this package in your Spark Applications using:

spark-shell, pyspark, or spark-submit

> $SPARK_HOME/bin/spark-shell --packages org.tupol:spark-tools_2.11:0.4.1

sbt

In your sbt build file, add:

libraryDependencies += "org.tupol" % "spark-tools_2.11" % "0.4.1"

Maven

In your pom.xml, add:
<dependencies>
  <!-- list of dependencies -->
  <dependency>
    <groupId>org.tupol</groupId>
    <artifactId>spark-tools_2.11</artifactId>
    <version>0.4.1</version>
  </dependency>
</dependencies>

Releases

Version: 0.4.1-s_2.11 ( 2e4ece | zip | jar ) / Date: 2020-09-12 / License: MIT / Scala version: 2.11

Version: 0.4.1-s_2.12 ( 2e4ece | zip | jar ) / Date: 2020-09-12 / License: MIT / Scala version: 2.12

Version: 0.4.1 ( 2e4ece | zip | jar ) / Date: 2020-09-12 / License: MIT / Scala version: 2.11

Version: 0.4.0 ( dc5949 | zip | jar ) / Date: 2019-08-28 / License: MIT / Scala version: 2.11

Version: 0.3.0 ( cc2a6a | zip | jar ) / Date: 2019-05-09 / License: MIT / Scala version: 2.11

Version: 0.2.1 ( a5a15c | zip | jar ) / Date: 2019-04-10 / License: MIT / Scala version: 2.11