A community index of third-party packages for Apache Spark.

Showing packages 1 - 17 out of 17 for search "tags:"Applications""

REST job server for Spark

@spark-jobserver / No release yet / (3)

  • 1|application
  • 1|REST
  • 1|Mesos


Zeppelin, a web-based notebook that enables interactive data analytics.

@NFLabs / No release yet / (3)

  • 1|Applications
  • 1|notebook
  • 1|interactive


Use Apache Spark straight from the Browser

@andypetrella / Latest release: v0.4.0 (2015-03-29) / Apache-2.0 / (2)

  • 1|notebook
  • 1|charts
  • 1|interactive


Spark and Spark SQL integration for Succinct

@amplab / Latest release: 0.1.8 (2019-07-10) / Apache-2.0 / (1)

  • 1|application
  • 1|data source


Docker-based, End-to-End, Real-time, Advanced Analytics Big Data Reference Pipeline using Spark, Spark SQL, Spark Streaming, ML, MLlib, GraphX, Kafka, Cassandra, Redis, Apache Zeppelin, Spark-Notebook, iPython/Jupyter Notebook, Tableau, H2O Flow, Tachyon,

@fluxcapacitor / No release yet / (3)

  • 2|streaming
  • 2|kafka
  • 1|machine learning


Solr Dictionary Annotator (Microservice for Spark)

@elsevierlabs-os / No release yet / (0)

  • 1|application
  • 1|tools


Create composable data processing pipelines in Spark, and execute them on a cluster using simple Scala code

@springnz / No release yet / (0)

  • 1|application
  • 1|testing
  • 1|tools


Livy, a REST Spark Server for submitting jobs and code snippets

@cloudera / No release yet / (2)

  • 1|application
  • 1|REST
  • 1|interactive


SparkSQL extension as a library for Apache Spark extending and improving its capabilities for a data federation system.

@Stratio / Latest release: 1.4.0 (2016-07-06) / Apache-2.0 / (6)

  • 3|SparkSQL
  • 3|sql
  • 2|library


A library to expose Apache Spark's metrics system

@groupon / Latest release: 1.0 (2016-05-21) / BSD 3-Clause / (0)

  • 1|metrics
  • 1|application
  • 1|core


A REST Api for CRUD operations on Cassandra using Apache Spark

@shiv4nsh / No release yet / (0)

  • 1|application
  • 1|spark
  • 1|example


Spark GraphX library to detect causalities across time related events

@aamend / Latest release: 1.0 (2017-07-14) / Apache-2.0 / (1)

  • 1|application
  • 1|graph


Optimus is the missing library for cleansing (cleaning and much more) and pre-processing data in a distributed fashion with Apache Spark.

@ironmussa / Latest release: 1.1.0 (2017-10-25) / Apache-2.0 / (2)

  • 1|machine learning
  • 1|tools
  • 1|pyspark


Structured Streaming is a reference application showing how to easily integrate structured streaming Apache Spark Structured Streaming, Apache Cassandra and Apache Kafka for fast, structured streaming computations on data.

@knoldus / Latest release: 0.1.0 (2018-01-05) / Apache-2.0 / (1)

  • 1|application
  • 1|structured-streaming
  • 1|scala


Kubernetes operator for specifying and running Apache Spark applications idiomatically on Kubernetes.

@GoogleCloudPlatform / No release yet / (0)

  • 1|application
  • 1|Kubernetes


Executable Apache Spark Tools: Format Converter & SQL Processor

@tupol / Latest release: 0.4.1-s_2.11 (2020-09-12) / MIT / (0)

  • 1|streaming
  • 1|sql
  • 1|kafka


Rumble: JSONiq for Apache Spark

@RumbleDB / No release yet / (1)

  • 1|Applications
  • 1|tools
  • 1|nosql