A community index of third-party packages for Apache Spark.
Showing packages 1 - 17 out of 17 for search "tags:"Applications""
spark-notebook
Use Apache Spark straight from the Browser
@andypetrella / Latest release: v0.4.0 (2015-03-29) / Apache-2.0 / (2)
pipeline
Docker-based, End-to-End, Real-time, Advanced Analytics Big Data Reference Pipeline using Spark, Spark SQL, Spark Streaming, ML, MLlib, GraphX, Kafka, Cassandra, Redis, Apache Zeppelin, Spark-Notebook, iPython/Jupyter Notebook, Tableau, H2O Flow, Tachyon,
@fluxcapacitor / No release yet / (3)
spark-crossdata
SparkSQL extension as a library for Apache Spark extending and improving its capabilities for a data federation system.
@Stratio / Latest release: 1.4.0 (2016-07-06) / Apache-2.0 / (6)
spark-metrics
A library to expose Apache Spark's metrics system
@groupon / Latest release: 1.0 (2016-05-21) / BSD 3-Clause / (0)
cassandra-spark-akka-http-starter-kit
A REST Api for CRUD operations on Cassandra using Apache Spark
@shiv4nsh / No release yet / (0)
Optimus
Optimus is the missing library for cleansing (cleaning and much more) and pre-processing data in a distributed fashion with Apache Spark.
@ironmussa / Latest release: 1.1.0 (2017-10-25) / Apache-2.0 / (2)
structured-streaming-application
Structured Streaming is a reference application showing how to easily integrate structured streaming Apache Spark Structured Streaming, Apache Cassandra and Apache Kafka for fast, structured streaming computations on data.
@knoldus / Latest release: 0.1.0 (2018-01-05) / Apache-2.0 / (1)
spark-on-k8s-operator
Kubernetes operator for specifying and running Apache Spark applications idiomatically on Kubernetes.
@GoogleCloudPlatform / No release yet / (0)
spark-tools
Executable Apache Spark Tools: Format Converter & SQL Processor
@tupol / Latest release: 0.4.1-s_2.11 (2020-09-12) / MIT / (0)