A community index of third-party packages for Apache Spark.
Showing packages 1 - 16 out of 16 for search "tags:"Core""
spark-indexedrdd
An efficient updatable key-value store for Apache Spark
@amplab / Latest release: 0.4.0 (2017-01-11) / Apache-2.0 / (1)
spark-sorted
Secondary sort and streaming reduce for Spark
@tresata / Latest release: 0.4.0-s_2.11 (2015-11-03) / Apache-2.0 / (0)
elasticsearch-hadoop
Official integration between Apache Spark and Elasticsearch real-time search and analytics
@elastic / Latest release: 5.3.1 (2017-04-21) / Apache-2.0 / (3)
spark-skewjoin
Joins for skewed datasets in Spark
@tresata / Latest release: 0.2.0-s_2.10 (2015-11-13) / Apache-2.0 / (0)
SparkCLR
C# API for Apache Spark. (Package moved to http://spark-packages.org/package/Microsoft/Mobius)
@skaarthik / No release yet / (2)
spark-tutorial
This tutorial provides a quick introduction to using Spark
@rklick-solutions / No release yet / (2)
spark-crossdata
SparkSQL extension as a library for Apache Spark extending and improving its capabilities for a data federation system.
@Stratio / Latest release: 1.4.0 (2016-07-06) / Apache-2.0 / (6)
spark-metrics
A library to expose Apache Spark's metrics system
@groupon / Latest release: 1.0 (2016-05-21) / BSD 3-Clause / (0)
spark-mergejoin
Robust and scalable join operators using sort-merge algorithm (high data skew, low cardinality, etc)
@hindog / Latest release: 2.0.1 (2017-04-04) / Apache-2.0 / (0)
spark-bigquery
Google BigQuery support for Spark, Structured Streaming, SQL, and DataFrames with easy Databricks integration.
@samelamin / Latest release: 0.2.5 (2018-08-08) / Apache-2.0 / (1)
SMACK_Tweets
Its a small application which collects tweets from twitter and process it with spark streaming and ingest it in cassandra ring
@phalodi / No release yet / (0)
spark-radar
A new scheduler being aware of tasks' size and nodes' capability for spark streaming
@u2009cf / Latest release: 1.0.0 (2017-08-14) / Apache-2.0 / (1)
spark-extension
A library that provides useful extensions to Apache Spark and PySpark.
@G-Research / Latest release: 2.10.0-3.5 (2023-10-07) / Apache-2.0 / (1)