A community index of third-party packages for Apache Spark.

Showing packages 1 - 16 out of 16 for search "tags:"Core""

An efficient updatable key-value store for Apache Spark

@amplab / Latest release: 0.4.0 (2017-01-11) / Apache-2.0 / (1)

  • 2|core
  • 2|kv
  • 1|anothertag


Secondary sort and streaming reduce for Spark

@tresata / Latest release: 0.4.0-s_2.11 (2015-11-03) / Apache-2.0 / (0)

  • 1|core


Official integration between Apache Spark and Elasticsearch real-time search and analytics

@elastic / Latest release: 5.3.1 (2017-04-21) / Apache-2.0 / (3)

  • 1|search
  • 1|elasticsearch
  • 1|sql


Joins for skewed datasets in Spark

@tresata / Latest release: 0.2.0-s_2.10 (2015-11-13) / Apache-2.0 / (0)

  • 1|core


Spark Modularized View

@TresAmigosSD / No release yet / (0)

  • 1|core
  • 1|sql


C# API for Apache Spark. (Package moved to http://spark-packages.org/package/Microsoft/Mobius)

@skaarthik / No release yet / (2)

  • 1|streaming
  • 1|examples
  • 1|sql


This tutorial provides a quick introduction to using Spark

@rklick-solutions / No release yet / (2)

  • 2|RDD
  • 2|spark
  • 2|Spark SQL


SparkSQL extension as a library for Apache Spark extending and improving its capabilities for a data federation system.

@Stratio / Latest release: 1.4.0 (2016-07-06) / Apache-2.0 / (6)

  • 3|SparkSQL
  • 3|sql
  • 2|library


C# API for Apache Spark

@Microsoft / Latest release: 1.6.100 (2016-05-02) / MIT / (2)

  • 1|streaming
  • 1|examples
  • 1|sql


A library to expose Apache Spark's metrics system

@groupon / Latest release: 1.0 (2016-05-21) / BSD 3-Clause / (0)

  • 1|metrics
  • 1|application
  • 1|core


Rich Spark adds more to Apache Spark

@mashin-io / No release yet / (0)

  • 1|ml
  • 1|library
  • 1|streaming


Robust and scalable join operators using sort-merge algorithm (high data skew, low cardinality, etc)

@hindog / Latest release: 2.0.1 (2017-04-04) / Apache-2.0 / (0)

  • 1|core


Google BigQuery support for Spark, Structured Streaming, SQL, and DataFrames with easy Databricks integration.

@samelamin / Latest release: 0.2.5 (2018-08-08) / Apache-2.0 / (1)

  • 1|streaming
  • 1|sql
  • 1|core


Its a small application which collects tweets from twitter and process it with spark streaming and ingest it in cassandra ring

@phalodi / No release yet / (0)

  • 1|streaming
  • 1|example
  • 1|core


A new scheduler being aware of tasks' size and nodes' capability for spark streaming

@u2009cf / Latest release: 1.0.0 (2017-08-14) / Apache-2.0 / (1)

  • 1|streaming
  • 1|scheduler
  • 1|core


A library that provides useful extensions to Apache Spark and PySpark.

@G-Research / Latest release: 2.10.0-3.5 (2023-10-07) / Apache-2.0 / (1)

  • 1|core
  • 1|pyspark