A community index of third-party packages for Apache Spark.
Showing packages 301 - 350 out of 516
bahir:streaming-akka
Spark DStream connector for Akka
@apache / Latest release: 2.2.0 (2017-09-09) / Apache-2.0 / (0)
bahir:streaming-mqtt
Spark DStream connector for MQTT
@apache / Latest release: 2.2.0 (2017-09-09) / Apache-2.0 / (0)
bahir:sql-streaming-mqtt
Spark Structured Streaming data source for MQTT
@apache / Latest release: 2.2.0 (2017-09-09) / Apache-2.0 / (1)
bahir:streaming-twitter
Spark DStream connector for Twitter
@apache / Latest release: 2.2.0 (2017-09-09) / Apache-2.0 / (0)
bahir:streaming-zeromq
Spark DStream connector for ZeroMQ
@apache / Latest release: 2.2.0 (2017-09-09) / Apache-2.0 / (0)
spark-learning
Example code which can help in getting started with spark 2
@engineerpawan / Latest release: 1 (2016-12-28) / MIT / (1)
spark-graphx-twitter
An example of Spark and GraphX with Twitter as sample
@knoldus / No release yet / (0)
parquet-index
Spark SQL index for Parquet tables
@lightcopy / Latest release: 0.5.0-s_2.12 (2020-08-01) / Apache-2.0 / (1)
Twitter-Sentiment-Analyzer
Twitter Sentiment Analysis - PySpark
@DayneSorvisto / No release yet / (1)
spark-hadoopoffice-ds
A Spark datasource for the HadoopOffice library
@ZuInnoTe / Latest release: 1.7.0-s_2.13 (2022-10-29) / Apache-2.0 / (1)
cassandra-couchbase-transfer-plugin
A Spark Program that transfers data from Cassandra to Couchbase
@shiv4nsh / No release yet / (1)
spark-generic-connector
Generic Connector for Apache Spark
@alvsanand / Latest release: 0.2.0-spark_2x-s_2.11 (2017-01-17) / Apache-2.0 / (1)
graphx-overlapping-community
Graphx Overlapping Community Detection
@bhardwajank / Latest release: 1.0 (2017-01-23) / Apache-2.0 / (0)
spark-datetime-lite
A lightweight, dependency-free package for extending Spark's date and timestamp operations, focused on time periods.
@danielpes / Latest release: 0.2.0-s_2.11 (2018-01-30) / Apache-2.0 / (1)
spark-IS-streaming
A Nearest Neighbor Classifier for High-Speed Big Data Streams with Instance Selection
@sramirez / Latest release: 0.8 (2017-01-27) / Apache-2.0 / (0)
memsql-spark-connector
A connector for MemSQL and Spark
@memsql / Latest release: 4.1.1-spark-3.3.0 (2022-07-14) / Apache-2.0 / (6)
imllib-spark
supplementation machine learning algorithms for Spark
@Intel-bigdata / Latest release: 0.1 (2017-02-06) / Apache-2.0 / (0)
TensorFlowOnSpark
TensorFlowOnSpark brings TensorFlow programs onto Apache Spark clusters
@yahoo / No release yet / (0)
spark-tensorflow-connector
Spark Tensorflow Connector
@tapanalyticstoolkit / Latest release: 1.0.0-s_2.11 (2017-02-21) / Apache-2.0 / (3)
spark-bigquery
Google BigQuery support for Spark, Structured Streaming, SQL, and DataFrames with easy Databricks integration.
@samelamin / Latest release: 0.2.5 (2018-08-08) / Apache-2.0 / (1)
spark-api
Akka-based library to help you manage your Apache Spark jobs
@JoaoVasques / No release yet / (0)
SMACK_Tweets
Its a small application which collects tweets from twitter and process it with spark streaming and ingest it in cassandra ring
@phalodi / No release yet / (0)
spark-scheduler
Library to schedule spark jobs related to time interval
@phalodi / No release yet / (0)
spark-marketo
Spark Marketo Connector
@springml / Latest release: 1.1.0 (2017-03-10) / Apache-2.0 / (1)
spark-daria
Open source Spark transformations and functions
@mrpowers / Latest release: 0.37.1-s_2.12 (2020-03-27) / MIT / (1)
sandpiper
Implementation of the Loopy Belief Propagation algorithm for Apache Spark
@HewlettPackard / No release yet / (0)
spark-streaming-with-google-cloud-example
an example of integrating Spark Streaming with Google Pub/Sub and Google Datastore
@yu-iskw / No release yet / (0)
spark-tree-plotting
A simple tool for plotting Spark ML's Decision Trees
@julioasotodv / Latest release: 0.2 (2017-03-25) / MIT / (1)
NoiseFramework
Noise Framework for removing noisy instances with three algorithms: HME-BD, HTE-BD and ENN.
@djgarcia / Latest release: 1.2 (2018-04-18) / Apache-2.0 / (2)
Lambda-Arch-Spark
Implementation of Lambda Architecture with Spark, Kafka, Cassandra and Twitter Streaming API
@knoldus / No release yet / (1)
spark-sas7bdat
Remove the splittable part
@chhokarpardeep / Latest release: 1.1.7-s_2.11 (2017-04-04) / GPL-3.0 / (0)
spark-fast-tests
Fast Apache Spark testing framework
@MrPowers / Latest release: 0.21.1-s_2.12 (2020-04-07) / MIT / (1)
algorithmia-scala
Scala Client for Algorithmia Algorithms and Data API
@algorithmiaio / Latest release: 0.9.2 (2017-05-24) / Apache-2.0 / (1)
spark-deep-learning
Deep Learning Pipelines for Apache Spark
@databricks / Latest release: 1.5.0-spark2.4-s_2.11 (2019-01-25) / Apache-2.0 / (3)
pyspark-cassandra
Python port of the awesome Datastax Spark Cassandra connector. Compatible w/ Spark 2.0+
@anguenot / Latest release: 2.4.1 (2022-08-03) / Apache-2.0 / (0)
azure-cosmosdb-spark
This project provides a client library that allows Azure Cosmos DB to act as an input source or output sink for Spark jobs.
@Azure / No release yet / (0)
spark-crowd
A package for dealing with crowdsourced big data.
@enriquegrodrigo / Latest release: 0.2.0 (2018-10-21) / MIT / (0)