A community index of third-party packages for Apache Spark.

Showing packages 1 - 50 out of 68 for search "tags:"Streaming""

High Performance Kafka Consumer for Spark Streaming.Supports Multi Topic Fetch, Kafka Security. Reliable offset management in Zookeeper. No Data-loss. No dependency on HDFS and WAL. In-built PID rate controller. Support Message Handler . Offset Lag checker

@dibbhatt / Latest release: 2.1.0 (2019-08-28) / Apache-2.0 / (7)

  • 4|streaming
  • 3|kafka


Pig on Apache Spark

@sigmoidanalytics / No release yet / (9)

  • 1|streaming
  • 1|spark
  • 1|pig


KillrWeather is a reference application (in progress) showing how to easily leverage and integrate Apache Spark, Apache Cassandra, and Apache Kafka for fast, streaming computations on time series data in asynchronous Akka event-driven environments.

@killrweather / No release yet / (1)

  • 1|streaming


Streaming CEP Engine Powered by Spark Streaming & Siddhi

@Stratio / Latest release: 0.6.2 (2015-01-14) / Apache-2.0 / (19)

  • 5|spark streaming
  • 5|cep
  • 4|complex event processing


Visualize streaming machine learning in Spark

@freeman-lab / No release yet / (1)

  • 1|streaming
  • 1|machine learning
  • 1|visualization


Apache Camel Streaming Consumer

@synsys / Latest release: 1.0.0 (2015-01-26) / Apache-2.0 / (0)

  • 1|streaming
  • 1|consumer
  • 1|camel


Connect Spark to HBase for reading and writing data with ease

@nerdammer / Latest release: 1.0.3 (2016-04-20) / Apache-2.0 / (3)

  • 1|streaming
  • 1|hbase
  • 1|library


Base classes to use when writing tests with Spark

@holdenk / Latest release: 2.2.2_0.11.0 (2018-12-23) / Apache-2.0 / (10)

  • 3|testing
  • 1|streaming
  • 1|tools


Low level integration of Spark and Kafka

@tresata / Latest release: 0.6.0-s_2.10 (2015-11-13) / Apache-2.0 / (0)

  • 1|streaming


Connects Spark to Cassandra

@datastax / Latest release: 2.4.0-s_2.11 (2018-11-29) / Apache-2.0 / (14)

  • 3|spark
  • 3|cassandra
  • 2|nosql


Power BI API adapter for Apache Spark

@granturing / Latest release: 1.5.0_0.0.7 (2015-09-13) / Apache-2.0 / (0)

  • 2|streaming
  • 1|sql
  • 1|realtime


Spark Streaming, Machine Learning and meetup.com streaming API.

@actions / No release yet / (1)

  • 1|ml
  • 1|example
  • 1|streaming


Deprecated, please see couchbase/couchbase-spark-connector

@couchbaselabs / Latest release: 1.0.0 (2015-10-20) / Apache-2.0 / (1)

  • 1|streaming
  • 1|library
  • 1|sql


RabbitMQ Spark Streaming receiver

@Stratio / Latest release: 0.4.0 (2016-12-20) / Apache-2.0 / (10)

  • 4|streaming


Streaming Recommendation Engine using matrix factorization with user and product bias

@brkyvz / Latest release: 0.1.0 (2015-05-26) / Apache-2.0 / (2)

  • 1|streaming
  • 1|ml
  • 1|machine learning


Manipulate Apache Spark Streaming by SQL

@Intel-bigdata / No release yet / (1)

  • 1|streaming
  • 1|sql


An Apache Spark standalone application using the Spark API in Scala. The application uses Simple Build(SBT) for building the project.

@prabeesh / Latest release: 0.1.0 (2015-08-04) / Apache-2.0 / (1)

  • 1|streaming
  • 1|sbt
  • 1|scala


An Apache Spark utility for pulling Tweets from Gnip's PowerTrack in realtime

@knoldus / No release yet / (1)

  • 1|streaming
  • 1|data source
  • 1|scala


Infinispan Spark Connector

@infinispan / Latest release: 0.9 (2018-11-05) / Apache-2.0 / (0)

  • 1|streaming
  • 1|sql
  • 1|scala


Machine Learning over Twitter's stream. Using Apache Spark, Web Server and Lightning Graph server.

@giorgioinf / Latest release: 0.2.0 (2016-06-19) / GPL-3.0 / (0)

  • 1|ml
  • 1|example
  • 1|streaming


Spark on Aliyun, supporting interactions with Aliyun's base services.

@aliyun / No release yet / (1)

  • 1|streaming
  • 1|data source


EventHubs Receiver for Spark Streaming

@hdinsight / No release yet / (0)

  • 1|Azure
  • 1|streaming
  • 1|eventhubs


Docker-based, End-to-End, Real-time, Advanced Analytics Big Data Reference Pipeline using Spark, Spark SQL, Spark Streaming, ML, MLlib, GraphX, Kafka, Cassandra, Redis, Apache Zeppelin, Spark-Notebook, iPython/Jupyter Notebook, Tableau, H2O Flow, Tachyon,

@fluxcapacitor / No release yet / (3)

  • 2|streaming
  • 2|kafka
  • 1|machine learning


C# API for Apache Spark. (Package moved to http://spark-packages.org/package/Microsoft/Mobius)

@skaarthik / No release yet / (2)

  • 1|streaming
  • 1|examples
  • 1|sql


ScalaCheck for Spark

@juanrh / No release yet / (0)

  • 1|streaming
  • 1|testing
  • 1|tools


JMS spark receiver

@tbfenet / Latest release: 0.2.1-s_2.11 (2016-11-23) / Apache-2.0 / (0)

  • 2|streaming


The Official Couchbase Spark Connector

@couchbase / Latest release: 2.2.0 (2017-09-20) / Apache-2.0 / (2)

  • 1|streaming
  • 1|library
  • 1|sql


Spark-lever is based on Spark Streaming,it is a proactive capability-aware load balancing system for batch stream processing on heterogeneous clusters.

@trueyao / No release yet / (2)

  • 2|streaming


Connects Spark to Hazelcast

@erenavsarogullari / Latest release: 1.0.0-s_2.11 (2016-03-07) / Apache-2.0 / (0)

  • 1|streaming
  • 1|spark
  • 1|scala


Adaptation of the CluStream method in Spark

@obackhoff / Latest release: 0.6.5 (2016-03-31) / Apache-2.0 / (1)

  • 1|clustering
  • 1|streaming
  • 1|machine learning


The official Riak Spark Connector for Apache Spark with Riak TS and Riak KV

@basho / Latest release: 1.6.3 (2017-03-17) / Apache-2.0 / (2)

  • 3|python
  • 3|riak
  • 3|data source


SparkSQL extension as a library for Apache Spark extending and improving its capabilities for a data federation system.

@Stratio / Latest release: 1.4.0 (2016-07-06) / Apache-2.0 / (6)

  • 3|SparkSQL
  • 3|sql
  • 2|library


C# API for Apache Spark

@Microsoft / Latest release: 1.6.100 (2016-05-02) / MIT / (2)

  • 1|streaming
  • 1|examples
  • 1|sql


Spark Receiver for SQL or NoSQL Databases like Cassandra, MongoDB, Elasticsearch or JDBC

@Stratio / Latest release: 0.1.0 (2016-06-30) / Apache-2.0 / (1)

  • 1|streaming
  • 1|library
  • 1|sql


Baryon is a library for building Spark Streaming applications that consume data from Kafka.

@groupon / Latest release: 1.0 (2016-07-29) / BSD 3-Clause / (0)

  • 1|streaming
  • 1|tools
  • 1|library


Mezzanine is a library built on Spark Streaming used to consume data from Kafka and store it into Hadoop.

@groupon / Latest release: 1.0 (2016-07-29) / BSD 3-Clause / (0)

  • 1|streaming
  • 1|tools
  • 1|library


Write your RDDs and DStreams to Kafka seamlessly

@BenFradet / Latest release: 0.4.0 (2017-07-22) / Apache-2.0 / (0)

  • 1|streaming
  • 1|data source


Rich Spark adds more to Apache Spark

@mashin-io / No release yet / (0)

  • 1|ml
  • 1|library
  • 1|streaming


Maelstrom is an open source Kafka integration with Spark that is designed to be developer friendly, high performance (millisecond stream processing), scalable (consumes messges at Spark worker nodes), and is extremely reliable.

@jeoffreylim / No release yet / (0)

  • 1|streaming
  • 1|kafka


Stream Data analysis on IoT generated data via Apache spark

@shiv4nsh / No release yet / (0)

  • 1|streaming
  • 1|spark
  • 1|example


Spark DStream connector for Akka

@apache / Latest release: 2.2.0 (2017-09-09) / Apache-2.0 / (0)

  • 1|streaming


Spark DStream connector for MQTT

@apache / Latest release: 2.2.0 (2017-09-09) / Apache-2.0 / (0)

  • 1|python
  • 1|streaming
  • 1|pyspark


Spark Structured Streaming data source for MQTT

@apache / Latest release: 2.2.0 (2017-09-09) / Apache-2.0 / (1)

  • 1|streaming
  • 1|sql
  • 1|structured streaming


Spark DStream connector for Twitter

@apache / Latest release: 2.2.0 (2017-09-09) / Apache-2.0 / (0)

  • 1|streaming


Spark DStream connector for ZeroMQ

@apache / Latest release: 2.2.0 (2017-09-09) / Apache-2.0 / (0)

  • 1|streaming


Generic Connector for Apache Spark

@alvsanand / Latest release: 0.2.0-spark_2x-s_2.11 (2017-01-17) / Apache-2.0 / (1)

  • 1|streaming
  • 1|data source
  • 1|Google Cloud


A Nearest Neighbor Classifier for High-Speed Big Data Streams with Instance Selection

@sramirez / Latest release: 0.8 (2017-01-27) / Apache-2.0 / (0)

  • 1|streaming
  • 1|machine learning
  • 1|instance selection


Google BigQuery support for Spark, Structured Streaming, SQL, and DataFrames with easy Databricks integration.

@samelamin / Latest release: 0.2.5 (2018-08-08) / Apache-2.0 / (1)

  • 1|streaming
  • 1|sql
  • 1|core


Its a small application which collects tweets from twitter and process it with spark streaming and ingest it in cassandra ring

@phalodi / No release yet / (0)

  • 1|streaming
  • 1|example
  • 1|core


an example of integrating Spark Streaming with Google Pub/Sub and Google Datastore

@yu-iskw / No release yet / (0)

  • 1|streaming
  • 1|example