pipeline (homepage)

Docker-based, End-to-End, Real-time, Advanced Analytics Big Data Reference Pipeline using Spark, Spark SQL, Spark Streaming, ML, MLlib, GraphX, Kafka, Cassandra, Redis, Apache Zeppelin, Spark-Notebook, iPython/Jupyter Notebook, Tableau, H2O Flow, Tachyon,

See https://github.com/fluxcapacitor/pipeline/wiki for Setup Instructions.


Tags

  • 2|streaming
  • 2|kafka
  • 1|machine learning
  • 1|avro
  • 1|SparkSQL
  • 1|ipython
  • 1|spark streaming
  • 1|cassandra
  • 1|elasticsearch
  • 1|h2o
  • 1|csv
  • 1|tools
  • 1|examples
  • 1|big data
  • 1|mllib
  • 1|Applications
  • 1|deployment
  • 1|data sources
  • 1|hive
  • 1|graph
  • 1|json
  • 1|docker
  • 1|end to end
  • 1|pipeline
  • 1|graphx
  • 1|dataframes
  • 1|zeppelin
  • 1|spark-notebook
  • 1|jupyter
  • 1|redis
  • 1|tachyon
  • 1|logstash
  • 1|kibana
  • 1|ganglia
  • 1|hdfs
  • 1|parquet
  • 1|datasources api
  • 1|catalyst optimizer

How to

This package doesn't have any releases published in the Spark Packages repo, or with maven coordinates supplied. You may have to build this package from source, or it may simply be a script. To use this Spark Package, please follow the instructions in the README.

Releases

No releases yet.