Spark-CluStream (homepage)

Adaptation of the CluStream method in Spark

@obackhoff / (1)

Algorithm for clustering data streams
Includes:
Online microclustering class
Offline macroclustering class


Tags

  • 1|streaming
  • 1|machine learning
  • 1|clustering
  • 1|clustream

How to

Include this package in your Spark Applications using:

spark-shell, pyspark, or spark-submit

> $SPARK_HOME/bin/spark-shell --packages obackhoff:Spark-CluStream:0.6.5

sbt

If you use the sbt-spark-package plugin, in your sbt build file, add:

spDependencies += "obackhoff/Spark-CluStream:0.6.5"

Otherwise,

resolvers += "Spark Packages Repo" at "http://dl.bintray.com/spark-packages/maven"

libraryDependencies += "obackhoff" % "Spark-CluStream" % "0.6.5"

Maven

In your pom.xml, add:
<dependencies>
  <!-- list of dependencies -->
  <dependency>
    <groupId>obackhoff</groupId>
    <artifactId>Spark-CluStream</artifactId>
    <version>0.6.5</version>
  </dependency>
</dependencies>
<repositories>
  <!-- list of other repositories -->
  <repository>
    <id>SparkPackagesRepo</id>
    <url>http://dl.bintray.com/spark-packages/maven</url>
  </repository>
</repositories>

Releases

Version: 0.6.5 ( 91c33c | zip | jar ) / Date: 2016-03-31 / License: Apache-2.0 / Scala version: 2.10

Version: 0.6 ( db961c | zip | jar ) / Date: 2016-03-26 / License: Apache-2.0 / Scala version: 2.10

Version: 0.5 ( aa644f | zip | jar ) / Date: 2016-03-21 / License: Apache-2.0 / Scala version: 2.10

Version: 0.4 ( 3def2c | zip | jar ) / Date: 2016-03-20 / License: Apache-2.0 / Scala version: 2.10

Version: 0.3 ( 3def2c | zip | jar ) / Date: 2016-03-20 / License: Apache-2.0 / Scala version: 2.10

Version: 0.2 ( d065c4 | zip | jar ) / Date: 2016-03-18 / License: Apache-2.0 / Scala version: 2.10

Version: 0.1 ( 766d06 | zip | jar ) / Date: 2016-03-13 / License: Apache-2.0 / Scala version: 2.10