Greedy-K-means

Greedy-K-means (homepage)

Greedy K-means Spark Package in Python

Greedy K-means is a variant of the classical K-means, which aims to handle the sensitivity of K-means initialization. Based on the Greedy K-means algorithm (Likas, A., Vlassis, N., & Verbeek, J. J. (2003). The global k-means clustering algorithm. Pattern recognition, 36(2), 451-461), we implement a fast version of Greedy K-means with 59 sampling strategy. Therefore it not only enjoys the theoretical guarantee, but also outperforms other initialization methods in a plenty of data sets.

Tags (No tags yet, login to add one. )

How to

This package doesn't have any releases published in the Spark Packages repo, or with maven coordinates supplied. You may have to build this package from source, or it may simply be a script. To use this Spark Package, please follow the instructions in the README.

Releases

No releases yet.