spark-MDLP-discretization (homepage)

Spark implementation of Fayyad's discretizer based on Minimum Description Length Principle (MDLP)

@sramirez / (7)

This method implements Fayyad's discretizer based on Minimum Description Length Principle (MDLP) in order to treat non discrete datasets from a distributed perspective. It supports sparse data, parallel-processing of attributes, etc.


Tags

  • 2|mllib
  • 2|discretization
  • 1|machine learning
  • 1|MDLP
  • 1|Fayyad
  • 1|feature

How to

Include this package in your Spark Applications using:

spark-shell, pyspark, or spark-submit

> $SPARK_HOME/bin/spark-shell --packages sramirez:spark-MDLP-discretization:1.4.1

sbt

If you use the sbt-spark-package plugin, in your sbt build file, add:

spDependencies += "sramirez/spark-MDLP-discretization:1.4.1"

Otherwise,

resolvers += "Spark Packages Repo" at "https://repos.spark-packages.org/"

libraryDependencies += "sramirez" % "spark-MDLP-discretization" % "1.4.1"

Maven

In your pom.xml, add:
<dependencies>
  <!-- list of dependencies -->
  <dependency>
    <groupId>sramirez</groupId>
    <artifactId>spark-MDLP-discretization</artifactId>
    <version>1.4.1</version>
  </dependency>
</dependencies>
<repositories>
  <!-- list of other repositories -->
  <repository>
    <id>SparkPackagesRepo</id>
    <url>https://repos.spark-packages.org/</url>
  </repository>
</repositories>

Releases

Version: 1.4.1 ( 014b61 | zip | jar ) / Date: 2017-09-25 / License: Apache-2.0 / Scala version: 2.11

Version: 1.4.0 ( 4e6cb7 | zip | jar ) / Date: 2017-09-22 / License: Apache-2.0 / Scala version: 2.11

Version: 1.3 ( 2db9d5 | zip | jar ) / Date: 2017-05-24 / License: Apache-2.0 / Scala version: 2.11

Version: 1.2.1 ( 039aa0 | zip | jar ) / Date: 2016-04-26 / License: Apache-2.0 / Scala version: 2.10

Version: 1.2 ( 09c4f1 | zip | jar ) / Date: 2016-02-23 / License: Apache-2.0 / Scala version: 2.10

Version: 1.1 ( a1e776 | zip | jar ) / Date: 2015-12-04 / License: Apache-2.0 / Scala version: 2.10

Spark Scala/Java API compatibility: - 53% , - 100% , - 12% , - 61% , - 48% , - 61%

Version: 1.0 ( 6b2f9a | zip | jar ) / Date: 2015-07-17 / License: Apache-2.0 / Scala version: 2.10

Spark Scala/Java API compatibility: - 100% , - 91% , - 14% , - 100% , - 57%

Version: 0.1 ( 9ed6e8 | zip | jar ) / Date: 2015-04-27 / License: Apache-2.0 / Scala version: 2.10

Spark Scala/Java API compatibility: - 14% , - 100% , - 57% , - 91%