PCARD (homepage)

PCARD ensemble method. Ensemble of decision trees based on Random Discretization and Principal Components Analysis.

@djgg / (3)

This method implements the PCARD ensemble algorithm. PCARD ensemble method is a distributed upgrade of the method presented by A. Ahmad. The algorithm performs Random Discretization and Principal Components Analysis to the input data, then joins the results and trains a decision tree on it.


Tags

  • 1|machine learning
  • 1|mllib
  • 1|ensemble

How to

Include this package in your Spark Applications using:

spark-shell, pyspark, or spark-submit

> $SPARK_HOME/bin/spark-shell --packages djgg:PCARD:1.3

sbt

If you use the sbt-spark-package plugin, in your sbt build file, add:

spDependencies += "djgg/PCARD:1.3"

Otherwise,

resolvers += "Spark Packages Repo" at "https://repos.spark-packages.org/"

libraryDependencies += "djgg" % "PCARD" % "1.3"

Maven

In your pom.xml, add:
<dependencies>
  <!-- list of dependencies -->
  <dependency>
    <groupId>djgg</groupId>
    <artifactId>PCARD</artifactId>
    <version>1.3</version>
  </dependency>
</dependencies>
<repositories>
  <!-- list of other repositories -->
  <repository>
    <id>SparkPackagesRepo</id>
    <url>https://repos.spark-packages.org/</url>
  </repository>
</repositories>

Releases

Version: 1.3 ( d6d271 | zip | jar ) / Date: 2018-04-05 / License: Apache-2.0 / Scala version: 2.11

Version: 1.2 ( 9aca69 | zip | jar ) / Date: 2018-04-05 / License: Apache-2.0 / Scala version: 2.11

Version: 1.1 ( e1b27a | zip | jar ) / Date: 2016-03-02 / License: Apache-2.0 / Scala version: 2.10

Version: 1.0 ( 75cc75 | zip | jar ) / Date: 2016-02-04 / License: Apache-2.0 / Scala version: 2.10

Spark Scala/Java API compatibility: - 8% , - 46% , - 16% , - 96% , - 37% , - 100%