sparkboost (homepage)

A distributed implementation of AdaBoost.MH and MP-Boost using Apache Spark

@tizfa / (0)

This repository contains a distributed implementation based on Apache Spark of AdaBoost.MH and MP-Boost algorithms. MP-Boost is an improved variant of the well known AdaBoost.MH machine learning algorithm. MP-Boost improves original AdaBoost.MH by building classifiers which allows to obtain remarkably better effectiveness and a very similar computational cost at build/classification time.
The software is open source and released under the terms of the Apache License, Version 2.0.
Please see https://github.com/tizfa/sparkboost for more details about the usage of the package.


Tags

  • 1|machine learning
  • 1|classification
  • 1|adaboost
  • 1|mp-boost
  • 1|multiclass
  • 1|multilabel
  • 1|binary
  • 1|adaboost.mh

How to

Include this package in your Spark Applications using:

spark-shell, pyspark, or spark-submit

> $SPARK_HOME/bin/spark-shell --packages tizfa:sparkboost:0.6

sbt

If you use the sbt-spark-package plugin, in your sbt build file, add:

spDependencies += "tizfa/sparkboost:0.6"

Otherwise,

resolvers += "Spark Packages Repo" at "https://repos.spark-packages.org/"

libraryDependencies += "tizfa" % "sparkboost" % "0.6"

Maven

In your pom.xml, add:
<dependencies>
  <!-- list of dependencies -->
  <dependency>
    <groupId>tizfa</groupId>
    <artifactId>sparkboost</artifactId>
    <version>0.6</version>
  </dependency>
</dependencies>
<repositories>
  <!-- list of other repositories -->
  <repository>
    <id>SparkPackagesRepo</id>
    <url>https://repos.spark-packages.org/</url>
  </repository>
</repositories>

Releases

Version: 0.6 ( eae7b7 | zip | jar ) / Date: 2015-07-01 / License: Apache-2.0

Spark Scala/Java API compatibility: - 14% , - 84% , - 16% , - 100% , - 80%