A distributed implementation of AdaBoost.MH and MP-Boost using Apache Spark
@tizfa / (0)
This repository contains a distributed implementation based on Apache Spark of AdaBoost.MH and MP-Boost algorithms. MP-Boost is an improved variant of the well known AdaBoost.MH machine learning algorithm. MP-Boost improves original AdaBoost.MH by building classifiers which allows to obtain remarkably better effectiveness and a very similar computational cost at build/classification time.
The software is open source and released under the terms of the Apache License, Version 2.0.
Please see https://github.com/tizfa/sparkboost for more details about the usage of the package.
Include this package in your Spark Applications using:
spark-shell, pyspark, or spark-submit
> $SPARK_HOME/bin/spark-shell --packages tizfa:sparkboost:0.6
If you use the sbt-spark-package plugin, in your sbt build file, add:
spDependencies += "tizfa/sparkboost:0.6"
resolvers += "Spark Packages Repo" at "http://dl.bintray.com/spark-packages/maven" libraryDependencies += "tizfa" % "sparkboost" % "0.6"
MavenIn your pom.xml, add:
<dependencies> <!-- list of dependencies --> <dependency> <groupId>tizfa</groupId> <artifactId>sparkboost</artifactId> <version>0.6</version> </dependency> </dependencies> <repositories> <!-- list of other repositories --> <repository> <id>SparkPackagesRepo</id> <url>http://dl.bintray.com/spark-packages/maven</url> </repository> </repositories>