SmartFiltering (homepage)
Smart Filtering framework for Big Data
@djgarcia / (2)
This framework implements four distance based Big Data preprocessing algorithms to remove noisy examples: ENN_BD, AllKNN_BD, NCNEdit_BD and RNG_BD filters, with special emphasis in their scalability and performance traits.
Tags
How to
Include this package in your Spark Applications using:
spark-shell, pyspark, or spark-submit
> $SPARK_HOME/bin/spark-shell --packages djgarcia:SmartFiltering:1.0
sbt
If you use the sbt-spark-package plugin, in your sbt build file, add:
spDependencies += "djgarcia/SmartFiltering:1.0"
Otherwise,
resolvers += "Spark Packages Repo" at "https://repos.spark-packages.org/" libraryDependencies += "djgarcia" % "SmartFiltering" % "1.0"
Maven
In your pom.xml, add:<dependencies> <!-- list of dependencies --> <dependency> <groupId>djgarcia</groupId> <artifactId>SmartFiltering</artifactId> <version>1.0</version> </dependency> </dependencies> <repositories> <!-- list of other repositories --> <repository> <id>SparkPackagesRepo</id> <url>https://repos.spark-packages.org/</url> </repository> </repositories>
Releases
Version: 1.0 ( 4f6037 | zip | jar ) / Date: 2018-04-09 / License: Apache-2.0 / Scala version: 2.11