spark-stemming (homepage)

Spark MLlib wrapper around Snowball stemming

@master / (0)

Snowball is a small string processing language designed for creating stemming algorithms for use in Information Retrieval. This package allows to use it as a part of Spark ML Pipeline API.


Tags

  • 1|machine learning

How to

Include this package in your Spark Applications using:

spark-shell, pyspark, or spark-submit

> $SPARK_HOME/bin/spark-shell --packages com.github.master:spark-stemming_2.10:0.2.1

sbt

If you use the sbt-spark-package plugin, in your sbt build file, add:

spDependencies += "master/spark-stemming:0.2.1"

Otherwise,

libraryDependencies += "com.github.master" % "spark-stemming_2.10" % "0.2.1"

Maven

In your pom.xml, add:
<dependencies>
  <!-- list of dependencies -->
  <dependency>
    <groupId>com.github.master</groupId>
    <artifactId>spark-stemming_2.10</artifactId>
    <version>0.2.1</version>
  </dependency>
</dependencies>

Releases

Version: 0.2.1 ( c5345d | zip | jar ) / Date: 2018-11-28 / License: BSD 2-Clause / Scala version: 2.10

Version: 0.2.0 ( dce4ca | zip | jar ) / Date: 2017-12-06 / License: BSD 2-Clause / Scala version: 2.10

Version: 0.2.0-s_2.10 ( 27c64c | zip | jar ) / Date: 2017-12-06 / License: BSD 2-Clause / Scala version: 2.10

Version: 0.1.1 ( 58076d | zip | jar ) / Date: 2016-03-17 / License: BSD 2-Clause / Scala version: 2.10