spark-stringmetric (homepage)

Spark functions to run popular phonetic and string matching algorithms

Includes similarity metrics like Dice / Sorensen, Hamming, Jaro, and Jaccard. Also includes phonetic algorithms like Megaphone, NYSIIS, and Soundex.

How to

Include this package in your Spark Applications using:

spark-shell, pyspark, or spark-submit

> $SPARK_HOME/bin/spark-shell --packages MrPowers:spark-stringmetric:0.2.0


If you use the sbt-spark-package plugin, in your sbt build file, add:

spDependencies += "MrPowers/spark-stringmetric:0.2.0"


libraryDependencies += "MrPowers" % "spark-stringmetric" % "0.2.0"


In your pom.xml, add:
Version: 0.2.0 ( bf5419 | zip | jar ) / Date: 2019-01-27 / License: Apache-2.0 / Scala version: 2.11

Version: 2.2.0_0.1.0 ( 9dfae1 | zip | jar ) / Date: 2017-09-12 / License: Apache-2.0 / Scala version: 2.11