spark-wilcoxon (homepage)

Compute Wilcoxon-Mann-Whitney rank sum statistic in Apache Spark

spark-wilcoxon computes the scaled Wilcoxon-Mann-Whitney rank sum statistic (also known by other permutations of names) for data which fall into two classes.

The scaled rank sum statistic is equivalent to the area under the ROC curve, considering the data as scores for the binary classification problem of distinguishing class 1 from class 0. As such, it is a fast, simple way to assess the relevance of the data for the binary classification problem. Scaled values near 1 or 0 indicated greater relevance (with 1 indicating positive correlation and 0 indicating negative correlation), and 1/2 indicating irrelevance.


Tags (No tags yet, login to add one. )


How to

This package doesn't have any releases published in the Spark Packages repo, or with maven coordinates supplied. You may have to build this package from source, or it may simply be a script. To use this Spark Package, please follow the instructions in the README.

Releases

No releases yet.