Graphx Overlapping Community Detection
@bhardwajank / (0)
Graphx Overlapping community detection is a spark package thats finds the overlapping communities for large networks using Graphx apis.It is based on label propagation algorithm but can detect communities that overlap.
This is an implementation of 'Finding overlapping communities in networks by label propagation(http://iopscience.iop.org/article/10.1088/1367-2630/12/10/103018/meta)'
// find the overlapping communities with maxiteration 5 and max noOfCommunities per node 4
val overlapCommunities = OverlappingCommunityDetection.run(graph,5,4)
API docs can be found at 'https://bhardwajank.github.io/graphx-overlapping-community/api/latest/#org.apache.spark.graphx.lib.OverlappingCommunityDetection$'
Include this package in your Spark Applications using:
spark-shell, pyspark, or spark-submit
> $SPARK_HOME/bin/spark-shell --packages bhardwajank:graphx-overlapping-community:1.0
If you use the sbt-spark-package plugin, in your sbt build file, add:
spDependencies += "bhardwajank/graphx-overlapping-community:1.0"
resolvers += "Spark Packages Repo" at "http://dl.bintray.com/spark-packages/maven" libraryDependencies += "bhardwajank" % "graphx-overlapping-community" % "1.0"
MavenIn your pom.xml, add:
<dependencies> <!-- list of dependencies --> <dependency> <groupId>bhardwajank</groupId> <artifactId>graphx-overlapping-community</artifactId> <version>1.0</version> </dependency> </dependencies> <repositories> <!-- list of other repositories --> <repository> <id>SparkPackagesRepo</id> <url>http://dl.bintray.com/spark-packages/maven</url> </repository> </repositories>