neo4j-spark-connector (homepage)

These are the beginnings / experiments of a Neo4j-Spark-Connector using the new binary protocol for Neo4j, Bolt.

This is a bi-directional datasource / graph integration for Neo4j <-> Spark which supports RRDs, DataFrames, GraphX and GraphFrames
It uses the binary protocol of Neo4j 3.0 for fast data transfer and wherever possible bytecode compiled Cypher queries for fast execution


Tags

  • 1|database
  • 1|data source
  • 1|graph

How to

Include this package in your Spark Applications using:

spark-shell, pyspark, or spark-submit

> $SPARK_HOME/bin/spark-shell --packages neo4j-contrib:neo4j-spark-connector:2.0.0-M2

sbt

If you use the sbt-spark-package plugin, in your sbt build file, add:

spDependencies += "neo4j-contrib/neo4j-spark-connector:2.0.0-M2"

Otherwise,

resolvers += "Spark Packages Repo" at "http://dl.bintray.com/spark-packages/maven"

libraryDependencies += "neo4j-contrib" % "neo4j-spark-connector" % "2.0.0-M2"

Maven

In your pom.xml, add:
<dependencies>
  <!-- list of dependencies -->
  <dependency>
    <groupId>neo4j-contrib</groupId>
    <artifactId>neo4j-spark-connector</artifactId>
    <version>2.0.0-M2</version>
  </dependency>
</dependencies>
<repositories>
  <!-- list of other repositories -->
  <repository>
    <id>SparkPackagesRepo</id>
    <url>http://dl.bintray.com/spark-packages/maven</url>
  </repository>
</repositories>

Releases

Version: 2.0.0-M2 ( 66fff9 | zip | jar ) / Date: 2016-09-12 / License: Apache-2.0

Version: 2.0.0-M1 ( 395d0b | zip | jar ) / Date: 2016-08-17 / License: Apache-2.0

Version: 1.0.0-RC1 ( 6268ff | zip | jar ) / Date: 2016-06-07 / License: Apache-2.0