pyspark-cassandra (homepage)

Python port of the awesome Datastax Spark Cassandra connector. Compatible w/ Spark 2.0+

@anguenot / (0)

This module provides Python support for Apache Spark's Resilient Distributed Datasets from Apache Cassandra CQL rows using https://github.com/datastax/spark-cassandra-connector within PySpark, both in the interactive shell and in Python programs submitted with spark-submit.


Tags

  • 1|python
  • 1|cassandra
  • 1|nosql
  • 1|pyspark
  • 1|cql

How to

Include this package in your Spark Applications using:

spark-shell, pyspark, or spark-submit

> $SPARK_HOME/bin/spark-shell --packages anguenot:pyspark-cassandra:0.9.0

sbt

If you use the sbt-spark-package plugin, in your sbt build file, add:

spDependencies += "anguenot/pyspark-cassandra:0.9.0"

Otherwise,

resolvers += "Spark Packages Repo" at "http://dl.bintray.com/spark-packages/maven"

libraryDependencies += "anguenot" % "pyspark-cassandra" % "0.9.0"

Maven

In your pom.xml, add:
<dependencies>
  <!-- list of dependencies -->
  <dependency>
    <groupId>anguenot</groupId>
    <artifactId>pyspark-cassandra</artifactId>
    <version>0.9.0</version>
  </dependency>
</dependencies>
<repositories>
  <!-- list of other repositories -->
  <repository>
    <id>SparkPackagesRepo</id>
    <url>http://dl.bintray.com/spark-packages/maven</url>
  </repository>
</repositories>

Releases

Version: 0.9.0 ( d17e5b | zip | jar ) / Date: 2018-06-08 / License: Apache-2.0 / Scala version: 2.11

Version: 0.8.0 ( 47a57b | zip | jar ) / Date: 2018-06-01 / License: Apache-2.0 / Scala version: 2.11

Version: 0.7.0 ( dccc87 | zip | jar ) / Date: 2017-12-12 / License: Apache-2.0 / Scala version: 2.11

Version: 0.6.0 ( 408cf3 | zip | jar ) / Date: 2017-10-05 / License: Apache-2.0 / Scala version: 2.11

Version: 0.5.0 ( b284da | zip | jar ) / Date: 2017-06-19 / License: Apache-2.0 / Scala version: 2.11

Version: 0.4.0 ( ef0a31 | zip | jar ) / Date: 2017-06-09 / License: Apache-2.0 / Scala version: 2.11