spark-tensorflow-connector (homepage)

Spark Tensorflow Connector

This package contains a library for loading and storing TensorFlow records with Apache Spark. The library implements data import from the standard TensorFlow record format (TFRecords) into Spark SQL DataFrames, and data export from DataFrames to TensorFlow records.


  • 2|data source
  • 2|tensorflow
  • 1|library

How to

Include this package in your Spark Applications using:

spark-shell, pyspark, or spark-submit

> $SPARK_HOME/bin/spark-shell --packages tapanalyticstoolkit:spark-tensorflow-connector:1.0.0-s_2.11


If you use the sbt-spark-package plugin, in your sbt build file, add:

spDependencies += "tapanalyticstoolkit/spark-tensorflow-connector:1.0.0-s_2.11"


resolvers += "Spark Packages Repo" at ""

libraryDependencies += "tapanalyticstoolkit" % "spark-tensorflow-connector" % "1.0.0-s_2.11"


In your pom.xml, add:
  <!-- list of dependencies -->
  <!-- list of other repositories -->


Version: 1.0.0-s_2.11 ( b561f8 | zip | jar ) / Date: 2017-02-21 / License: Apache-2.0 / Scala version: 2.11