spark-deep-learning (homepage)

Deep Learning Pipelines for Apache Spark

@databricks / (2)

Deep Learning Pipelines aims at enabling everyone to easily integrate scalable deep learning into their workflows, from machine learning practitioners to business analysts. It builds on Apache Spark's ML Pipelines for training, and on Spark DataFrames and SQL for deploying models. It includes high-level APIs for common aspects of deep learning so they can be efficiently done in a few lines of code.


  • 1|machine learning
  • 1|deep learning
  • 1|GPU

How to

Include this package in your Spark Applications using:

spark-shell, pyspark, or spark-submit

> $SPARK_HOME/bin/spark-shell --packages databricks:spark-deep-learning:0.2.0-spark2.1-s_2.11


If you use the sbt-spark-package plugin, in your sbt build file, add:

spDependencies += "databricks/spark-deep-learning:0.2.0-spark2.1-s_2.11"


resolvers += "Spark Packages Repo" at ""

libraryDependencies += "databricks" % "spark-deep-learning" % "0.2.0-spark2.1-s_2.11"


In your pom.xml, add:
  <!-- list of dependencies -->
  <!-- list of other repositories -->


Version: 0.2.0-spark2.1-s_2.11 ( 4cd6d4 | zip | jar ) / Date: 2017-10-26 / License: Apache-2.0 / Scala version: 2.11

Version: 0.1.0-spark2.1-s_2.11 ( 2bf96c | zip | jar ) / Date: 2017-06-06 / License: Apache-2.0 / Scala version: 2.11