cookie-datasets (homepage)

Popular ML Datasets for Spark ML (MNIST, IRIS, CIFAR)

@cookieai / (0)

Provides DataFrame readers for popular datasets used by the ML community. The current version supports MNIST, IRIS, and CIFAR.


Tags

  • 1|machine learning
  • 1|data source

How to

Include this package in your Spark Applications using:

spark-shell, pyspark, or spark-submit

> $SPARK_HOME/bin/spark-shell --packages ai.cookie:cookie-datasets_2.10:0.1.0

sbt

In your sbt build file, add:

libraryDependencies += "ai.cookie" % "cookie-datasets_2.10" % "0.1.0"

Maven

In your pom.xml, add:
<dependencies>
  <!-- list of dependencies -->
  <dependency>
    <groupId>ai.cookie</groupId>
    <artifactId>cookie-datasets_2.10</artifactId>
    <version>0.1.0</version>
  </dependency>
</dependencies>

Releases

Version: 0.1.0 ( 74dda5 | zip | jar ) / Date: 2015-12-22 / License: Apache-2.0 / Scala version: 2.10

Spark Scala/Java API compatibility: 1.0.0 - 14% , 1.1.0 - 53% , 1.2.0 - 59% , 1.3.0 - 60% , 1.4.0 - 72% , 1.5.0 - 100%