pyspark_dist_explore (homepage)

Data Exploration in PySpark made easy - Pyspark_dist_explore provides methods to get fast insights in your Spark DataFrames.

@Bergvca / (0)

Get quick insights on data in Spark DataFrames through histograms and density plots, where the heavy lifting is done in Spark. pyspark_dist_explore is fast to understand as it leverages matplotlib for its plotting.


Tags

  • 1|python
  • 1|pyspark
  • 1|Histogram

How to

This package doesn't have any releases published in the Spark Packages repo, or with maven coordinates supplied. You may have to build this package from source, or it may simply be a script. To use this Spark Package, please follow the instructions in the README.

Releases

Version: 0.1.4 ( b21255 | zip ) / Date: 2017-08-02 / License: Apache-2.0