spark-json-schema (homepage)

JSON schema parser for Apache Spark

This goal of the spark-json-schema library is to support input data integrity when loading json data into Apache Spark. For this purpose the library:

-- Reads in an existing json-schema file
-- Parses the json-schema and builds a Spark DataFrame schema

This generated schema can be used when loading json data into Spark. This verifies that the input data conforms to the given schema and enables to filter out corrupt input data.


Tags (No tags yet, login to add one. )


How to

This package doesn't have any releases published in the Spark Packages repo, or with maven coordinates supplied. You may have to build this package from source, or it may simply be a script. To use this Spark Package, please follow the instructions in the README.

Releases

No releases yet.