Spark Dataset on top of GoodData workspace
Published in
Jan 28, 2021
This example shows how to create Spark Dataset on top of GoodData workspace. Once you have data in a Spark Dataset, you can use all data processing power of Spark including data transformation, machine learning, etc.
The dataset uses the workspace created in this tutorial.
Setup
The example requires a .gooddata
configuration file located in your home directory. The file has this structure:
{
"host": "<your-gd-domain>.na.gooddata.com",
"username": "<your-gd-username>",
"password": "<your-gd-password>",
"workspace":"<your-gd-workspace-id>"
}
Code
Check out the example code
Tutorials and articles
- Motivation: SQL and aggregated data: is there a better way?
- Accessing GoodData workspace from Apache Zeppelin notebook
- Accessing GoodData workspace from Beaver JDBC console tutorial