Working with Data Flows Using Watson Data APIs
IBM Watson offers a collection of REST APIs for creating, running, managing, and troubleshooting data flows to allow your applications to easily integrate with Data Refinery.
A flow can read data from a large variety of sources, process that data in a runtime engine using pre-defined operations or custom code, and then write it to one or more targets. The runtime engine can handle large amounts of data so it’s ideally suited for reading, processing, and writing data at volume.
The APIs are supported by a growing set of resources, including documentation and tutorials.
API Documentation
The data flows API specification can be found in the Watson Data API documentation under Documentation > Data flows.
Tutorials and Notebooks
The Watson Studio Community is a hub of useful blogs, notebooks, tutorials and data sets to get you started.
The Create and run a data flow using Watson Data APIs notebook introduces you to the data flow model and shows you how to define a flow with a data source, refining operations, and a target data set. It also covers data flow run creation, status monitoring, and log retrieval for troubleshooting.
The Monitor data flow usage using Watson Data APIs notebook demonstrates how run metrics can be visualized over time and shows how lists of runs can be filtered using query parameters.
You can find more information about Data Refinery in the announcement blog post: Self-service data preparation with Data Refinery
Try out Data Refinery today from either Watson Studio or Watson Knowledge Catalog.