Working with Data Flows Using Watson Data APIs

Damian Cummins
IBM watsonx Assistant
2 min readMar 20, 2018
Watson Data API

IBM Watson offers a collection of REST APIs for creating, running, managing, and troubleshooting data flows to allow your applications to easily integrate with Data Refinery.

A flow can read data from a large variety of sources, process that data in a runtime engine using pre-defined operations or custom code, and then write it to one or more targets. The runtime engine can handle large amounts of data so it’s ideally suited for reading, processing, and writing data at volume.

The APIs are supported by a growing set of resources, including documentation and tutorials.

API Documentation

The data flows API specification can be found in the Watson Data API documentation under Documentation > Data flows.

Tutorials and Notebooks

The Watson Studio Community is a hub of useful blogs, notebooks, tutorials and data sets to get you started.

The Create and run a data flow using Watson Data APIs notebook introduces you to the data flow model and shows you how to define a flow with a data source, refining operations, and a target data set. It also covers data flow run creation, status monitoring, and log retrieval for troubleshooting.

The Monitor data flow usage using Watson Data APIs notebook demonstrates how run metrics can be visualized over time and shows how lists of runs can be filtered using query parameters.

You can find more information about Data Refinery in the announcement blog post: Self-service data preparation with Data Refinery

Try out Data Refinery today from either Watson Studio or Watson Knowledge Catalog.

--

--