By: Chinmay Chandak, Jeremy Dyer

Image for post
Image for post
https://www.istockphoto.com/photo/crazy-ride-on-the-night-by-car-gm481924646-69957833

A large percentage of production streaming pipelines today have Kafka as their source. Over the years, Apache Kafka has become one of the most popular open-source distributed stream-processing platforms for handling real-time data feeds. …


Image for post
Image for post

In the cuStreamz introduction blog, we demonstrated how to implement a classic Streaming Word Count example using RAPIDS cuStreamz on GPUs. In this blog, we show how to easily and efficiently scale that same word count job in a distributed fashion to leverage multiple GPU machines.

Here is the notebook that runs streaming word count end-to-end in a distributed mode using Dask. …

About

Software Engineer, NVIDIA

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store