Introducing Cloudera Streaming Analytics Community Edition

kennygorman
Cloudera
Published in
3 min readOct 26, 2021
Photo by Austin Distel on Unsplash

Creating modern streaming data processors is inherently a developer experience — Composing SQL/Java, aggregation, working with schema’s and complex data structures or serialization formats, inspecting results data, and processing data from sources to sinks — it’s all a development exercise and our goal is to make that workflow as simple and powerful as possible.

To that end, today we are making available Cloudera Streaming Analytics Community Edition (CE) — For free.

Using Docker Compose, you can easily run a stand-alone Cloudera Streaming Analytics stack on your laptop or any other development node. The only pre-requisite is Docker. Head over to the Cloudera downloads page to get the docker-compose YAML file, and check out the documentation to get started in just a few minutes. There is an included tutorial to get your first stream processor built and running.

What does this include?

We have worked hard to ensure the entire stack works well for all of your stream processor development activities. This isn’t a trial version with a time-bomb or cripple-ware. It’s a nearly full fledge version of our enterprise grade software, so you can develop stream processors with the most powerful tool available today — Cloudera SQL Stream Builder with Apache Flink. When you are ready to run your stream processing job in staging or production you can run it on Cloudera Streaming Analytics clusters in your enterprise.

As we’ve mentioned, it was important to ensure you, as a developer, have all the power you need at your fingertips when developing complex stream processors. So, this release includes the same powerful functions and capabilities that are included in CSA 1.5 like:

  • Flink SQL grammar on Flink 1.13 with RESTful API
  • A robust Materialized View engine with pull queries by any attribute via REST or PostgreSQL wire protocol
  • Schema detection for JSON data
  • Simple connectivity to any Kafka cluster endpoint
  • Rich nested and complex structure support
  • User-defined Java and Javascript functions
  • Input transforms
  • Match Recognize for Complex Event Processing (CEP)

Here are some helpful resources to get started now:

Where do we go from here?

This is the initial GA release and includes the latest version (CSA CE 1.5) but we plan to continue to add more tutorials, documentation, videos, examples, flows, and more. We would love feedback from you on what we can add, or how we can change things to make the development experience even better. We hope this brings new productivity, power, and, frankly, fun to your development experience.

--

--

kennygorman
Cloudera

Product Management @mongodb, Previous Co-Founder @eventadorlabs & @objectrocket. Early @paypal. Views are my own.