Building our Data Platform: Why we have chosen FluentD

6 min readFeb 12, 2022

A detailed analysis of why is the best solution for our needs

Originally published at https://blog.denexus.io.

This is the second article in which we describe different components of our Data Platform, if you want to check the first one:

Building our Data Platform: Why we have chosen Databricks over Snowflake

A detailed analysis of why is the best solution for our needs

medium.com

Selecting key technology and data partners is a central focus of the data engineering team at DeNexus. Building a data platform that can scale and address a challenge as complex as cyber risk quantification requires a reliable Data Platform. In this article we are going to cover the tool we decided to use as data collector in our Data Platform.

DeNexus Requirements

Compatible with many use cases: we want to process syslogs, launch REST endpoints to easily gather data, execute external programs to receive or pull event logs… (and these are only some examples of use cases that we have already identified).
TLS compatible: some of the customers that send us data require compatibility with TLS authentication to trust the server that is going to receive their data.
High scalability : We want to be ready for Petabytes of data volume as we are rapidly growing. “ Design for Infiniti: Big enough will not always be Good enough.”
Portability: The tool should be able to easily be deployed in other systems without surprises like “ It works on my machine “. Furthermore, it should be lightweight.
High availability: We do not want to lose any event/file so the solution must be reliable.
No kind of vendor lock-in: We want to stick to the multi-cloud paradigm as much as possible. Solution must be decoupled from a certain target/cloud provider.

Considering the described use cases and needs, we analyzed several tools and concluded that FluentD was the correct choice. Here’s why…

The Solution: FluentD

It is an open-source data collector for a unified logging layer. Fluentd allows us to unify data collection and consumption for better use and understanding of data.

Although it is written in Ruby, it is most performance-sensitive parts (like object serialization and networking layers) are written in C. Fluentd sacrifices some overall performance by having access to many plugins developed by the Ruby community, which allows it to achieve the status of the unified logging layer .

FluentD is not only an event collector and aggregator, but also allows us to implement functionalities such as: log-parsing, filtering, data conversion and data processing. That is, depending on the use case, FluentD can be implemented as an all-in-one tool that will:

Collect (actively or passively) data
Transform data to a desired/useful format (e.g. from plain text to JSON by using regex)
Filter input data (e.g. grep-like plugin to filter events by value)
Apply transformations to input data (e.g. change the timestamp format of an event time field)
Push the result to the desired storage service among a large number of options.

Despite its small memory footprint (30~40MB), FluentD has a little brother named Fluent Bit written entirely in C and whose memory footprint is about ten times smaller than FluentD’s already small memory footprint.

FluentD — Fluent Bit comparisson (Image by FluentD)

But despite being lighter and written entirely in C, the number of available plugins is much smaller for Fluent Bit than for FluentD ( Fluent Bit only has around 80+ plugins compared to Fluentd’s 600+ ), so Fluent Bit gains efficiency and performance by paying a price in capabilities (especially the parsing ones).

Why Does FluentD Meet our Needs?

Proven in many use cases: The list of FluentD plugins already has more than 600 entries , satisfies all our identified use cases , and allows us some peace of mind with respect to possible new use cases that may arise in the future. One of our current use cases requires S3 compatibility (achieved thanks to the S3 plugin ) and to keep S3 related costs under control ( S3 API calls are charged per object, not per size: uploading 1-byte costs the same as uploading 1GB ) as events are aggregated in single files based on time or file size.

TLS support: It was the main reason we decided to use FluentD instead of FluentBit. Although Fluent Bit implements TLS support (by default) in all output plugins , this is not the case for all input plugins and, for one of our identified use cases (syslog), is not (yet) TLS compliant:

Please add TLS/SSL support to syslog input plugin. · Issue #2513 · fluent/fluent-bit

Is your feature request related to a problem? Please describe. I'm only able to send syslog messages to fluent-bit…

github.com

In the case of FluentD, it is fully supported and is currently used in all our use cases. We even have a use case in which we use it to implement TLS Mutual Authentication:

FluentD webhook with TLS Mutual Authentication

Build your own HTTP callback with code examples!

medium.com

High scalability: We have seen that FluentD’s memory footprint is only 30–40 mb and a regular PC box can handle 18,000 messages/second with a single process.

Since scalability must be considered as if we were designing for infinity, in case of having higher processing needs we could horizontally scale or to apply solutions that combine FluentD with Fluent Bit:

“The combination of Fluentd and Fluent Bit is becoming extremely popular in Kubernetes deployments because of the way they complement each other — Fluent Bit acting as a lightweight shipper collecting data from the different nodes in the cluster and forwarding the data to Fluentd for aggregation, processing and routing to any of the supported output destinations.”

Portability: Easy installation, available as a service in many formats and systems:

msi (WIndows)
dmg (MAC)
deb (Ubuntu/Debian)
Ruby Gem (without dependencies).
Docker: we create a container with the desired configuration, and we can deploy it any number of times without any further effort in its configuration.

High availability: Since FluentD is not a serverless or managed solution, the high availability configuration depends entirely on us and on our architecture. However, different recommendations on how to implement FluentD for high availability can be found in official documentation . There are several buffer plugins that help us implement failure scenarios like forwarder/aggregator ones.

NO vendor lock-in: Although there are Enterprise services available , FluentD is a fully open-source solution without any dependency on third party or cloud providers. Moreover, there are use cases where FluentD is used as a key tool to achieve a multi-cloud architecture .

Last Thoughts

As 2,000+ data-driven companies rely on Fluentd (Even technology leaders like: Microsoft or Google ) we can be quite sure that open source based technologies like FluentD are superior, from a technology perspective, than it’s proprietary peers.

FluentD in DeNexus Data Platform (Image by author)

So, this is why we have chosen FluentD — but how does FluentD fit into the DeNexus data ecosystem exactly? Stay tuned — In next articles we will cover in detail how we have implemented some of our current use cases with FLuentD.

“The success formula: solve your own problems and freely share the solutions.”― Naval Ravikant

Building our Data Platform: Why we have chosen FluentD

Building our Data Platform: Why we have chosen Databricks over Snowflake

A detailed analysis of why is the best solution for our needs

DeNexus Requirements

The Solution: FluentD

Why Does FluentD Meet our Needs?

Please add TLS/SSL support to syslog input plugin. · Issue #2513 · fluent/fluent-bit

Is your feature request related to a problem? Please describe. I'm only able to send syslog messages to fluent-bit…

FluentD webhook with TLS Mutual Authentication

Build your own HTTP callback with code examples!

Last Thoughts

Written by Iván Gómez Arnedo