Funnel Analysis with Spark and Druid — Itai Yaffe (Big Data Tech Lead) @ Nielsen

Itai Yaffe
Nielsen-TLV-Tech-Blog
1 min readFeb 11, 2020

Abstract:
Every day, millions of advertising campaigns are happening around the world.
As campaign owners, measuring the ongoing campaign effectiveness (e.g “how many distinct users saw my online ad VS how many distinct users saw my online ad, clicked it and purchased my product?”) is super important.
However, this task (often referred to as “funnel analysis”) is not an easy task, especially if the chronological order of events matters.

So, while the combination of Druid and ThetaSketch aggregators can answer some of these questions, it still can’t answer the question “how many distinct users viewed the brand’s homepage FIRST and THEN viewed product X page?”
In this talk, we will discuss how we combine Spark, Druid and ThetaSketch aggregators to answer such questions at scale.

--

--

Itai Yaffe
Nielsen-TLV-Tech-Blog

Sr. Architect @Akamai (prev. @databricks). Public speaker. @DataWomen Israeli chapter co-founder, @bigthingshere co-organizer. https://twitter.com/ItaiYaffe