Stopping Invalid Traffic using Spark Streaming, Kafka, and Science!

Xandr Engineering
Xandr-Tech
Published in
1 min readOct 28, 2015

The Signals Intelligence group (SIGINT) within AppNexus Data Science has the responsibility of identifying and stopping invalid traffic as quickly as possible. One technique they use to achieve this very quickly is collecting, aggregating and acting on streaming data using Kafka and Spark Streaming.

Watch this video to learn how AppNexus use these systems, some of the data science findings, the challenges and tribulations they’ve had to overcome, and how you can put these techniques into practice yourself.

--

--