Echobox Insights
Published in

Echobox Insights

How we predicted President Macron

With the reliability of traditional opinion polls called into question, the search is on for a new way of tracking the public mood and predicting election results. Our successful prediction of the outcome of the French presidential election earlier this month shows that big data can disrupt traditional polling — if it is done right.

Earlier this year, we launched the French Election Tracker, a measure of how much interest each candidate received that was based on highly granular and comprehensive live data. Using billions of data points spanning the entire election campaign, we also issued a series of accurate predictions for the outcomes of both rounds of the election.

Ours was the only big data prediction to correctly predict Macron as the winner in both rounds. We also predicted Mélenchon’s surge long before it became apparent in the polls. Moreover, the FET predicted the correct outcome an hour before official results were published in both rounds of the election.

Round 1

Prediction: 23.7%. Result: 24.0%.

Le Pen
Prediction: 22.9%. Result: 21.3%.

Prediction: 21.0%. Result: 20.0%.

Prediction: 17.3%. Result: 19.6%.

Prediction: 15.1%. Result: 6.4%.


Round 2

Prediction: 64.7%. Result: 65.8%.

Le Pen
Prediction: 35.3%. Result: 34.2%.

As people from all demographic groups spend more time online, the data created as clicks and likes accumulate may one day render obsolete any polling based on small, representative samples.

Yet the French presidential election showed clearly the potential risks of relying on new methods. Most big data predictions were far off the mark, counting Macron out and variously predicting Presidents Le Pen, Fillon and even Mélenchon.

Our correct predictions were based not only on uniquely comprehensive and high-quality data, but also on a thorough understanding of French politics, the country’s history and its unusual electoral system. Moreover, we were transparent about our methods, our data and the limitations of our model.

This focus on context, quality and transparency is what turns a lot of data into big data. We think it constitutes the gold standard for big data predictions, a standard which we will continue to uphold as we move on to building our German Election Tracker.

You can sign up to receive the latest news our data showcases here. For more frequent updates, follow @EchoboxHQ on Twitter. For more about Echobox, go to

Echobox Insights showcases what we have learned about AI, social media and journalism in the digital age by building the first artificial intelligence that understands the meaning of content. Find out more about Echobox at

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store
Sebastian Huempfer

Sebastian Huempfer

Former VP Operations at @EchoboxHQ.

More from Medium

Why hiring time is hard to predict and what you can do about it.

Foxglove joins the ROS Technical Steering Committee

Data Lakes: 3 Reasons why Business Needs Them

7 tips to deal with a Passive-Aggressive stakeholder — Challenging Stakeholders Part I