Overview of the different approaches to putting Machine Learning (ML) models in production

Julien Kervizic
Hacking Analytics
Published in
14 min readApr 29, 2019

--

Photo by Mantas Hesthaven on Unsplash

There are different approaches to putting models into productions, with benefits that can vary dependent on the specific use case. Take for example the use case of churn prediction, there is value in having a static value already that can easily be looked up when someone calls a customer service, but there is some extra value that could be gained if, for specific events, the model could be re-run with the newly acquired information.

There are generally different ways to both train and serve models into production:

  • Train: one-off, batch and real-time/online training
  • Serve: Batch, Realtime (Database Trigger, Pub/Sub, web-service, inApp)

Each approach has its own set of benefits and tradeoffs that need to be considered.

One-off Training

Models don’t necessarily need to be continuously trained in order to be pushed to production. Quite often, a model can be just trained ad-hoc by a data-scientist and pushed to production until its performance deteriorates enough that they are called upon to refresh it.

--

--

Julien Kervizic
Hacking Analytics

Living at the interstice of business, data and technology | Head of Data at iptiQ by SwissRe | previously at Facebook, Amazon | julienkervizic@gmail.com