Evolving From Descriptive to Prescriptive Analytics: Part 2, Acquiring the Right Skills, Faster

Shaikh Quader
Inside Machine learning
4 min readApr 4, 2018

By Shaikh Quader and Chad Marston

Pexels.com

In the first part of this blog series, we reviewed the essential first step of acquiring leadership support to shift focus from descriptive to prescriptive analytics. Next is to acquire the skills in your team in order to succeed with that goal.

If you’re new to Machine Learning and need options to quickly master the art of Machine Learning, Coursera, edX and Udemy have a total of 319 Machine Learning courses for you — and on top of those, there’s Udacity and YouTube. It’s easy to be overwhelmed by the magnitude of learning resources available on the web and it’s difficult to decide where to begin. In fact, you can easily waste weeks or months without learning anything significant. We could call this confused state “Learner’s Block”.

In fact, when we, a team of Software Developers, stepped into Machine Learning for the first time, we had “Learner’s Block” for the first few months. At that point, we did a couple of things that moved us out of the slow lane.

Holding the Hand of a Mentor

As the team’s lead architect, I started by looking for a mentor in Machine Learning, which turned out to be the single most important decision in our journey into Machine Learning. The mentor has driven the road and knows where to turn, slow down, or accelerate. Jean-François Puget (JFP), a Distinguished Engineer for Machine Learning and Optimization at IBM, had kindly agreed to be my Mentor. We met for 30 minutes each month, and I went to each meeting with prepared questions, sometimes sending him the questions in advance. When I had a pressing question, I could also ask JFP via Slack. I then passed everything I learned on to rest of my team. The whole game of learning had changed for us. We were on an accelerated path to Machine Learning.

Starting Right — Understanding the Core Concepts

JFP advised us to stop wandering around many different learning channels, and instead focus on one course to get started. We registered for Andrew Ng’s “Machine Learning” from Coursera. Each of us was learning personally and as a group. But Machine Learning can’t be learned just by watching videos; it needs to be supplemented with hands-on exercise. Every day, we would gather for 30 minutes, watch Andrew’s lecture together for 20–25 minutes followed by discussion. From group discussion, each member in the group got comfortable with the concepts we just learned. It took us roughly 2 months to go through much of the course. Some of us went ahead to earn the course certificate. In 2 months, we had a firm grip on the core concepts of Machine Learning.

Picking up ML Arsenals — Learning Python

Andrew Ng based his course assignments on Octave, which was good for its purpose, but we knew we needed to learn either Python or R — one of the two most of Popular languages for industry-grade Machine Learning. We decided on Python and started with the following 2 courses:

  • Google’s Python Class — This is the compilation of lessons from a 2-day intensive course taught internally at Google. The course offers its lessons in two formats — recorded video lectures or text lessons. We were able to finish this course in a couple of hours. This course covers the fundamentals of Python pretty well.
  • Intro to Python for Data Science by DataCamp—DataCamp has done a good job in distilling the key concepts of Python for Data Science, with bite-size lessons, interwoven with short programming exercises. DataCamp has simplified the programming practices by embedding the notebook and Python shell in the course portal.

Swimming in the Wild — Solving Problems

According to CrowdFlower, 51% of the efforts in Machine Learning projects goes into collecting, labeling, cleaning, and organizing data. Unfortunately, the Machine Learning datasets found in the textbooks or in most online courses are simple and don’t offer the chance to practice these data preparation exercises, so they don’t help much to see, feel, and deal with real world data challenges. JFP advised us to continue practice Machine Learning with different datasets.

We took his advice and after finishing our Python courses, we started practicing Machine Learning with datasets from Kaggle and UCI repositories. Each week, we picked up one or a few Machine Learning algorithms and a dataset to try over the next week. We then got together to review each other’s models and compare the features and performance. The process gave our whole team the courage, confidence, and competence to work on real Machine Learning projects.

We now have the organizational commitment and skills. In our next blog we’ll discuss the tooling needed for efficient and scalable Machine Learning solutions.

--

--

Shaikh Quader
Inside Machine learning

A machine learning researcher who lost 50 lbs of weight and experiments with self-discipline, habits, creative thinking, learning, and wellbeing.