ML Prototyping Made Easy

One-Stop Solution for different ML Problems!

Image for post
Image for post
Photo by Edward Howell on Unsplash

Have you ever wanted a simple and easy to use python library bundled in a single package that can handle an end-to-end Machine learning journey with a consistent syntax interface? If you are working on a PoC, PyCaret can be a handy tool as it provides a one-stop solution for a variety of problems. Be it removing multi-collinearity from your dataset or impute missing values, it has got your back.

PyCaret can significantly reduce the time and efforts required to build and train ML models and all this can be achieved in only a couple of function calls. It has some interesting built-in functionality for Feature engineering, Hyper Parameter Tuning, etc. It is also capable of comparing models, stack, and ensemble them based on your requirement. Furthermore, it can also be integrated with ML Flow to track and manage ML experiments. You can also use it to deploy on different cloud platforms like Azure or AWS. Usually, we have defined the machine learning pipeline that we follow in creating a model. …

MS in the UK

Lessons learned at Real-life Hogwarts aka UoG

Image for post
Image for post
Image by Author

Are you a Harry Potter fan and want to study at real-life Hogwarts School of Witchcraft and Wizardry? Do you want to pursue higher education in the UK and not sure if it will be worth it? Are multiple offers from different Universities confusing you? Worry no more, you have landed at the right place! I’ve been there as well and I feel you. This post is dedicated to you — I have tried to answer questions that I have been asked most by different people on LinkedIn. …

SQL Questions for Data Jobs

SQL Interview Questions

Image for post
Image for post
Photo by Free To Use Sounds on Unsplash

SQL is everywhere. It is probably one of the most widely used languages today and also the primary language for retrieving data from databases. It came into existence after E.F. Codd laid the foundation of the modern relational database in his groundbreaking paper A Relational Model of Data for Large Shared Data Banks.

As a beginner, I was intimidated by different SQL( Structured Query Language) Flavors(Oracle SQL, MySQL, PostgreSQL) but I found out that the basic concept remains the same and they only differ in terms of syntax and implementation. However, In this post, we will use Oracle SQL for query and demonstration. Notably, NoSQL databases are popular nowadays as the majority of the data produced today is non-relational. Nevertheless, this does not mean SQL days are over. It’s not going anywhere. Not anytime soon. …

Model Explainability using LIME

Can you trust your Machine learning model?

Image for post
Image for post
Photo by Andy Kelly on Unsplash

Why should we trust a machine learning model blindly? Wouldn’t it be wonderful if we can get a better insight into model predictions and improve our decision making? With the advent of explainable AI techniques such as LIME and SHAP, it is no longer a challenge. Nowadays, machine learning models are ubiquitous and becoming a part of our lives more than ever. These models are usually a black box in essence that it’s hard for us to assess the model behavior. From smart speakers with inbuilt conversational agents to personalized recommendation systems, we use them daily, but do we understand why they behave in a certain way? Given their ability to influence our decision, it is of paramount and supreme importance that we should be able to trust them. …

Distributed computing for Advanced Analytics

Scale-up Pandas, Numpy and Sci-kit learn natively

Image for post
Image for post
Image Source: Dask

“Data is the new science. Big Data holds the answers.” — By Pat Gelsinger

Big Data does hold the answer. The more data we have, the more possibilities of gaining business value out of it. However, gathering data is not the only challenge, we also need to consider data storage and processing. As a Data Scientist, we often use tools like Pandas and Numpy to analyze data since these are widely trusted and efficient. However, as the size of the dataset increases, we start facing the actual limitations of these tools. What do we do next? We switch to a more scalable solution such as Spark and at times, this rework is time-consuming. Wouldn’t it be wonderful if you could do it in your system locally or scale-up to a cluster on a need basis? …

H2o Flow:

Build a Machine learning model without writing a single line of code

Image for post
Image for post
Photo by Marvin Meyer on Unsplash

Have you ever wanted to play around with Machine learning? Wish you could build a machine learning model without having to write any code? Lack of programming experience is stopping you? Happy Days are here — has a solution for you, it’s called H2o Flow.

It is widely known that the field of Data Science is highly dynamic and there is something new in DS stack every day. The advent of AutoML is certainly a commendable push to this stack. …

Develop an interactive web app — Streamlit 101

A quick and easy way to develop a web application for your machine learning models.

Image for post
Image for post

Fancy building an interactive web app? Don’t want to write HTML/CSS scripts? Wish you could write a few lines of python code to build an app and showcase your app to stakeholders gracefully. Worry, no more, Here is a solution — Streamlit.

Streamlit is a super useful tool to develop an interactive web application in a couple of function calls written in python. It lets you create apps for your machine learning models using fairly simple python code. If you can write scripts in python, you can quickly embrace streamlit by getting acquainted with a couple of streamlit functions. …

Getting Started with Docker

Docker For Absolute Beginners

Image for post
Image for post
Photo by Who’s Denilo ? on Unsplash

Have you ever been intimidated by Docker’s fancy name and wondered what it is? — Great, This post is for you.

In this post, we will cover what exactly this devil is and what it does.

First and foremost, what is Docker? 🧐

Docker is an open-source containerization tool mainly used for shipping and running applications quickly across different platforms. Docker’s way of shipping and deploying code swiftly, significantly reduces the delay between writing code and running it in production. It is developed using GO and was first released in 2013.

Why Docker?

Docker has a list of advantages that makes it super useful. …

Deploy your ML Model on the Cloud

Quickly Deploy your Web applications on the Cloud using Heroku

Image for post
Image for post

Heroku is a quick and easy to use Platform as a service (PaaS) that facilitates the deployment of web apps so that it is accessible to anyone with a URL. In other words, this will enable you to take your application from your local machine to the cloud. In this tutorial, We will discuss the steps to deploy apps built on Streamlit (Python) on Heroku. Heroku has been selected for deployment mainly because it provides free resource hours when you sign up for a new account.

Before we dive in further, some prerequisites for this would be a working streamlit app and some basic knowledge of the cloud. However, you can still deploy your application on the web even if you are not familiar with any cloud technology. …


Shashvat G

Data Scientist | Analyst who aspires to continuously learn and grow in Data Science Space. Find him on LinkedIn

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store