Soumya Ghosh – Medium

Soumya Ghosh

Soumya Ghosh

Simple Matrix Factorization example on the Movielens dataset using Pyspark

Matrix factorization works great for building recommender systems. I think it got pretty popular after the Netflix prize competition. All…

Mar 22, 2018

Simple Matrix Factorization example on the Movielens dataset using Pyspark

Mar 22, 2018

Soumya Ghosh

Basic data preparation in Pyspark — Capping, Normalizing and Scaling

In this blog, I’ll share some basic data preparation stuff I find myself doing quite often and I’m sure you do too. I’ll use Pyspark and…

Mar 21, 2018

Mar 21, 2018

Soumya Ghosh

Visualising Indian startup investments using Python — violin plots, heatmaps and sankey diagrams

Sometimes histograms and scatterplots arnt enough. Here I’ll cover some of the more complicated plots that you might need to use — violin…

Mar 18, 2018

Visualising Indian startup investments using Python — violin plots, heatmaps and sankey diagrams

Mar 18, 2018

Soumya Ghosh

Topic modelling with Latent Dirichlet Allocation (LDA) in Pyspark

In one of the projects that I was a part of we had to find topics from millions of documents. You can try doing topic modelling using two…

Mar 17, 2018

Topic modelling with Latent Dirichlet Allocation (LDA) in Pyspark

Mar 17, 2018

Soumya Ghosh

Recommender system on the Movielens dataset using an Autoencoder and Tensorflow in Python

I’m a huge fan of autoencoders. They have a ton of uses. They can be used for dimensionality reduction like I show here, they can be used…

Mar 17, 2018

Recommender system on the Movielens dataset using an Autoencoder and Tensorflow in Python

Mar 17, 2018

Soumya Ghosh

Denoising MNIST images using an Autoencoder and Tensorflow in python

Previously I had written sort of a tutorial on building a simple autoencoder in tensorflow. In that tutorial I had used the autoencoder for…

Mar 15, 2018

Denoising MNIST images using an Autoencoder and Tensorflow in python

Mar 15, 2018

Soumya Ghosh

Simple Autoencoder example using Tensorflow in Python on the Fashion MNIST dataset

Autoencoders can be used to solve a lot of problems. The one I’ll try to solve here is that of dimensionality reduction. This is a pretty…

Mar 11, 2018

Simple Autoencoder example using Tensorflow in Python on the Fashion MNIST dataset

Mar 11, 2018

Soumya Ghosh

Soumya Ghosh

Following

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams