Open in app

Sign In

Write

Sign In

Huy Bui
Huy Bui

230 Followers

Home

About

Published in

Towards Data Science

·Pinned

A Mathematical Breakdown of the Closed-Form Equation of Simple Linear Regression

A quick tour to strengthen your foundation in machine learning — Linear regression is widely used by professionals in business, science, engineering, and more but not too many people understand (or care) about the math under the hood. This article will guide readers to the realm of math and hopefully to gain some math appreciation along the way. I. Introduction and Problem Statement Linear Regression is…

Linear Regression

4 min read

A Mathematical Breakdown of the Closed-Form Equation of Simple Linear Regression
A Mathematical Breakdown of the Closed-Form Equation of Simple Linear Regression
Linear Regression

4 min read


Published in

Towards Data Science

·Pinned

Derive Mortgage Amortization Formula from Scratch

A Journey to Discover the Beauty of Math in Real Estates with the Visualization in Python — I. Definition If you took a down payment on your mortgage, most likely you are taking an Amortizing Loan. An amortization loan refers to an exact amount you pay monthly so that by the end of the loan term you paid off the debt and the interest. The monthly amortization consists of…

Data Science

3 min read

Derive Mortgage Amortization Formula from Scratch
Derive Mortgage Amortization Formula from Scratch
Data Science

3 min read


Published in

Towards Data Science

·Jul 30, 2020

Creating Variable Factor Map (PCA) Plot with Python

In this tutorial I will show you how to create the Variable Factor Map using matplotlib step by step — I. Introduction One of the most joyful activities in analytics is working with beautiful visualization. With the Variable Factor Map, you can explain Principal Component Analysis with ease. A picture worth a thousand words Principle Component Analysis (PCA), is a dimensionality-reduction method that is used to reduce the dimensionality of large data…

Python

4 min read

Creating Variable Factor Map (PCA) Plot with Python
Creating Variable Factor Map (PCA) Plot with Python
Python

4 min read


Published in

Towards Data Science

·Apr 1, 2020

The Monty Hall Problem will drive you crazy

Solving the famous brain teaser by math, statistics, and Monte Carlo methods — The Monty Hall Problem is a famous probability puzzle in statistics. It is named after Monty, the host of the television game show “Let’s Makes a Deal”. The brain teaser loosely replicates the game show concept and it goes like this: There are 3 doors. You will have to choose…

5 min read

Monty Hall Problem will drive you crazy
Monty Hall Problem will drive you crazy

5 min read


Published in

Towards Data Science

·Mar 31, 2020

Decision Tree Fundamentals

Learning about Gini Impurity, Entropy, and how to construct a decision tree — When talking about the decision trees, I always imagine a list of questions I would ask my girlfriend when she does not know what she wants for dinner: Do you want to eat something with the noodle? How much do you want to spend? Asian or Western? …

Decision Tree

7 min read

Decision Tree Fundamentals
Decision Tree Fundamentals
Decision Tree

7 min read


Published in

Towards Data Science

·Mar 22, 2020

Introduction to Natural Language Processing with the Beatles and Taylor Swift

Manipulating unstructured data with different techniques such as tokenization, lemmatization, stop words, TF-IDF. — Natural language processing is an interesting field because it is thought-provoking to disambiguate the input sentence to produce the machine representation language. Take a look at the famous Groucho Marx’s joke: One morning I shot an elephant in my pajamas. How he got into my pajamas I’ll never know.

Tf Idf

11 min read

Introduction to Natural Language Processing  with the Beatles and Taylor Swift
Introduction to Natural Language Processing  with the Beatles and Taylor Swift
Tf Idf

11 min read


Published in

Towards Data Science

·Mar 18, 2020

How to Be a Data Scientist in 2020

Understanding the machine learning project life cycle and demystifying criteria for a successful project. — A data science team is the core of every big company. A successful data scientist needs to be the head for business strategy, can make discoveries and vision through data, and can convince stakeholders through communication and visualization. However, the amount of data nowadays increase exponentially and it makes data…

Beginner

8 min read

How to Be a Data Scientist in 2020
How to Be a Data Scientist in 2020
Beginner

8 min read


Published in

Towards Data Science

·Mar 3, 2020

Tracing the Oil Spill

My journey from oil leakage to applying AI to predict equipment failures — I) Motivation In the job searching process, I found that many companies in my area looking for data scientists having knowledge of oil and gas. My background is mostly about mathematics, so I decided to go on an adventure along the oil pipeline. One thing that I noticed is oil and gas…

Data Science

6 min read

Tracing the Oil Spill
Tracing the Oil Spill
Data Science

6 min read


Published in

Towards Data Science

·Feb 25, 2020

ROC Curve Transforms the Way We Look at a Classification Problem

There is no machine learning algorithm that works best for all the problems — The Receiver Operating Characteristic (ROC) curve is a probability curve that illustrates how good our binary classification is in classifying classes based on true-positive and false-positive rates. The Area Under Curve (AUC) is a metric that ranges from 0 to 1. It is the area under the (ROC) curve. Motivation Why…

Roc Curve

6 min read

ROC Curve Transforms the Way We Look at a Classification Problem
ROC Curve Transforms the Way We Look at a Classification Problem
Roc Curve

6 min read


Published in

Analytics Vidhya

·Feb 18, 2020

From Convolutional Neural Network to Variational Auto Encoder

The most fascinating about generative deep learning, such as auto-encoder is that the machine can teach itself to be creative. The algorithm simply mimics the way humans learn and innovate. When first encounter a new concept, one needs to read, listen, memorize what important, and then practice. The more training…

Autoencoder

6 min read

From Convolutional Neural Network to Variational Auto Encoder
From Convolutional Neural Network to Variational Auto Encoder
Autoencoder

6 min read

Huy Bui

Huy Bui

230 Followers

The Bayesian boy

Following
  • ODSC - Open Data Science

    ODSC - Open Data Science

  • Valentina Alto

    Valentina Alto

  • Jesus Rodriguez

    Jesus Rodriguez

  • Sadrach Pierre, Ph.D.

    Sadrach Pierre, Ph.D.

  • Soner Yıldırım

    Soner Yıldırım

See all (168)

Help

Status

Writers

Blog

Careers

Privacy

Terms

About

Text to speech

Teams