Open in app

Sign in

Write

Sign in

Mala Deep
Mala Deep

1K Followers

Home

Lists

About

Published in

Towards Data Science

·Pinned

Surprisingly Effective Way To Name Matching In Python

Data Matching, Fuzzy Matching, Data Deduplication — Recently I came across this dataset, where I needed to analyze the sales recording of digital products. I got the dataset of having almost 572000 rows and 12 columns. I was so excited to work on such big data. With great enthusiasm, I gave a quick view of data, and…

Data Science

9 min read

Surprisingly Effective Way To Name Matching In Python
Surprisingly Effective Way To Name Matching In Python
Data Science

9 min read


Published in

Towards Data Science

·Pinned

Little Known Ways to Make your Data Visualization Awesome

Stripping Away The Excess — A few months back, while I was surfing Instagram, I saw a comment in a post, with a remark, “You have used data-ink ratio in good order.” I immediately started exploring the term “data-ink” and realized that it was coined by “Edward Tufte.” I had read his name on some…

Data Visualization

8 min read

Little Known Ways to Make your Data Visualization Awesome
Little Known Ways to Make your Data Visualization Awesome
Data Visualization

8 min read


Pinned

Table of Contents

Simplest way to gain access to all of… —

Data Science

3 min read

Table of Contents
Table of Contents
Data Science

3 min read


Published in

Towards AI

·Oct 1

Addressing Data Leakage: Essential Considerations for Trustworthy Machine Learning Models

With the ChatGPT use case and other AI markets on the rise, the accuracy and trustworthiness of AI models are of utmost importance. The numerous moving parts in this endeavor make it difficult, including data leakage, which is frequently underrated but has serious repercussions. Before going into what data leakage…

Data Science

6 min read

Addressing Data Leakage: Essential Considerations for Trustworthy Machine Learning Models
Addressing Data Leakage: Essential Considerations for Trustworthy Machine Learning Models
Data Science

6 min read


Published in

Bootcamp

·Aug 15

How I Created an Inverted Index Search App Using Python and Streamlit

Vertical search engine for research papers — Humans have always been driven by their inherent curiosity to seek out information. This is one of the motivations behind developing search engines [1]. From WebCrawler, created by Brian Pinkerton, a computer science student at the University of Washington, to the web3.0-based search engines like Presearch, to AltaVista, Lycos, Yahoo…

Data Science

16 min read

How I Created an Inverted Index Search Engine Using Python and Streamlit
How I Created an Inverted Index Search Engine Using Python and Streamlit
Data Science

16 min read


Published in

Towards AI

·Aug 10

How to install Hadoop on MacBook M1 or M2 without Homebrew or Virtual Machine

In this article, I will walk you through the simple installation of Hadoop on your local MacBook M1 or M2. I will be using a MacBook Air M1 (arm64), 2020, with 8 GB of memory and MacOS Ventura 13.2.1. Let’s get started! Before we get started, I am confident you…

Hadoop

10 min read

How to install Hadoop on MacBook M1 or M2 without Homebrew or Virtual Machine
How to install Hadoop on MacBook M1 or M2 without Homebrew or Virtual Machine
Hadoop

10 min read


Published in

Towards Data Science

·Jun 15

Modeling EEG Signals using Polynomial Regression in R

selecting best model with polynomial regression from scratch — Introduction to EEG signals EEG stands for electroencephalogram, which is an electrical signal that measures the electrical activity of the brain [1]. To get the EEG result, electrodes consisting of small metal discs with thin wires are pasted onto the scalp. The electrodes detect tiny electrical charges that result from the activity of your…

Data Science

12 min read

Modeling EEG Signals using Polynomial Regression in R
Modeling EEG Signals using Polynomial Regression in R
Data Science

12 min read


Published in

Dev Genius

·Mar 11

Tableau Desktop Specialist Certification by Adam Mico

Longtime practitioners will find this book refreshing, and newbies will find motivation in it. If you are into the data domain, I am sure you have heard about business intelligence (BI) tools. If not, learn a few here. …

Tableau

7 min read

Tableau Desktop Specialist Certification by Adam Mico — Book Review
Tableau Desktop Specialist Certification by Adam Mico — Book Review
Tableau

7 min read


Published in

UX Planet

·Mar 5

Experiencing Zero-UI on a daily basis

No physical or visual interaction needed — If you are reading this post, then I am sure you are accessing it through Medium’s blog interface, or simply the Medium User Interface (UI). User Interface is the means by which a user controls and interacts with a software application or hardware device in a natural and intuitive way…

UI

5 min read

Experiencing Zero-UI on a daily basis
Experiencing Zero-UI on a daily basis
UI

5 min read


Published in

Artificial Intelligence in Plain English

·Feb 14

This is what ChatGPT have to say about Google Current Business

The market is constantly evolving, says ChatGPT — Recently, I had a talk with OpenAI ChatGPT. ChatGPT is the talk of the town. If you are hearing it for the first time, I suggest you do a Google search (opps!). If you are working in the data field and would like to know how I am using ChatGPT…

ChatGPT

9 min read

This is what ChatGPT have to say about Google Current Business
This is what ChatGPT have to say about Google Current Business
ChatGPT

9 min read

Mala Deep

Mala Deep

1K Followers

Top 1500 Writer in Data Science & Visualization. Also talks about HCI & Design Psychology

Following
  • Philippe Bouaziz @DataScienceMustNeededSkills

    Philippe Bouaziz @DataScienceMustNeededSkills

  • Derick David

    Derick David

  • Saurabh singh

    Saurabh singh

  • Chris Soschner

    Chris Soschner

  • Lewiscoaches

    Lewiscoaches

See all (1,065)

Help

Status

About

Careers

Blog

Privacy

Terms

Text to speech

Teams