Homepage
Open in app
Sign in
Get started
Data Dojo
Follow
Mastering Differences and Pitfalls when Switching SQL Databases: PostgreSQL vs.
Mastering Differences and Pitfalls when Switching SQL Databases: PostgreSQL vs.
Transitioning to another SQL database? This blog post is for you. Shifting from one SQL dialect to another can be a journey full of…
Heike Maria
Dec 9, 2022
From localhost to a web server — How to host your Streamlit App on Heroku (for free)
From localhost to a web server — How to host your Streamlit App on Heroku (for free)
You have built a great streamlit app. So far, you only ran it locally on your computer on localhost:8501. Now you would like to share your…
Heike Maria
Sep 5, 2021
Biases in learning to rank models and three approaches to deal with them
Biases in learning to rank models and three approaches to deal with them
Training data for Learning to Rank models in search suffers from selection and position biases. What can you do?
Heike Maria
Apr 29, 2021
Intro into APIs: How to access data from public REST APIs with `curl`
Intro into APIs: How to access data from public REST APIs with `curl`
This post will teach you the inution of REST APIs and how you can use them to get interesting datasets for your data projects. First, we…
Heike Maria
Nov 26, 2020
Pointwise, Pairswise and Listwise Learning to Rank Models — Three Approaches to Optimize Relative…
Pointwise, Pairswise and Listwise Learning to Rank Models — Three Approaches to Optimize Relative…
In many scenarios, such as a google search or a product recommendation in an online shop, we have tons of data and limited space to…
Heike Maria
Oct 15, 2020
AI-Machine-Learning-Buzzword-Bingo
AI-Machine-Learning-Buzzword-Bingo
I was recently invited to join a panel discussion among developers to dispel the myth of the typical BS Buzzword Bingo around machine…
Heike Maria
Sep 10, 2020
The Intuition of Word Embeddings: How you Teach A Computer to Understand Text
The Intuition of Word Embeddings: How you Teach A Computer to Understand Text
Humans intuitively understand the meaning of words: Which words are similar, opposites or related to each other? But our machine learning…
Heike Maria
Aug 31, 2020
Jupyter Notebooks: Boost your productivity with Extensions and Magic Commands
Jupyter Notebooks: Boost your productivity with Extensions and Magic Commands
In this blogpost I will share some tips for working with Jupyter Notebooks. Those tips greatly improved my productivity when working with…
Heike Maria
Jul 12, 2020
Mastering ElasticSearch Queries If You Have Only Worked With SQL Before
Mastering ElasticSearch Queries If You Have Only Worked With SQL Before
Elasticsearch is often the storage engine of choice for storing and querying full text data. But writing an ElasticSearch query is pretty…
Heike Maria
Jun 27, 2020
How the Inverted Index and Scoring Work in ElasticSearch
How the Inverted Index and Scoring Work in ElasticSearch
Searching through full text fields with regexes in relational database systems like PostgreSQL or MySQL is painful: The query latency is…
Heike Maria
Jun 24, 2020
Working with Complex Datatypes in Hive — Array, Structs and Maps
Working with Complex Datatypes in Hive — Array, Structs and Maps
The basic idea of complex datatypes is to store multiple values in a single column. So if you are working with a Hive database and you…
Heike Maria
Jun 7, 2020
Mastering Data Preparation with Pandas: Subsetting, Filtering and Joining DataFrames
Mastering Data Preparation with Pandas: Subsetting, Filtering and Joining DataFrames
When I started working with pandas I noticed that there were so many ways how to subset, filter and join data with pandas. But I was…
Heike Maria
Aug 18, 2019
Everything An Analyst Needs to Know to Use Git for Version Control
Everything An Analyst Needs to Know to Use Git for Version Control
Yes, Git is known as THE tool for version control tool of choice for many developers. Having a nontechnical background, I was pretty…
Heike Maria
Dec 17, 2018
About Data Dojo
Latest Stories
Archive
About Medium
Terms
Privacy
Teams