Open in app

Sign In

Write

Sign In

João Paulo Figueira
João Paulo Figueira

482 Followers

Home

About

Published in Towards Data Science

·Pinned

Geofencing with Quadkeys

This article explains how to create geofences with square partitions — A geofence is a virtual border delimiting an arbitrary geographic region. We use these constructs to classify whether an object lies inside the area of interest, like when we want to determine if a car has entered a specific parking lot. …

Geofence

14 min read

Geofencing with Quadkeys
Geofencing with Quadkeys
Geofence

14 min read


Published in Towards Data Science

·Feb 10

More on Road Network Matching

Road network matching shenanigans — The purpose of this article is to complement and correct the previous one on the same subject. In that article, I presented an approach to reconstructing missing map-matched data from the Extended Vehicle Energy Dataset¹ (EVED) [1]. My technique explored the mathematical properties of triangles to find the lost data…

Network

9 min read

More on Road Network Matching
More on Road Network Matching
Network

9 min read


Published in Towards Data Science

·Jan 3

Road Network Edge Matching With Triangles

Triangles have mighty properties for geospatial queries — Triangles are shapes with many practical geometric properties. In this article, I illustrate using such properties when performing opportunistic optimizations while solving a particular geospatial problem: the recovery of missing map-matched information. I started exploring the Extended Vehicle Energy Dataset¹ (EVED) [1] a while ago to search for compelling geospatial…

Network

13 min read

Road Network Edge Matching With Triangles
Road Network Edge Matching With Triangles
Network

13 min read


Published in Towards Data Science

·Nov 23, 2022

Trajectory Queries Using Space Partitioning

How can we quickly find overlapping trajectories? — While traveling through space, an object describes a trajectory. We can think about a trajectory as a function of time that outputs positions in space. Conceptually, trajectories are continuous functions, although we pragmatically use their discrete versions. A discrete trajectory is a time-ordered collection of points in space where we…

Trajectory

10 min read

Trajectory Queries Using Space Partitioning
Trajectory Queries Using Space Partitioning
Trajectory

10 min read


Published in Towards Data Science

·Oct 13, 2022

Stratified K-Fold Cross-Validation on Grouped Datasets

This article explains how to use optimization to perform stratified K-Fold cross-validation on a grouped dataset — Cross-validation is a common resampling technique to get more mileage from your dataset. The procedure involves taking repeated independent samples from the original dataset and fitting them to the desired model. Cross-validation is helpful for model selection, as it provides better generalization performance estimates than the holdout method. …

Optimization

8 min read

Stratified K-Fold Cross-Validation on Grouped Datasets
Stratified K-Fold Cross-Validation on Grouped Datasets
Optimization

8 min read


Published in Towards Data Science

·Sep 28, 2022

Travel Time Estimation Using Quadkeys

This article explains how to estimate travel times using known speed vectors indexed by quadkeys — How long is your road trip going to last? If you are like me, you go to Google maps or Here maps and query the route directly. You will not only get the fastest or shortest paths, but you will also get estimates of your travel times. …

Quadkey

12 min read

Travel Time Estimation Using Quadkeys
Travel Time Estimation Using Quadkeys
Quadkey

12 min read


Published in Towards Data Science

·Sep 6, 2022

A Metric for HDBSCAN-Generated Clusters

How can we determine the equivalent DBSCAN ε parameter for HDBSCAN-generated clusters? — HDBSCAN is a hierarchical density-based clustering algorithm that works under simple assumptions. At a minimum, it only requires the data points to cluster and the minimum number of observations per cluster. The algorithm accepts a distance matrix if the data has a non-obvious associated distance metric. Like its predecessor, DBSCAN…

Hdbscan

9 min read

A Metric for HDBSCAN-Generated Clusters
A Metric for HDBSCAN-Generated Clusters
Hdbscan

9 min read


Published in Towards Data Science

·Feb 23, 2021

Stratified Splitting of Grouped Datasets Using Optimization

This article explains how to perform a stratified split of a grouped dataset into train and validation sets — One of the most frequent steps on a machine learning pipeline is splitting data into training and validation sets. It is one of the necessary skills all practitioners must master before tackling any problem. The splitting process requires a random shuffle of the data followed by a partition using a…

Machine Learning

13 min read

Stratified Splitting of Grouped Datasets Using Optimization
Stratified Splitting of Grouped Datasets Using Optimization
Machine Learning

13 min read


Published in Towards Data Science

·Jan 5, 2021

Displaying Geographic Information Using Custom Map Tiles

Learn how to create custom tiles for your interactive maps — Interactive maps are now a staple of our everyday digital life. We use them to learn of our whereabouts, plan the next trip, or review our past travels. In a professional setting, maps became priceless tools for all manner of businesses in planning, operations, and analytics. An interactive map displays…

Maps

11 min read

Displaying Geographic Information Using Custom Map Tiles
Displaying Geographic Information Using Custom Map Tiles
Maps

11 min read


Published in Towards Data Science

·Sep 30, 2020

Geospatial Indexing with Quadkeys

Squaring the Earth — When you browse an interactive map online or on your phone, you see the result of a patchwork of square tiles. Each tile contains a bit of the map information and perfectly matches the eight tiles around it. …

Geospatial

6 min read

Geospatial Indexing with Quadkeys
Geospatial Indexing with Quadkeys
Geospatial

6 min read

João Paulo Figueira

João Paulo Figueira

482 Followers

Addicted to math and data, slightly off-centered. Data Scientist at tblx.io

Following
  • Angela Shi

    Angela Shi

  • TDS Editors

    TDS Editors

  • Bruno Gonçalves

    Bruno Gonçalves

  • Bex T.

    Bex T.

  • Pier Paolo Ippolito

    Pier Paolo Ippolito

Help

Status

Writers

Blog

Careers

Privacy

Terms

About

Text to speech