A THOROUGH COMPARISON

A performance comparison of three popular techniques (Contrastive, Triplet & Quadruplet loss) used to train similarity learning algorithms on the Quora dataset [1]

Image for post
Image for post
Photo by Axel Bonniot on Unsplash

Through this article, I will evaluate and compare three different losses for the task of Deep Similarity Learning. If this topic is still not perfectly understandable to you, I have written an article introducing the main concepts with code examples as well as a complete GitHub repository for you to check:

Table of Content

I. Quick overview of the task

II. Siamese Recurrent Network: similarity learning for sequences

III. Losses for Deep Similarity Learning

IV. Concrete Application: question pairs detection

I. Quick overview of the task

I used for this task the famous Quora question pairs dataset, where the main goal is to predict if two question pairs have the same intent. …


An in-depth review of a Deep Learning technique for the task of similarity classification.

Image for post
Image for post
Photo by Romain Vignes on Unsplash

In this article, I will go through my take on the general concept of Similarity Learning, which processes it involves and how it can be summarized. I will then apply these outlined concepts to the context of sequence similarity detection with question similarities.

Table of Contents

  1. Overview of Similarity Learning
  2. Text Similarity Learning
  3. Source code (PyTorch implementation)

1. Overview of Deep Similarity Learning

When one is doing similarity learning, the same process is always performed:

Image for post
Image for post
Data processing pipeline with Similarity Learning

As explained in this infographic, any process involving Similarity Learning revolves around 3 main concepts:

  1. Transformation of the data in a vector of features
  2. Comparison of the vectors using a distance metric
  3. Classification of the distance as being similar or…


A quick review of different DL architectures for the TiSeLaC challenge

Presented back in 2017 by the Université de Montpellier, the TiSeLaC challenge [1] (TiSeLaC for Time Series Land Cover) consists in predicting Land Cover class of pixels in Time Series of satellite images.

Image for post
Image for post
Photo by Kelly Lacy from Pexels

Table of Contents

  1. What is Time Series Satellite Imagery
  2. What about TiSeLaC dataset ?
  3. TiSeLaC classification task
  4. Conclusion

1. What is Time Series Satellite Imagery?

Time Series Satellite Imagery is the addition of a temporal dimension to Satellite Imagery. …

About

Thomas Di Martino

As a French PhD student, I am passionate to whatever comes close to Artificial Intelligence and Earth Observation.

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store