Become a member
Sign in
Hubert Bryłkowski
Hubert Bryłkowski

Hubert Bryłkowski

2 Following
71 Followers
  • Profile

  • Claps

  • Highlights

Featured

Hubert Bryłkowski
Hubert Bryłkowski in Brainly Engineering
Oct 6, 2017 · 9 min read

Locality sensitive hashing — LSH explained

The problem of finding duplicate documents in a list may look like a simple task — use a hash table, and the job is done quickly and the algorithm is fast. However, if we need to find not only exact duplicates, but also documents with differences such as…

1.3K

17 responses

Highlighted by Hubert Bryłkowski

See more

From Locality sensitive hashing — LSH explained by Hubert Bryłkowski

We can conclude — the more common words, the bigger the Jaccard index, the more probable it is that two questions are a duplicate. So where we can set a threshold above which pairs would be marked as a duplicate? For now, let’s assume 0.5 as a threshold, but in a real life, we need to get this v…

From Designing AI: Solving Snake with Evolution by Peter Binggeser

Follow me on Twitter @peterbinggeser for updates. A web-playground for you to customize population size, reward criteria, and even the r…

Claps from Hubert Bryłkowski

See more

How to leak memory with Disposables in RxJava2

Marcin Robaczyński

Scaling engineering team without slowdown

gmiejski

Understanding How Kubernetes Readiness and Liveness Probes do Correlate — or Better How Not

Stephan Hartmann