InGoPenAIbykirouane AyoubIntroduction to Large-Scale Similarity Search (part one) : HNSW , IVF , LSHThe world is awash in data, and finding the right information within vast stores of multimedia content is becoming increasingly…Sep 27
InTowards Data SciencebyEric ZhùFinding Needles in a Haystack — Search Indexes for Jaccard SimilarityFrom basic concepts to exact and approximate indexesAug 18, 20231
Wenjing ZhanData Preprocessing — Deduplication with MinHash and LSHWhen dealing with text preprocessing, one headache a data scientist has to deal with is the duplicated or similar documents.Nov 2, 2020Nov 2, 2020
InMirko Peters — Data & Analytics BlogbyMirko PetersRevolutionizing Search with Locality-Sensitive Hashing in Machine LearningEmbark on an in-depth exploration of Locality-Sensitive Hashing (LSH) in machine learning, a technique that is transforming the speed and…Apr 5Apr 5
InGoPenAIbykirouane AyoubIntroduction to Large-Scale Similarity Search (part one) : HNSW , IVF , LSHThe world is awash in data, and finding the right information within vast stores of multimedia content is becoming increasingly…Sep 27
InTowards Data SciencebyEric ZhùFinding Needles in a Haystack — Search Indexes for Jaccard SimilarityFrom basic concepts to exact and approximate indexesAug 18, 20231
Wenjing ZhanData Preprocessing — Deduplication with MinHash and LSHWhen dealing with text preprocessing, one headache a data scientist has to deal with is the duplicated or similar documents.Nov 2, 2020
InMirko Peters — Data & Analytics BlogbyMirko PetersRevolutionizing Search with Locality-Sensitive Hashing in Machine LearningEmbark on an in-depth exploration of Locality-Sensitive Hashing (LSH) in machine learning, a technique that is transforming the speed and…Apr 5
Stefano LoriRanking document similarity at scale with Spark NLPCombining the power of Spark NLP sentence embeddings and LSH approximate nearest neighbors search pipelines to catch contextual and…Jul 2, 2023
InMirko Peters — Data & Analytics BlogbyMirko PetersLocality Sensitive HashingImproving Efficiency and Accuracy in Machine LearningOct 16, 2023
InGeek Culturebysid dhuriLocality Sensitive Hashing for Fast Search in High dimension dataReal word problemsMar 25, 2021