Roman Grebennikov – Medium

Roman Grebennikov

Roman Grebennikov
in
Nixiesearch

Nixiesearch: running Lucene over S3, and why we’re building a new serverless search engine.

A new search engine in 2024? Yes, but stateless — index on S3, serverless — no cluster state, and with all Lucene features out of the box.

Oct 10

Nixiesearch: running Lucene over S3, and why we’re building a new serverless search engine.

Oct 10

Roman Grebennikov
in
Nixiesearch

How to compute LLM embeddings 3X faster with model quantization

Running LLM embedding models is slow on CPU and expensive on GPU. We will make it up to 3X faster with ONNX model quantization, see how…

Nov 13, 2023

How to compute LLM embeddings 3X faster with model quantization

Nov 13, 2023

Roman Grebennikov
in
Metarank

From zero to semantic search embedding model

A series of articles on building an accurate Large Language Model for neural search from scratch. We’ll start with BERT and…

Jun 12, 2023

From zero to semantic search embedding model

Jun 12, 2023

Roman Grebennikov
in
Metarank

Solving a search cold-start problem with aggregated CTR

How do you rank a new item in search results when you have no visitor feedback? It was never displayed, so it got zero clicks. A recent…

Mar 14, 2023

Solving a search cold-start problem with aggregated CTR

Mar 14, 2023

Roman Grebennikov
in
Metarank

Search results diversification with Metarank

A recent Metarank 0.6.3 release introduced a new diversity feature extractor, which measures how a particular item differs from others in…

Mar 7, 2023

Search results diversification with Metarank

Mar 7, 2023

Roman Grebennikov
in
Metarank

Metarank 0.6.2

Only a few days ago, we’ve launched the 0.6 release that adds recommendations, and we’re back with a few fixes that our beloved users have…

Feb 7, 2023

Metarank 0.6.2

Feb 7, 2023

Roman Grebennikov
in
Metarank

Metarank 0.5.9: performance and memory improvements

We’re pleased to announce a new version of Metarank. This time we were focused on performance and overall costs of running the service.

Nov 2, 2022

Metarank 0.5.9: performance and memory improvements

Nov 2, 2022

Roman Grebennikov
in
Metarank

Why you should post your story to HackerNews on the weekend (and it should be in Rust).

We collected a dataset of HN posting and ranking activity to understand the best time to submit your own story. It’s the weekend, but we’re…

Oct 15, 2022

Why you should post your story to HackerNews on the weekend (and it should be in Rust).

Oct 15, 2022

Roman Grebennikov
in
Metarank

Your ML setup is not unique: you don’t need more data scientists

We’ve been long working on diverse set of ML projects, and we see the same decisions taken and same mistakes made again and again. But ML…

Sep 16, 2022

Your ML setup is not unique: you don’t need more data scientists

Sep 16, 2022

Roman Grebennikov
in
Metarank

Metarank RAM usage benchmark

UPDATE 14 Oct 2022: metarank 0.5.6 has optimized memory usage, check the updated numbers here.

Sep 9, 2022

Metarank RAM usage benchmark

Sep 9, 2022

Roman Grebennikov

Roman Grebennikov

Metarank maintainer

Following

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams