Mike Palei – Medium

Mike Palei

Mike Palei

Pyspark’s applyInPandas to bridge the gap between offline training and online inferencing

PySpark for offline feature extraction vs pure Python with Numba for feature extraction online

Dec 25, 2023

Pyspark’s applyInPandas to bridge the gap between offline training and online inferencing

Dec 25, 2023

Mike Palei

Streamlining CI/CD and ETL pipelines with Jenkins — part 2

This is the second post in the series. The first one can be found here.

Dec 16, 2023

Streamlining CI/CD and ETL pipelines with Jenkins — part 2

Dec 16, 2023

Mike Palei

Streamlining CI/CD and ETL pipelines with Jenkins— part 1

The dynamic realm of DevOps/MLOps hosts three distinct factions: those who bear a grudge against Jenkins, those who can’t abide Airflow…

Nov 26, 2023

Streamlining CI/CD and ETL pipelines with Jenkins— part 1

Nov 26, 2023

Mike Palei

Running Scala from Pyspark

Suppose you have a large legacy codebase written in Scala with a lot of goodies in it but your team of data scientists is, understandably…

Dec 13, 2021

Dec 13, 2021

Mike Palei

Serving a Tensorflow 2 model on AWS Lambda

So you trained your model, a real miracle in the world of AI, and now comes the time to serve it. But how exactly? What is the best way…

Aug 9, 2020

Serving a Tensorflow 2 model on AWS Lambda

Aug 9, 2020

Mike Palei

Faceted navigation for e-commerce with Elasticsearch

My colleague and I (the credit has to go to Dimitry Apter who’s done most of the actual tangible work) have recently been commissioned…

Jun 26, 2020

Jun 26, 2020

Mike Palei
in
Neo4j Developer Blog

Cosine similarity in Neo4J

This post will showcase the use of cosine similarity algorithm in Neo4J and also provide examples in addition to the available…

Mar 26, 2019

Cosine similarity in Neo4J

Mar 26, 2019

Mike Palei

Mike Palei

Following

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams