Mike PaleiPyspark’s applyInPandas to bridge the gap between offline training and online inferencingPySpark for offline feature extraction vs pure Python with Numba for feature extraction onlineDec 25, 2023Dec 25, 2023
Mike PaleiStreamlining CI/CD and ETL pipelines with Jenkins — part 2This is the second post in the series. The first one can be found here.Dec 16, 2023Dec 16, 2023
Mike PaleiStreamlining CI/CD and ETL pipelines with Jenkins— part 1The dynamic realm of DevOps/MLOps hosts three distinct factions: those who bear a grudge against Jenkins, those who can’t abide Airflow…Nov 26, 2023Nov 26, 2023
Mike PaleiRunning Scala from PysparkSuppose you have a large legacy codebase written in Scala with a lot of goodies in it but your team of data scientists is, understandably…Dec 13, 20213Dec 13, 20213
Mike PaleiServing a Tensorflow 2 model on AWS LambdaSo you trained your model, a real miracle in the world of AI, and now comes the time to serve it. But how exactly? What is the best way…Aug 9, 20201Aug 9, 20201
Mike PaleiFaceted navigation for e-commerce with ElasticsearchMy colleague and I (the credit has to go to Dimitry Apter who’s done most of the actual tangible work) have recently been commissioned…Jun 26, 2020Jun 26, 2020
Mike PaleiinNeo4j Developer BlogCosine similarity in Neo4JThis post will showcase the use of cosine similarity algorithm in Neo4J and also provide examples in addition to the available…Mar 26, 20191Mar 26, 20191