Published ingetdozerWhy you might not even need a Data PlatformEvery company I meet today has a data platform. And if they don’t have one, they want one. Problem is that building and maintaining a data…Mar 18, 2023Mar 18, 2023
Published ingetdozerBreaking down microservices silos: Building real-time cohesive APIsAs a developer, you’re no stranger to the challenges of creating a cohesive API from multiple microservices. But with Dozer, you can say…Mar 16, 2023Mar 16, 2023
Published ingetdozerAre Rust, C++ and WASM the new tools for Data Engineering?I started my career as a C/C++ developer 20 years go working on network protocols and embedded systems. Over time, I moved more and more to…Jun 24, 20221Jun 24, 20221
Published inHackerNoon.comSolving the late payment problem with the blockchainIt is no surprise that late payments between companies are a big problem (especially in Europe), and can seriously put at risk the…Jul 12, 20172Jul 12, 20172
Hadoop as a Generic Distributed Platform: Storing Application MetadataI want to go back to the topic of one of my previous articles where I described how I used Hadoop and Yarn as a generic distributed…Mar 28, 2017Mar 28, 2017
Aggregating a billion items in less than a secondRecently, I have been working a lot of time series data, and one of the common denominator of most time series is volume. It is very easy…Feb 16, 2017Feb 16, 2017
Hadoop as generic distributed application platformI have been working on developing application on top of Hadoop for a while using all the components usually provided by Hadoop…Feb 3, 2017Feb 3, 2017
Zipping multiple RDDs at onceI’m using H2O for some machine learning and, when I have to score multiple models with data sharing some features, I find it useful to…Oct 22, 2016Oct 22, 2016
Reducing memory consumption with Apache Spark and sparse DataFramesWhen analysing and doing machine learning on top of Apache Spark, very often I find myself dealing with huge pivot tables. It is very…Oct 7, 2016Oct 7, 2016