Tudor Lapusanapache zeppelin : user impersonationZeppelin is a great tool for making data analysis using Apache Spark. It’s so useful that in the end your entire team will use it.Aug 10, 20201Aug 10, 20201
Tudor LapusanYARN Job History logs issuesHave you ever received “Error getting logs at server_ip:8041” for your finished YARN applications? If yes, this short post can help you !Apr 7, 2020Apr 7, 2020
Tudor LapusanDeploying multiple Spark Job History servers on the same clusterA very useful way to monitor a Spark application is through its own Spark UI interface.Apr 3, 2020Apr 3, 2020
Tudor LapusanVisual interpretation of Decision Tree structureIn Machine Learning it’s important to understand why based on specific inputs (model hyperparameters, features or training set) your…May 20, 2019May 20, 2019
Tudor LapusanMy first experience with Kaggle kernelsWhen I’m playing on Kaggle, usually I choose Python and Sklearn. The usually default tool to write the code in is Jupyter notebook, but…Feb 20, 2018Feb 20, 2018