Kass 09Airflow scheduler and Broken DAGsA common headache for the users of Airflow is often that fact that the scheduler seems to do its periodic runs on old versions of DAGs. As…Feb 14, 2019Feb 14, 2019
Kass 09MLflow: A very quick startMLflow is introduced as an ‘open source platform for the machine learning lifecycle’. It promises to give data scientists the power to…Dec 14, 2018Dec 14, 2018
Kass 09Spark streaming & Kafka in python: A test on local machineWords count through KafkaJan 19, 201710Jan 19, 201710
Kass 09Run a python Spark application on Amazon EMRCreate an Spank cluster on Amazon EMR, if you don’t know how to do it, check hereOct 25, 20161Oct 25, 20161
Kass 09Apache’s Spark funtamental transformations in PythonHaving our pyspark environment set up we run some of the funtamental transformations supported by Spark.Oct 18, 2016Oct 18, 2016
Kass 09RabbitMQ ClusteringIt is often useful to have multiple applications on different machines connected to a single AMQP broker (in our case RabbitMQ). With…Aug 14, 2014Aug 14, 2014
Kass 09Run Flask on VagrantIn order to make your Flask application that runs on you VM, visible to your host machine, you need to change the address that Flask binds…Aug 13, 2014Aug 13, 2014
Kass 09Connections to Rabbitmq are blockedThis kept me busy the whole morning. My rabbitmq server was up but acted like it was dead. No messages going around, zero, nada! Trying to…Aug 7, 20141Aug 7, 20141
Kass 09RQ: Simplest job queueing in PythonSearching for an alternative for Celery, I stumbled across Redis Queue (RQ): A simple pythonic library for queueing and processing jobs…Jul 24, 2014Jul 24, 2014