achilleusRobust Apache Airflow DeploymentInstalling and managing Apache Airflow on an RHEL Linux environmentMar 30, 2020Mar 30, 2020
achilleusEasy way to manage your Airflow setupManage Airflow services as systemd serviceMar 30, 2020Mar 30, 2020
achilleusA case against publishing your articles only on MediumRecently I noticed a trend in my Google search results that whenever I try to read something about a new tool or technology Medium is…Mar 27, 2020Mar 27, 2020
achilleusSnowflake Cloud Data WarehouseA truly elastic, scalable cloud data warehouseJul 14, 20192Jul 14, 20192
achilleusAdd code to your medium article GitHub GistA quick way to insert code for your medium stories.Jun 24, 201910Jun 24, 201910
achilleusDatabricks Koalas-Python Pandas for SparkEasy integration of Python Pandas to Spark to scale existing Python PandasMay 1, 20191May 1, 20191
achilleusGet started with Pyspark on Mac using an IDE-PyCharmI found running Spark on Python in an IDE was kinda tricky, hence writing this post to get started with development on IDE using pysparkApr 29, 2019Apr 29, 2019
achilleusDelta lake , ACID transactions for Apache SparkAn open source storage layer by Databricks, creators of Spark to create easier and reliable Enterprise Data Lakes both On prem and Cloud.Apr 27, 20194Apr 27, 20194
achilleusSpark UDFs, how to write them and some gotchas?UDFs are one exciting aspect of spark which has evolved tremendously over the spark releases. We will try to cover different aspects of it.Apr 24, 20191Apr 24, 20191