Ali AminzadehHow to extract large zip files in an Amazon S3 bucket by using AWS EC2 and PythonI’ve been spending a lot of time with AWS S3 recently building data pipelines and have encountered a surprisingly non-trivial challenge of…May 22May 22
Ali AminzadehSet up Apache Airflow on a Multi-Node Cluster With PostgreSQL and RabbitMQIntroduction:Aug 20, 2023Aug 20, 2023
Ali AminzadehScalable Web Crawling using StormCrawler and Apache SolrIn this post, I am going to write a web crawler that will scrape data from some websites and store their content in Apache Solr . But…Sep 12, 20193Sep 12, 20193