Ehsan KhodadadiUnmasking Hidden Failures: Overcoming Survivorship Bias for a Reliable System DesignSurvivorship bias is an interesting cognitive phenomenon that often goes unnoticed but has significant repercussions, particularly in…Feb 9Feb 9
Ehsan KhodadadiSRE and people culture, it is always peopleWith over 17 years of experience in IT, the last 7 of which have been dedicated to a focused career in Site Reliability Engineering (SRE)…Dec 13, 2023Dec 13, 2023
Ehsan KhodadadiinTechspirationDeploying Prometheus Multi-Cluster monitoring using Prometheus Agent ModeIn the previous post I wrote about Prometheus Multi-cluster monitoring and how using Prometheus in agent mode helps create a single pane of…Aug 8, 20223Aug 8, 20223
Ehsan KhodadadiPatterns and anti-patterns for a reliable Kubernetes infra deploymentKubernetes has become the most favorite container orchestration solution and many companies have moved forward to microservice architecture…Jun 2, 2022Jun 2, 2022
Ehsan KhodadadiPrometheus Multi-Cluster monitoring using Prometheus Agent ModePrometheus is the most favoured monitoring solution for monitoring Kubernetes metrics nowadays. Prometheus allows SRE/DevOps teams to find…Mar 25, 20221Mar 25, 20221
Ehsan KhodadadiinTechspirationWhy Infrastructure as code?What does Infrastructure as Code mean?Jun 8, 2021Jun 8, 2021