Thomas CardenasMy First Kubernetes project: From Containers to Deploying in MinutesI’ve been wanting to deploy a few of my personal projects online just so I don’t have to run a ton of containers locally. I had a few other…Sep 271Sep 271
Thomas CardenasTransforming Serialized Strings to JSON DataFrames with Apache SparkTransforming JSON String column into it’s own JSON DataFrameSep 24Sep 24
Thomas CardenasAnalyzing Orders Per Second: A Quick Data Exploration with Athena and PlotlyIntroductionSep 17Sep 17
Thomas CardenasinAWS TipExploring Streaming Data Pipelines with Apache Flink and AWSExperience deploying my first streaming applicationAug 2Aug 2
Thomas CardenasUnderstanding Monthly Recurring RevenueIt almost seems like everything nowadays is a subscription. This may be due to how well such services give predictive revenue. Monthly…Jun 26Jun 26
Thomas CardenasinData Engineer ThingsBest Practices for Writing Maintainable and Testable Spark Code in ScalaEnhancing Scalability and Reliability Through Structured Spark Development PracticesApr 17Apr 17
Thomas CardenasinData Engineer ThingsManaging Late-Arriving Data for Accurate ReportingData Engineering Excellence: Best Practices for Managing Late-Arriving Data in Metrics PipelinesMar 26Mar 26
Thomas CardenasinData Engineer ThingsThe Art of Efficient Data Lake OrganizationGuidelines for Streamlined Data Lake OrganizationOct 24, 2023Oct 24, 2023
Thomas CardenasinAncestry Product & TechnologyHarnessing Intervals in Apache Airflow for Efficient and Reliable Data ProcessingIntroductionOct 17, 2023Oct 17, 2023
Thomas CardenasCalculating Daily/Monthly Active Users with Spark & IcebergWhen ever I hear about metrics I really want to dive into understanding them and coming up with a sample pipeline to demonstrate it. One of…Oct 17, 2023Oct 17, 2023