PinnedBrad CaffeyHow to lower job costs on EC2 instances and improve performance when using Spark’s persist methodIn this blog, we’ll examine Spark’s persist method and explain how to use it in a performant manner and lower job costs.6 min read·Mar 27, 2024----
PinnedBrad CaffeyHow to improve performance and lower Spark job costs on EC2 instances when using Spark’s coalesce…7 min read·Mar 27, 2024----
Brad CaffeyinExpedia Group TechnologyPart 6: Summary of Apache Spark Cost Tuning StrategyThe step by step overview of the cost tuning strategy3 min read·Aug 20, 2020----
Brad CaffeyinExpedia Group TechnologyPart 5: How to Resolve Common Errors When Switching to Cost Efficient Apache Spark Executor…How to resolve memory issues that happen when switching to efficient executor configs5 min read·Aug 18, 2020----
Brad CaffeyinExpedia Group TechnologyPart 4: How to Migrate Existing Apache Spark Jobs to Cost Efficient Executor ConfigurationsSteps to follow when converting existing jobs to cost efficient config5 min read·Aug 13, 2020--3--3
Brad CaffeyinExpedia Group TechnologyPart 3: Cost Efficient Executor Configuration for Apache SparkFind the most efficient executor configuration for your node9 min read·Aug 11, 2020--5--5
Brad CaffeyinExpedia Group TechnologyPart 2: Real World Apache Spark Cost Tuning ExamplesI outline the procedure for working through cost tuning5 min read·Aug 6, 2020--1--1
Brad CaffeyinExpedia Group TechnologyPart 1: Cloud Spending Efficiency Guide for Apache Spark on EC2 InstancesHow I saved 60% of costs in an Apache Spark job, with no increase in job time and no decrease in data processed6 min read·Aug 4, 2020--2--2
Brad CaffeyinHomeAway Tech BlogAre You Sure You Have Good Data?Best practices for detecting bad data before it spreads8 min read·Apr 3, 2019----