PinnedBrad CaffeyHow to lower job costs on EC2 instances and improve performance when using Spark’s persist methodIn this blog, we’ll examine Spark’s persist method and explain how to use it in a performant manner and lower job costs.Mar 27Mar 27
PinnedBrad CaffeyHow to improve performance and lower Spark job costs on EC2 instances when using Spark’s coalesce…Mar 27Mar 27
Brad CaffeyinExpedia Group TechnologyPart 6: Summary of Apache Spark Cost Tuning StrategyThe step by step overview of the cost tuning strategyAug 20, 2020Aug 20, 2020
Brad CaffeyinExpedia Group TechnologyPart 5: How to Resolve Common Errors When Switching to Cost Efficient Apache Spark Executor…How to resolve memory issues that happen when switching to efficient executor configsAug 18, 2020Aug 18, 2020
Brad CaffeyinExpedia Group TechnologyPart 4: How to Migrate Existing Apache Spark Jobs to Cost Efficient Executor ConfigurationsSteps to follow when converting existing jobs to cost efficient configAug 13, 20203Aug 13, 20203
Brad CaffeyinExpedia Group TechnologyPart 3: Cost Efficient Executor Configuration for Apache SparkFind the most efficient executor configuration for your nodeAug 11, 20205Aug 11, 20205
Brad CaffeyinExpedia Group TechnologyPart 2: Real World Apache Spark Cost Tuning ExamplesI outline the procedure for working through cost tuningAug 6, 20201Aug 6, 20201
Brad CaffeyinExpedia Group TechnologyPart 1: Cloud Spending Efficiency Guide for Apache Spark on EC2 InstancesHow I saved 60% of costs in an Apache Spark job, with no increase in job time and no decrease in data processedAug 4, 20202Aug 4, 20202
Brad CaffeyinHomeAway Tech BlogAre You Sure You Have Good Data?Best practices for detecting bad data before it spreadsApr 3, 2019Apr 3, 2019