PinnedAjay EdinTowards Data ScienceMerging too many small files into fewer large files using Apache Spark in DatalakeIncrease performance of your read queries on huge datasetsJul 15, 20211Jul 15, 20211