Apache Spark: RDD Partitioning Preservation
Corentin Kerisit

Interesting reading especially for newcomers. Even more important when you are creating your own RDD. I don’t think is a big deal when working with the default RDDs, but when writing your owns its a completely different story.

