Awesome post! Thank you for sharing your knowledge.
We’ve encountered a similar problem in applying DL on a synthetic dataset of financial derivatives trades for regulatory compliance (EU Directive 1286 coming to force in 2018). The dataset is a real crux. No amount of Kaggle experience prepares you for the real world “data problems” :-)