全文参照:http://lamda.nju.edu.cn/weixs/project/CNNTricks/CNNTricks.html
1. Data Augmentation
2. Pre-processing
3. Initialization
4. During training
5. Activation functions
6. Regularizations
7. Insights from figures
8. Ensemble