梯度下降综述(译文): https://blog.youkuaiyun.com/google19890102/article/details/69942970 原文: http://ruder.io/optimizing-gradient-descent/