1. tensorflow 新老版本的问题,常见的问题下面的这篇博文已经列出
http://blog.youkuaiyun.com/zeuseign/article/details/72771598
2. Tensorflow cifar10_multi_gpu问题:Variable conv1/weights/ExponentialMovingAverage/ does not exist...
这个问题的解决方案是在 for i in xrange(FLAGS.num_gpus): 的外面加上with tf.variable_scope(tf.get_variable_scope()):
with tf.variable_scope(tf.get_variable_scope()):
for i in xrange(FLAGS.num_gpus):
with tf.device('/gpu:%d' % i):
原因:
When you do tf.get_variable_scope().reuse_variables() you set the current scope to reuse variables. If you call the optimizer in such scope, it's trying to reuse slot variables, which it cannot find, so it throws an error. If you put a scope around, the tf.get_variable_scope().reuse_variables() only affects that scope, so when you exit it, you're back in the non-reusing mode, the one you want.
3. 可以用github上及时更新维护的tensorflow例子, 包括mnist, cifar-10, AlexNet, ImageNet
https://github.com/tensorflow/models/tree/master/tutorials/image
TensorFlow多GPU问题解决
本文解决了TensorFlow中使用多GPU训练模型时遇到的变量不存在问题,并提供了详细的代码修改方案。此外,还推荐了一个包含多个经典案例的GitHub仓库,帮助读者更好地理解和应用多GPU训练。
2万+

被折叠的 条评论
为什么被折叠?



