1.在本机报错:CUDA out of memory
于是用vscode配置连接组里的服务器:下载vscode;下载ssh插件;传输文件(直接从本机拖拽到左侧文件目录那里);
参考链接:vscode连接远程服务器(傻瓜式教学)_chls的博客-优快云博客https://blog.youkuaiyun.com/zhaxun/article/details/120568402
2.报错:
Traceback (most recent call last): File "/data/zhangyiming/ddpm/DDPM-main/scripts/train_cifar.py", line 8, in <module> from ddpm import script_utils ModuleNotFoundError: No module named 'ddpm'将项目根目录添加到 Python 的模块搜索路径(PYTHONPATH
)中,这样 Python 解释器就能够找到 'ddpm' 模块。可以在终端中执行以下命令:
export PYTHONPATH=/data/zym/ddpm/DDPM-main:$PYTHONPATH
python scripts/train_cifar.py
3.报错:
Traceback (most recent call last):
File "scripts/train_cifar.py", line 164,
in <module> main()
File "scripts/train_cifar.py", line 29,
in main run = wandb.init(
AttributeError: module 'wandb' has no attribute 'init'
目前的代码是在base环境中运行的,没有一个匹配的封闭的conda虚拟环境,所以首先创建一个conda虚拟环境:
(1)创建一个名称为zymddpm的conda虚拟环境
conda create --name zymddpm python=3.8
(2)激活该环境
conda activate zymddpm
(3)进入到该环境后,运行代码:
python your_script.py
4.输入了conda activate zymddpm之后 报错
CommandNotFoundError: Your shell has not been properly configured to use 'conda activate'. To initialize your shell, run $ conda init <SHELL_NAME> Currently supported shells are: - bash - fish - tcsh - xonsh - zsh - powershell See 'conda init --help' for more information and options. IMPORTANT: You may need to close and restart your shell after running 'conda init'.
对于bash shell,运行:
conda init bash
对于zsh shell,运行:
conda init zsh
对于fish shell,运行:
conda init fish
5.虚拟环境init完毕之后重启终端,开始配置虚拟环境里面的各种package:
conda install pytorch==1.12.0 torchvision==0.13.0 torchaudio==0.12.0 cudatoolkit=10.2 -c py
一般情况下直接pip install pytorch也可以,如果你需要特定版本的PyTorch,可以根据官方文档提供的安装命令来安装:Start Locally | PyTorch,如果是公用的服务器,先查看自己电脑cuda和torch的版本,以及python的版本,选择合适的pytorch版本去安装;
还未配置完,后面再继续...