Anaconda Usages

最新推荐文章于 2024-07-17 18:41:02 发布

原创最新推荐文章于 2024-07-17 18:41:02 发布 · 178 阅读

0 ·

CC 4.0 BY-SA版权

Anaconda 专栏收录该内容

1 篇文章

订阅专栏

本文详细介绍如何使用conda进行软件包的安装、卸载以及环境的导出与恢复，为用户提供了一个全面的conda环境管理流程。

使用conda安装软件包(如matplotlib):

conda install matplotlib

使用conda卸载软件包(如matplotlib):

conda remove matplotlib

使用conda将当前终端所处环境导出：

conda env export > environment.txt

使用conda从文件中恢复环境:

conda env create -f environment.txt

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

Hello_Parasol

关注关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
分享

复制链接

分享到 QQ

分享到新浪微博

扫一扫
举报

举报

专栏目录

搭建 K8S 环境:Centos7安装生产环境可用的K8S集群图文教程指南

极客星云-优快云博客

12-15

1558

本文分享了在Centos7系统上搭建生产可用K8S集群的详细教程。

etcd集群部署

zuofanxiu的专栏

04-25

344

1、机器信息主机名称 IP地址操作系统组件 etcd1 192.168.81.17 Red Hat 7.6 etcd、cfssl etcd2 192.168.81.18 Red Hat 7.6 etcd etcd3 192.168.81.19 Red Hat 7.6 etcd 2、安装cfssl工具集 # 地址 https://github.com/cloudflare/cfssl # cfssl_1.5.0

参与评论您还未登录，请先登录后发表或查看评论

Conda usage

weixin_30565327的博客

05-07

131

1 ## **Conda 内容** 2 3 ### 激活 conda 内容 4 5 - 确认 conda 已安装 6 `conda --version` 7 8 - 更新 conda 版本 9 `conda update conda` 10 11 - 创建新环境 12 13 `conda create --name snowfla...

全局配置文件-environments

蒟蒻码农的博客

04-03

419

environments:MyBatis可以配置多种环境,通过default指定使用某种环境，可以达到快速切换环境如图可以修改为test环境子标签：environment:配置一个具体的环境，里面必须有两个标签。id表示当前环境的唯一标识 transactionManager：事务管理器 ...

conda环境管理介绍

生如蚁，美如神

10-28

2906

我们可以使用conda 来切换不同的环境，主要的用法如下： 1. 创建环境 # 指定python版本为2.7，注意至少需要指定python版本或者要安装的包 # 后一种情况下，自动安装最新python版本 conda create -n env_name python=2.7 # 同时安装必要的包 conda create -n env_name numpy matplotlib pytho

转移C盘中的conda环境（包括.condarc文件修改，environment.txt文件修改，conda报错）

qq_43596278的博客

07-17

3607

conda环境一般是默认安装到C盘的，若建立多个虚拟环境，时间长了，容易让本不富裕的C盘更加雪上加霜，下面给出将conda环境从C盘转移到D盘的方法。包括.condarc文件修改，environment.txt文件修改，conda报错等内容。

查看 Anaconda 创建环境的位置

LiDe2000的博客

07-23

8333

查看 Anaconda 创建环境的位置

rank1]:[E830 06:13:19.780493643 ProcessGroupNCCL.cpp:632] [Rank 1] Watchdog caught collective operation timeout: WorkNCCL(SeqNum=146806, OpType=ALLREDUCE, NumelIn=3675597, NumelOut=3675597, Timeout(ms)=600000) ran for 600472 milliseconds before timing out.████████████████ | 519/804 [18:24<09:52, 2.08s/it, loss=0.0163] [rank1]:[E830 06:13:57.611628340 ProcessGroupNCCL.cpp:2271] [PG ID 0 PG GUID 0(default_pg) Rank 1] failure detected by watchdog at work sequence id: 146806 PG status: last enqueued work: 146806, last completed work: 146804 [rank1]:[E830 06:13:57.634486146 ProcessGroupNCCL.cpp:670] Stack trace of the failed collective not found, potentially because FlightRecorder is disabled. You can enable it by setting TORCH_NCCL_TRACE_BUFFER_SIZE to a non-zero value. [rank1]:[E830 06:14:02.704255369 ProcessGroupNCCL.cpp:2106] [PG ID 0 PG GUID 0(default_pg) Rank 1] First PG on this rank to signal dumping. [rank0]:[E830 06:14:02.725916508 ProcessGroupNCCL.cpp:1685] [PG ID 0 PG GUID 0(default_pg) Rank 0] Observed flight recorder dump signal from another rank via TCPStore. [rank1]:[E830 06:14:02.768576450 ProcessGroupNCCL.cpp:1746] [PG ID 0 PG GUID 0(default_pg) Rank 1] Received a dump signal due to a collective timeout from rank 1 and we will try our best to dump the debug info. Last enqueued NCCL work: 146806, last completed NCCL work: 146804.This is most likely caused by incorrect usages of collectives, e.g., wrong sizes used across ranks, the order of collectives is not same for all ranks or the scheduled collective, for some reason, didn't run. Additionally, this can be caused by GIL deadlock or other reasons such as network errors or bugs in the communications library (e.g. NCCL), etc. [rank0]:[E830 06:14:02.768580472 ProcessGroupNCCL.cpp:1746] [PG ID 0 PG GUID 0(default_pg) Rank 0] Received a dump signal due to a collective timeout from rank 1 and we will try our best to dump the debug info. Last enqueued NCCL work: 146804, last completed NCCL work: 146804.This is most likely caused by incorrect usages of collectives, e.g., wrong sizes used across ranks, the order of collectives is not same for all ranks or the scheduled collective, for some reason, didn't run. Additionally, this can be caused by GIL deadlock or other reasons such as network errors or bugs in the communications library (e.g. NCCL), etc. [rank0]:[E830 06:14:08.324151488 ProcessGroupNCCL.cpp:1536] [PG ID 0 PG GUID 0(default_pg) Rank 0] ProcessGroupNCCL preparing to dump debug info. Include stack trace: 1 [rank1]:[E830 06:14:08.324155500 ProcessGroupNCCL.cpp:1536] [PG ID 0 PG GUID 0(default_pg) Rank 1] ProcessGroupNCCL preparing to dump debug info. Include stack trace: 1 [rank1]:[E830 06:16:17.610834920 ProcessGroupNCCL.cpp:684] [Rank 1] Some NCCL operations have failed or timed out. Due to the asynchronous nature of CUDA kernels, subsequent GPU operations might run on corrupted/incomplete data. [rank1]:[E830 06:16:17.667404073 ProcessGroupNCCL.cpp:698] [Rank 1] To avoid data inconsistency, we are taking the entire process down. [rank1]:[E830 06:16:45.611443648 ProcessGroupNCCL.cpp:1899] [PG ID 0 PG GUID 0(default_pg) Rank 1] Process group watchdog thread terminated with exception: [Rank 1] Watchdog caught collective operation timeout: WorkNCCL(SeqNum=146806, OpType=ALLREDUCE, NumelIn=3675597, NumelOut=3675597, Timeout(ms)=600000) ran for 600472 milliseconds before timing out. Exception raised from checkTimeout at /pytorch/torch/csrc/distributed/c10d/ProcessGroupNCCL.cpp:635 (most recent call first): frame #0: c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >) + 0x98 (0x7a36a8d785e8 in /root/anaconda3/envs/torch/lib/python3.12/site-packages/torch/lib/libc10.so) frame #1: c10d::ProcessGroupNCCL::WorkNCCL::checkTimeout(std::optional<std::chrono::duration<long, std::ratio<1l, 1000l> > >) + 0x23d (0x7a36533e2a6d in /root/anaconda3/envs/torch/lib/python3.12/site-packages/torch/lib/libtorch_cuda.so) frame #2: c10d::ProcessGroupNCCL::watchdogHandler() + 0xc80 (0x7a36533e47f0 in /root/anaconda3/envs/torch/lib/python3.12/site-packages/torch/lib/libtorch_cuda.so) frame #3: c10d::ProcessGroupNCCL::ncclCommWatchdog() + 0x14d (0x7a36533e5efd in /root/anaconda3/envs/torch/lib/python3.12/site-packages/torch/lib/libtorch_cuda.so) frame #4: <unknown function> + 0xdbbf4 (0x7a3642edbbf4 in /root/anaconda3/envs/torch/bin/../lib/libstdc++.so.6) frame #5: <unknown function> + 0x94ac3 (0x7a36aa694ac3 in /lib/x86_64-linux-gnu/libc.so.6) frame #6: <unknown function> + 0x126850 (0x7a36aa726850 in /lib/x86_64-linux-gnu/libc.so.6) [rank0]:[F830 06:22:40.572585407 ProcessGroupNCCL.cpp:1557] [PG ID 0 PG GUID 0(default_pg) Rank 0] [PG ID 0 PG GUID 0(default_pg) Rank 0] Terminating the process after attempting to dump debug info, due to collective timeout or exception. [rank1]: Traceback (most recent call last): [rank1]: File "/root/anaconda3/envs/torch/lib/python3.12/multiprocessing/resource_sharer.py", line 139, in _serve [rank1]: msg = conn.recv() [rank1]: ^^^^^^^^^^^ [rank1]: File "/root/anaconda3/envs/torch/lib/python3.12/multiprocessing/connection.py", line 249, in recv [rank1]: buf = self._recv_bytes() [rank1]: ^^^^^^^^^^^^^^^^^^ [rank1]: File "/root/anaconda3/envs/torch/lib/python3.12/multiprocessing/connection.py", line 413, in _recv_bytes [rank1]: buf = self._recv(4) [rank1]: ^^^^^^^^^^^^^ [rank1]: File "/root/anaconda3/envs/torch/lib/python3.12/multiprocessing/connection.py", line 382, in _recv [rank1]: raise EOFError [rank1]: EOFError W0830 06:40:48.263000 3372 site-packages/torch/distributed/elastic/multiprocessing/api.py:900] Sending process 3385 closing signal SIGTERM E0830 06:40:48.433000 3372 site-packages/torch/distributed/elastic/multiprocessing/api.py:874] failed (exitcode: -6) local_rank: 1 (pid: 3386) of binary: /root/anaconda3/envs/torch/bin/python3.12 Traceback (most recent call last): File "/root/anaconda3/envs/torch/bin/torchrun", line 8, in <module> sys.exit(main()) ^^^^^^ File "/root/anaconda3/envs/torch/lib/python3.12/site-packages/torch/distributed/elastic/multiprocessing/errors/__init__.py", line 355, in wrapper return f(*args, **kwargs) ^^^^^^^^^^^^^^^^^^ File "/root/anaconda3/envs/torch/lib/python3.12/site-packages/torch/distributed/run.py", line 892, in main run(args) File "/root/anaconda3/envs/torch/lib/python3.12/site-packages/torch/distributed/run.py", line 883, in run elastic_launch( File "/root/anaconda3/envs/torch/lib/python3.12/site-packages/torch/distributed/launcher/api.py", line 139, in __call__ return launch_agent(self._config, self._entrypoint, list(args)) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/root/anaconda3/envs/torch/lib/python3.12/site-packages/torch/distributed/launcher/api.py", line 270, in launch_agent raise ChildFailedError( torch.distributed.elastic.multiprocessing.errors.ChildFailedError: ===================================================== train.py FAILED ----------------------------------------------------- Failures: <NO_OTHER_FAILURES> ----------------------------------------------------- Root Cause (first observed failure): [0]: time : 2025-08-30_06:40:48 host : ub-MS-7A93 rank : 1 (local_rank: 1) exitcode : -6 (pid: 3386) error_file: <N/A> traceback : Signal 6 (SIGABRT) received by PID 3386

最新发布

08-31

嗯，用户提出的问题相当专业，涉及到PyTorch分布式训练中的NCCL超时和进程崩溃问题。从用户能准确描述SIGABRT信号和ALLREDUCE操作来看，应该是正在实际进行多机多卡训练的研究员或工程师，而且遇到了阻碍工作进展的...

# Get Started ## Installation 1. clone this repo. ``` git clone https://github.com/ShiqiYu/OpenGait.git ``` 2. Install dependenices: - pytorch >= 1.10 - torchvision - pyyaml - tensorboard - opencv-python - tqdm - py7zr - kornia - einops Install dependenices by [Anaconda](https://conda.io/projects/conda/en/latest/user-guide/install/index.html): ``` conda install tqdm pyyaml tensorboard opencv kornia einops -c conda-forge conda install pytorch==1.10 torchvision -c pytorch ``` Or, Install dependenices by pip: ``` pip install tqdm pyyaml tensorboard opencv-python kornia einops pip install torch==1.10 torchvision==0.11 ``` ## Prepare dataset See [prepare dataset](2.prepare_dataset.md). ## Get trained model - Option 1: ``` python misc/download_pretrained_model.py ``` - Option 2: Go to the [release page](https://github.com/ShiqiYu/OpenGait/releases/), then download the model file and uncompress it to [output](output). ## Train Train a model by ``` CUDA_VISIBLE_DEVICES=0,1 python -m torch.distributed.launch --nproc_per_node=2 opengait/main.py --cfgs ./configs/baseline/baseline.yaml --phase train ``` - `python -m torch.distributed.launch` [DDP](https://pytorch.org/tutorials/intermediate/ddp_tutorial.html) launch instruction. - `--nproc_per_node` The number of gpus to use, and it must equal the length of `CUDA_VISIBLE_DEVICES`. - `--cfgs` The path to config file. - `--phase` Specified as `train`.  - `--log_to_file` If specified, the terminal log will be written on disk simultaneously. You can run commands in [train.sh](train.sh) for training different models. ## Test Evaluate the trained model by ``` CUDA_VISIBLE_DEVICES=0,1 python -m torch.distributed.launch --nproc_per_node=2 opengait/main.py --cfgs ./configs/baseline/baseline.yaml --phase test ``` - `--phase` Specified as `test`. - `--iter` Specify a iteration checkpoint. **Tip**: Other arguments are the same as train phase. You can run commands in [test.sh](test.sh) for testing different models. ## Customize 1. Read the [detailed config](docs/1.detailed_config.md) to figure out the usage of needed setting items; 2. See [how to create your model](docs/2.how_to_create_your_model.md); 3. There are some advanced usages, refer to [advanced usages](docs/3.advanced_usages.md), please. ## Warning - In `DDP` mode, zombie processes may be generated when the program terminates abnormally. You can use this command [sh misc/clean_process.sh](./misc/clean_process.sh) to clear them.

03-18

如多机训练、混合精度训练等，详见 [advanced_usages.md](docs/3.advanced_usages.md)。 --- #### **7. 常见问题与警告** - **僵尸进程清理** 若训练异常终止导致进程残留，运行以下命令清理： ```bash sh ...

k8s学习笔记（一）：k8s单主架构部署

qq_57629230的博客

06-20

1415

k8s单主架构的详细部署教程！

basemap安装出错时，正确得pyproj文件

11-11

用anaconda安装basemap出错时所需得正确pyproj文件，找到anaconda里这个文件替换就行。

华纳云：在Conda中环境迁移有哪些步骤

YOKEhn的博客

01-10

586

通过这两个步骤，你应该能够在目标系统上成功迁移Conda环境。确保两个系统上都有相同版本的Conda，并且操作系统兼容。在Conda中，环境的迁移通常涉及两个方面：导出环境配置和导出环境中的包。这将根据 environment.yml 中的配置创建一个新的Conda环境。这将安装 environment.txt 文件中列出的所有包及其相应版本。这将把当前环境的所有依赖项和配置导出到一个YAML文件中。这将生成一个包含当前环境中所有包及其版本的文本文件。

《导出项目依赖库和环境依赖库——python学习笔记》

YUelite的博客

07-04

4042

它可以扫描项目中的Python文件和目录，并自动识别所需的依赖库及其版本信息，然后将这些信息保存到一个名为。通过执行上述步骤，你将能够导出Python项目的依赖库以及整个环境的依赖库列表。记得在导出列表后检查生成的文件，确保它们包含了所有必要的依赖库。这将会扫描当前目录下的所有Python文件和目录，并自动识别所需的依赖库及其版本信息。可以方便地将项目的依赖库清单导出到其他环境中使用，或者在多个项目之间共享依赖库。这将会强制重新扫描所有Python文件和目录，并生成最新的依赖库清单。

pycharm配置虚拟环境和导入requirments.txt安装库包，以及pycharm导入

天生我才必有用

07-03

9993

pycharm配置虚拟环境和导入requirments.txt安装库包，以及pycharm导入

conda environments.txt

m0_48209770的博客

06-10

1258

environments.txt

三、在pycharm上创建虚拟环境，并下载requirement.txt依赖包

chuntingting的博客

11-13

6313

在本地并行多个项目的时候，每个项目依赖的包区别很大，如果只有一个环境所有的依赖都下载到一个环境中，别人只需要一个项目，最后你给别人提供的requirement.txt文件中也有很多无用的依赖包，为了便于管理以及方便他人使用，建议为自己的项目创建独立的环境，最后项目导出的requirement.txt文件也是小清新的哟。进入项目-> 点击pycharm-> preferences ->python interpreter-> virtualenv environment ->add interpreter。

environment.yml或者requirements.txt

weixin_43845922的博客

09-22

1896

想把当前的环境moea也写成environment.yaml或者requirements.txt,方便别人安装。

Python库安装之requirements.txt, environment.yml