你好,我正在尝试使用多个gpu来训练arcgis。复制行为的步骤:python - m torch。分布。启动- - nproc_per_node = 8 test。py测试。Py代码如下:learn import prepare_data, UnetClassifier, DeepLab import os import torch print ('Available devices ',火炬。cuda。device_count = r())路径“/ home / gpu_test02 /数据/ label382_chip512_rota120”data = prepare_data(路径、chip_size = 512 batch_size = 6) m = DeepLab(数据)m .解冻()m .符合0.001 (1)m .拯救(model_test)错误:文件“/ home / gpu_test02 / miniconda3 / env / arcgis / lib / python3.6 /网站/ arcgis /学习/模型/ _arcgis_model.py”,1000行,在_save文件“/ home / gpu_test02 / miniconda3 / env / arcgis / lib / python3.6 /网站/ arcgis /学习/模型/ _arcgis_model.py”,1000年线_save操作系统。makedirs(自我。 learn. path / self. learn. model_dir) File "/home/gpu_test02/miniconda3/envs/arcgis/lib/python3.6/os.py", line 220, in makedirs os. makedirs( self. learn. path / self. learn. model_dir) File "/home/gpu_test02/miniconda3/envs/arcgis/lib/python3.6/os.py", line 220, in makedirs os. makedirs( self. learn. path / self. learn. model_dir) os. makedirs( self. learn. path / self. learn. model_dir) File "/home/gpu_test02/miniconda3/envs/arcgis/lib/python3.6/os.py", line 220, in makedirs File "/home/gpu_test02/miniconda3/envs/arcgis/lib/python3.6/os.py", line 220, in makedirs os. makedirs( self. learn. path / self. learn. model_dir) File "/home/gpu_test02/miniconda3/envs/arcgis/lib/python3.6/os.py", line 220, in makedirs os. makedirs( self. learn. path / self. learn. model_dir) File "/home/gpu_test02/miniconda3/envs/arcgis/lib/python3.6/os.py", line 220, in makedirs mkdir( name, mode) mkdir( name, mode) mkdir( name, mode) mkdir( name, mode) FileExistsError: [ Errno 17] File exists: '/home/gpu_test02/data/label382_chip512_rota120/models/checkpoint_2021-06-19_15-30-17' FileExistsError: [ Errno 17] File exists: '/home/gpu_test02/data/label382_chip512_rota120/models/checkpoint_2021-06-19_15-30-17' mkdir( name, mode) FileExistsError: [ Errno 17] File exists: '/home/gpu_test02/data/label382_chip512_rota120/models/checkpoint_2021-06-19_15-30-17' FileExistsError: [ Errno 17] File exists: '/home/gpu_test02/data/label382_chip512_rota120/models/checkpoint_2021-06-19_15-30-17' FileExistsError: [ Errno 17] File exists: '/home/gpu_test02/data/label382_chip512_rota120/models/checkpoint_2021-06-19_15-30-17' mkdir( name, mode) FileExistsError: [ Errno 17] File exists: '/home/gpu_test02/data/label382_chip512_rota120/models/checkpoint_2021-06-19_15-30-17' Screenshots Expected behavior A clear and concise description of what you expected to happen. Platform (please complete the following information): OS: centos7 Browser [e.g. chrome, safari] Python API Version 1.8.4 Additional context Add any other context about the problem here, attachments etc.
...查看更多