2025-04-02 10:41:12,742 ERROR trial_runner.py:616 -- Trial CQL_ExpertGuidedEnv_5836b_00000: Error processing event.
Traceback (most recent call last):
File "/home/dwh/anaconda3/envs/egpo_a/lib/python3.7/site-packages/ray/tune/trial_runner.py", line 586, in _process_trial
results = self.trial_executor.fetch_result(trial)
File "/home/dwh/anaconda3/envs/egpo_a/lib/python3.7/site-packages/ray/tune/ray_trial_executor.py", line 609, in fetch_result
result = ray.get(trial_future[0], timeout=DEFAULT_GET_TIMEOUT)
File "/home/dwh/anaconda3/envs/egpo_a/lib/python3.7/site-packages/ray/_private/client_mode_hook.py", line 47, in wrapper
return func(*args, **kwargs)
File "/home/dwh/anaconda3/envs/egpo_a/lib/python3.7/site-packages/ray/worker.py", line 1456, in get
raise value.as_instanceof_cause()
ray.exceptions.RayTaskError(TypeError): ray::CQL.train_buffered() (pid=15131, ip=10.200.84.15)
File "python/ray/_raylet.pyx", line 480, in ray._raylet.execute_task
File "python/ray/_raylet.pyx", line 432, in ray._raylet.execute_task.function_executor
File "/home/dwh/anaconda3/envs/egpo_a/lib/python3.7/site-packages/ray/tune/trainable.py", line 167, in train_buffered
result = self.train()
File "/home/dwh/anaconda3/envs/egpo_a/lib/python3.7/site-packages/ray/rllib/agents/trainer.py", line 529, in train
raise e
File "/home/dwh/anaconda3/envs/egpo_a/lib/python3.7/site-packages/ray/rllib/agents/trainer.py", line 515, in train
result = Trainable.train(self)
File "/home/dwh/anaconda3/envs/egpo_a/lib/python3.7/site-packages/ray/tune/trainable.py", line 226, in train
result = self.step()
File "/home/dwh/anaconda3/envs/egpo_a/lib/python3.7/site-packages/ray/rllib/agents/trainer_template.py", line 157, in step
evaluation_metrics = self._evaluate()
File "/home/dwh/anaconda3/envs/egpo_a/lib/python3.7/site-packages/ray/rllib/agents/trainer.py", line 749, in _evaluate
self._sync_weights_to_workers(worker_set=self.evaluation_workers)
File "/home/dwh/anaconda3/envs/egpo_a/lib/python3.7/site-packages/ray/rllib/agents/trainer.py", line 802, in _sync_weights_to_workers
worker_set.foreach_worker(lambda w: w.restore(ray.get(weights)))
File "/home/dwh/anaconda3/envs/egpo_a/lib/python3.7/site-packages/ray/rllib/evaluation/worker_set.py", line 164, in foreach_worker
local_result = [func(self.local_worker())]
File "/home/dwh/anaconda3/envs/egpo_a/lib/python3.7/site-packages/ray/rllib/agents/trainer.py", line 802, in <lambda>
worker_set.foreach_worker(lambda w: w.restore(ray.get(weights)))
File "/home/dwh/anaconda3/envs/egpo_a/lib/python3.7/site-packages/ray/rllib/evaluation/rollout_worker.py", line 1014, in restore
self.policy_map[pid].set_state(state)
File "/home/dwh/anaconda3/envs/egpo_a/lib/python3.7/site-packages/ray/rllib/policy/torch_policy.py", line 515, in set_state
s, device=self.device)
File "/home/dwh/anaconda3/envs/egpo_a/lib/python3.7/site-packages/ray/rllib/utils/torch_ops.py", line 111, in convert_to_torch_tensor
return tree.map_structure(mapping, x)
File "/home/dwh/anaconda3/envs/egpo_a/lib/python3.7/site-packages/tree/__init__.py", line 435, in map_structure
[func(*args) for args in zip(*map(flatten, structures))])
File "/home/dwh/anaconda3/envs/egpo_a/lib/python3.7/site-packages/tree/__init__.py", line 435, in <listcomp>
[func(*args) for args in zip(*map(flatten, structures))])
File "/home/dwh/anaconda3/envs/egpo_a/lib/python3.7/site-packages/ray/rllib/utils/torch_ops.py", line 105, in mapping
tensor = torch.from_numpy(np.asarray(item))
TypeError: can't convert np.ndarray of type numpy.object_. The only supported types are: float64, float32, float16, complex64, complex128, int64, int32, int16, int8, uint8, and bool.
最新发布