FileNotFoundError: File not found
2025-07-24 05:02:17.666 INFO [Dummy-2] [document_indexing_task.py:80] - Processed dataset: 695f9f4f-da54-44f9-a5f5-5cbef1634796 latency: 0.6217305730096996
2025-07-24 05:02:17.874 INFO [Dummy-2] [trace.py:128] - Task tasks.document_indexing_task.document_indexing_task[93a86dcd-ce2c-4535-9465-fab22b79b9c7] succeeded in 0.8317920980043709s: None
2025-07-24 05:09:50.999 INFO [MainThread] [strategy.py:161] - Task tasks.retry_document_indexing_task.retry_document_indexing_task[8bd046ef-ba37-4ca4-a5ac-bd90b65763ac] received
2025-07-24 05:09:51.037 INFO [Dummy-3] [retry_document_indexing_task.py:60] - Start retry document: 21663e49-588c-4669-9338-074c46b9dea7
2025-07-24 05:09:51.112 ERROR [Dummy-3] [indexing_runner.py:96] - consume document failed
Traceback (most recent call last):
File "/app/api/core/indexing_runner.py", line 70, in run
text_docs = self._extract(index_processor, dataset_document, processing_rule.to_dict())
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/app/api/core/indexing_runner.py", line 356, in _extract
text_docs = index_processor.extract(extract_setting, process_rule_mode=process_rule["mode"])
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/app/api/core/rag/index_processor/processor/paragraph_index_processor.py", line 22, in extract
text_docs = ExtractProcessor.extract(
^^^^^^^^^^^^^^^^^^^^^^^^^
File "/app/api/core/rag/extractor/extract_processor.py", line 103, in extract
storage.download(upload_file.key, file_path)
File "/app/api/extensions/ext_storage.py", line 97, in download
self.storage_runner.download(filename, target_filepath)
File "/app/api/extensions/storage/opendal_storage.py", line 67, in download
raise FileNotFoundError("File not found")
FileNotFoundError: File not found
2025-07-24 05:09:51.134 INFO [Dummy-3] [retry_document_indexing_task.py:102] - Retry dataset: 695f9f4f-da54-44f9-a5f5-5cbef1634796 latency: 0.12441904097795486
2025-07-24 05:09:51.379 INFO [Dummy-3] [trace.py:128] - Task tasks.retry_document_indexing_task.retry_document_indexing_task[8bd046ef-ba37-4ca4-a5ac-bd90b65763ac] succeeded in 0.3706196919083595s: None
2025-07-24 05:10:08.712 INFO [MainThread] [strategy.py:161] - Task tasks.document_indexing_task.document_indexing_task[d5b949ad-ba43-4e84-8160-80aa9a38d106] received
2025-07-24 05:10:08.721 INFO [Dummy-4] [document_indexing_task.py:63] - Start process document: 7c3d3436-7164-41dc-afc6-336b0ef53969
2025-07-24 05:10:08.742 ERROR [Dummy-4] [indexing_runner.py:96] - consume document failed
Traceback (most recent call last):
File "/app/api/core/indexing_runner.py", line 70, in run
text_docs = self._extract(index_processor, dataset_document, processing_rule.to_dict())
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/app/api/core/indexing_runner.py", line 356, in _extract
text_docs = index_processor.extract(extract_setting, process_rule_mode=process_rule["mode"])
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/app/api/core/rag/index_processor/processor/paragraph_index_processor.py", line 22, in extract
text_docs = ExtractProcessor.extract(
^^^^^^^^^^^^^^^^^^^^^^^^^
File "/app/api/core/rag/extractor/extract_processor.py", line 103, in extract
storage.download(upload_file.key, file_path)
File "/app/api/extensions/ext_storage.py", line 97, in download
self.storage_runner.download(filename, target_filepath)
File "/app/api/extensions/storage/opendal_storage.py", line 67, in download
raise FileNotFoundError("File not found")
FileNotFoundError: File not found
2025-07-24 05:10:08.748 INFO [Dummy-4] [document_indexing_task.py:80] - Processed dataset: c6c02118-21ea-4f2b-9770-de6d41cb4e0a latency: 0.03286655503325164
2025-07-24 05:10:08.907 INFO [Dummy-4] [trace.py:128] - Task tasks.document_indexing_task.document_indexing_task[d5b949ad-ba43-4e84-8160-80aa9a38d106] succeeded in 0.1924347539898008s: None
2025-07-24 05:17:23.919 INFO [MainThread] [strategy.py:161] - Task tasks.document_indexing_task.document_indexing_task[c95b3b5f-79ec-4591-8a70-7eeb0be6f4f2] received
2025-07-24 05:17:23.957 INFO [Dummy-5] [document_indexing_task.py:63] - Start process document: ddf5b24a-6c5d-47b3-9f23-6fb6eb884f0c
2025-07-24 05:17:23.995 ERROR [Dummy-5] [indexing_runner.py:96] - consume document failed
Traceback (most recent call last):
File "/app/api/core/indexing_runner.py", line 70, in run
text_docs = self._extract(index_processor, dataset_document, processing_rule.to_dict())
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/app/api/core/indexing_runner.py", line 356, in _extract
text_docs = index_processor.extract(extract_setting, process_rule_mode=process_rule["mode"])
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/app/api/core/rag/index_processor/processor/paragraph_index_processor.py", line 22, in extract
text_docs = ExtractProcessor.extract(
^^^^^^^^^^^^^^^^^^^^^^^^^
File "/app/api/core/rag/extractor/extract_processor.py", line 103, in extract
storage.download(upload_file.key, file_path)
File "/app/api/extensions/ext_storage.py", line 97, in download
self.storage_runner.download(filename, target_filepath)
File "/app/api/extensions/storage/opendal_storage.py", line 67, in download
raise FileNotFoundError("File not found")
FileNotFoundError: File not found
2025-07-24 05:17:24.008 INFO [Dummy-5] [document_indexing_task.py:80] - Processed dataset: 8af61382-18c0-4651-bb1e-b17e21876af7 latency: 0.08017416414804757
2025-07-24 05:17:24.214 INFO [Dummy-5] [trace.py:128] - Task tasks.document_indexing_task.document_indexing_task[c95b3b5f-79ec-4591-8a70-7eeb0be6f4f2] succeeded in 0.28705954691395164s: None
2025-07-24 05:19:08.234 INFO [MainThread] [strategy.py:161] - Task tasks.retry_document_indexing_task.retry_document_indexing_task[b4ba321d-f402-4be3-91a4-0f2eb8df9b1e] received
2025-07-24 05:19:08.251 INFO [Dummy-6] [retry_document_indexing_task.py:60] - Start retry document: ddf5b24a-6c5d-47b3-9f23-6fb6eb884f0c
2025-07-24 05:19:08.287 ERROR [Dummy-6] [indexing_runner.py:96] - consume document failed
Traceback (most recent call last):
File "/app/api/core/indexing_runner.py", line 70, in run
text_docs = self._extract(index_processor, dataset_document, processing_rule.to_dict())
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/app/api/core/indexing_runner.py", line 356, in _extract
text_docs = index_processor.extract(extract_setting, process_rule_mode=process_rule["mode"])
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/app/api/core/rag/index_processor/processor/paragraph_index_processor.py", line 22, in extract
text_docs = ExtractProcessor.extract(
^^^^^^^^^^^^^^^^^^^^^^^^^
File "/app/api/core/rag/extractor/extract_processor.py", line 103, in extract
storage.download(upload_file.key, file_path)
File "/app/api/extensions/ext_storage.py", line 97, in download
self.storage_runner.download(filename, target_filepath)
File "/app/api/extensions/storage/opendal_storage.py", line 67, in download
raise FileNotFoundError("File not found")
FileNotFoundError: File not found
2025-07-24 05:19:08.298 INFO [Dummy-6] [retry_document_indexing_task.py:102] - Retry dataset: 8af61382-18c0-4651-bb1e-b17e21876af7 latency: 0.06023550289683044
2025-07-24 05:19:08.525 INFO [Dummy-6] [trace.py:128] - Task tasks.retry_document_indexing_task.retry_document_indexing_task[b4ba321d-f402-4be3-91a4-0f2eb8df9b1e] succeeded in 0.28708126000128686s: None
[root@clickhouse-master web]#
最新发布