scrapy-Error: [Failure instance: Traceback (failure with no frames): <class ‘scrapy.pipelines.files.

部署运行你感兴趣的模型镜像
Error: [Failure instance: Traceback (failure with no frames): <class 'scrapy.pipelines.files.FileException'>:

如果出现这个错误,就是只有这个错误, 很有可能是你的域名没有禁止掉,所以导致文件传输错误

 # allowed_domains = ["www.xxx.com"]

直接禁用掉就好,然后一定要禁用掉君子协议reboot

您可能感兴趣的与本文相关的镜像

ACE-Step

ACE-Step

音乐合成
ACE-Step

ACE-Step是由中国团队阶跃星辰(StepFun)与ACE Studio联手打造的开源音乐生成模型。 它拥有3.5B参数量,支持快速高质量生成、强可控性和易于拓展的特点。 最厉害的是,它可以生成多种语言的歌曲,包括但不限于中文、英文、日文等19种语言

PS C:\Users\童琪琪\Desktop\bishe.6\biyesheji.6\movie_analysis_project> scrapy crawl maoyan -s LOG_LEVEL=INFO >> 2025-11-14 19:14:53 [scrapy.utils.log] INFO: Scrapy 2.11.0 started (bot: movie_analysis_project) 2025-11-14 19:14:53 [scrapy.utils.log] INFO: Versions: lxml 5.2.1.0, libxml2 2.13.1, cssselect 1.2.0, parsel 1.8.1, w3lib 2.1.2, Twisted 22.10.0, Python 3.12.7 | packaged by Anaconda, Inc. | (main, Oct 4 2024, 13:17:27) [MSC v.1929 64 bit (AMD64)], pyOpenSSL 24.2.1 (OpenSSL 3.0.15 3 Sep 2024), cryptography 43.0.0, Platform Windows-11-10.0.26100-SP0 2025-11-14 19:14:53 [scrapy.addons] INFO: Enabled addons: [] 2025-11-14 19:14:53 [py.warnings] WARNING: D:\Anaconda\Lib\site-packages\scrapy\utils\request.py:254: ScrapyDeprecationWarning: '2.6' is a deprecated value for the 'REQUEST_FINGERPRINTER_IMPLEMENTATION' setting. It is also the default value. In other words, it is normal to get this warning if you have not defined a value for the 'REQUEST_FINGERPRINTER_IMPLEMENTATION' setting. This is so for backward compatibility reasons, but it will change in a future version of Scrapy. See the documentation of the 'REQUEST_FINGERPRINTER_IMPLEMENTATION' setting for information on how to handle this deprecation. return cls(crawler) 2025-11-14 19:14:53 [scrapy.extensions.telnet] INFO: Telnet Password: b9dbfee686c0aa3d 2025-11-14 19:14:54 [scrapy.middleware] INFO: Enabled extensions: ['scrapy.extensions.corestats.CoreStats', 'scrapy.extensions.telnet.TelnetConsole', 'scrapy.extensions.logstats.LogStats'] 2025-11-14 19:14:54 [scrapy.crawler] INFO: Overridden settings: {'BOT_NAME': 'movie_analysis_project', 'CONCURRENT_REQUESTS': 1, 'DOWNLOAD_DELAY': 2, 'LOG_LEVEL': 'INFO', 'NEWSPIDER_MODULE': 'movie_analysis_project.spiders', 'RETRY_TIMES': 3, 'SPIDER_MODULES': ['movie_analysis_project.spiders'], 'USER_AGENT': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36'} 2025-11-14 19:14:55 [scrapy.middleware] INFO: Enabled downloader middlewares: ['scrapy.downloadermiddlewares.httpauth.HttpAuthMiddleware', 'scrapy.downloadermiddlewares.downloadtimeout.DownloadTimeoutMiddleware', 'scrapy.downloadermiddlewares.defaultheaders.DefaultHeadersMiddleware', 'scrapy.downloadermiddlewares.useragent.UserAgentMiddleware', 'scrapy.downloadermiddlewares.retry.RetryMiddleware', 'scrapy.downloadermiddlewares.redirect.MetaRefreshMiddleware', 'scrapy.downloadermiddlewares.httpcompression.HttpCompressionMiddleware', 'scrapy.downloadermiddlewares.redirect.RedirectMiddleware', 'scrapy.downloadermiddlewares.cookies.CookiesMiddleware', 'scrapy.downloadermiddlewares.httpproxy.HttpProxyMiddleware', 'scrapy.downloadermiddlewares.stats.DownloaderStats'] 2025-11-14 19:14:55 [scrapy.middleware] INFO: Enabled spider middlewares: ['scrapy.spidermiddlewares.httperror.HttpErrorMiddleware', 'scrapy.spidermiddlewares.offsite.OffsiteMiddleware', 'scrapy.spidermiddlewares.referer.RefererMiddleware', 'scrapy.spidermiddlewares.urllength.UrlLengthMiddleware', 'scrapy.spidermiddlewares.depth.DepthMiddleware'] Unhandled error in Deferred: 2025-11-14 19:14:58 [twisted] CRITICAL: Unhandled error in Deferred: Traceback (most recent call last): File "D:\Anaconda\Lib\site-packages\scrapy\crawler.py", line 265, in crawl return self._crawl(crawler, *args, **kwargs) File "D:\Anaconda\Lib\site-packages\scrapy\crawler.py", line 269, in _crawl d = crawler.crawl(*args, **kwargs) File "D:\Anaconda\Lib\site-packages\twisted\internet\defer.py", line 1947, in unwindGenerator return _cancellableInlineCallbacks(gen) File "D:\Anaconda\Lib\site-packages\twisted\internet\defer.py", line 1857, in _cancellableInlineCallbacks _inlineCallbacks(None, gen, status, _copy_context()) --- <exception caught here> --- File "D:\Anaconda\Lib\site-packages\twisted\internet\defer.py", line 1697, in _inlineCallbacks result = context.run(gen.send, result) File "D:\Anaconda\Lib\site-packages\scrapy\crawler.py", line 158, in crawl self.engine = self._create_engine() File "D:\Anaconda\Lib\site-packages\scrapy\crawler.py", line 172, in _create_engine return ExecutionEngine(self, lambda _: self.stop()) File "D:\Anaconda\Lib\site-packages\scrapy\core\engine.py", line 100, in __init__ self.scraper = Scraper(crawler) File "D:\Anaconda\Lib\site-packages\scrapy\core\scraper.py", line 109, in __init__ self.itemproc: ItemPipelineManager = itemproc_cls.from_crawler(crawler) File "D:\Anaconda\Lib\site-packages\scrapy\middleware.py", line 90, in from_crawler return cls.from_settings(crawler.settings, crawler) File "D:\Anaconda\Lib\site-packages\scrapy\middleware.py", line 66, in from_settings mwcls = load_object(clspath) File "D:\Anaconda\Lib\site-packages\scrapy\utils\misc.py", line 79, in load_object mod = import_module(module) File "D:\Anaconda\Lib\importlib\__init__.py", line 90, in import_module return _bootstrap._gcd_import(name[level:], package, level) File "<frozen importlib._bootstrap>", line 1387, in _gcd_import File "<frozen importlib._bootstrap>", line 1360, in _find_and_load File "<frozen importlib._bootstrap>", line 1331, in _find_and_load_unlocked File "<frozen importlib._bootstrap>", line 935, in _load_unlocked File "<frozen importlib._bootstrap_external>", line 995, in exec_module File "<frozen importlib._bootstrap>", line 488, in _call_with_frames_removed File "C:\Users\童琪琪\Desktop\bishe.6\biyesheji.6\movie_analysis_project\movie_analysis_project\pipelines.py", line 8, in <module> from .utils.data_cleaner import clean_text, convert_date, extract_gender builtins.ImportError: cannot import name 'extract_gender' from 'movie_analysis_project.utils.data_cleaner' (C:\Users\童琪琪\Desktop\bishe.6\biyesheji.6\movie_analysis_project\movie_analysis_project\utils\data_cleaner.py) 2025-11-14 19:14:58 [twisted] CRITICAL: Traceback (most recent call last): File "D:\Anaconda\Lib\site-packages\twisted\internet\defer.py", line 1697, in _inlineCallbacks result = context.run(gen.send, result) File "D:\Anaconda\Lib\site-packages\scrapy\crawler.py", line 158, in crawl self.engine = self._create_engine() File "D:\Anaconda\Lib\site-packages\scrapy\crawler.py", line 172, in _create_engine return ExecutionEngine(self, lambda _: self.stop()) File "D:\Anaconda\Lib\site-packages\scrapy\core\engine.py", line 100, in __init__ self.scraper = Scraper(crawler) File "D:\Anaconda\Lib\site-packages\scrapy\core\scraper.py", line 109, in __init__ self.itemproc: ItemPipelineManager = itemproc_cls.from_crawler(crawler) File "D:\Anaconda\Lib\site-packages\scrapy\middleware.py", line 90, in from_crawler return cls.from_settings(crawler.settings, crawler) File "D:\Anaconda\Lib\site-packages\scrapy\middleware.py", line 66, in from_settings mwcls = load_object(clspath) File "D:\Anaconda\Lib\site-packages\scrapy\utils\misc.py", line 79, in load_object mod = import_module(module) File "D:\Anaconda\Lib\importlib\__init__.py", line 90, in import_module return _bootstrap._gcd_import(name[level:], package, level) File "<frozen importlib._bootstrap>", line 1387, in _gcd_import File "<frozen importlib._bootstrap>", line 1360, in _find_and_load File "<frozen importlib._bootstrap>", line 1331, in _find_and_load_unlocked File "<frozen importlib._bootstrap>", line 935, in _load_unlocked File "<frozen importlib._bootstrap_external>", line 995, in exec_module File "<frozen importlib._bootstrap>", line 488, in _call_with_frames_removed File "C:\Users\童琪琪\Desktop\bishe.6\biyesheji.6\movie_analysis_project\movie_analysis_project\pipelines.py", line 8, in <module> from .utils.data_cleaner import clean_text, convert_date, extract_gender ImportError: cannot import name 'extract_gender' from 'movie_analysis_project.utils.data_cleaner' (C:\Users\童琪琪\Desktop\bishe.6\biyesheji.6\movie_analysis_project\movie_analysis_project\utils\data_cleaner.py) PS C:\Users\童琪琪\Desktop\bishe.6\biyesheji.6\movie_analysis_project>
最新发布
11-15
(scrapy_env) C:\Users\Lenovo\nepu_qa_project>scrapy crawl nepu_info 2025-07-06 23:05:58 [scrapy.utils.log] INFO: Scrapy 2.8.0 started (bot: nepu_spider) 2025-07-06 23:05:58 [scrapy.utils.log] INFO: Versions: lxml 4.9.3.0, libxml2 2.10.4, cssselect 1.1.0, parsel 1.6.0, w3lib 1.21.0, Twisted 22.10.0, Python 3.11.5 | packaged by Anaconda, Inc. | (main, Sep 11 2023, 13:26:23) [MSC v.1916 64 bit (AMD64)], pyOpenSSL 23.2.0 (OpenSSL 3.0.10 1 Aug 2023), cryptography 41.0.3, Platform Windows-10-10.0.26100-SP0 2025-07-06 23:05:58 [scrapy.crawler] INFO: Overridden settings: {'AUTOTHROTTLE_ENABLED': True, 'AUTOTHROTTLE_MAX_DELAY': 10.0, 'AUTOTHROTTLE_START_DELAY': 1.0, 'AUTOTHROTTLE_TARGET_CONCURRENCY': 8.0, 'BOT_NAME': 'nepu_spider', 'CONCURRENT_REQUESTS': 8, 'COOKIES_ENABLED': False, 'DEPTH_LIMIT': 3, 'DEPTH_PRIORITY': 1, 'DOWNLOAD_DELAY': 0.5, 'DOWNLOAD_TIMEOUT': 15, 'HTTPCACHE_ENABLED': True, 'HTTPCACHE_EXPIRATION_SECS': 86400, 'HTTPCACHE_IGNORE_HTTP_CODES': [301, 302, 404], 'LOG_LEVEL': 'INFO', 'NEWSPIDER_MODULE': 'nepu_spider.spiders', 'RETRY_HTTP_CODES': [500, 502, 503, 504, 408, 429], 'RETRY_TIMES': 3, 'ROBOTSTXT_OBEY': True, 'SPIDER_MODULES': ['nepu_spider.spiders'], 'USER_AGENT': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 ' '(KHTML, like Gecko) Chrome/91.0.4472.124 Safari/537.36'} 2025-07-06 23:05:58 [py.warnings] WARNING: D:\annaCONDA\Lib\site-packages\scrapy\utils\request.py:232: ScrapyDeprecationWarning: '2.6' is a deprecated value for the 'REQUEST_FINGERPRINTER_IMPLEMENTATION' setting. It is also the default value. In other words, it is normal to get this warning if you have not defined a value for the 'REQUEST_FINGERPRINTER_IMPLEMENTATION' setting. This is so for backward compatibility reasons, but it will change in a future version of Scrapy. See the documentation of the 'REQUEST_FINGERPRINTER_IMPLEMENTATION' setting for information on how to handle this deprecation. return cls(crawler) 2025-07-06 23:05:58 [scrapy.extensions.telnet] INFO: Telnet Password: b6620448c3adbe39 2025-07-06 23:05:58 [scrapy.middleware] INFO: Enabled extensions: ['scrapy.extensions.corestats.CoreStats', 'scrapy.extensions.telnet.TelnetConsole', 'scrapy.extensions.logstats.LogStats', 'scrapy.extensions.throttle.AutoThrottle'] Unhandled error in Deferred: 2025-07-06 23:05:58 [twisted] CRITICAL: Unhandled error in Deferred: Traceback (most recent call last): File "D:\annaCONDA\Lib\site-packages\scrapy\crawler.py", line 233, in crawl return self._crawl(crawler, *args, **kwargs) File "D:\annaCONDA\Lib\site-packages\scrapy\crawler.py", line 237, in _crawl d = crawler.crawl(*args, **kwargs) File "D:\annaCONDA\Lib\site-packages\twisted\internet\defer.py", line 1947, in unwindGenerator return _cancellableInlineCallbacks(gen) File "D:\annaCONDA\Lib\site-packages\twisted\internet\defer.py", line 1857, in _cancellableInlineCallbacks _inlineCallbacks(None, gen, status, _copy_context()) --- <exception caught here> --- File "D:\annaCONDA\Lib\site-packages\twisted\internet\defer.py", line 1697, in _inlineCallbacks result = context.run(gen.send, result) File "D:\annaCONDA\Lib\site-packages\scrapy\crawler.py", line 122, in crawl self.engine = self._create_engine() File "D:\annaCONDA\Lib\site-packages\scrapy\crawler.py", line 136, in _create_engine return ExecutionEngine(self, lambda _: self.stop()) File "D:\annaCONDA\Lib\site-packages\scrapy\core\engine.py", line 78, in __init__ self.downloader = downloader_cls(crawler) File "D:\annaCONDA\Lib\site-packages\scrapy\core\downloader\__init__.py", line 85, in __init__ self.middleware = DownloaderMiddlewareManager.from_crawler(crawler) File "D:\annaCONDA\Lib\site-packages\scrapy\middleware.py", line 68, in from_crawler return cls.from_settings(crawler.settings, crawler) File "D:\annaCONDA\Lib\site-packages\scrapy\middleware.py", line 43, in from_settings mwcls = load_object(clspath) File "D:\annaCONDA\Lib\site-packages\scrapy\utils\misc.py", line 60, in load_object mod = import_module(module) File "D:\annaCONDA\Lib\importlib\__init__.py", line 126, in import_module return _bootstrap._gcd_import(name[level:], package, level) File "<frozen importlib._bootstrap>", line 1204, in _gcd_import File "<frozen importlib._bootstrap>", line 1176, in _find_and_load File "<frozen importlib._bootstrap>", line 1126, in _find_and_load_unlocked File "<frozen importlib._bootstrap>", line 241, in _call_with_frames_removed File "<frozen importlib._bootstrap>", line 1204, in _gcd_import File "<frozen importlib._bootstrap>", line 1176, in _find_and_load File "<frozen importlib._bootstrap>", line 1140, in _find_and_load_unlocked builtins.ModuleNotFoundError: No module named 'your_project' 2025-07-06 23:05:58 [twisted] CRITICAL: Traceback (most recent call last): File "D:\annaCONDA\Lib\site-packages\twisted\internet\defer.py", line 1697, in _inlineCallbacks result = context.run(gen.send, result) File "D:\annaCONDA\Lib\site-packages\scrapy\crawler.py", line 122, in crawl self.engine = self._create_engine() File "D:\annaCONDA\Lib\site-packages\scrapy\crawler.py", line 136, in _create_engine return ExecutionEngine(self, lambda _: self.stop()) File "D:\annaCONDA\Lib\site-packages\scrapy\core\engine.py", line 78, in __init__ self.downloader = downloader_cls(crawler) File "D:\annaCONDA\Lib\site-packages\scrapy\core\downloader\__init__.py", line 85, in __init__ self.middleware = DownloaderMiddlewareManager.from_crawler(crawler) File "D:\annaCONDA\Lib\site-packages\scrapy\middleware.py", line 68, in from_crawler return cls.from_settings(crawler.settings, crawler) File "D:\annaCONDA\Lib\site-packages\scrapy\middleware.py", line 43, in from_settings mwcls = load_object(clspath) File "D:\annaCONDA\Lib\site-packages\scrapy\utils\misc.py", line 60, in load_object mod = import_module(module) File "D:\annaCONDA\Lib\importlib\__init__.py", line 126, in import_module return _bootstrap._gcd_import(name[level:], package, level) File "<frozen importlib._bootstrap>", line 1204, in _gcd_import File "<frozen importlib._bootstrap>", line 1176, in _find_and_load File "<frozen importlib._bootstrap>", line 1126, in _find_and_load_unlocked File "<frozen importlib._bootstrap>", line 241, in _call_with_frames_removed File "<frozen importlib._bootstrap>", line 1204, in _gcd_import File "<frozen importlib._bootstrap>", line 1176, in _find_and_load File "<frozen importlib._bootstrap>", line 1140, in _find_and_load_unlocked ModuleNotFoundError: No module named 'your_project'
07-07
(.venv) PS D:\python\pythonProject1-scrapy\myproject> scrapy crawl douban_movies -o news.csv Traceback (most recent call last): File "D:\python\python38\lib\runpy.py", line 192, in _run_module_as_main return _run_code(code, main_globals, None, File "D:\python\python38\lib\runpy.py", line 85, in _run_code exec(code, run_globals) File "D:\python\pythonProject1-scrapy\.venv\Scripts\scrapy.exe\__main__.py", line 7, in <module> File "D:\python\pythonProject1-scrapy\.venv\lib\site-packages\scrapy\cmdline.py", line 160, in execute cmd.crawler_process = CrawlerProcess(settings) File "D:\python\pythonProject1-scrapy\.venv\lib\site-packages\scrapy\crawler.py", line 357, in __init__ super().__init__(settings) File "D:\python\pythonProject1-scrapy\.venv\lib\site-packages\scrapy\crawler.py", line 227, in __init__ self.spider_loader = self._get_spider_loader(settings) File "D:\python\pythonProject1-scrapy\.venv\lib\site-packages\scrapy\crawler.py", line 221, in _get_spider_loader return loader_cls.from_settings(settings.frozencopy()) File "D:\python\pythonProject1-scrapy\.venv\lib\site-packages\scrapy\spiderloader.py", line 79, in from_settings return cls(settings) File "D:\python\pythonProject1-scrapy\.venv\lib\site-packages\scrapy\spiderloader.py", line 34, in __init__ self._load_all_spiders() File "D:\python\pythonProject1-scrapy\.venv\lib\site-packages\scrapy\spiderloader.py", line 63, in _load_all_spiders for module in walk_modules(name): File "D:\python\pythonProject1-scrapy\.venv\lib\site-packages\scrapy\utils\misc.py", line 106, in walk_modules submod = import_module(fullpath) File "D:\python\python38\lib\importlib\__init__.py", line 127, in import_module return _bootstrap._gcd_import(name[level:], package, level) File "<frozen importlib._bootstrap>", line 1014, in _gcd_import File "<frozen importlib._bootstrap>", line 991, in _find_and_load File "<frozen importlib._bootstrap>", line 975, in _find_and_load_unlocked File "<frozen importlib._bootstrap>", line 671, in _load_unlocked File "<frozen importlib._bootstrap_external>", line 783, in exec_module File "<frozen importlib._bootstrap>", line 219, in _call_with_frames_removed File "D:\python\pythonProject1-scrapy\myproject\myproject\spiders\douban_movies.py", line 2, in <module> from movie1905.items import NewsItem ModuleNotFoundError: No module named 'movie1905' (.venv) PS D:\python\pythonProject1-scrapy\myproject>
05-27
评论
成就一亿技术人!
拼手气红包6.0元
还能输入1000个字符
 
红包 添加红包
表情包 插入表情
 条评论被折叠 查看
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包

打赏作者

Volcanoforever

你的鼓励将是我创作的最大动力

¥1 ¥2 ¥4 ¥6 ¥10 ¥20
扫码支付:¥1
获取中
扫码支付

您的余额不足,请更换扫码支付或充值

打赏作者

实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值