Yolov8中断后继续训练，resume方法，断点训练

最新推荐文章于 2024-08-11 21:25:40 发布

戊己戌

最新推荐文章于 2024-08-11 21:25:40 发布

阅读量4.9k

点赞数 10

文章标签： YOLO python 深度学习目标检测计算机视觉

本文链接：https://blog.youkuaiyun.com/weixin_61623830/article/details/139091358

版权

1.错误尝试
在训练YOLOv8的时候，因为开太多其他程序，导致在100多次的时候崩溃，查询网上相关知识如何接着训练，在yolo5中把resume改成True就可以。
在yolov8中也这样尝试，将ultralytics/yolo/cfg/default.yaml中的resume改成True发现并没有作用，感觉yolov8代码还是有很多bug

2.可行的方法

2.1 ultralytics/yolo/engine/trainer.py
找到check_resume和resume_training方法
在check_resume方法里面将resume=中断地方的last.pt
在resume_training里面将ckpt=中断地方的last.pt

    def check_resume(self, overrides):
        """Check if resume checkpoint exists and update arguments accordingly."""
        #resume = self.args.resume
        resume = 'runs/detect/train9/weights/last.pt'     #🚀🚀🚀🚀🚀手动resume修改完毕记得该回去
        if resume:
            try:
                exists = isinstance(resume, (str, Path)) and Path(resume).exists()
                last = Path(check_file(resume) if exists else get_latest_run())

                # Check that resume data YAML exists, otherwise strip to force re-download of dataset
                ckpt_args = attempt_load_weights(last).args
                if not Path(ckpt_args["data"]).exists():
                    ckpt_args["data"] = self.args.data

                resume = True
                self.args = get_cfg(ckpt_args)
                self.args.model = str(last)  # reinstate model
                for k in "imgsz", "batch":  # allow arg updates to reduce memory on

最低0.47元/天解锁文章