Tensorflow2——图像定位

最新推荐文章于 2022-03-03 22:17:59 发布

逆风飞扬fei

最新推荐文章于 2022-03-03 22:17:59 发布

阅读量2.1k

点赞数 1

分类专栏：深度学习

本文链接：https://blog.youkuaiyun.com/weixin_45392405/article/details/106486218

版权

图像定位

1、单张图片图像定位
2、随意尺度图片定位
3、批量图片定位
一级目录

给定一副图片，我们要输出四个数字（x,y,w,h），图像中某一个点的坐标（x,y），以及图像的宽度和高度，有了这四个数字，我们可以很容易的找到物体的边框。

1、单张图片图像定位

import tensorflow as tf
import matplotlib.pyplot as plt
%matplotlib inline
import numpy as np
from lxml import etree
import glob
from matplotlib.patches import Rectangle

img=tf.io.read_file("./location/images/Abyssinian_1.jpg")
img=tf.image.decode_jpeg(img)
plt.imshow(img)

在这里插入图片描述
#读取xml文件

xml=open("./location/annotations/xmls/Abyssinian_1.xml").read()
#解析
sel=etree.HTML(xml) #建立好选择器
width=int(sel.xpath("//size/width/text()")[0])
height=int(sel.xpath("//size/height/text()")[0])
xmin=int(sel.xpath("//bndbox/xmin/text()")[0])
xmax=int(sel.xpath("//bndbox/xmax/text()")[0])
ymin=int(sel.xpath("//bndbox/ymin/text()")[0])
ymax=int(sel.xpath("//bndbox/ymax/text()")[0])
#根目录下的size里的width，取出text文本
#这样解析出来的是一个列表，列表里面放置的有文本
## width,height,xmin,xmax,ymin,ymax
#(600, 400, 333, 425, 72, 158)

plt.imshow(img)
rec=Rectangle((xmin,ymin),(xmax-xmin),(ymax-ymin),fill=False,color="red")  #最下角的值就是xmin,ymin
ax=plt.gca()  #获取当前图像
ax.axes.add_patch(rec)

在这里插入图片描述

2、随意尺度图片定位

（代码紧接上）

img=tf.image.resize(img,(224,224))
img=img/255
plt.imshow(img)

在这里插入图片描述

xmin=(xmin/width)*224
xmax=(xmax/width)*224
ymin=(ymin/height)*224
ymax=(ymax/height)*224

plt.imshow

最低0.47元/天解锁文章