ADE20K
Link
leaderboard
Pytorch implementation
paper
- Train (20000) validation (2000)
- test (需要上传结果到服务器才能得到结果)
- 150类
- 场景包括室内室外各种各样的场景.
cityscapes
- 5000 high quality pixel-level image
- 50 city
- 30 class
| Group | Classes |
|---|---|
| flat | road · sidewalk · parking+ · rail track+ |
| human | person* · rider* |
| construction | building · wall · fence · guard rail+ · bridge+ · tunnel+ |
| object | pole · pole group+ · traffic sign · traffic light |
| nature | vegetation · terrain |
| sky | sky |
| void | ground+ · dynamic+ · static+ |
dataset directory
|--cityscape
|--leftImg8bit_trainvaltest
|--leftImg8bit (5,000)
|--train (2975)
|--val (500)
|--test (1525)
|--gtFine_trainvaltest (5000 * 4)
|--gtFine
|--train (2975 * 4)
|--val (500 * 4)
|--test (1525 * 4)
|--leftImg8bit_trainextra
|--leftImg8bit (20,000)
|--train_extra (20000)
|--gtCoarse
|--gtCoarse (23475 * 4)
|--train (2975 * 4)
|--train_extra (20000 * 4)
|--val (500 * 4)
pascal VOC
- 20个对象类别和1个背景类
1 PASCAL VOC2012
Link
Download
leaderboard
paper
- classification, detection and segmentation.
- 20 类
Group| Classes|
-|- |
Person| person|
Animal| bird, cat, cow, dog, horse, sheep|
Vehicle |aeroplane, bicycle, boat, bus, car, motorbike, train|
Indoor |bottle, chair, dining table, potted plant, sofa, tv/monitor|
dataset directory
|--VOCdevkit
|--VOC2012(trainval)
|--Annotations (17,125)
|--*.xml
|--ImageSets
|--Action (33)
|--*.txt
|--Layout
|--train.txt
|--trainval.txt
|--val.txt
|--Main (63)
|--*.txt
|--Segmentation
|--train.txt (1464 lines)
|--trainval.txt (2913 lines)
|--val.txt (1449 lines)
|--JPEGImages (17,125)
|--*.jpg
|--SegmentationClass (2913)
|--*.png
|--SegmentationObject (2913)
|--*.png
|--VOC2012(test)
|--Annotations (5,138)
|--*.xml
|--ImageSets
|--Action (11)
|--*.txt
|--Layout
|--test.txt
|--Main (21)
|--*.txt
|--Segmentation
|--test.txt (1456 lines)
|--JPEGImages (16,135)
|--*.jpg
2 PASCAL Context
- 基于PASCAL VOC 2010做的标记
- Training and validation ( 10,103) testing (9,637)
3 PASCAL-Part Dataset
MS-COCO
- The COCO train, validation, and test sets, containing more than 200,000 images and 80 object categories, are available on the download page.
- All object instances are annotated with a detailed segmentation mask.
- Annotations on the training and validation sets (with over 500,000 object instances segmented) are publicly available.
A2D
-
3782 videos
-
每个有效的actor-action元组至少有99个实例
-
视频被标记为像素级actor和采样帧的操作
-
适用于:视频级的单标签和多标签动作识别、实例级的对象分割/协同分割、像素级的动作语义分割
SYNTHIA
- 计算机合成的城市道路驾驶环境的像素级标注的数据集。
CamVid
- 人体肖像分割数据库
- 视频数据库
NYUDv2
- 室内场景的视频
深度学习语义分割:全面解析数据集
本文介绍了多个用于语义分割任务的数据集,包括ADE20K、Cityscapes、PASCAL VOC 2012等。ADE20K包含150类,涵盖室内室外场景;Cityscapes提供50城市的高质像素级图像,有30类;PASCAL VOC 2012则涉及20个对象类别,可用于分类、检测和分割。
2416

被折叠的 条评论
为什么被折叠?



