微软发布的COCO数据库, 除了图片以外还提供物体检测, 分割(segmentation)和对图像的语义文本描述信息.
COCO数据库的网址是:

MS COCO API - http://mscoco.org/
Github网址 - https://github.com/pdollar/coco
关于API更多的细节在网站: http://mscoco.org/dataset/#download

数据库提供 Matlab, Python 和 Lua 的 API 接口. 其中 matlab 和 python 的 API 接口可以提供完整的图像标签数据的加载, parsing 和可视化.此外,网站还提供了数据相关的文章, 教程等.

在使用 COCO 数据库提供的 API 和 demo 时, 需要首先下载 COCO 的图像和标签数据.

安装:
1. 首先解压数据文件:
  - 图像数据下载到 coco/images/ 文件夹中
  - 标签数据下载到 coco/ 文件夹中.
2. matlab, 在 matlab 的默认路径中添加 coco/MatlabApi
3. Python. 打开终端,将路径切换到 coco/PythonAPI下,输入 make
COCO数据集的标注信息

COCO的数据标注信息包括:

类别标志
类别数量区分
像素级的分割

import sys
sys.path.append('E:/xinlib')
from data import cocox
import zipfile
        
          
        
        
        
          
          AI 代码解读

查看 coco/images/ 文件夹下的数据：

image_names = cocox.get_image_names()
image_names
        
          
        
        
        
          
          AI 代码解读

['E:/Data/coco/images/test2017.zip',
 'E:/Data/coco/images/train2017.zip',
 'E:/Data/coco/images/unlabeled2017.zip',
 'E:/Data/coco/images/val2017.zip']
        
          
        
        
        
          
          AI 代码解读

查看 coco/ 文件夹的文件：

import os
dataDir = cocox.root
        
          
        
        
        
          
          AI 代码解读

os.listdir(dataDir)
        
          
        
        
        
          
          AI 代码解读

['annotations',
 'annotations_trainval2017.zip',
 'cocoapi',
 'images',
 'image_info_test2017.zip',
 'image_info_unlabeled2017.zip',
 'stuff_annotations_trainval2017.zip']
        
          
        
        
        
          
          AI 代码解读

我们只需要获取 annotations 的信息（这里都是以 .zip 结尾）：

annDir = [z_name for z_name in os.listdir(dataDir) if z_name.endswith('.zip')]
annDir
        
          
        
        
        
          
          AI 代码解读

['annotations_trainval2017.zip',
 'image_info_test2017.zip',
 'image_info_unlabeled2017.zip',
 'stuff_annotations_trainval2017.zip']
        
          
        
        
        
          
          AI 代码解读

解压 annotations 的文件：

for ann_name in annDir:
    z = zipfile.ZipFile(dataDir + '/' + ann_name)
    # 全部解压
    z.extractall(dataDir)
        
          
        
        
        
          
          AI 代码解读

# 封装为函数
cocox.unzip_annotations()
        
          
        
        
        
          
          AI 代码解读

# 删除标签的压缩文件
cocox.del_annotations()
        
          
        
        
        
          
          AI 代码解读

由于图片数据比较大，我就不解压了，不过可以通过 MXNet + zipfile 来直接获取图片信息。

获取图片数据

我以 test2017.zip 为例：

image_names
        
          
        
        
        
          
          AI 代码解读

['E:/Data/coco/images/test2017.zip',
 'E:/Data/coco/images/train2017.zip',
 'E:/Data/coco/images/unlabeled2017.zip',
 'E:/Data/coco/images/val2017.zip']
        
          
        
        
        
          
          AI 代码解读

z = zipfile.ZipFile(image_names[0])
        
          
        
        
        
          
          AI 代码解读

# 测试集的图片名称列表
z.namelist()
        
          
        
        
        
          
          AI 代码解读

['test2017/',
 'test2017/000000259564.jpg',
 'test2017/000000344475.jpg',
 ...]
        
          
        
        
        
          
          AI 代码解读

我们可以看出，第一个是目录名，之后的才是图片。下面我们来看看第一张图片：

from mxnet import image
        
          
        
        
        
          
          AI 代码解读

r = z.read(z.namelist()[1])    # bytes
data = image.imdecode(r)       # 转换为 NDArray 数组，可以做数值运算
data
        
          
        
        
        
          
          AI 代码解读

[[[ 87  94  78]
  [ 85  94  77]
  [ 87  96  79]
  ..., 
  [108  63  44]
  [252 244 233]
  [253 253 253]]

 [[ 86  95  76]
  [ 88  97  78]
  [ 85  94  75]
  ..., 
  [ 55  14   0]
  [150  94  81]
  [252 245 216]]

 [[ 90  99  78]
  [ 89  98  77]
  [ 89  98  77]
  ..., 
  [ 63  37  12]
  [ 90  30   6]
  [149  83  61]]

 ..., 
 [[ 86 104  82]
  [ 89 102  82]
  [ 84 102  80]
  ..., 
  [ 50  62  40]
  [ 50  61  45]
  [ 51  58  50]]

 [[ 89 101  77]
  [ 87  96  75]
  [ 89 104  83]
  ..., 
  [ 54  63  42]
  [ 49  53  39]
  [ 53  54  48]]

 [[ 96 100  77]
  [ 94  97  76]
  [ 88 103  82]
  ..., 
  [ 44  58  32]
  [ 45  57  37]
  [ 49  57  42]]]
<NDArray 480x640x3 @cpu(0)>
        
          
        
        
        
          
          AI 代码解读

x = data.asnumpy()   # 转换为 array
        
          
        
        
        
          
          AI 代码解读

# 显示图片
%pylab inline 
plt.imshow(x)
        
          
        
        
        
          
          AI 代码解读

output_21_3.png-125.1kB

为此，我们可以将其封装为一个迭代器：cocox.data_iter(dataType)

获取标签信息（利用官方给定教程）

安装 python API：

pip install -U pycocotools
        
          
        
        
        
          
          AI 代码解读

Windows （一般需要安装 visual studio）下有许多的坑：Windows 10 编译 Pycocotools 踩坑记

%pylab inline
from pycocotools.coco import COCO
import numpy as np
import skimage.io as io
import matplotlib.pyplot as plt
import pylab
pylab.rcParams['figure.figsize'] = (8.0, 10.0)
        
          
        
        
        
          
          AI 代码解读

这里有一个坑 (由 PIL 引发) import skimage.io as io 在 Windows 下可能会报错，我的解决办法是：

先卸载 Pillow，然后重新安装即可。
插曲：PIL(Python Imaging Library)是Python一个强大方便的图像处理库，名气也比较大。Pillow 是 PIL 的一个派生分支，但如今已经发展成为比 PIL 本身更具活力的图像处理库。

dataDir = cocox.root
dataType = 'val2017'
annFile = '{}/annotations/instances_{}.json'.format(dataDir, dataType)
        
          
        
        
        
          
          AI 代码解读

# initialize COCO api for instance annotations
coco=COCO(annFile)
        
          
        
        
        
          
          AI 代码解读

loading annotations into memory...
Done (t=0.93s)
creating index...
index created!
        
          
        
        
        
          
          AI 代码解读

COCO 是一个类：

Constructor of Microsoft COCO helper class for reading and visualizing annotations.
:param annotation_file (str): location of annotation file
:param image_folder (str): location to the folder that hosts images.
        
          
        
        
        
          
          AI 代码解读

display COCO categories and supercategories

cats = coco.loadCats(coco.getCatIds())
nms = [cat['name'] for cat in cats]
print('COCO categories: \n{}\n'.format(' '.join(nms)))

nms = set([cat['supercategory'] for cat in cats])
print('COCO supercategories: \n{}'.format(' '.join(nms)))
        
          
        
        
        
          
          AI 代码解读

COCO categories: 
person bicycle car motorcycle airplane bus train truck boat traffic light fire hydrant stop sign parking meter bench bird cat dog horse sheep cow elephant bear zebra giraffe backpack umbrella handbag tie suitcase frisbee skis snowboard sports ball kite baseball bat baseball glove skateboard surfboard tennis racket bottle wine glass cup fork knife spoon bowl banana apple sandwich orange broccoli carrot hot dog pizza donut cake chair couch potted plant bed dining table toilet tv laptop mouse remote keyboard cell phone microwave oven toaster sink refrigerator book clock vase scissors teddy bear hair drier toothbrush

COCO supercategories: 
appliance sports person indoor vehicle food electronic furniture animal outdoor accessory kitchen
        
          
        
        
        
          
          AI 代码解读

# get all images containing given categories, select one at random
catIds = coco.getCatIds(catNms=['person', 'dog', 'skateboard'])
imgIds = coco.getImgIds(catIds=catIds)
imgIds = coco.getImgIds(imgIds=[335328])
img = coco.loadImgs(imgIds[np.random.randint(0, len(imgIds))])[0]
        
          
        
        
        
          
          AI 代码解读

{'license': 4,
 'file_name': '000000335328.jpg',
 'coco_url': 'http://images.cocodataset.org/val2017/000000335328.jpg',
 'height': 640,
 'width': 512,
 'date_captured': '2013-11-20 19:29:37',
 'flickr_url': 'http://farm3.staticflickr.com/2079/2128089396_ddd988a59a_z.jpg',
 'id': 335328}
        
          
        
        
        
          
          AI 代码解读

官方给的这个代码需要将图片数据集解压：

# load and display image
# use url to load image
# I = io.imread(img['coco_url'])
I = io.imread('%s/images/%s/%s' % (dataDir, dataType, img['file_name']))
plt.axis('off')
plt.imshow(I)
plt.show()
        
          
        
        
        
          
          AI 代码解读

我们可以使用 zipfile 模块直接读取图片，而无须解压：

image_names[-1]
        
          
        
        
        
          
          AI 代码解读

'E:/Data/coco/images/val2017.zip'
        
          
        
        
        
          
          AI 代码解读

val_z = zipfile.ZipFile(image_names[-1])
I = image.imdecode(val_z.read('%s/%s' % (dataType, img['file_name']))).asnumpy()
plt.axis('off')
plt.imshow(I)
plt.show()
        
          
        
        
        
          
          AI 代码解读

output_36_0.png-493.1kB

load and display instance annotations

plt.imshow(I)
plt.axis('off')
annIds = coco.getAnnIds(imgIds=img['id'], catIds=catIds, iscrowd=None)
anns = coco.loadAnns(annIds)
coco.showAnns(anns)
        
          
        
        
        
          
          AI 代码解读

output_38_0.png-491.6kB

initialize COCO api for person keypoints annotations

annFile = '{}/annotations/person_keypoints_{}.json'.format(dataDir, dataType)
coco_kps = COCO(annFile)
        
          
        
        
        
          
          AI 代码解读

loading annotations into memory...
Done (t=0.43s)
creating index...
index created!
        
          
        
        
        
          
          AI 代码解读

load and display keypoints annotations

plt.imshow(I)
plt.axis('off')
ax = plt.gca()
annIds = coco_kps.getAnnIds(imgIds=img['id'], catIds=catIds, iscrowd=None)
anns = coco_kps.loadAnns(annIds)
coco_kps.showAnns(anns)
        
          
        
        
        
          
          AI 代码解读

output_42_0.png-491kB

initialize COCO api for caption annotations

annFile = '{}/annotations/captions_{}.json'.format(dataDir, dataType)
coco_caps = COCO(annFile)
        
          
        
        
        
          
          AI 代码解读

loading annotations into memory...
Done (t=0.06s)
creating index...
index created!
        
          
        
        
        
          
          AI 代码解读

load and display caption annotations

annIds = coco_caps.getAnnIds(imgIds=img['id'])
anns = coco_caps.loadAnns(annIds)
coco_caps.showAnns(anns)
plt.imshow(I)
plt.axis('off')
plt.show()
        
          
        
        
        
          
          AI 代码解读

A couple of people riding waves on top of boards.
a couple of people that are surfing in water
A man and a young child in wet suits surfing in the ocean.
a man and small child standing on a surf board  and riding some waves
A young boy on a surfboard being taught to surf.
        
          
        
        
        
          
          AI 代码解读

output_46_1.png-493.1kB

GitHub 展示

你也可以在线编辑：https://mybinder.org/v2/gh/q735613050/dataLoader/master

探寻有趣之事！

博文更新见：COCO 数据集的使用：https://www.cnblogs.com/q735613050/p/8969452.html

COCO 数据集的使用

获取图片数据

获取标签信息（利用官方给定教程）

display COCO categories and supercategories

load and display instance annotations

initialize COCO api for person keypoints annotations

load and display keypoints annotations

initialize COCO api for caption annotations

load and display caption annotations

热门文章

最新文章

相关电子书

相关实验场景

探索云世界

热门

云计算

大数据

云原生

人工智能

数据库

开发与运维

活动广场

任务中心

训练营

直播

乘风者计划

下载

镜像站

技术资料

COCO 数据集的使用

获取图片数据

获取标签信息（利用官方给定教程）

display COCO categories and supercategories

load and display instance annotations

initialize COCO api for person keypoints annotations

load and display keypoints annotations

initialize COCO api for caption annotations

load and display caption annotations

热门文章

最新文章

相关电子书

相关实验场景