voc
Stay organized with collections
Save and categorize content based on your preferences.
This dataset contains the data from the PASCAL Visual Object Classes Challenge,
corresponding to the Classification and Detection competitions.
In the Classification competition, the goal is to predict the set of labels
contained in the image, while in the Detection competition the goal is to
predict the bounding box and label of each individual object. WARNING: As per
the official dataset, the test set of VOC2012 does not contain annotations.
FeaturesDict({
'image': Image(shape=(None, None, 3), dtype=uint8),
'image/filename': Text(shape=(), dtype=string),
'labels': Sequence(ClassLabel(shape=(), dtype=int64, num_classes=20)),
'labels_no_difficult': Sequence(ClassLabel(shape=(), dtype=int64, num_classes=20)),
'objects': Sequence({
'bbox': BBoxFeature(shape=(4,), dtype=float32),
'is_difficult': bool,
'is_truncated': bool,
'label': ClassLabel(shape=(), dtype=int64, num_classes=20),
'pose': ClassLabel(shape=(), dtype=int64, num_classes=5),
}),
})
Feature |
Class |
Shape |
Dtype |
Description |
|
FeaturesDict |
|
|
|
image |
Image |
(None, None, 3) |
uint8 |
|
image/filename |
Text |
|
string |
|
labels |
Sequence(ClassLabel) |
(None,) |
int64 |
|
labels_no_difficult |
Sequence(ClassLabel) |
(None,) |
int64 |
|
objects |
Sequence |
|
|
|
objects/bbox |
BBoxFeature |
(4,) |
float32 |
|
objects/is_difficult |
Tensor |
|
bool |
|
objects/is_truncated |
Tensor |
|
bool |
|
objects/label |
ClassLabel |
|
int64 |
|
objects/pose |
ClassLabel |
|
int64 |
|
voc/2007 (default config)
- Config description: This dataset contains the data from the PASCAL
Visual Object Classes Challenge 2007, a.k.a. VOC2007.
A total of 9963 images are included in this dataset, where each image contains a
set of objects, out of 20 different classes, making a total of 24640 annotated
objects.
Split |
Examples |
'test' |
4,952 |
'train' |
2,501 |
'validation' |
2,510 |

@misc{pascal-voc-2007,
author = "Everingham, M. and Van~Gool, L. and Williams, C. K. I. and Winn, J. and Zisserman, A.",
title = "The {PASCAL} {V}isual {O}bject {C}lasses {C}hallenge 2007 {(VOC2007)} {R}esults",
howpublished = "http://www.pascal-network.org/challenges/VOC/voc2007/workshop/index.html"}
voc/2012
- Config description: This dataset contains the data from the PASCAL
Visual Object Classes Challenge 2012, a.k.a. VOC2012.
A total of 11540 images are included in this dataset, where each image contains
a set of objects, out of 20 different classes, making a total of 27450 annotated
objects.
Split |
Examples |
'test' |
10,991 |
'train' |
5,717 |
'validation' |
5,823 |

@misc{pascal-voc-2012,
author = "Everingham, M. and Van~Gool, L. and Williams, C. K. I. and Winn, J. and Zisserman, A.",
title = "The {PASCAL} {V}isual {O}bject {C}lasses {C}hallenge 2012 {(VOC2012)} {R}esults",
howpublished = "http://www.pascal-network.org/challenges/VOC/voc2012/workshop/index.html"}
Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.
Last updated 2025-06-03 UTC.
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Missing the information I need","missingTheInformationINeed","thumb-down"],["Too complicated / too many steps","tooComplicatedTooManySteps","thumb-down"],["Out of date","outOfDate","thumb-down"],["Samples / code issue","samplesCodeIssue","thumb-down"],["Other","otherDown","thumb-down"]],["Last updated 2025-06-03 UTC."],[],[],null,["# voc\n\n\u003cbr /\u003e\n\n- **Description**:\n\nThis dataset contains the data from the PASCAL Visual Object Classes Challenge,\ncorresponding to the Classification and Detection competitions.\n\nIn the Classification competition, the goal is to predict the set of labels\ncontained in the image, while in the Detection competition the goal is to\npredict the bounding box and label of each individual object. WARNING: As per\nthe official dataset, the test set of VOC2012 does not contain annotations.\n\n- **Additional Documentation** :\n [Explore on Papers With Code\n north_east](https://paperswithcode.com/dataset/pascal-voc-2007)\n\n- **Source code** :\n [`tfds.object_detection.Voc`](https://github.com/tensorflow/datasets/tree/master/tensorflow_datasets/object_detection/voc.py)\n\n- **Versions**:\n\n - **`5.0.0`** (default): No release notes.\n- **Auto-cached**\n ([documentation](https://www.tensorflow.org/datasets/performances#auto-caching)):\n No\n\n- **Feature structure**:\n\n FeaturesDict({\n 'image': Image(shape=(None, None, 3), dtype=uint8),\n 'image/filename': Text(shape=(), dtype=string),\n 'labels': Sequence(ClassLabel(shape=(), dtype=int64, num_classes=20)),\n 'labels_no_difficult': Sequence(ClassLabel(shape=(), dtype=int64, num_classes=20)),\n 'objects': Sequence({\n 'bbox': BBoxFeature(shape=(4,), dtype=float32),\n 'is_difficult': bool,\n 'is_truncated': bool,\n 'label': ClassLabel(shape=(), dtype=int64, num_classes=20),\n 'pose': ClassLabel(shape=(), dtype=int64, num_classes=5),\n }),\n })\n\n- **Feature documentation**:\n\n| Feature | Class | Shape | Dtype | Description |\n|----------------------|----------------------|-----------------|---------|-------------|\n| | FeaturesDict | | | |\n| image | Image | (None, None, 3) | uint8 | |\n| image/filename | Text | | string | |\n| labels | Sequence(ClassLabel) | (None,) | int64 | |\n| labels_no_difficult | Sequence(ClassLabel) | (None,) | int64 | |\n| objects | Sequence | | | |\n| objects/bbox | BBoxFeature | (4,) | float32 | |\n| objects/is_difficult | Tensor | | bool | |\n| objects/is_truncated | Tensor | | bool | |\n| objects/label | ClassLabel | | int64 | |\n| objects/pose | ClassLabel | | int64 | |\n\n- **Supervised keys** (See [`as_supervised` doc](https://www.tensorflow.org/datasets/api_docs/python/tfds/load#args)): `None`\n\nvoc/2007 (default config)\n-------------------------\n\n- **Config description**: This dataset contains the data from the PASCAL Visual Object Classes Challenge 2007, a.k.a. VOC2007.\n\nA total of 9963 images are included in this dataset, where each image contains a\nset of objects, out of 20 different classes, making a total of 24640 annotated\nobjects.\n\n- **Homepage** :\n \u003chttp://host.robots.ox.ac.uk/pascal/VOC/voc2007/\u003e\n\n- **Download size** : `868.85 MiB`\n\n- **Dataset size** : `837.73 MiB`\n\n- **Splits**:\n\n| Split | Examples |\n|----------------|----------|\n| `'test'` | 4,952 |\n| `'train'` | 2,501 |\n| `'validation'` | 2,510 |\n\n- **Figure** ([tfds.show_examples](https://www.tensorflow.org/datasets/api_docs/python/tfds/visualization/show_examples)):\n\n- **Examples** ([tfds.as_dataframe](https://www.tensorflow.org/datasets/api_docs/python/tfds/as_dataframe)):\n\nDisplay examples... \n\n- **Citation**:\n\n @misc{pascal-voc-2007,\n author = \"Everingham, M. and Van~Gool, L. and Williams, C. K. I. and Winn, J. and Zisserman, A.\",\n title = \"The {PASCAL} {V}isual {O}bject {C}lasses {C}hallenge 2007 {(VOC2007)} {R}esults\",\n howpublished = \"http://www.pascal-network.org/challenges/VOC/voc2007/workshop/index.html\"}\n\nvoc/2012\n--------\n\n- **Config description**: This dataset contains the data from the PASCAL Visual Object Classes Challenge 2012, a.k.a. VOC2012.\n\nA total of 11540 images are included in this dataset, where each image contains\na set of objects, out of 20 different classes, making a total of 27450 annotated\nobjects.\n\n- **Homepage** :\n \u003chttp://host.robots.ox.ac.uk/pascal/VOC/voc2012/\u003e\n\n- **Download size** : `3.59 GiB`\n\n- **Dataset size** : `2.44 GiB`\n\n- **Splits**:\n\n| Split | Examples |\n|----------------|----------|\n| `'test'` | 10,991 |\n| `'train'` | 5,717 |\n| `'validation'` | 5,823 |\n\n- **Figure** ([tfds.show_examples](https://www.tensorflow.org/datasets/api_docs/python/tfds/visualization/show_examples)):\n\n- **Examples** ([tfds.as_dataframe](https://www.tensorflow.org/datasets/api_docs/python/tfds/as_dataframe)):\n\nDisplay examples... \n\n- **Citation**:\n\n @misc{pascal-voc-2012,\n author = \"Everingham, M. and Van~Gool, L. and Williams, C. K. I. and Winn, J. and Zisserman, A.\",\n title = \"The {PASCAL} {V}isual {O}bject {C}lasses {C}hallenge 2012 {(VOC2012)} {R}esults\",\n howpublished = \"http://www.pascal-network.org/challenges/VOC/voc2012/workshop/index.html\"}"]]