placesfull
Stay organized with collections
Save and categorize content based on your preferences.
The Places dataset is designed following principles of human visual cognition.
Our goal is to build a core of visual knowledge that can be used to train
artificial systems for high-level visual understanding tasks, such as scene
context, object recognition, action and event prediction, and theory-of-mind
inference.
The semantic categories of Places are defined by their function: the labels
represent the entry-level of an environment. To illustrate, the dataset has
different categories of bedrooms, or streets, etc, as one does not act the same
way, and does not make the same predictions of what can happen next, in a home
bedroom, an hotel bedroom or a nursery. In total, Places contains more than 10
million images comprising 400+ unique scene categories. The dataset features
5000 to 30,000 training images per class, consistent with real-world frequencies
of occurrence. Using convolutional neural networks (CNN), Places dataset allows
learning of deep scene features for various scene recognition tasks, with the
goal to establish new state-of-the-art performances on scene-centric benchmarks.
Here we provide the Places Database and the trained CNNs for academic research
and education purposes.
Split |
Examples |
'train' |
10,653,087 |
FeaturesDict({
'filename': Text(shape=(), dtype=string),
'image': Image(shape=(256, 256, 3), dtype=uint8),
'label': ClassLabel(shape=(), dtype=int64, num_classes=435),
})
Feature |
Class |
Shape |
Dtype |
Description |
|
FeaturesDict |
|
|
|
filename |
Text |
|
string |
|
image |
Image |
(256, 256, 3) |
uint8 |
|
label |
ClassLabel |
|
int64 |
|

@article{zhou2017places,
title={Places: A 10 million Image Database for Scene Recognition},
author={Zhou, Bolei and Lapedriza, Agata and Khosla, Aditya and Oliva, Aude and Torralba, Antonio},
journal={IEEE Transactions on Pattern Analysis and Machine Intelligence},
year={2017},
publisher={IEEE}
}
Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.
Last updated 2022-12-16 UTC.
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Missing the information I need","missingTheInformationINeed","thumb-down"],["Too complicated / too many steps","tooComplicatedTooManySteps","thumb-down"],["Out of date","outOfDate","thumb-down"],["Samples / code issue","samplesCodeIssue","thumb-down"],["Other","otherDown","thumb-down"]],["Last updated 2022-12-16 UTC."],[],[],null,["# placesfull\n\n\u003cbr /\u003e\n\n- **Description**:\n\nThe Places dataset is designed following principles of human visual cognition.\nOur goal is to build a core of visual knowledge that can be used to train\nartificial systems for high-level visual understanding tasks, such as scene\ncontext, object recognition, action and event prediction, and theory-of-mind\ninference.\n\nThe semantic categories of Places are defined by their function: the labels\nrepresent the entry-level of an environment. To illustrate, the dataset has\ndifferent categories of bedrooms, or streets, etc, as one does not act the same\nway, and does not make the same predictions of what can happen next, in a home\nbedroom, an hotel bedroom or a nursery. In total, Places contains more than 10\nmillion images comprising 400+ unique scene categories. The dataset features\n5000 to 30,000 training images per class, consistent with real-world frequencies\nof occurrence. Using convolutional neural networks (CNN), Places dataset allows\nlearning of deep scene features for various scene recognition tasks, with the\ngoal to establish new state-of-the-art performances on scene-centric benchmarks.\n\nHere we provide the Places Database and the trained CNNs for academic research\nand education purposes.\n\n- **Homepage** : \u003chttp://places2.csail.mit.edu/\u003e\n\n- **Source code** :\n [`tfds.datasets.placesfull.Builder`](https://github.com/tensorflow/datasets/tree/master/tensorflow_datasets/datasets/placesfull/placesfull_dataset_builder.py)\n\n- **Versions**:\n\n - **`1.0.0`** (default): No release notes.\n- **Download size** : `143.56 GiB`\n\n- **Dataset size** : `136.56 GiB`\n\n- **Auto-cached**\n ([documentation](https://www.tensorflow.org/datasets/performances#auto-caching)):\n No\n\n- **Splits**:\n\n| Split | Examples |\n|-----------|------------|\n| `'train'` | 10,653,087 |\n\n- **Feature structure**:\n\n FeaturesDict({\n 'filename': Text(shape=(), dtype=string),\n 'image': Image(shape=(256, 256, 3), dtype=uint8),\n 'label': ClassLabel(shape=(), dtype=int64, num_classes=435),\n })\n\n- **Feature documentation**:\n\n| Feature | Class | Shape | Dtype | Description |\n|----------|--------------|---------------|--------|-------------|\n| | FeaturesDict | | | |\n| filename | Text | | string | |\n| image | Image | (256, 256, 3) | uint8 | |\n| label | ClassLabel | | int64 | |\n\n- **Supervised keys** (See\n [`as_supervised` doc](https://www.tensorflow.org/datasets/api_docs/python/tfds/load#args)):\n `('image', 'label', 'filename')`\n\n- **Figure**\n ([tfds.show_examples](https://www.tensorflow.org/datasets/api_docs/python/tfds/visualization/show_examples)):\n\n- **Examples** ([tfds.as_dataframe](https://www.tensorflow.org/datasets/api_docs/python/tfds/as_dataframe)):\n\nDisplay examples... \n\n- **Citation**:\n\n @article{zhou2017places,\n title={Places: A 10 million Image Database for Scene Recognition},\n author={Zhou, Bolei and Lapedriza, Agata and Khosla, Aditya and Oliva, Aude and Torralba, Antonio},\n journal={IEEE Transactions on Pattern Analysis and Machine Intelligence},\n year={2017},\n publisher={IEEE}\n }"]]