scan
Stay organized with collections
Save and categorize content based on your preferences.
SCAN tasks with various splits.
SCAN is a set of simple language-driven navigation tasks for studying
compositional learning and zero-shot generalization.
Most splits are described at https://github.com/brendenlake/SCAN For the MCD
splits please see https://arxiv.org/abs/1912.09713.pdf
Basic usage:
data = tfds.load('scan/length')
More advanced example:
import tensorflow_datasets as tfds
from tensorflow_datasets.datasets.scan import scan_dataset_builder
data = tfds.load(
'scan',
builder_kwargs=dict(
config=scan_dataset_builder.ScanConfig(
name='simple_p8', directory='simple_split/size_variations')))
FeaturesDict({
'actions': Text(shape=(), dtype=string),
'commands': Text(shape=(), dtype=string),
})
Feature |
Class |
Shape |
Dtype |
Description |
|
FeaturesDict |
|
|
|
actions |
Text |
|
string |
|
commands |
Text |
|
string |
|
@inproceedings{Lake2018GeneralizationWS,
title={Generalization without Systematicity: On the Compositional Skills of
Sequence-to-Sequence Recurrent Networks},
author={Brenden M. Lake and Marco Baroni},
booktitle={ICML},
year={2018},
url={https://arxiv.org/pdf/1711.00350.pdf},
}
@inproceedings{Keysers2020,
title={Measuring Compositional Generalization: A Comprehensive Method on
Realistic Data},
author={Daniel Keysers and Nathanael Sch\"{a}rli and Nathan Scales and
Hylke Buisman and Daniel Furrer and Sergii Kashubin and
Nikola Momchev and Danila Sinopalnikov and Lukasz Stafiniak and
Tibor Tihon and Dmitry Tsarkov and Xiao Wang and Marc van Zee and
Olivier Bousquet},
note={Additional citation for MCD splits},
booktitle={ICLR},
year={2020},
url={https://arxiv.org/abs/1912.09713.pdf},
}
scan/simple (default config)
Download size: 17.82 MiB
Dataset size: 4.47 MiB
Splits:
Split |
Examples |
'test' |
4,182 |
'train' |
16,728 |
scan/addprim_jump
Download size: 17.82 MiB
Dataset size: 4.53 MiB
Splits:
Split |
Examples |
'test' |
7,706 |
'train' |
14,670 |
scan/addprim_turn_left
Download size: 17.82 MiB
Dataset size: 4.58 MiB
Splits:
Split |
Examples |
'test' |
1,208 |
'train' |
21,890 |
scan/filler_num0
Download size: 17.82 MiB
Dataset size: 3.20 MiB
Splits:
Split |
Examples |
'test' |
1,173 |
'train' |
15,225 |
scan/filler_num1
Download size: 17.82 MiB
Dataset size: 3.51 MiB
Splits:
Split |
Examples |
'test' |
1,173 |
'train' |
16,290 |
scan/filler_num2
Download size: 17.82 MiB
Dataset size: 3.84 MiB
Splits:
Split |
Examples |
'test' |
1,173 |
'train' |
17,391 |
scan/filler_num3
Download size: 17.82 MiB
Dataset size: 4.17 MiB
Splits:
Split |
Examples |
'test' |
1,173 |
'train' |
18,528 |
scan/length
Download size: 17.82 MiB
Dataset size: 4.47 MiB
Splits:
Split |
Examples |
'test' |
3,920 |
'train' |
16,990 |
scan/template_around_right
Download size: 17.82 MiB
Dataset size: 4.17 MiB
Splits:
Split |
Examples |
'test' |
4,476 |
'train' |
15,225 |
scan/template_jump_around_right
Download size: 17.82 MiB
Dataset size: 4.17 MiB
Splits:
Split |
Examples |
'test' |
1,173 |
'train' |
18,528 |
scan/template_opposite_right
Download size: 17.82 MiB
Dataset size: 4.22 MiB
Splits:
Split |
Examples |
'test' |
4,476 |
'train' |
15,225 |
scan/template_right
Download size: 17.82 MiB
Dataset size: 4.26 MiB
Splits:
Split |
Examples |
'test' |
4,476 |
'train' |
15,225 |
scan/mcd1
Download size: 17.89 MiB
Dataset size: 1.89 MiB
Splits:
Split |
Examples |
'test' |
1,045 |
'train' |
8,365 |
scan/mcd2
Download size: 17.89 MiB
Dataset size: 1.84 MiB
Splits:
Split |
Examples |
'test' |
1,045 |
'train' |
8,365 |
scan/mcd3
Download size: 17.89 MiB
Dataset size: 1.87 MiB
Splits:
Split |
Examples |
'test' |
1,045 |
'train' |
8,365 |
Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.
Last updated 2022-12-23 UTC.
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Missing the information I need","missingTheInformationINeed","thumb-down"],["Too complicated / too many steps","tooComplicatedTooManySteps","thumb-down"],["Out of date","outOfDate","thumb-down"],["Samples / code issue","samplesCodeIssue","thumb-down"],["Other","otherDown","thumb-down"]],["Last updated 2022-12-23 UTC."],[],[],null,["# scan\n\n\u003cbr /\u003e\n\n- **Description**:\n\nSCAN tasks with various splits.\n\nSCAN is a set of simple language-driven navigation tasks for studying\ncompositional learning and zero-shot generalization.\n\nMost splits are described at \u003chttps://github.com/brendenlake/SCAN\u003e For the MCD\nsplits please see \u003chttps://arxiv.org/abs/1912.09713.pdf\u003e\n\nBasic usage: \n\n data = tfds.load('scan/length')\n\nMore advanced example: \n\n import tensorflow_datasets as tfds\n from tensorflow_datasets.datasets.scan import scan_dataset_builder\n\n data = tfds.load(\n 'scan',\n builder_kwargs=dict(\n config=scan_dataset_builder.ScanConfig(\n name='simple_p8', directory='simple_split/size_variations')))\n\n- **Additional Documentation** :\n [Explore on Papers With Code\n north_east](https://paperswithcode.com/dataset/scan)\n\n- **Homepage** :\n \u003chttps://github.com/brendenlake/SCAN\u003e\n\n- **Source code** :\n [`tfds.datasets.scan.Builder`](https://github.com/tensorflow/datasets/tree/master/tensorflow_datasets/datasets/scan/scan_dataset_builder.py)\n\n- **Versions**:\n\n - **`1.1.1`** (default): No release notes.\n- **Auto-cached**\n ([documentation](https://www.tensorflow.org/datasets/performances#auto-caching)):\n Yes\n\n- **Feature structure**:\n\n FeaturesDict({\n 'actions': Text(shape=(), dtype=string),\n 'commands': Text(shape=(), dtype=string),\n })\n\n- **Feature documentation**:\n\n| Feature | Class | Shape | Dtype | Description |\n|----------|--------------|-------|--------|-------------|\n| | FeaturesDict | | | |\n| actions | Text | | string | |\n| commands | Text | | string | |\n\n- **Supervised keys** (See\n [`as_supervised` doc](https://www.tensorflow.org/datasets/api_docs/python/tfds/load#args)):\n `('commands', 'actions')`\n\n- **Figure**\n ([tfds.show_examples](https://www.tensorflow.org/datasets/api_docs/python/tfds/visualization/show_examples)):\n Not supported.\n\n- **Citation**:\n\n @inproceedings{Lake2018GeneralizationWS,\n title={Generalization without Systematicity: On the Compositional Skills of\n Sequence-to-Sequence Recurrent Networks},\n author={Brenden M. Lake and Marco Baroni},\n booktitle={ICML},\n year={2018},\n url={https://arxiv.org/pdf/1711.00350.pdf},\n }\n @inproceedings{Keysers2020,\n title={Measuring Compositional Generalization: A Comprehensive Method on\n Realistic Data},\n author={Daniel Keysers and Nathanael Sch\\\"{a}rli and Nathan Scales and\n Hylke Buisman and Daniel Furrer and Sergii Kashubin and\n Nikola Momchev and Danila Sinopalnikov and Lukasz Stafiniak and\n Tibor Tihon and Dmitry Tsarkov and Xiao Wang and Marc van Zee and\n Olivier Bousquet},\n note={Additional citation for MCD splits},\n booktitle={ICLR},\n year={2020},\n url={https://arxiv.org/abs/1912.09713.pdf},\n }\n\nscan/simple (default config)\n----------------------------\n\n- **Download size** : `17.82 MiB`\n\n- **Dataset size** : `4.47 MiB`\n\n- **Splits**:\n\n| Split | Examples |\n|-----------|----------|\n| `'test'` | 4,182 |\n| `'train'` | 16,728 |\n\n- **Examples** ([tfds.as_dataframe](https://www.tensorflow.org/datasets/api_docs/python/tfds/as_dataframe)):\n\nDisplay examples... \n\nscan/addprim_jump\n-----------------\n\n- **Download size** : `17.82 MiB`\n\n- **Dataset size** : `4.53 MiB`\n\n- **Splits**:\n\n| Split | Examples |\n|-----------|----------|\n| `'test'` | 7,706 |\n| `'train'` | 14,670 |\n\n- **Examples** ([tfds.as_dataframe](https://www.tensorflow.org/datasets/api_docs/python/tfds/as_dataframe)):\n\nDisplay examples... \n\nscan/addprim_turn_left\n----------------------\n\n- **Download size** : `17.82 MiB`\n\n- **Dataset size** : `4.58 MiB`\n\n- **Splits**:\n\n| Split | Examples |\n|-----------|----------|\n| `'test'` | 1,208 |\n| `'train'` | 21,890 |\n\n- **Examples** ([tfds.as_dataframe](https://www.tensorflow.org/datasets/api_docs/python/tfds/as_dataframe)):\n\nDisplay examples... \n\nscan/filler_num0\n----------------\n\n- **Download size** : `17.82 MiB`\n\n- **Dataset size** : `3.20 MiB`\n\n- **Splits**:\n\n| Split | Examples |\n|-----------|----------|\n| `'test'` | 1,173 |\n| `'train'` | 15,225 |\n\n- **Examples** ([tfds.as_dataframe](https://www.tensorflow.org/datasets/api_docs/python/tfds/as_dataframe)):\n\nDisplay examples... \n\nscan/filler_num1\n----------------\n\n- **Download size** : `17.82 MiB`\n\n- **Dataset size** : `3.51 MiB`\n\n- **Splits**:\n\n| Split | Examples |\n|-----------|----------|\n| `'test'` | 1,173 |\n| `'train'` | 16,290 |\n\n- **Examples** ([tfds.as_dataframe](https://www.tensorflow.org/datasets/api_docs/python/tfds/as_dataframe)):\n\nDisplay examples... \n\nscan/filler_num2\n----------------\n\n- **Download size** : `17.82 MiB`\n\n- **Dataset size** : `3.84 MiB`\n\n- **Splits**:\n\n| Split | Examples |\n|-----------|----------|\n| `'test'` | 1,173 |\n| `'train'` | 17,391 |\n\n- **Examples** ([tfds.as_dataframe](https://www.tensorflow.org/datasets/api_docs/python/tfds/as_dataframe)):\n\nDisplay examples... \n\nscan/filler_num3\n----------------\n\n- **Download size** : `17.82 MiB`\n\n- **Dataset size** : `4.17 MiB`\n\n- **Splits**:\n\n| Split | Examples |\n|-----------|----------|\n| `'test'` | 1,173 |\n| `'train'` | 18,528 |\n\n- **Examples** ([tfds.as_dataframe](https://www.tensorflow.org/datasets/api_docs/python/tfds/as_dataframe)):\n\nDisplay examples... \n\nscan/length\n-----------\n\n- **Download size** : `17.82 MiB`\n\n- **Dataset size** : `4.47 MiB`\n\n- **Splits**:\n\n| Split | Examples |\n|-----------|----------|\n| `'test'` | 3,920 |\n| `'train'` | 16,990 |\n\n- **Examples** ([tfds.as_dataframe](https://www.tensorflow.org/datasets/api_docs/python/tfds/as_dataframe)):\n\nDisplay examples... \n\nscan/template_around_right\n--------------------------\n\n- **Download size** : `17.82 MiB`\n\n- **Dataset size** : `4.17 MiB`\n\n- **Splits**:\n\n| Split | Examples |\n|-----------|----------|\n| `'test'` | 4,476 |\n| `'train'` | 15,225 |\n\n- **Examples** ([tfds.as_dataframe](https://www.tensorflow.org/datasets/api_docs/python/tfds/as_dataframe)):\n\nDisplay examples... \n\nscan/template_jump_around_right\n-------------------------------\n\n- **Download size** : `17.82 MiB`\n\n- **Dataset size** : `4.17 MiB`\n\n- **Splits**:\n\n| Split | Examples |\n|-----------|----------|\n| `'test'` | 1,173 |\n| `'train'` | 18,528 |\n\n- **Examples** ([tfds.as_dataframe](https://www.tensorflow.org/datasets/api_docs/python/tfds/as_dataframe)):\n\nDisplay examples... \n\nscan/template_opposite_right\n----------------------------\n\n- **Download size** : `17.82 MiB`\n\n- **Dataset size** : `4.22 MiB`\n\n- **Splits**:\n\n| Split | Examples |\n|-----------|----------|\n| `'test'` | 4,476 |\n| `'train'` | 15,225 |\n\n- **Examples** ([tfds.as_dataframe](https://www.tensorflow.org/datasets/api_docs/python/tfds/as_dataframe)):\n\nDisplay examples... \n\nscan/template_right\n-------------------\n\n- **Download size** : `17.82 MiB`\n\n- **Dataset size** : `4.26 MiB`\n\n- **Splits**:\n\n| Split | Examples |\n|-----------|----------|\n| `'test'` | 4,476 |\n| `'train'` | 15,225 |\n\n- **Examples** ([tfds.as_dataframe](https://www.tensorflow.org/datasets/api_docs/python/tfds/as_dataframe)):\n\nDisplay examples... \n\nscan/mcd1\n---------\n\n- **Download size** : `17.89 MiB`\n\n- **Dataset size** : `1.89 MiB`\n\n- **Splits**:\n\n| Split | Examples |\n|-----------|----------|\n| `'test'` | 1,045 |\n| `'train'` | 8,365 |\n\n- **Examples** ([tfds.as_dataframe](https://www.tensorflow.org/datasets/api_docs/python/tfds/as_dataframe)):\n\nDisplay examples... \n\nscan/mcd2\n---------\n\n- **Download size** : `17.89 MiB`\n\n- **Dataset size** : `1.84 MiB`\n\n- **Splits**:\n\n| Split | Examples |\n|-----------|----------|\n| `'test'` | 1,045 |\n| `'train'` | 8,365 |\n\n- **Examples** ([tfds.as_dataframe](https://www.tensorflow.org/datasets/api_docs/python/tfds/as_dataframe)):\n\nDisplay examples... \n\nscan/mcd3\n---------\n\n- **Download size** : `17.89 MiB`\n\n- **Dataset size** : `1.87 MiB`\n\n- **Splits**:\n\n| Split | Examples |\n|-----------|----------|\n| `'test'` | 1,045 |\n| `'train'` | 8,365 |\n\n- **Examples** ([tfds.as_dataframe](https://www.tensorflow.org/datasets/api_docs/python/tfds/as_dataframe)):\n\nDisplay examples..."]]