web_questions
Stay organized with collections
Save and categorize content based on your preferences.
This dataset consists of 6,642 question/answer pairs. The questions are supposed
to be answerable by Freebase, a large knowledge graph. The questions are mostly
centered around a single named entity. The questions are popular ones asked on
the web (at least in 2013).
Split |
Examples |
'test' |
2,032 |
'train' |
3,778 |
FeaturesDict({
'answers': Sequence(Text(shape=(), dtype=string)),
'question': Text(shape=(), dtype=string),
'url': Text(shape=(), dtype=string),
})
Feature |
Class |
Shape |
Dtype |
Description |
|
FeaturesDict |
|
|
|
answers |
Sequence(Text) |
(None,) |
string |
|
question |
Text |
|
string |
|
url |
Text |
|
string |
|
@inproceedings{berant-etal-2013-semantic,
title = "Semantic Parsing on {F}reebase from Question-Answer Pairs",
author = "Berant, Jonathan and
Chou, Andrew and
Frostig, Roy and
Liang, Percy",
booktitle = "Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing",
month = oct,
year = "2013",
address = "Seattle, Washington, USA",
publisher = "Association for Computational Linguistics",
url = "https://www.aclweb.org/anthology/D13-1160",
pages = "1533--1544",
}
Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.
Last updated 2022-12-06 UTC.
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Missing the information I need","missingTheInformationINeed","thumb-down"],["Too complicated / too many steps","tooComplicatedTooManySteps","thumb-down"],["Out of date","outOfDate","thumb-down"],["Samples / code issue","samplesCodeIssue","thumb-down"],["Other","otherDown","thumb-down"]],["Last updated 2022-12-06 UTC."],[],[],null,["# web_questions\n\n\u003cbr /\u003e\n\n- **Description**:\n\nThis dataset consists of 6,642 question/answer pairs. The questions are supposed\nto be answerable by Freebase, a large knowledge graph. The questions are mostly\ncentered around a single named entity. The questions are popular ones asked on\nthe web (at least in 2013).\n\n- **Additional Documentation** :\n [Explore on Papers With Code\n north_east](https://paperswithcode.com/dataset/webquestions)\n\n- **Homepage** :\n \u003chttps://worksheets.codalab.org/worksheets/0xba659fe363cb46e7a505c5b6a774dc8a\u003e\n\n- **Source code** :\n [`tfds.question_answering.WebQuestions`](https://github.com/tensorflow/datasets/tree/master/tensorflow_datasets/question_answering/web_questions.py)\n\n- **Versions**:\n\n - **`1.0.0`** (default): No release notes.\n- **Download size** : `1.21 MiB`\n\n- **Dataset size** : `983.88 KiB`\n\n- **Auto-cached**\n ([documentation](https://www.tensorflow.org/datasets/performances#auto-caching)):\n Yes\n\n- **Splits**:\n\n| Split | Examples |\n|-----------|----------|\n| `'test'` | 2,032 |\n| `'train'` | 3,778 |\n\n- **Feature structure**:\n\n FeaturesDict({\n 'answers': Sequence(Text(shape=(), dtype=string)),\n 'question': Text(shape=(), dtype=string),\n 'url': Text(shape=(), dtype=string),\n })\n\n- **Feature documentation**:\n\n| Feature | Class | Shape | Dtype | Description |\n|----------|----------------|---------|--------|-------------|\n| | FeaturesDict | | | |\n| answers | Sequence(Text) | (None,) | string | |\n| question | Text | | string | |\n| url | Text | | string | |\n\n- **Supervised keys** (See\n [`as_supervised` doc](https://www.tensorflow.org/datasets/api_docs/python/tfds/load#args)):\n `None`\n\n- **Figure**\n ([tfds.show_examples](https://www.tensorflow.org/datasets/api_docs/python/tfds/visualization/show_examples)):\n Not supported.\n\n- **Examples**\n ([tfds.as_dataframe](https://www.tensorflow.org/datasets/api_docs/python/tfds/as_dataframe)):\n\nDisplay examples... \n\n- **Citation**:\n\n @inproceedings{berant-etal-2013-semantic,\n title = \"Semantic Parsing on {F}reebase from Question-Answer Pairs\",\n author = \"Berant, Jonathan and\n Chou, Andrew and\n Frostig, Roy and\n Liang, Percy\",\n booktitle = \"Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing\",\n month = oct,\n year = \"2013\",\n address = \"Seattle, Washington, USA\",\n publisher = \"Association for Computational Linguistics\",\n url = \"https://www.aclweb.org/anthology/D13-1160\",\n pages = \"1533--1544\",\n }"]]