opinion_abstracts
Stay organized with collections
Save and categorize content based on your preferences.
There are two sub datasets:
(1) RottenTomatoes: The movie critics and consensus crawled from
http://rottentomatoes.com/ It has fields of "_movie_name", "_movie_id",
"_critics", and "_critic_consensus".
(2) IDebate: The arguments crawled from http://idebate.org/ It has fields of
"_debate_name", "_debate_id", "_claim", "_claim_id", "_argument_sentences".
See also https://web.eecs.umich.edu/~wangluxy/datasets/opinion_README.txt
@inproceedings{wang-ling-2016-neural,
title = "Neural Network-Based Abstract Generation for Opinions and Arguments",
author = "Wang, Lu and
Ling, Wang",
booktitle = "Proceedings of the 2016 Conference of the North {A}merican Chapter of the Association for Computational Linguistics: Human Language Technologies",
month = jun,
year = "2016",
address = "San Diego, California",
publisher = "Association for Computational Linguistics",
url = "https://www.aclweb.org/anthology/N16-1007",
doi = "10.18653/v1/N16-1007",
pages = "47--57",
}
opinion_abstracts/rotten_tomatoes (default config)
Split |
Examples |
'train' |
3,731 |
FeaturesDict({
'_critic_consensus': string,
'_critics': Sequence({
'key': string,
'value': string,
}),
'_movie_id': string,
'_movie_name': string,
})
Feature |
Class |
Shape |
Dtype |
Description |
|
FeaturesDict |
|
|
|
_critic_consensus |
Tensor |
|
string |
|
_critics |
Sequence |
|
|
|
_critics/key |
Tensor |
|
string |
|
_critics/value |
Tensor |
|
string |
|
_movie_id |
Tensor |
|
string |
|
_movie_name |
Tensor |
|
string |
|
opinion_abstracts/idebate
Split |
Examples |
'train' |
2,259 |
FeaturesDict({
'_argument_sentences': Sequence({
'key': string,
'value': string,
}),
'_claim': string,
'_claim_id': string,
'_debate_name': string,
})
Feature |
Class |
Shape |
Dtype |
Description |
|
FeaturesDict |
|
|
|
_argument_sentences |
Sequence |
|
|
|
_argument_sentences/key |
Tensor |
|
string |
|
_argument_sentences/value |
Tensor |
|
string |
|
_claim |
Tensor |
|
string |
|
_claim_id |
Tensor |
|
string |
|
_debate_name |
Tensor |
|
string |
|
Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.
Last updated 2023-04-15 UTC.
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Missing the information I need","missingTheInformationINeed","thumb-down"],["Too complicated / too many steps","tooComplicatedTooManySteps","thumb-down"],["Out of date","outOfDate","thumb-down"],["Samples / code issue","samplesCodeIssue","thumb-down"],["Other","otherDown","thumb-down"]],["Last updated 2023-04-15 UTC."],[],[],null,["# opinion_abstracts\n\n\u003cbr /\u003e\n\n- **Description**:\n\nThere are two sub datasets:\n\n(1) RottenTomatoes: The movie critics and consensus crawled from\n\u003chttp://rottentomatoes.com/\u003e It has fields of \"_movie_name\", \"_movie_id\",\n\"_critics\", and \"_critic_consensus\".\n\n(2) IDebate: The arguments crawled from \u003chttp://idebate.org/\u003e It has fields of\n\"_debate_name\", \"_debate_id\", \"_claim\", \"_claim_id\", \"_argument_sentences\".\n\nSee also [https://web.eecs.umich.edu/\\~wangluxy/datasets/opinion_README.txt](https://web.eecs.umich.edu/~wangluxy/datasets/opinion_README.txt)\n\n- **Homepage** :\n [https://web.eecs.umich.edu/\\~wangluxy/data.html](https://web.eecs.umich.edu/%7Ewangluxy/data.html)\n\n- **Source code** :\n [`tfds.datasets.opinion_abstracts.Builder`](https://github.com/tensorflow/datasets/tree/master/tensorflow_datasets/datasets/opinion_abstracts/opinion_abstracts_dataset_builder.py)\n\n- **Versions**:\n\n - **`1.0.0`** (default): No release notes.\n- **Download size** : `20.08 MiB`\n\n- **Auto-cached**\n ([documentation](https://www.tensorflow.org/datasets/performances#auto-caching)):\n Yes\n\n- **Figure**\n ([tfds.show_examples](https://www.tensorflow.org/datasets/api_docs/python/tfds/visualization/show_examples)):\n Not supported.\n\n- **Citation**:\n\n @inproceedings{wang-ling-2016-neural,\n title = \"Neural Network-Based Abstract Generation for Opinions and Arguments\",\n author = \"Wang, Lu and\n Ling, Wang\",\n booktitle = \"Proceedings of the 2016 Conference of the North {A}merican Chapter of the Association for Computational Linguistics: Human Language Technologies\",\n month = jun,\n year = \"2016\",\n address = \"San Diego, California\",\n publisher = \"Association for Computational Linguistics\",\n url = \"https://www.aclweb.org/anthology/N16-1007\",\n doi = \"10.18653/v1/N16-1007\",\n pages = \"47--57\",\n }\n\nopinion_abstracts/rotten_tomatoes (default config)\n--------------------------------------------------\n\n- **Config description**: Professional critics and consensus of 3,731 movies.\n\n- **Dataset size** : `50.10 MiB`\n\n- **Splits**:\n\n| Split | Examples |\n|-----------|----------|\n| `'train'` | 3,731 |\n\n- **Feature structure**:\n\n FeaturesDict({\n '_critic_consensus': string,\n '_critics': Sequence({\n 'key': string,\n 'value': string,\n }),\n '_movie_id': string,\n '_movie_name': string,\n })\n\n- **Feature documentation**:\n\n| Feature | Class | Shape | Dtype | Description |\n|-------------------|--------------|-------|--------|-------------|\n| | FeaturesDict | | | |\n| _critic_consensus | Tensor | | string | |\n| _critics | Sequence | | | |\n| _critics/key | Tensor | | string | |\n| _critics/value | Tensor | | string | |\n| _movie_id | Tensor | | string | |\n| _movie_name | Tensor | | string | |\n\n- **Supervised keys** (See\n [`as_supervised` doc](https://www.tensorflow.org/datasets/api_docs/python/tfds/load#args)):\n `('_critics', '_critic_consensus')`\n\n- **Examples**\n ([tfds.as_dataframe](https://www.tensorflow.org/datasets/api_docs/python/tfds/as_dataframe)):\n\nDisplay examples... \n\nopinion_abstracts/idebate\n-------------------------\n\n- **Config description**: 2,259 claims for 676 debates.\n\n- **Dataset size** : `3.15 MiB`\n\n- **Splits**:\n\n| Split | Examples |\n|-----------|----------|\n| `'train'` | 2,259 |\n\n- **Feature structure**:\n\n FeaturesDict({\n '_argument_sentences': Sequence({\n 'key': string,\n 'value': string,\n }),\n '_claim': string,\n '_claim_id': string,\n '_debate_name': string,\n })\n\n- **Feature documentation**:\n\n| Feature | Class | Shape | Dtype | Description |\n|---------------------------|--------------|-------|--------|-------------|\n| | FeaturesDict | | | |\n| _argument_sentences | Sequence | | | |\n| _argument_sentences/key | Tensor | | string | |\n| _argument_sentences/value | Tensor | | string | |\n| _claim | Tensor | | string | |\n| _claim_id | Tensor | | string | |\n| _debate_name | Tensor | | string | |\n\n- **Supervised keys** (See\n [`as_supervised` doc](https://www.tensorflow.org/datasets/api_docs/python/tfds/load#args)):\n `('_argument_sentences', '_claim')`\n\n- **Examples**\n ([tfds.as_dataframe](https://www.tensorflow.org/datasets/api_docs/python/tfds/as_dataframe)):\n\nDisplay examples..."]]