- Description:
This dataset contains machine translations of MNLI into each of the XNLI languages. The translation data is provided by XTREME. Note that this is different from the machine translated data provided by the original XNLI paper.
Source code:
tfds.text.xtreme_xnli.XtremeXnliVersions:
1.1.0(default): No release notes.
Download size:
2.31 GiBDataset size:
1.59 GiBAuto-cached (documentation): No
Splits:
| Split | Examples |
|---|---|
'train' |
392,570 |
- Feature structure:
FeaturesDict({
'hypothesis': TranslationVariableLanguages({
'language': Text(shape=(), dtype=string),
'translation': Text(shape=(), dtype=string),
}),
'label': ClassLabel(shape=(), dtype=int64, num_classes=3),
'premise': Translation({
'ar': Text(shape=(), dtype=string),
'bg': Text(shape=(), dtype=string),
'de': Text(shape=(), dtype=string),
'el': Text(shape=(), dtype=string),
'en': Text(shape=(), dtype=string),
'es': Text(shape=(), dtype=string),
'fr': Text(shape=(), dtype=string),
'hi': Text(shape=(), dtype=string),
'ru': Text(shape=(), dtype=string),
'sw': Text(shape=(), dtype=string),
'th': Text(shape=(), dtype=string),
'tr': Text(shape=(), dtype=string),
'ur': Text(shape=(), dtype=string),
'vi': Text(shape=(), dtype=string),
'zh': Text(shape=(), dtype=string),
}),
})
- Feature documentation:
| Feature | Class | Shape | Dtype | Description |
|---|---|---|---|---|
| FeaturesDict | ||||
| hypothesis | TranslationVariableLanguages | |||
| hypothesis/language | Text | string | ||
| hypothesis/translation | Text | string | ||
| label | ClassLabel | int64 | ||
| premise | Translation | |||
| premise/ar | Text | string | ||
| premise/bg | Text | string | ||
| premise/de | Text | string | ||
| premise/el | Text | string | ||
| premise/en | Text | string | ||
| premise/es | Text | string | ||
| premise/fr | Text | string | ||
| premise/hi | Text | string | ||
| premise/ru | Text | string | ||
| premise/sw | Text | string | ||
| premise/th | Text | string | ||
| premise/tr | Text | string | ||
| premise/ur | Text | string | ||
| premise/vi | Text | string | ||
| premise/zh | Text | string |
Supervised keys (See
as_superviseddoc):NoneFigure (tfds.show_examples): Not supported.
Examples (tfds.as_dataframe):
- Citation:
@article{hu2020xtreme,
author = {Junjie Hu and Sebastian Ruder and Aditya Siddhant and Graham Neubig and Orhan Firat and Melvin Johnson},
title = {XTREME: A Massively Multilingual Multi-task Benchmark for Evaluating Cross-lingual Generalization},
journal = {CoRR},
volume = {abs/2003.11080},
year = {2020},
archivePrefix = {arXiv},
eprint = {2003.11080}
}