TFDS ora supporta il formato Croissant 🥐 ! Leggi la documentazione per saperne di più.

Questa pagina è stata tradotta dall'API Cloud Translation.

rlu_rwrl

Descrizione :

RL Unplugged è una suite di benchmark per l'apprendimento per rinforzo offline. RL Unplugged è progettato sulla base delle seguenti considerazioni: per facilitare la facilità d'uso, forniamo ai set di dati un'API unificata che rende facile per il professionista lavorare con tutti i dati nella suite una volta stabilita una pipeline generale.

I set di dati seguono il formato RLDS per rappresentare passaggi ed episodi.

Gli esempi nel set di dati rappresentano le transizioni SAR memorizzate durante l'esecuzione di un agente addestrato parzialmente online come descritto in https://arxiv.org/abs/1904.12901 Seguiamo il formato del set di dati RLDS, come specificato in https://github.com/google-research /rlds#formato-set di dati

Rilasciamo 40 set di dati su 8 attività in totale, con nessuna sfida combinata e sfida combinata facile per attività su carretto, camminatore, quadrupede e umanoide. Ogni attività contiene 5 diverse dimensioni di set di dati, 1%, 5%, 20%, 40% e 100%. Si noti che non è garantito che il set di dati più piccolo sia un sottoinsieme di quelli più grandi. Per i dettagli su come è stato generato il set di dati, fare riferimento al documento.

Pagina iniziale : https://github.com/deepmind/deepmind-research/tree/master/rl_unplugged
Codice sorgente : tfds.rl_unplugged.rlu_rwrl.RluRwrl
Versioni :
- 1.0.0 : Versione iniziale.
- 1.0.1 (impostazione predefinita): corregge un bug nel set di dati RLU RWRL in cui sono presenti ID episodio duplicati in uno dei set di dati umanoidi.
Dimensioni del download : Unknown size
Chiavi supervisionate (Vedi as_supervised doc ): None
Figura ( tfds.show_examples ): non supportato.
Citazione :

@misc{gulcehre2020rl,
    title={RL Unplugged: Benchmarks for Offline Reinforcement Learning},
    author={Caglar Gulcehre and Ziyu Wang and Alexander Novikov and Tom Le Paine
        and  Sergio Gómez Colmenarejo and Konrad Zolna and Rishabh Agarwal and
        Josh Merel and Daniel Mankowitz and Cosmin Paduraru and Gabriel
        Dulac-Arnold and Jerry Li and Mohammad Norouzi and Matt Hoffman and
        Ofir Nachum and George Tucker and Nicolas Heess and Nando deFreitas},
    year={2020},
    eprint={2006.13888},
    archivePrefix={arXiv},
    primaryClass={cs.LG}
}

rlu_rwrl/cartpole_swingup_combined_challenge_none_1_percent (configurazione predefinita)

Dimensione del set di dati: 172.43 KiB
Auto-cache ( documentazione ): Sì
Divisioni :

Diviso	Esempi
`'train'`	5

Struttura delle caratteristiche :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(1,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'position': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(2,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

Documentazione delle funzionalità :

Caratteristica	Classe	Forma	Tipo D
	CaratteristicheDict
episodio_ritorno	Tensore		galleggiante32
passi	Set di dati
passi/azione	Tensore	(1,)	galleggiante32
passi/sconto	Tensore	(1,)	galleggiante32
passi/è_primo	Tensore		bool
passi/è_ultimo	Tensore		bool
passi/è_terminale	Tensore		bool
passi/osservazione	CaratteristicheDict
passi/osservazione/posizione	Tensore	(3,)	galleggiante32
passi/osservazione/velocità	Tensore	(2,)	galleggiante32
passi/ricompensa	Tensore	(1,)	galleggiante32

Esempi ( tfds.as_dataframe ):

rlu_rwrl/cartpole_swingup_combined_challenge_none_5_percent

Dimensione del set di dati: 862.13 KiB
Auto-cache ( documentazione ): Sì
Divisioni :

Diviso	Esempi
`'train'`	25

Struttura delle caratteristiche :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(1,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'position': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(2,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

Documentazione delle funzionalità :

Caratteristica	Classe	Forma	Tipo D
	CaratteristicheDict
episodio_ritorno	Tensore		galleggiante32
passi	Set di dati
passi/azione	Tensore	(1,)	galleggiante32
passi/sconto	Tensore	(1,)	galleggiante32
passi/è_primo	Tensore		bool
passi/è_ultimo	Tensore		bool
passi/è_terminale	Tensore		bool
passi/osservazione	CaratteristicheDict
passi/osservazione/posizione	Tensore	(3,)	galleggiante32
passi/osservazione/velocità	Tensore	(2,)	galleggiante32
passi/ricompensa	Tensore	(1,)	galleggiante32

Esempi ( tfds.as_dataframe ):

rlu_rwrl/cartpole_swingup_combined_challenge_none_20_percent

Dimensione del set di dati: 3.37 MiB
Auto-cache ( documentazione ): Sì
Divisioni :

Diviso	Esempi
`'train'`	100

Struttura delle caratteristiche :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(1,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'position': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(2,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

Documentazione delle funzionalità :

Caratteristica	Classe	Forma	Tipo D
	CaratteristicheDict
episodio_ritorno	Tensore		galleggiante32
passi	Set di dati
passi/azione	Tensore	(1,)	galleggiante32
passi/sconto	Tensore	(1,)	galleggiante32
passi/è_primo	Tensore		bool
passi/è_ultimo	Tensore		bool
passi/è_terminale	Tensore		bool
passi/osservazione	CaratteristicheDict
passi/osservazione/posizione	Tensore	(3,)	galleggiante32
passi/osservazione/velocità	Tensore	(2,)	galleggiante32
passi/ricompensa	Tensore	(1,)	galleggiante32

Esempi ( tfds.as_dataframe ):

rlu_rwrl/cartpole_swingup_combined_challenge_none_40_percent

Dimensione del set di dati: 6.74 MiB
Auto-cache ( documentazione ): Sì
Divisioni :

Diviso	Esempi
`'train'`	200

Struttura delle caratteristiche :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(1,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'position': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(2,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

Documentazione delle funzionalità :

Caratteristica	Classe	Forma	Tipo D
	CaratteristicheDict
episodio_ritorno	Tensore		galleggiante32
passi	Set di dati
passi/azione	Tensore	(1,)	galleggiante32
passi/sconto	Tensore	(1,)	galleggiante32
passi/è_primo	Tensore		bool
passi/è_ultimo	Tensore		bool
passi/è_terminale	Tensore		bool
passi/osservazione	CaratteristicheDict
passi/osservazione/posizione	Tensore	(3,)	galleggiante32
passi/osservazione/velocità	Tensore	(2,)	galleggiante32
passi/ricompensa	Tensore	(1,)	galleggiante32

Esempi ( tfds.as_dataframe ):

rlu_rwrl/cartpole_swingup_combined_challenge_none_100_percent

Dimensione del set di dati: 16.84 MiB
Auto-cache ( documentazione ): Sì
Divisioni :

Diviso	Esempi
`'train'`	500

Struttura delle caratteristiche :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(1,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'position': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(2,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

Documentazione delle funzionalità :

Caratteristica	Classe	Forma	Tipo D
	CaratteristicheDict
episodio_ritorno	Tensore		galleggiante32
passi	Set di dati
passi/azione	Tensore	(1,)	galleggiante32
passi/sconto	Tensore	(1,)	galleggiante32
passi/è_primo	Tensore		bool
passi/è_ultimo	Tensore		bool
passi/è_terminale	Tensore		bool
passi/osservazione	CaratteristicheDict
passi/osservazione/posizione	Tensore	(3,)	galleggiante32
passi/osservazione/velocità	Tensore	(2,)	galleggiante32
passi/ricompensa	Tensore	(1,)	galleggiante32

Esempi ( tfds.as_dataframe ):

rlu_rwrl/quadruped_walk_combined_challenge_none_1_percent

Dimensione del set di dati: 1.77 MiB
Auto-cache ( documentazione ): Sì
Divisioni :

Diviso	Esempi
`'train'`	5

Struttura delle caratteristiche :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(12,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'egocentric_state': Tensor(shape=(44,), dtype=float32),
            'force_torque': Tensor(shape=(24,), dtype=float32),
            'imu': Tensor(shape=(6,), dtype=float32),
            'torso_upright': Tensor(shape=(1,), dtype=float32),
            'torso_velocity': Tensor(shape=(3,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

Documentazione delle funzionalità :

Caratteristica	Classe	Forma	Tipo D
	CaratteristicheDict
episodio_ritorno	Tensore		galleggiante32
passi	Set di dati
passi/azione	Tensore	(12,)	galleggiante32
passi/sconto	Tensore	(1,)	galleggiante32
passi/è_primo	Tensore		bool
passi/è_ultimo	Tensore		bool
passi/è_terminale	Tensore		bool
passi/osservazione	CaratteristicheDict
passi/osservazione/stato_egocentrico	Tensore	(44,)	galleggiante32
passi/osservazione/force_torque	Tensore	(24,)	galleggiante32
passi/osservazione/imu	Tensore	(6,)	galleggiante32
passi/osservazione/torso_upright	Tensore	(1,)	galleggiante32
passi/osservazione/torso_velocity	Tensore	(3,)	galleggiante32
passi/ricompensa	Tensore	(1,)	galleggiante32

Esempi ( tfds.as_dataframe ):

rlu_rwrl/quadruped_walk_combined_challenge_none_5_percent

Dimensione del set di dati: 8.86 MiB
Auto-cache ( documentazione ): Sì
Divisioni :

Diviso	Esempi
`'train'`	25

Struttura delle caratteristiche :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(12,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'egocentric_state': Tensor(shape=(44,), dtype=float32),
            'force_torque': Tensor(shape=(24,), dtype=float32),
            'imu': Tensor(shape=(6,), dtype=float32),
            'torso_upright': Tensor(shape=(1,), dtype=float32),
            'torso_velocity': Tensor(shape=(3,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

Documentazione delle funzionalità :

Caratteristica	Classe	Forma	Tipo D
	CaratteristicheDict
episodio_ritorno	Tensore		galleggiante32
passi	Set di dati
passi/azione	Tensore	(12,)	galleggiante32
passi/sconto	Tensore	(1,)	galleggiante32
passi/è_primo	Tensore		bool
passi/è_ultimo	Tensore		bool
passi/è_terminale	Tensore		bool
passi/osservazione	CaratteristicheDict
passi/osservazione/stato_egocentrico	Tensore	(44,)	galleggiante32
passi/osservazione/force_torque	Tensore	(24,)	galleggiante32
passi/osservazione/imu	Tensore	(6,)	galleggiante32
passi/osservazione/torso_upright	Tensore	(1,)	galleggiante32
passi/osservazione/torso_velocity	Tensore	(3,)	galleggiante32
passi/ricompensa	Tensore	(1,)	galleggiante32

Esempi ( tfds.as_dataframe ):

rlu_rwrl/quadruped_walk_combined_challenge_none_20_percent

Dimensione del set di dati: 35.46 MiB
Auto-cache ( documentazione ): Sì
Divisioni :

Diviso	Esempi
`'train'`	100

Struttura delle caratteristiche :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(12,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'egocentric_state': Tensor(shape=(44,), dtype=float32),
            'force_torque': Tensor(shape=(24,), dtype=float32),
            'imu': Tensor(shape=(6,), dtype=float32),
            'torso_upright': Tensor(shape=(1,), dtype=float32),
            'torso_velocity': Tensor(shape=(3,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

Documentazione delle funzionalità :

Caratteristica	Classe	Forma	Tipo D
	CaratteristicheDict
episodio_ritorno	Tensore		galleggiante32
passi	Set di dati
passi/azione	Tensore	(12,)	galleggiante32
passi/sconto	Tensore	(1,)	galleggiante32
passi/è_primo	Tensore		bool
passi/è_ultimo	Tensore		bool
passi/è_terminale	Tensore		bool
passi/osservazione	CaratteristicheDict
passi/osservazione/stato_egocentrico	Tensore	(44,)	galleggiante32
passi/osservazione/force_torque	Tensore	(24,)	galleggiante32
passi/osservazione/imu	Tensore	(6,)	galleggiante32
passi/osservazione/torso_upright	Tensore	(1,)	galleggiante32
passi/osservazione/torso_velocity	Tensore	(3,)	galleggiante32
passi/ricompensa	Tensore	(1,)	galleggiante32

Esempi ( tfds.as_dataframe ):

rlu_rwrl/quadruped_walk_combined_challenge_none_40_percent

Dimensione del set di dati: 70.92 MiB
Auto-cache ( documentazione ): Sì
Divisioni :

Diviso	Esempi
`'train'`	200

Struttura delle caratteristiche :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(12,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'egocentric_state': Tensor(shape=(44,), dtype=float32),
            'force_torque': Tensor(shape=(24,), dtype=float32),
            'imu': Tensor(shape=(6,), dtype=float32),
            'torso_upright': Tensor(shape=(1,), dtype=float32),
            'torso_velocity': Tensor(shape=(3,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

Documentazione delle funzionalità :

Caratteristica	Classe	Forma	Tipo D
	CaratteristicheDict
episodio_ritorno	Tensore		galleggiante32
passi	Set di dati
passi/azione	Tensore	(12,)	galleggiante32
passi/sconto	Tensore	(1,)	galleggiante32
passi/è_primo	Tensore		bool
passi/è_ultimo	Tensore		bool
passi/è_terminale	Tensore		bool
passi/osservazione	CaratteristicheDict
passi/osservazione/stato_egocentrico	Tensore	(44,)	galleggiante32
passi/osservazione/force_torque	Tensore	(24,)	galleggiante32
passi/osservazione/imu	Tensore	(6,)	galleggiante32
passi/osservazione/torso_upright	Tensore	(1,)	galleggiante32
passi/osservazione/torso_velocity	Tensore	(3,)	galleggiante32
passi/ricompensa	Tensore	(1,)	galleggiante32

Esempi ( tfds.as_dataframe ):

rlu_rwrl/quadruped_walk_combined_challenge_none_100_percent

Dimensione del set di dati: 177.29 MiB
Cache automatica ( documentazione ): solo quando shuffle_files=False (treno)
Divisioni :

Diviso	Esempi
`'train'`	500

Struttura delle caratteristiche :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(12,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'egocentric_state': Tensor(shape=(44,), dtype=float32),
            'force_torque': Tensor(shape=(24,), dtype=float32),
            'imu': Tensor(shape=(6,), dtype=float32),
            'torso_upright': Tensor(shape=(1,), dtype=float32),
            'torso_velocity': Tensor(shape=(3,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

Documentazione delle funzionalità :

Caratteristica	Classe	Forma	Tipo D
	CaratteristicheDict
episodio_ritorno	Tensore		galleggiante32
passi	Set di dati
passi/azione	Tensore	(12,)	galleggiante32
passi/sconto	Tensore	(1,)	galleggiante32
passi/è_primo	Tensore		bool
passi/è_ultimo	Tensore		bool
passi/è_terminale	Tensore		bool
passi/osservazione	CaratteristicheDict
passi/osservazione/stato_egocentrico	Tensore	(44,)	galleggiante32
passi/osservazione/force_torque	Tensore	(24,)	galleggiante32
passi/osservazione/imu	Tensore	(6,)	galleggiante32
passi/osservazione/torso_upright	Tensore	(1,)	galleggiante32
passi/osservazione/torso_velocity	Tensore	(3,)	galleggiante32
passi/ricompensa	Tensore	(1,)	galleggiante32

Esempi ( tfds.as_dataframe ):

rlu_rwrl/walker_walk_combined_challenge_none_1_percent

Dimensione del set di dati: 6.27 MiB
Auto-cache ( documentazione ): Sì
Divisioni :

Diviso	Esempi
`'train'`	50

Struttura delle caratteristiche :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(6,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'height': Tensor(shape=(1,), dtype=float32),
            'orientations': Tensor(shape=(14,), dtype=float32),
            'velocity': Tensor(shape=(9,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

Documentazione delle funzionalità :

Caratteristica	Classe	Forma	Tipo D
	CaratteristicheDict
episodio_ritorno	Tensore		galleggiante32
passi	Set di dati
passi/azione	Tensore	(6,)	galleggiante32
passi/sconto	Tensore	(1,)	galleggiante32
passi/è_primo	Tensore		bool
passi/è_ultimo	Tensore		bool
passi/è_terminale	Tensore		bool
passi/osservazione	CaratteristicheDict
passi/osservazione/altezza	Tensore	(1,)	galleggiante32
passi/osservazione/orientamenti	Tensore	(14,)	galleggiante32
passi/osservazione/velocità	Tensore	(9,)	galleggiante32
passi/ricompensa	Tensore	(1,)	galleggiante32

Esempi ( tfds.as_dataframe ):

rlu_rwrl/walker_walk_combined_challenge_none_5_percent

Dimensione del set di dati: 31.34 MiB
Auto-cache ( documentazione ): Sì
Divisioni :

Diviso	Esempi
`'train'`	250

Struttura delle caratteristiche :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(6,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'height': Tensor(shape=(1,), dtype=float32),
            'orientations': Tensor(shape=(14,), dtype=float32),
            'velocity': Tensor(shape=(9,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

Documentazione delle funzionalità :

Caratteristica	Classe	Forma	Tipo D
	CaratteristicheDict
episodio_ritorno	Tensore		galleggiante32
passi	Set di dati
passi/azione	Tensore	(6,)	galleggiante32
passi/sconto	Tensore	(1,)	galleggiante32
passi/è_primo	Tensore		bool
passi/è_ultimo	Tensore		bool
passi/è_terminale	Tensore		bool
passi/osservazione	CaratteristicheDict
passi/osservazione/altezza	Tensore	(1,)	galleggiante32
passi/osservazione/orientamenti	Tensore	(14,)	galleggiante32
passi/osservazione/velocità	Tensore	(9,)	galleggiante32
passi/ricompensa	Tensore	(1,)	galleggiante32

Esempi ( tfds.as_dataframe ):

rlu_rwrl/walker_walk_combined_challenge_none_20_percent

Dimensione del set di dati: 125.37 MiB
Auto-cache ( documentazione ): Sì
Divisioni :

Diviso	Esempi
`'train'`	1.000

Struttura delle caratteristiche :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(6,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'height': Tensor(shape=(1,), dtype=float32),
            'orientations': Tensor(shape=(14,), dtype=float32),
            'velocity': Tensor(shape=(9,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

Documentazione delle funzionalità :

Caratteristica	Classe	Forma	Tipo D
	CaratteristicheDict
episodio_ritorno	Tensore		galleggiante32
passi	Set di dati
passi/azione	Tensore	(6,)	galleggiante32
passi/sconto	Tensore	(1,)	galleggiante32
passi/è_primo	Tensore		bool
passi/è_ultimo	Tensore		bool
passi/è_terminale	Tensore		bool
passi/osservazione	CaratteristicheDict
passi/osservazione/altezza	Tensore	(1,)	galleggiante32
passi/osservazione/orientamenti	Tensore	(14,)	galleggiante32
passi/osservazione/velocità	Tensore	(9,)	galleggiante32
passi/ricompensa	Tensore	(1,)	galleggiante32

Esempi ( tfds.as_dataframe ):

rlu_rwrl/walker_walk_combined_challenge_none_40_percent

Dimensione del set di dati: 250.75 MiB
Cache automatica ( documentazione ): No
Divisioni :

Diviso	Esempi
`'train'`	2.000

Struttura delle caratteristiche :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(6,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'height': Tensor(shape=(1,), dtype=float32),
            'orientations': Tensor(shape=(14,), dtype=float32),
            'velocity': Tensor(shape=(9,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

Documentazione delle funzionalità :

Caratteristica	Classe	Forma	Tipo D
	CaratteristicheDict
episodio_ritorno	Tensore		galleggiante32
passi	Set di dati
passi/azione	Tensore	(6,)	galleggiante32
passi/sconto	Tensore	(1,)	galleggiante32
passi/è_primo	Tensore		bool
passi/è_ultimo	Tensore		bool
passi/è_terminale	Tensore		bool
passi/osservazione	CaratteristicheDict
passi/osservazione/altezza	Tensore	(1,)	galleggiante32
passi/osservazione/orientamenti	Tensore	(14,)	galleggiante32
passi/osservazione/velocità	Tensore	(9,)	galleggiante32
passi/ricompensa	Tensore	(1,)	galleggiante32

Esempi ( tfds.as_dataframe ):

rlu_rwrl/walker_walk_combined_challenge_none_100_percent

Dimensione del set di dati: 626.86 MiB
Cache automatica ( documentazione ): No
Divisioni :

Diviso	Esempi
`'train'`	5.000

Struttura delle caratteristiche :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(6,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'height': Tensor(shape=(1,), dtype=float32),
            'orientations': Tensor(shape=(14,), dtype=float32),
            'velocity': Tensor(shape=(9,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

Documentazione delle funzionalità :

Caratteristica	Classe	Forma	Tipo D
	CaratteristicheDict
episodio_ritorno	Tensore		galleggiante32
passi	Set di dati
passi/azione	Tensore	(6,)	galleggiante32
passi/sconto	Tensore	(1,)	galleggiante32
passi/è_primo	Tensore		bool
passi/è_ultimo	Tensore		bool
passi/è_terminale	Tensore		bool
passi/osservazione	CaratteristicheDict
passi/osservazione/altezza	Tensore	(1,)	galleggiante32
passi/osservazione/orientamenti	Tensore	(14,)	galleggiante32
passi/osservazione/velocità	Tensore	(9,)	galleggiante32
passi/ricompensa	Tensore	(1,)	galleggiante32

Esempi ( tfds.as_dataframe ):

rlu_rwrl/humanoid_walk_combined_challenge_none_1_percent

Dimensione del set di dati: 69.40 MiB
Auto-cache ( documentazione ): Sì
Divisioni :

Diviso	Esempi
`'train'`	200

Struttura delle caratteristiche :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(21,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'com_velocity': Tensor(shape=(3,), dtype=float32),
            'extremities': Tensor(shape=(12,), dtype=float32),
            'head_height': Tensor(shape=(1,), dtype=float32),
            'joint_angles': Tensor(shape=(21,), dtype=float32),
            'torso_vertical': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(27,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

Documentazione delle funzionalità :

Caratteristica	Classe	Forma	Tipo D
	CaratteristicheDict
episodio_ritorno	Tensore		galleggiante32
passi	Set di dati
passi/azione	Tensore	(21,)	galleggiante32
passi/sconto	Tensore	(1,)	galleggiante32
passi/è_primo	Tensore		bool
passi/è_ultimo	Tensore		bool
passi/è_terminale	Tensore		bool
passi/osservazione	CaratteristicheDict
passi/osservazione/com_velocity	Tensore	(3,)	galleggiante32
passi/osservazione/estremità	Tensore	(12,)	galleggiante32
passi/osservazione/altezza_testa	Tensore	(1,)	galleggiante32
passi/osservazione/joint_angles	Tensore	(21,)	galleggiante32
passi/osservazione/torso_vertical	Tensore	(3,)	galleggiante32
passi/osservazione/velocità	Tensore	(27,)	galleggiante32
passi/ricompensa	Tensore	(1,)	galleggiante32

Esempi ( tfds.as_dataframe ):

rlu_rwrl/humanoid_walk_combined_challenge_none_5_percent

Dimensione del set di dati: 346.98 MiB
Cache automatica ( documentazione ): No
Divisioni :

Diviso	Esempi
`'train'`	1.000

Struttura delle caratteristiche :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(21,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'com_velocity': Tensor(shape=(3,), dtype=float32),
            'extremities': Tensor(shape=(12,), dtype=float32),
            'head_height': Tensor(shape=(1,), dtype=float32),
            'joint_angles': Tensor(shape=(21,), dtype=float32),
            'torso_vertical': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(27,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

Documentazione delle funzionalità :

Caratteristica	Classe	Forma	Tipo D
	CaratteristicheDict
episodio_ritorno	Tensore		galleggiante32
passi	Set di dati
passi/azione	Tensore	(21,)	galleggiante32
passi/sconto	Tensore	(1,)	galleggiante32
passi/è_primo	Tensore		bool
passi/è_ultimo	Tensore		bool
passi/è_terminale	Tensore		bool
passi/osservazione	CaratteristicheDict
passi/osservazione/com_velocity	Tensore	(3,)	galleggiante32
passi/osservazione/estremità	Tensore	(12,)	galleggiante32
passi/osservazione/altezza_testa	Tensore	(1,)	galleggiante32
passi/osservazione/joint_angles	Tensore	(21,)	galleggiante32
passi/osservazione/torso_vertical	Tensore	(3,)	galleggiante32
passi/osservazione/velocità	Tensore	(27,)	galleggiante32
passi/ricompensa	Tensore	(1,)	galleggiante32

Esempi ( tfds.as_dataframe ):

rlu_rwrl/humanoid_walk_combined_challenge_none_20_percent

Dimensione del set di dati: 1.36 GiB
Cache automatica ( documentazione ): No
Divisioni :

Diviso	Esempi
`'train'`	4.000

Struttura delle caratteristiche :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(21,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'com_velocity': Tensor(shape=(3,), dtype=float32),
            'extremities': Tensor(shape=(12,), dtype=float32),
            'head_height': Tensor(shape=(1,), dtype=float32),
            'joint_angles': Tensor(shape=(21,), dtype=float32),
            'torso_vertical': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(27,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

Documentazione delle funzionalità :

Caratteristica	Classe	Forma	Tipo D
	CaratteristicheDict
episodio_ritorno	Tensore		galleggiante32
passi	Set di dati
passi/azione	Tensore	(21,)	galleggiante32
passi/sconto	Tensore	(1,)	galleggiante32
passi/è_primo	Tensore		bool
passi/è_ultimo	Tensore		bool
passi/è_terminale	Tensore		bool
passi/osservazione	CaratteristicheDict
passi/osservazione/com_velocity	Tensore	(3,)	galleggiante32
passi/osservazione/estremità	Tensore	(12,)	galleggiante32
passi/osservazione/altezza_testa	Tensore	(1,)	galleggiante32
passi/osservazione/joint_angles	Tensore	(21,)	galleggiante32
passi/osservazione/torso_vertical	Tensore	(3,)	galleggiante32
passi/osservazione/velocità	Tensore	(27,)	galleggiante32
passi/ricompensa	Tensore	(1,)	galleggiante32

Esempi ( tfds.as_dataframe ):

rlu_rwrl/humanoid_walk_combined_challenge_none_40_percent

Dimensione del set di dati: 2.71 GiB
Cache automatica ( documentazione ): No
Divisioni :

Diviso	Esempi
`'train'`	8.000

Struttura delle caratteristiche :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(21,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'com_velocity': Tensor(shape=(3,), dtype=float32),
            'extremities': Tensor(shape=(12,), dtype=float32),
            'head_height': Tensor(shape=(1,), dtype=float32),
            'joint_angles': Tensor(shape=(21,), dtype=float32),
            'torso_vertical': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(27,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

Documentazione delle funzionalità :

Caratteristica	Classe	Forma	Tipo D
	CaratteristicheDict
episodio_ritorno	Tensore		galleggiante32
passi	Set di dati
passi/azione	Tensore	(21,)	galleggiante32
passi/sconto	Tensore	(1,)	galleggiante32
passi/è_primo	Tensore		bool
passi/è_ultimo	Tensore		bool
passi/è_terminale	Tensore		bool
passi/osservazione	CaratteristicheDict
passi/osservazione/com_velocity	Tensore	(3,)	galleggiante32
passi/osservazione/estremità	Tensore	(12,)	galleggiante32
passi/osservazione/altezza_testa	Tensore	(1,)	galleggiante32
passi/osservazione/joint_angles	Tensore	(21,)	galleggiante32
passi/osservazione/torso_vertical	Tensore	(3,)	galleggiante32
passi/osservazione/velocità	Tensore	(27,)	galleggiante32
passi/ricompensa	Tensore	(1,)	galleggiante32

Esempi ( tfds.as_dataframe ):

rlu_rwrl/humanoid_walk_combined_challenge_none_100_percent

Dimensione del set di dati: 6.78 GiB
Cache automatica ( documentazione ): No
Divisioni :

Diviso	Esempi
`'train'`	20.000

Struttura delle caratteristiche :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(21,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'com_velocity': Tensor(shape=(3,), dtype=float32),
            'extremities': Tensor(shape=(12,), dtype=float32),
            'head_height': Tensor(shape=(1,), dtype=float32),
            'joint_angles': Tensor(shape=(21,), dtype=float32),
            'torso_vertical': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(27,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

Documentazione delle funzionalità :

Caratteristica	Classe	Forma	Tipo D
	CaratteristicheDict
episodio_ritorno	Tensore		galleggiante32
passi	Set di dati
passi/azione	Tensore	(21,)	galleggiante32
passi/sconto	Tensore	(1,)	galleggiante32
passi/è_primo	Tensore		bool
passi/è_ultimo	Tensore		bool
passi/è_terminale	Tensore		bool
passi/osservazione	CaratteristicheDict
passi/osservazione/com_velocity	Tensore	(3,)	galleggiante32
passi/osservazione/estremità	Tensore	(12,)	galleggiante32
passi/osservazione/altezza_testa	Tensore	(1,)	galleggiante32
passi/osservazione/joint_angles	Tensore	(21,)	galleggiante32
passi/osservazione/torso_vertical	Tensore	(3,)	galleggiante32
passi/osservazione/velocità	Tensore	(27,)	galleggiante32
passi/ricompensa	Tensore	(1,)	galleggiante32

Esempi ( tfds.as_dataframe ):

rlu_rwrl/cartpole_swingup_combined_challenge_easy_1_percent

Dimensione del set di dati: 369.84 KiB
Auto-cache ( documentazione ): Sì
Divisioni :

Diviso	Esempi
`'train'`	5

Struttura delle caratteristiche :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(1,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'position': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(2,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

Documentazione delle funzionalità :

Caratteristica	Classe	Forma	Tipo D
	CaratteristicheDict
episodio_ritorno	Tensore		galleggiante32
passi	Set di dati
passi/azione	Tensore	(1,)	galleggiante32
passi/sconto	Tensore	(1,)	galleggiante32
passi/è_primo	Tensore		bool
passi/è_ultimo	Tensore		bool
passi/è_terminale	Tensore		bool
passi/osservazione	CaratteristicheDict
passi/osservazione/manichino-0	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-1	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-2	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-3	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-4	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-5	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-6	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-7	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-8	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-9	Tensore	(1,)	galleggiante32
passi/osservazione/posizione	Tensore	(3,)	galleggiante32
passi/osservazione/velocità	Tensore	(2,)	galleggiante32
passi/ricompensa	Tensore	(1,)	galleggiante32

Esempi ( tfds.as_dataframe ):

rlu_rwrl/cartpole_swingup_combined_challenge_easy_5_percent

Dimensione del set di dati: 1.81 MiB
Auto-cache ( documentazione ): Sì
Divisioni :

Diviso	Esempi
`'train'`	25

Struttura delle caratteristiche :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(1,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'position': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(2,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

Documentazione delle funzionalità :

Caratteristica	Classe	Forma	Tipo D
	CaratteristicheDict
episodio_ritorno	Tensore		galleggiante32
passi	Set di dati
passi/azione	Tensore	(1,)	galleggiante32
passi/sconto	Tensore	(1,)	galleggiante32
passi/è_primo	Tensore		bool
passi/è_ultimo	Tensore		bool
passi/è_terminale	Tensore		bool
passi/osservazione	CaratteristicheDict
passi/osservazione/manichino-0	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-1	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-2	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-3	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-4	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-5	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-6	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-7	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-8	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-9	Tensore	(1,)	galleggiante32
passi/osservazione/posizione	Tensore	(3,)	galleggiante32
passi/osservazione/velocità	Tensore	(2,)	galleggiante32
passi/ricompensa	Tensore	(1,)	galleggiante32

Esempi ( tfds.as_dataframe ):

rlu_rwrl/cartpole_swingup_combined_challenge_easy_20_percent

Dimensione del set di dati: 7.22 MiB
Auto-cache ( documentazione ): Sì
Divisioni :

Diviso	Esempi
`'train'`	100

Struttura delle caratteristiche :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(1,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'position': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(2,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

Documentazione delle funzionalità :

Caratteristica	Classe	Forma	Tipo D
	CaratteristicheDict
episodio_ritorno	Tensore		galleggiante32
passi	Set di dati
passi/azione	Tensore	(1,)	galleggiante32
passi/sconto	Tensore	(1,)	galleggiante32
passi/è_primo	Tensore		bool
passi/è_ultimo	Tensore		bool
passi/è_terminale	Tensore		bool
passi/osservazione	CaratteristicheDict
passi/osservazione/manichino-0	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-1	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-2	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-3	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-4	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-5	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-6	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-7	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-8	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-9	Tensore	(1,)	galleggiante32
passi/osservazione/posizione	Tensore	(3,)	galleggiante32
passi/osservazione/velocità	Tensore	(2,)	galleggiante32
passi/ricompensa	Tensore	(1,)	galleggiante32

Esempi ( tfds.as_dataframe ):

rlu_rwrl/cartpole_swingup_combined_challenge_easy_40_percent

Dimensione del set di dati: 14.45 MiB
Auto-cache ( documentazione ): Sì
Divisioni :

Diviso	Esempi
`'train'`	200

Struttura delle caratteristiche :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(1,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'position': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(2,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

Documentazione delle funzionalità :

Caratteristica	Classe	Forma	Tipo D
	CaratteristicheDict
episodio_ritorno	Tensore		galleggiante32
passi	Set di dati
passi/azione	Tensore	(1,)	galleggiante32
passi/sconto	Tensore	(1,)	galleggiante32
passi/è_primo	Tensore		bool
passi/è_ultimo	Tensore		bool
passi/è_terminale	Tensore		bool
passi/osservazione	CaratteristicheDict
passi/osservazione/manichino-0	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-1	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-2	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-3	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-4	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-5	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-6	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-7	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-8	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-9	Tensore	(1,)	galleggiante32
passi/osservazione/posizione	Tensore	(3,)	galleggiante32
passi/osservazione/velocità	Tensore	(2,)	galleggiante32
passi/ricompensa	Tensore	(1,)	galleggiante32

Esempi ( tfds.as_dataframe ):

rlu_rwrl/cartpole_swingup_combined_challenge_easy_100_percent

Dimensione del set di dati: 36.12 MiB
Auto-cache ( documentazione ): Sì
Divisioni :

Diviso	Esempi
`'train'`	500

Struttura delle caratteristiche :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(1,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'position': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(2,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

Documentazione delle funzionalità :

Caratteristica	Classe	Forma	Tipo D
	CaratteristicheDict
episodio_ritorno	Tensore		galleggiante32
passi	Set di dati
passi/azione	Tensore	(1,)	galleggiante32
passi/sconto	Tensore	(1,)	galleggiante32
passi/è_primo	Tensore		bool
passi/è_ultimo	Tensore		bool
passi/è_terminale	Tensore		bool
passi/osservazione	CaratteristicheDict
passi/osservazione/manichino-0	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-1	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-2	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-3	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-4	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-5	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-6	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-7	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-8	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-9	Tensore	(1,)	galleggiante32
passi/osservazione/posizione	Tensore	(3,)	galleggiante32
passi/osservazione/velocità	Tensore	(2,)	galleggiante32
passi/ricompensa	Tensore	(1,)	galleggiante32

Esempi ( tfds.as_dataframe ):

rlu_rwrl/quadruped_walk_combined_challenge_easy_1_percent

Dimensione del set di dati : 1.97 MiB
Auto-cache ( documentazione ): Sì
Divisioni :

Diviso	Esempi
`'train'`	5

Struttura delle caratteristiche :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(12,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'egocentric_state': Tensor(shape=(44,), dtype=float32),
            'force_torque': Tensor(shape=(24,), dtype=float32),
            'imu': Tensor(shape=(6,), dtype=float32),
            'torso_upright': Tensor(shape=(1,), dtype=float32),
            'torso_velocity': Tensor(shape=(3,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

Documentazione delle funzionalità :

Caratteristica	Classe	Forma	Tipo D
	CaratteristicheDict
episodio_ritorno	Tensore		galleggiante32
passi	Set di dati
passi/azione	Tensore	(12,)	galleggiante32
passi/sconto	Tensore	(1,)	galleggiante32
passi/è_primo	Tensore		bool
passi/è_ultimo	Tensore		bool
passi/è_terminale	Tensore		bool
passi/osservazione	CaratteristicheDict
passi/osservazione/manichino-0	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-1	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-2	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-3	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-4	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-5	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-6	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-7	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-8	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-9	Tensore	(1,)	galleggiante32
passi/osservazione/stato_egocentrico	Tensore	(44,)	galleggiante32
passi/osservazione/force_torque	Tensore	(24,)	galleggiante32
passi/osservazione/imu	Tensore	(6,)	galleggiante32
passi/osservazione/torso_upright	Tensore	(1,)	galleggiante32
passi/osservazione/torso_velocity	Tensore	(3,)	galleggiante32
passi/ricompensa	Tensore	(1,)	galleggiante32

Esempi ( tfds.as_dataframe ):

rlu_rwrl/quadruped_walk_combined_challenge_easy_5_percent

Dimensione del set di dati: 9.83 MiB
Auto-cache ( documentazione ): Sì
Divisioni :

Diviso	Esempi
`'train'`	25

Struttura delle caratteristiche :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(12,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'egocentric_state': Tensor(shape=(44,), dtype=float32),
            'force_torque': Tensor(shape=(24,), dtype=float32),
            'imu': Tensor(shape=(6,), dtype=float32),
            'torso_upright': Tensor(shape=(1,), dtype=float32),
            'torso_velocity': Tensor(shape=(3,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

Documentazione delle funzionalità :

Caratteristica	Classe	Forma	Tipo D
	CaratteristicheDict
episodio_ritorno	Tensore		galleggiante32
passi	Set di dati
passi/azione	Tensore	(12,)	galleggiante32
passi/sconto	Tensore	(1,)	galleggiante32
passi/è_primo	Tensore		bool
passi/è_ultimo	Tensore		bool
passi/è_terminale	Tensore		bool
passi/osservazione	CaratteristicheDict
passi/osservazione/manichino-0	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-1	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-2	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-3	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-4	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-5	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-6	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-7	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-8	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-9	Tensore	(1,)	galleggiante32
passi/osservazione/stato_egocentrico	Tensore	(44,)	galleggiante32
passi/osservazione/force_torque	Tensore	(24,)	galleggiante32
passi/osservazione/imu	Tensore	(6,)	galleggiante32
passi/osservazione/torso_upright	Tensore	(1,)	galleggiante32
passi/osservazione/torso_velocity	Tensore	(3,)	galleggiante32
passi/ricompensa	Tensore	(1,)	galleggiante32

Esempi ( tfds.as_dataframe ):

rlu_rwrl/quadruped_walk_combined_challenge_easy_20_percent

Dimensione del set di dati: 39.31 MiB
Auto-cache ( documentazione ): Sì
Divisioni :

Diviso	Esempi
`'train'`	100

Struttura delle caratteristiche :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(12,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'egocentric_state': Tensor(shape=(44,), dtype=float32),
            'force_torque': Tensor(shape=(24,), dtype=float32),
            'imu': Tensor(shape=(6,), dtype=float32),
            'torso_upright': Tensor(shape=(1,), dtype=float32),
            'torso_velocity': Tensor(shape=(3,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

Documentazione delle funzionalità :

Caratteristica	Classe	Forma	Tipo D
	CaratteristicheDict
episodio_ritorno	Tensore		galleggiante32
passi	Set di dati
passi/azione	Tensore	(12,)	galleggiante32
passi/sconto	Tensore	(1,)	galleggiante32
passi/è_primo	Tensore		bool
passi/è_ultimo	Tensore		bool
passi/è_terminale	Tensore		bool
passi/osservazione	CaratteristicheDict
passi/osservazione/manichino-0	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-1	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-2	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-3	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-4	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-5	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-6	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-7	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-8	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-9	Tensore	(1,)	galleggiante32
passi/osservazione/stato_egocentrico	Tensore	(44,)	galleggiante32
passi/osservazione/force_torque	Tensore	(24,)	galleggiante32
passi/osservazione/imu	Tensore	(6,)	galleggiante32
passi/osservazione/torso_upright	Tensore	(1,)	galleggiante32
passi/osservazione/torso_velocity	Tensore	(3,)	galleggiante32
passi/ricompensa	Tensore	(1,)	galleggiante32

Esempi ( tfds.as_dataframe ):

rlu_rwrl/quadruped_walk_combined_challenge_easy_40_percent

Dimensione del set di dati: 78.63 MiB
Auto-cache ( documentazione ): Sì
Divisioni :

Diviso	Esempi
`'train'`	200

Struttura delle caratteristiche :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(12,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'egocentric_state': Tensor(shape=(44,), dtype=float32),
            'force_torque': Tensor(shape=(24,), dtype=float32),
            'imu': Tensor(shape=(6,), dtype=float32),
            'torso_upright': Tensor(shape=(1,), dtype=float32),
            'torso_velocity': Tensor(shape=(3,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

Documentazione delle funzionalità :

Caratteristica	Classe	Forma	Tipo D
	CaratteristicheDict
episodio_ritorno	Tensore		galleggiante32
passi	Set di dati
passi/azione	Tensore	(12,)	galleggiante32
passi/sconto	Tensore	(1,)	galleggiante32
passi/è_primo	Tensore		bool
passi/è_ultimo	Tensore		bool
passi/è_terminale	Tensore		bool
passi/osservazione	CaratteristicheDict
passi/osservazione/manichino-0	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-1	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-2	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-3	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-4	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-5	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-6	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-7	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-8	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-9	Tensore	(1,)	galleggiante32
passi/osservazione/stato_egocentrico	Tensore	(44,)	galleggiante32
passi/osservazione/force_torque	Tensore	(24,)	galleggiante32
passi/osservazione/imu	Tensore	(6,)	galleggiante32
passi/osservazione/torso_upright	Tensore	(1,)	galleggiante32
passi/osservazione/torso_velocity	Tensore	(3,)	galleggiante32
passi/ricompensa	Tensore	(1,)	galleggiante32

Esempi ( tfds.as_dataframe ):

rlu_rwrl/quadruped_walk_combined_challenge_easy_100_percent

Dimensione del set di dati: 196.57 MiB
Cache automatica ( documentazione ): solo quando shuffle_files=False (treno)
Divisioni :

Diviso	Esempi
`'train'`	500

Struttura delle caratteristiche :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(12,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'egocentric_state': Tensor(shape=(44,), dtype=float32),
            'force_torque': Tensor(shape=(24,), dtype=float32),
            'imu': Tensor(shape=(6,), dtype=float32),
            'torso_upright': Tensor(shape=(1,), dtype=float32),
            'torso_velocity': Tensor(shape=(3,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

Documentazione delle funzionalità :

Caratteristica	Classe	Forma	Tipo D
	CaratteristicheDict
episodio_ritorno	Tensore		galleggiante32
passi	Set di dati
passi/azione	Tensore	(12,)	galleggiante32
passi/sconto	Tensore	(1,)	galleggiante32
passi/è_primo	Tensore		bool
passi/è_ultimo	Tensore		bool
passi/è_terminale	Tensore		bool
passi/osservazione	CaratteristicheDict
passi/osservazione/manichino-0	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-1	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-2	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-3	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-4	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-5	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-6	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-7	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-8	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-9	Tensore	(1,)	galleggiante32
passi/osservazione/stato_egocentrico	Tensore	(44,)	galleggiante32
passi/osservazione/force_torque	Tensore	(24,)	galleggiante32
passi/osservazione/imu	Tensore	(6,)	galleggiante32
passi/osservazione/torso_upright	Tensore	(1,)	galleggiante32
passi/osservazione/torso_velocity	Tensore	(3,)	galleggiante32
passi/ricompensa	Tensore	(1,)	galleggiante32

Esempi ( tfds.as_dataframe ):

rlu_rwrl/walker_walk_combined_challenge_easy_1_percent

Dimensione del set di dati : 8.20 MiB
Auto-cache ( documentazione ): Sì
Divisioni :

Diviso	Esempi
`'train'`	50

Struttura delle caratteristiche :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(6,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'height': Tensor(shape=(1,), dtype=float32),
            'orientations': Tensor(shape=(14,), dtype=float32),
            'velocity': Tensor(shape=(9,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

Documentazione delle funzionalità :

Caratteristica	Classe	Forma	Tipo D
	CaratteristicheDict
episodio_ritorno	Tensore		galleggiante32
passi	Set di dati
passi/azione	Tensore	(6,)	galleggiante32
passi/sconto	Tensore	(1,)	galleggiante32
passi/è_primo	Tensore		bool
passi/è_ultimo	Tensore		bool
passi/è_terminale	Tensore		bool
passi/osservazione	CaratteristicheDict
passi/osservazione/manichino-0	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-1	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-2	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-3	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-4	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-5	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-6	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-7	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-8	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-9	Tensore	(1,)	galleggiante32
passi/osservazione/altezza	Tensore	(1,)	galleggiante32
passi/osservazione/orientamenti	Tensore	(14,)	galleggiante32
passi/osservazione/velocità	Tensore	(9,)	galleggiante32
passi/ricompensa	Tensore	(1,)	galleggiante32

Esempi ( tfds.as_dataframe ):

rlu_rwrl/walker_walk_combined_challenge_easy_5_percent

Dimensione del set di dati: 40.98 MiB
Auto-cache ( documentazione ): Sì
Divisioni :

Diviso	Esempi
`'train'`	250

Struttura delle caratteristiche :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(6,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'height': Tensor(shape=(1,), dtype=float32),
            'orientations': Tensor(shape=(14,), dtype=float32),
            'velocity': Tensor(shape=(9,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

Documentazione delle funzionalità :

Caratteristica	Classe	Forma	Tipo D
	CaratteristicheDict
episodio_ritorno	Tensore		galleggiante32
passi	Set di dati
passi/azione	Tensore	(6,)	galleggiante32
passi/sconto	Tensore	(1,)	galleggiante32
passi/è_primo	Tensore		bool
passi/è_ultimo	Tensore		bool
passi/è_terminale	Tensore		bool
passi/osservazione	CaratteristicheDict
passi/osservazione/manichino-0	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-1	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-2	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-3	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-4	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-5	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-6	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-7	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-8	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-9	Tensore	(1,)	galleggiante32
passi/osservazione/altezza	Tensore	(1,)	galleggiante32
passi/osservazione/orientamenti	Tensore	(14,)	galleggiante32
passi/osservazione/velocità	Tensore	(9,)	galleggiante32
passi/ricompensa	Tensore	(1,)	galleggiante32

Esempi ( tfds.as_dataframe ):

rlu_rwrl/walker_walk_combined_challenge_easy_20_percent

Dimensione del set di dati: 163.93 MiB
Cache automatica ( documentazione ): solo quando shuffle_files=False (treno)
Divisioni :

Diviso	Esempi
`'train'`	1.000

Struttura delle caratteristiche :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(6,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'height': Tensor(shape=(1,), dtype=float32),
            'orientations': Tensor(shape=(14,), dtype=float32),
            'velocity': Tensor(shape=(9,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

Documentazione delle funzionalità :

Caratteristica	Classe	Forma	Tipo D
	CaratteristicheDict
episodio_ritorno	Tensore		galleggiante32
passi	Set di dati
passi/azione	Tensore	(6,)	galleggiante32
passi/sconto	Tensore	(1,)	galleggiante32
passi/è_primo	Tensore		bool
passi/è_ultimo	Tensore		bool
passi/è_terminale	Tensore		bool
passi/osservazione	CaratteristicheDict
passi/osservazione/manichino-0	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-1	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-2	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-3	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-4	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-5	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-6	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-7	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-8	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-9	Tensore	(1,)	galleggiante32
passi/osservazione/altezza	Tensore	(1,)	galleggiante32
passi/osservazione/orientamenti	Tensore	(14,)	galleggiante32
passi/osservazione/velocità	Tensore	(9,)	galleggiante32
passi/ricompensa	Tensore	(1,)	galleggiante32

Esempi ( tfds.as_dataframe ):

rlu_rwrl/walker_walk_combined_challenge_easy_40_percent

Dimensione del set di dati: 327.86 MiB
Cache automatica ( documentazione ): No
Divisioni :

Diviso	Esempi
`'train'`	2.000

Struttura delle caratteristiche :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(6,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'height': Tensor(shape=(1,), dtype=float32),
            'orientations': Tensor(shape=(14,), dtype=float32),
            'velocity': Tensor(shape=(9,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

Documentazione delle funzionalità :

Caratteristica	Classe	Forma	Tipo D
	CaratteristicheDict
episodio_ritorno	Tensore		galleggiante32
passi	Set di dati
passi/azione	Tensore	(6,)	galleggiante32
passi/sconto	Tensore	(1,)	galleggiante32
passi/è_primo	Tensore		bool
passi/è_ultimo	Tensore		bool
passi/è_terminale	Tensore		bool
passi/osservazione	CaratteristicheDict
passi/osservazione/manichino-0	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-1	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-2	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-3	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-4	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-5	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-6	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-7	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-8	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-9	Tensore	(1,)	galleggiante32
passi/osservazione/altezza	Tensore	(1,)	galleggiante32
passi/osservazione/orientamenti	Tensore	(14,)	galleggiante32
passi/osservazione/velocità	Tensore	(9,)	galleggiante32
passi/ricompensa	Tensore	(1,)	galleggiante32

Esempi ( tfds.as_dataframe ):

rlu_rwrl/walker_walk_combined_challenge_easy_100_percent

Dimensione del set di dati: 819.65 MiB
Cache automatica ( documentazione ): No
Divisioni :

Diviso	Esempi
`'train'`	5.000

Struttura delle caratteristiche :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(6,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'height': Tensor(shape=(1,), dtype=float32),
            'orientations': Tensor(shape=(14,), dtype=float32),
            'velocity': Tensor(shape=(9,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

Documentazione delle funzionalità :

Caratteristica	Classe	Forma	Tipo D
	CaratteristicheDict
episodio_ritorno	Tensore		galleggiante32
passi	Set di dati
passi/azione	Tensore	(6,)	galleggiante32
passi/sconto	Tensore	(1,)	galleggiante32
passi/è_primo	Tensore		bool
passi/è_ultimo	Tensore		bool
passi/è_terminale	Tensore		bool
passi/osservazione	CaratteristicheDict
passi/osservazione/manichino-0	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-1	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-2	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-3	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-4	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-5	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-6	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-7	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-8	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-9	Tensore	(1,)	galleggiante32
passi/osservazione/altezza	Tensore	(1,)	galleggiante32
passi/osservazione/orientamenti	Tensore	(14,)	galleggiante32
passi/osservazione/velocità	Tensore	(9,)	galleggiante32
passi/ricompensa	Tensore	(1,)	galleggiante32

Esempi ( tfds.as_dataframe ):

rlu_rwrl/humanoid_walk_combined_challenge_easy_1_percent

Dimensione del set di dati: 77.11 MiB
Auto-cache ( documentazione ): Sì
Divisioni :

Diviso	Esempi
`'train'`	200

Struttura delle caratteristiche :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(21,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'com_velocity': Tensor(shape=(3,), dtype=float32),
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'extremities': Tensor(shape=(12,), dtype=float32),
            'head_height': Tensor(shape=(1,), dtype=float32),
            'joint_angles': Tensor(shape=(21,), dtype=float32),
            'torso_vertical': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(27,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

Documentazione delle funzionalità :

Caratteristica	Classe	Forma	Tipo D
	CaratteristicheDict
episodio_ritorno	Tensore		galleggiante32
passi	Set di dati
passi/azione	Tensore	(21,)	galleggiante32
passi/sconto	Tensore	(1,)	galleggiante32
passi/è_primo	Tensore		bool
passi/è_ultimo	Tensore		bool
passi/è_terminale	Tensore		bool
passi/osservazione	CaratteristicheDict
passi/osservazione/com_velocity	Tensore	(3,)	galleggiante32
passi/osservazione/manichino-0	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-1	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-2	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-3	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-4	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-5	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-6	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-7	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-8	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-9	Tensore	(1,)	galleggiante32
passi/osservazione/estremità	Tensore	(12,)	galleggiante32
passi/osservazione/altezza_testa	Tensore	(1,)	galleggiante32
passi/osservazione/joint_angles	Tensore	(21,)	galleggiante32
passi/osservazione/torso_vertical	Tensore	(3,)	galleggiante32
passi/osservazione/velocità	Tensore	(27,)	galleggiante32
passi/ricompensa	Tensore	(1,)	galleggiante32

Esempi ( tfds.as_dataframe ):

rlu_rwrl/humanoid_walk_combined_challenge_easy_5_percent

Dimensione del set di dati: 385.54 MiB
Cache automatica ( documentazione ): No
Divisioni :

Diviso	Esempi
`'train'`	1.000

Struttura delle caratteristiche :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(21,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'com_velocity': Tensor(shape=(3,), dtype=float32),
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'extremities': Tensor(shape=(12,), dtype=float32),
            'head_height': Tensor(shape=(1,), dtype=float32),
            'joint_angles': Tensor(shape=(21,), dtype=float32),
            'torso_vertical': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(27,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

Documentazione delle funzionalità :

Caratteristica	Classe	Forma	Tipo D
	CaratteristicheDict
episodio_ritorno	Tensore		galleggiante32
passi	Set di dati
passi/azione	Tensore	(21,)	galleggiante32
passi/sconto	Tensore	(1,)	galleggiante32
passi/è_primo	Tensore		bool
passi/è_ultimo	Tensore		bool
passi/è_terminale	Tensore		bool
passi/osservazione	CaratteristicheDict
passi/osservazione/com_velocity	Tensore	(3,)	galleggiante32
passi/osservazione/manichino-0	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-1	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-2	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-3	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-4	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-5	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-6	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-7	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-8	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-9	Tensore	(1,)	galleggiante32
passi/osservazione/estremità	Tensore	(12,)	galleggiante32
passi/osservazione/altezza_testa	Tensore	(1,)	galleggiante32
passi/osservazione/joint_angles	Tensore	(21,)	galleggiante32
passi/osservazione/torso_vertical	Tensore	(3,)	galleggiante32
passi/osservazione/velocità	Tensore	(27,)	galleggiante32
passi/ricompensa	Tensore	(1,)	galleggiante32

Esempi ( tfds.as_dataframe ):

rlu_rwrl/humanoid_walk_combined_challenge_easy_20_percent

Dimensione del set di dati: 1.51 GiB
Cache automatica ( documentazione ): No
Divisioni :

Diviso	Esempi
`'train'`	4.000

Struttura delle caratteristiche :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(21,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'com_velocity': Tensor(shape=(3,), dtype=float32),
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'extremities': Tensor(shape=(12,), dtype=float32),
            'head_height': Tensor(shape=(1,), dtype=float32),
            'joint_angles': Tensor(shape=(21,), dtype=float32),
            'torso_vertical': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(27,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

Documentazione delle funzionalità :

Caratteristica	Classe	Forma	Tipo D
	CaratteristicheDict
episodio_ritorno	Tensore		galleggiante32
passi	Set di dati
passi/azione	Tensore	(21,)	galleggiante32
passi/sconto	Tensore	(1,)	galleggiante32
passi/è_primo	Tensore		bool
passi/è_ultimo	Tensore		bool
passi/è_terminale	Tensore		bool
passi/osservazione	CaratteristicheDict
passi/osservazione/com_velocity	Tensore	(3,)	galleggiante32
passi/osservazione/manichino-0	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-1	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-2	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-3	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-4	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-5	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-6	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-7	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-8	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-9	Tensore	(1,)	galleggiante32
passi/osservazione/estremità	Tensore	(12,)	galleggiante32
passi/osservazione/altezza_testa	Tensore	(1,)	galleggiante32
passi/osservazione/joint_angles	Tensore	(21,)	galleggiante32
passi/osservazione/torso_vertical	Tensore	(3,)	galleggiante32
passi/osservazione/velocità	Tensore	(27,)	galleggiante32
passi/ricompensa	Tensore	(1,)	galleggiante32

Esempi ( tfds.as_dataframe ):

rlu_rwrl/humanoid_walk_combined_challenge_easy_40_percent

Dimensione del set di dati: 3.01 GiB
Cache automatica ( documentazione ): No
Divisioni :

Diviso	Esempi
`'train'`	8.000

Struttura delle caratteristiche :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(21,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'com_velocity': Tensor(shape=(3,), dtype=float32),
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'extremities': Tensor(shape=(12,), dtype=float32),
            'head_height': Tensor(shape=(1,), dtype=float32),
            'joint_angles': Tensor(shape=(21,), dtype=float32),
            'torso_vertical': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(27,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

Documentazione delle funzionalità :

Caratteristica	Classe	Forma	Tipo D
	CaratteristicheDict
episodio_ritorno	Tensore		galleggiante32
passi	Set di dati
passi/azione	Tensore	(21,)	galleggiante32
passi/sconto	Tensore	(1,)	galleggiante32
passi/è_primo	Tensore		bool
passi/è_ultimo	Tensore		bool
passi/è_terminale	Tensore		bool
passi/osservazione	CaratteristicheDict
passi/osservazione/com_velocity	Tensore	(3,)	galleggiante32
passi/osservazione/manichino-0	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-1	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-2	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-3	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-4	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-5	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-6	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-7	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-8	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-9	Tensore	(1,)	galleggiante32
passi/osservazione/estremità	Tensore	(12,)	galleggiante32
passi/osservazione/altezza_testa	Tensore	(1,)	galleggiante32
passi/osservazione/joint_angles	Tensore	(21,)	galleggiante32
passi/osservazione/torso_vertical	Tensore	(3,)	galleggiante32
passi/osservazione/velocità	Tensore	(27,)	galleggiante32
passi/ricompensa	Tensore	(1,)	galleggiante32

Esempi ( tfds.as_dataframe ):

rlu_rwrl/humanoid_walk_combined_challenge_easy_100_percent

Dimensione del set di dati: 7.53 GiB
Cache automatica ( documentazione ): No
Divisioni :

Diviso	Esempi
`'train'`	20.000

Struttura delle caratteristiche :

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(21,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'com_velocity': Tensor(shape=(3,), dtype=float32),
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'extremities': Tensor(shape=(12,), dtype=float32),
            'head_height': Tensor(shape=(1,), dtype=float32),
            'joint_angles': Tensor(shape=(21,), dtype=float32),
            'torso_vertical': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(27,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

Documentazione delle funzionalità :

Caratteristica	Classe	Forma	Tipo D
	CaratteristicheDict
episodio_ritorno	Tensore		galleggiante32
passi	Set di dati
passi/azione	Tensore	(21,)	galleggiante32
passi/sconto	Tensore	(1,)	galleggiante32
passi/è_primo	Tensore		bool
passi/è_ultimo	Tensore		bool
passi/è_terminale	Tensore		bool
passi/osservazione	CaratteristicheDict
passi/osservazione/com_velocity	Tensore	(3,)	galleggiante32
passi/osservazione/manichino-0	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-1	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-2	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-3	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-4	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-5	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-6	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-7	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-8	Tensore	(1,)	galleggiante32
passi/osservazione/manichino-9	Tensore	(1,)	galleggiante32
passi/osservazione/estremità	Tensore	(12,)	galleggiante32
passi/osservazione/altezza_testa	Tensore	(1,)	galleggiante32
passi/osservazione/joint_angles	Tensore	(21,)	galleggiante32
passi/osservazione/torso_vertical	Tensore	(3,)	galleggiante32
passi/osservazione/velocità	Tensore	(27,)	galleggiante32
passi/ricompensa	Tensore	(1,)	galleggiante32

Esempi ( tfds.as_dataframe ):

rlu_rwrl Mantieni tutto organizzato con le raccolte Salva e classifica i contenuti in base alle tue preferenze.