Module: tf_agents.agents.ppo

PPO Agents.

Modules

ppo_actor_network module: Sequential Actor Network for PPO.

ppo_agent module: A PPO Agent.

ppo_clip_agent module: A PPO Agent implementing the clipped probability ratios.

ppo_kl_penalty_agent module: A PPO Agent implementing the KL penalty loss.

ppo_policy module: An ActorPolicy that also returns policy_info needed for PPO training.

ppo_utils module: Utils functions for ppo_agent.py.

Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.

Last updated 2024-04-26 UTC.