APPOSettings

Full path: schola.scripts.rllib.settings.APPOSettings

Dataclass for APPO (Asynchronous Proximal Policy Optimization) algorithm specific settings. This class inherits from both IMPALASettings and PPOSettings to combine the settings for both algorithms. This allows for the use of both V-trace for off-policy correction and PPO for policy optimization in a single algorithm.

APPOSettings(gae_lambda = 0.95, clip_param = 0.2, use_gae = True, vtrace = True,
             vtrace_clip_rho_threshold = 1.0,
             vtrace_clip_pg_rho_threshold = 1.0)

Bases: IMPALASettings, PPOSettings

Parameters

gae_lambda (Annotated)
clip_param (Annotated)
use_gae (bool)
vtrace (bool)
vtrace_clip_rho_threshold (Annotated)
vtrace_clip_pg_rho_threshold (Annotated)

Methods

init

__init__(gae_lambda = 0.95, clip_param = 0.2, use_gae = True, vtrace = True,
         vtrace_clip_rho_threshold = 1.0, vtrace_clip_pg_rho_threshold = 1.0)

Parameters

gae_lambda (Annotated)
clip_param (Annotated)
use_gae (bool)
vtrace (bool)
vtrace_clip_rho_threshold (Annotated)
vtrace_clip_pg_rho_threshold (Annotated)

get_settings_dict

get_settings_dict()

Get the settings as a dictionary keyed by the correct parameter name in Ray

Attributes

name

name

rllib_config

rllib_config