Skip to content

IMPALASettings

Full path: schola.scripts.rllib.settings.IMPALASettings

Dataclass for IMPALA (Importance Weighted Actor-Learner Architecture) algorithm specific settings. This class defines the parameters used in the IMPALA algorithm, including V-trace settings for off-policy correction.

IMPALASettings(vtrace = True, vtrace_clip_rho_threshold = 1.0,
vtrace_clip_pg_rho_threshold = 1.0)

Bases: RllibAlgorithmSpecificSettings

Parameters

  • vtrace (bool)

  • vtrace_clip_rho_threshold (Annotated)

  • vtrace_clip_pg_rho_threshold (Annotated)

Methods

init

__init__(vtrace = True, vtrace_clip_rho_threshold = 1.0,
vtrace_clip_pg_rho_threshold = 1.0)

Parameters

  • vtrace (bool)

  • vtrace_clip_rho_threshold (Annotated)

  • vtrace_clip_pg_rho_threshold (Annotated)


get_settings_dict

get_settings_dict()

Get the settings as a dictionary keyed by the correct parameter name in Ray

Attributes

vtrace

vtrace

Whether to use the V-trace algorithm for off-policy correction in the IMPALA algorithm. V-trace is a method to correct the bias introduced by using off-policy data for training. It helps to ensure that the value estimates are more accurate and stable.


vtrace_clip_pg_rho_threshold

vtrace_clip_pg_rho_threshold

The clip threshold for V-trace rho values in the policy gradient.


vtrace_clip_rho_threshold

vtrace_clip_rho_threshold

The clip threshold for V-trace rho values.


name

name

rllib_config

rllib_config