schola.scripts.ray.settings.IMPALASettings
- class schola.scripts.ray.settings.IMPALASettings(vtrace=True, vtrace_clip_rho_threshold=1.0, vtrace_clip_pg_rho_threshold=1.0)[source]
-
Bases:
RLLibAlgorithmSpecificSettings
Dataclass for IMPALA (Importance Weighted Actor-Learner Architecture) algorithm specific settings. This class defines the parameters used in the IMPALA algorithm, including V-trace settings for off-policy correction.
Methods
__init__
([vtrace, …])Add the settings to the parser or subparser
Get the settings as a dictionary keyed by the correct parameter name in Ray
Attributes
Whether to use the V-trace algorithm for off-policy correction in the IMPALA algorithm.
The clip threshold for V-trace rho values in the policy gradient.
The clip threshold for V-trace rho values.
- classmethod get_parser()[source]
-
Add the settings to the parser or subparser
- get_settings_dict()[source]
-
Get the settings as a dictionary keyed by the correct parameter name in Ray
- property name: str
- property rllib_config: Type[IMPALAConfig]
- vtrace: bool = True
-
Whether to use the V-trace algorithm for off-policy correction in the IMPALA algorithm. V-trace is a method to correct the bias introduced by using off-policy data for training. It helps to ensure that the value estimates are more accurate and stable.
- vtrace_clip_pg_rho_threshold: float = 1.0
-
The clip threshold for V-trace rho values in the policy gradient.
- vtrace_clip_rho_threshold: float = 1.0
-
The clip threshold for V-trace rho values.