Skip to content

Settings

Cyclopts dataclasses for Stable-Baselines3 training with Schola (PPO, SAC, checkpoints, resume).

NameDescription
BaseSb3AlgorithmSettingsShared rollout and optimizer settings for on-policy SB3 algorithms.
PPOTrainSettingsDataclass for configuring the settings of the Proximal Policy Optimization (PPO) algorithm.
SACTrainSettingsDataclass for configuring the settings of the Soft Actor-Critic (SAC) algorithm.
Sb3CheckpointSettingsSB3-specific checkpoint settings.
Sb3LoggingSettingsDataclass for configuring logging settings for the training process.
Sb3NetworkArchitectureSettingsNetwork architecture settings for SB3 algorithms.
Sb3ResumeSettingsDataclass for holding arguments related to resuming training from a saved state.
Sb3TrainingSettingsTop-level training run options for the SB3 launcher (mirrors RLlib’s TrainingSettings grouping).
Sb3TrainScriptSettingsTop level dataclass for configuring the script arguments used in the SB3 launcher.