schola-sb3 Command

This script trains Stable Baselines3 models using Schola with various configuration options for training, checkpointing, and network architecture.

Usage

usage: schola-sb3 [-h] [--launch-unreal] [--executable-path EXECUTABLE_PATH] [--headless] [-p PORT] [--map MAP] [--fps FPS] [--disable-script]
                  [-scholav SCHOLA_VERBOSITY] [--enable-checkpoints] [--checkpoint-dir CHECKPOINT_DIR] [--save-freq SAVE_FREQ] [--name-prefix NAME_PREFIX]
                  [--export-onnx] [--save-final-policy] [--save-replay-buffer] [--save-vecnormalize] [--resume-from RESUME_FROM] [--load-vecnormalize LOAD_VECNORMALIZE]
                  [--load-replay-buffer LOAD_REPLAY_BUFFER] [--reset-timestep] [--policy-parameters POLICY_PARAMETERS [POLICY_PARAMETERS ...]]
                  [--critic-parameters CRITIC_PARAMETERS [CRITIC_PARAMETERS ...]]
                  {PPO,SAC} ...

Optional Arguments

Standard command-line arguments for the schola-sb3 script.

Unreal Process Arguments

--launch-unreal - Launch Unreal Engine automatically
Default: False
Required: False
--executable-path - Path to the Unreal Engine executable
Type: str
Required: False
--headless - Run Unreal Engine in headless mode
Default: False
Required: False
-p, --port - Port for Unreal Engine communication
Default: 15151
Type: int
Required: False
--map - Map to load in Unreal Engine
Type: str
Required: False
--fps - Target FPS for Unreal Engine
Default: 60
Type: int
Required: False
--disable-script - Disable script execution in Unreal Engine
Default: False
Required: False

Logging Arguments

-scholav, --schola-verbosity - Verbosity level for the Schola environment
Default: 0
Type: int
Required: False

Checkpoint Arguments

--enable-checkpoints - Enable saving checkpoints
Default: False
Required: False
--checkpoint-dir - Directory to save checkpoints
Default: './ckpt'
Type: str
Required: False
--save-freq - Frequency with which to save checkpoints
Default: 100000
Type: int
Required: False
--name-prefix - Override the name prefix for the checkpoint files (e.g. SAC, PPO, etc.)
Type: str
Required: False
--export-onnx - Whether to export the model to ONNX format instead of just saving a checkpoint
Default: False
Required: False
--save-final-policy - Whether to save the final policy after training is complete
Default: False
Required: False
--save-replay-buffer - Save the replay buffer during training, if saving checkpoints
Default: False
Required: False
--save-vecnormalize - Save the VecNormalize parameters during training, if saving checkpoints
Default: False
Required: False

Resume Arguments

--resume-from - Path to a saved model to resume training from
Type: str
Required: False
--load-vecnormalize - Path to a saved VecNormalize parameters to load, if resuming from a checkpoint
Type: str
Required: False
--load-replay-buffer - Path to a saved Replay Buffer to load, if resuming from a checkpoint
Type: str
Required: False
--reset-timestep - Reset the timestep counter to 0 when resuming from a checkpoint
Default: False
Required: False

Network Architecture Arguments

--policy-parameters - Network architecture for the policy
Type: int (multiple values allowed)
Required: False
--critic-parameters - Network architecture for the critic
Type: int (multiple values allowed)
Required: False

Sub-commands

PPO

Proximal Policy Optimization algorithm for Stable Baselines3.

Optional Arguments

Algorithm-specific arguments for PPO configuration

SAC

Soft Actor-Critic algorithm for Stable Baselines3.

Optional Arguments

Algorithm-specific arguments for SAC configuration