schola-sb3 Command
This script trains Stable Baselines3 models using Schola with various configuration options for training, checkpointing, and network architecture.
Usage
usage: schola-sb3 [-h] [--launch-unreal] [--executable-path EXECUTABLE_PATH] [--headless] [-p PORT] [--map MAP] [--fps FPS] [--disable-script] [-scholav SCHOLA_VERBOSITY] [--enable-checkpoints] [--checkpoint-dir CHECKPOINT_DIR] [--save-freq SAVE_FREQ] [--name-prefix NAME_PREFIX] [--export-onnx] [--save-final-policy] [--save-replay-buffer] [--save-vecnormalize] [--resume-from RESUME_FROM] [--load-vecnormalize LOAD_VECNORMALIZE] [--load-replay-buffer LOAD_REPLAY_BUFFER] [--reset-timestep] [--policy-parameters POLICY_PARAMETERS [POLICY_PARAMETERS ...]] [--critic-parameters CRITIC_PARAMETERS [CRITIC_PARAMETERS ...]] {PPO,SAC} ...
Optional Arguments
Standard command-line arguments for the schola-sb3 script.
Unreal Process Arguments
-
--launch-unreal
- Launch Unreal Engine automatically -
Default:
False
-
Required: False
-
--executable-path
- Path to the Unreal Engine executable -
Type: str
-
Required: False
-
--headless
- Run Unreal Engine in headless mode -
Default:
False
-
Required: False
-
-p, --port
- Port for Unreal Engine communication -
Default:
15151
-
Type: int
-
Required: False
-
--map
- Map to load in Unreal Engine -
Type: str
-
Required: False
-
--fps
- Target FPS for Unreal Engine -
Default:
60
-
Type: int
-
Required: False
-
--disable-script
- Disable script execution in Unreal Engine -
Default:
False
-
Required: False
Logging Arguments
-scholav, --schola-verbosity
- Verbosity level for the Schola environment- Default:
0
- Type: int
- Required: False
Checkpoint Arguments
-
--enable-checkpoints
- Enable saving checkpoints -
Default:
False
-
Required: False
-
--checkpoint-dir
- Directory to save checkpoints -
Default:
'./ckpt'
-
Type: str
-
Required: False
-
--save-freq
- Frequency with which to save checkpoints -
Default:
100000
-
Type: int
-
Required: False
-
--name-prefix
- Override the name prefix for the checkpoint files (e.g. SAC, PPO, etc.) -
Type: str
-
Required: False
-
--export-onnx
- Whether to export the model to ONNX format instead of just saving a checkpoint -
Default:
False
-
Required: False
-
--save-final-policy
- Whether to save the final policy after training is complete -
Default:
False
-
Required: False
-
--save-replay-buffer
- Save the replay buffer during training, if saving checkpoints -
Default:
False
-
Required: False
-
--save-vecnormalize
- Save the VecNormalize parameters during training, if saving checkpoints -
Default:
False
-
Required: False
Resume Arguments
-
--resume-from
- Path to a saved model to resume training from -
Type: str
-
Required: False
-
--load-vecnormalize
- Path to a saved VecNormalize parameters to load, if resuming from a checkpoint -
Type: str
-
Required: False
-
--load-replay-buffer
- Path to a saved Replay Buffer to load, if resuming from a checkpoint -
Type: str
-
Required: False
-
--reset-timestep
- Reset the timestep counter to 0 when resuming from a checkpoint -
Default:
False
-
Required: False
Network Architecture Arguments
-
--policy-parameters
- Network architecture for the policy -
Type: int (multiple values allowed)
-
Required: False
-
--critic-parameters
- Network architecture for the critic -
Type: int (multiple values allowed)
-
Required: False
Sub-commands
PPO
Proximal Policy Optimization algorithm for Stable Baselines3.
Optional Arguments
- Algorithm-specific arguments for PPO configuration
SAC
Soft Actor-Critic algorithm for Stable Baselines3.
Optional Arguments
- Algorithm-specific arguments for SAC configuration