Skip to content

FRLlibAPPOSettings

A struct to hold Asynchronous Proximal Policy Optimization(APPO) settings for an RLLib training script.

struct FRLlibAPPOSettings : public FTrainingSettings

Methods

GenerateTrainingArgs

virtual void GenerateTrainingArgs(FScriptArgBuilder &ArgBuilder) const

Appends APPO-related CLI arguments to the training script builder.

Parameters

  • ArgBuilder (FScriptArgBuilder)

FRLlibAPPOSettings

virtual ~FRLlibAPPOSettings()

Attributes

bVTrace

bool bVTrace = true

Whether to use V-trace for off-policy correction (APPO).


VTraceClipRhoThreshold

float VTraceClipRhoThreshold = 1.0

V-trace rho clipping threshold.


VTraceClipPGRhoThreshold

float VTraceClipPGRhoThreshold = 1.0

V-trace policy-gradient rho clipping threshold.


GAELambda

float GAELambda = 0.95

GAE lambda for advantage estimation.


ClipParam

float ClipParam = 0.2

PPO-style policy clipping parameter.


bUseGAE

bool bUseGAE = true

Whether to use generalized advantage estimation.

Source: Source/ScholaTraining/Public/TrainingSettings/Ray/Algorithms/RLlibAPPOSettings.h