schola.scripts.sb3.utils.SingleEnvRewardCallback

class schola.scripts.sb3.utils.SingleEnvRewardCallback(verbose=0, id=0, frequency=10) : Bases: BaseCallback

Callback for logging rewards and steps taken by a single environment inside a vector environment.

Parameters: : - verbose (int) – Verbosity level.

id (int) – The id of the environment to log rewards and steps for.
frequency (int) – The frequency at which to log the rewards and steps taken.

episode_reward : The reward for the current episode.

Type: : float

episode_rewards : The rewards for each episode.

Type: : List[float]

episode_steps : The number of steps taken in the current episode.

Type: : int

step_count : The number of steps taken in each episode.

Type: : List[int]

last_logging_interval : The last interval that was logged.

Type: : int

logging_interval_size : The frequency at which to log the rewards and steps taken.

Type: : int

id : The id of the environment to log rewards and steps for.

Type: : int

Methods


`__init__`([verbose, id, frequency])
`get_reward_interval`()	Returns the rewards for the last logging interval.
`get_step_interval`()	Returns the steps taken for each episode in the last logging interval.
`increment_logging_interval`()	Increments the logging interval by self.logging_interval_size steps.
`init_callback`(model)	Initialize the callback by saving references to the RL model and the training environment for convenience.
`on_rollout_end`()
`on_rollout_start`()
`on_step`()	This method will be called by the model after each call to `env.step()`.
`on_training_end`()
`on_training_start`(locals_, globals_)
`update_child_locals`(locals_)	Update the references to the local variables on sub callbacks.
`update_locals`(locals_)	Update the references to the local variables.

Attributes


`ready_to_log`	Returns whether the environment is ready to log, by checking if there are more episodes completed than self.logging_interval_size since we last logged.
`model`
`logger`

__init__(verbose=0, id=0, frequency=10)

get_reward_interval() : Returns the rewards for the last logging interval.

Returns: : The rewards for the last logging interval.

Return type: : List[float]

get_step_interval() : Returns the steps taken for each episode in the last logging interval.

Returns: : The steps taken for each episode in the last logging interval.

Return type: : List[int]

increment_logging_interval() : Increments the logging interval by self.logging_interval_size steps.

Return type: : None

property ready_to_log*: bool* : Returns whether the environment is ready to log, by checking if there are more episodes completed than self.logging_interval_size since we last logged.

Returns: : Whether the environment is ready to log.

Return type: : bool