Skip to content

Velora

state

Achronus/velora

velora.state¶

Documentation

User Guide - Tutorials: Callbacks

Dataclasses for storing states used during callbacks and agent training.

`AnalyticsState` `dataclass` ¶

A storage container for the details of a Comet or Weights and Biases analytics experiment.

Parameters:

Name	Type	Description	Default
`project_name`	`str`	the name of the project to add this experiment to	required
`experiment_name`	`str`	the name of the experiment	`None`
`tags`	`List[str]`	a list of tags associated with the experiment	`None`

Source code in velora/state.py

Python
@dataclass
class AnalyticsState:
    """
    A storage container for the details of a [Comet](https://www.comet.com/) or
    [Weights and Biases](https://wandb.ai/) analytics experiment.

    Parameters:
        project_name (str): the name of the project to add this experiment to
        experiment_name (str, optional): the name of the experiment
        tags (List[str], optional): a list of tags associated with the experiment
    """

    project_name: str
    experiment_name: str | None = None
    tags: List[str] | None = None

`RecordState` `dataclass` ¶

A storage container for the video recording state.

Parameters:

Name	Type	Description	Default
`dirpath`	`Path`	the video directory path to store the videos	required
`method`	`Literal['episode', 'step']`	the recording method	required
`episode_trigger`	`Callable[[int], bool]`	the `episode` recording trigger	`None`
`step_trigger`	`Callable[[int], bool]`	the `step` recording trigger	`None`

Source code in velora/state.py

Python
@dataclass
class RecordState:
    """
    A storage container for the video recording state.

    Parameters:
        dirpath (Path): the video directory path to store the videos
        method (Literal["episode", "step"]): the recording method
        episode_trigger (Callable[[int], bool], optional): the `episode` recording
            trigger
        step_trigger (Callable[[int], bool], optional): the `step` recording trigger
    """

    dirpath: Path
    method: RecordMethodLiteral
    episode_trigger: Callable[[int], bool] | None = None
    step_trigger: Callable[[int], bool] | None = None

    def to_wrapper(self) -> Dict[str, Any]:
        """
        Converts the state into wrapper parameters.

        Returns:
            params (Dict[str, Any]): values as parameters for [Gymnasium's RecordVideo](https://gymnasium.farama.org/api/wrappers/misc_wrappers/#gymnasium.wrappers.RecordVideo) wrapper.

            Includes the following keys - `[video_folder, episode_trigger, step_trigger]`.
        """
        return {
            "video_folder": self.dirpath,
            "episode_trigger": self.episode_trigger,
            "step_trigger": self.step_trigger,
        }

`to_wrapper()` ¶

Converts the state into wrapper parameters.

Returns:

Name	Type	Description
`params`	`Dict[str, Any]`	values as parameters for Gymnasium's RecordVideo wrapper.
	`Dict[str, Any]`	Includes the following keys - `[video_folder, episode_trigger, step_trigger]`.

Source code in velora/state.py

Python
def to_wrapper(self) -> Dict[str, Any]:
    """
    Converts the state into wrapper parameters.

    Returns:
        params (Dict[str, Any]): values as parameters for [Gymnasium's RecordVideo](https://gymnasium.farama.org/api/wrappers/misc_wrappers/#gymnasium.wrappers.RecordVideo) wrapper.

        Includes the following keys - `[video_folder, episode_trigger, step_trigger]`.
    """
    return {
        "video_folder": self.dirpath,
        "episode_trigger": self.episode_trigger,
        "step_trigger": self.step_trigger,
    }

`TrainState` `dataclass` ¶

A storage container for the current state of model training.

Parameters:

Name	Type	Description	Default
`agent`	`RLModuleAgent`	the agent being trained	required
`env`	`gymnasium.Env`	a single training or evaluation environment	required
`session`	`sqlmodel.Session`	the current metric database session	required
`experiment_id`	`int`	the current experiment's unique ID	required
`total_episodes`	`int`	total number of training episodes	`0`
`total_steps`	`int`	total number of training steps	`0`
`status`	`Literal['start', 'episode', 'logging', 'step', 'complete']`	the current stage of training. `start` - before training starts. `episode` - inside the episode loop. `logging` - metric logging. `step` - inside the timestep loop. `complete` - completed training.	`'start'`
`logging_type`	`Literal['episode', 'step']`	the logging type	`'episode'`
`current_ep`	`int`	the current episode index	`0`
`current_step`	`int`	the current training timestep	`0`
`ep_reward`	`float`	the current episode reward	`0.0`
`stop_training`	`bool`	a flag to declare training termination	`False`
`saving_enabled`	`bool`	a flag for checkpoint saving	`False`
`checkpoint_dir`	`Path`	the checkpoint directory path when `saving_enabled=True`	`None`
`record_state`	`RecordState`	the video recording state	`None`
`analytics_state`	`AnalyticsState`	the analytics state	`None`

Source code in velora/state.py

Python
@dataclass
class TrainState:
    """
    A storage container for the current state of model training.

    Parameters:
        agent (RLModuleAgent): the agent being trained
        env (gymnasium.Env): a single training or evaluation environment
        session (sqlmodel.Session): the current metric database session
        experiment_id (int): the current experiment's unique ID
        total_episodes (int, optional): total number of training episodes
        total_steps (int, optional): total number of training steps
        status (Literal["start", "episode", "logging", "step", "complete"], optional): the current stage of training.

            - `start` - before training starts.
            - `episode` - inside the episode loop.
            - `logging` - metric logging.
            - `step` - inside the timestep loop.
            - `complete` - completed training.

        logging_type (Literal["episode", "step"], optional): the logging type
        current_ep (int, optional): the current episode index
        current_step (int, optional): the current training timestep
        ep_reward (float, optional): the current episode reward
        stop_training (bool, optional): a flag to declare training termination
        saving_enabled (bool, optional): a flag for checkpoint saving
        checkpoint_dir (Path, optional): the checkpoint directory path when
            `saving_enabled=True`
        record_state (RecordState, optional): the video recording state
        analytics_state (AnalyticsState, optional): the analytics state
    """

    agent: RLModuleAgent
    env: gym.Env
    session: Session
    experiment_id: int
    total_episodes: int = 0
    total_steps: int = 0
    status: StatusLiteral = "start"
    logging_type: Literal["episode", "step"] = "episode"
    current_ep: int = 0
    current_step: int = 0
    ep_reward: float = 0.0
    stop_training: bool = False
    saving_enabled: bool = False
    checkpoint_dir: Path | None = None
    record_state: RecordState | None = None
    analytics_state: AnalyticsState | None = None

    def update(
        self,
        *,
        status: StatusLiteral | None = None,
        current_ep: int | None = None,
        current_step: int | None = None,
        ep_reward: int | None = None,
        logging_type: Literal["episode", "step"] | None = None,
    ) -> None:
        """
        Updates the training state. When any input is `None`, uses existing value.

        Parameters:
            status (Literal["start", "episode", "logging", "step", "complete"], optional): the current stage of training.

                - `start` - before training start.
                - `episode` - inside the episode loop.
                - `logging` - metric logging.
                - `step` - inside the timestep loop.
                - `complete` - completed training.

            current_ep (int, optional): the current episode index
            current_step (int, optional): the current training timestep
            ep_reward (float, optional): the current episode or rollout update reward
            logging_type (Literal["episode", "step"], optional): the logging type
        """
        self.status = status if status else self.status
        self.current_ep = current_ep if current_ep else self.current_ep
        self.current_step = current_step if current_step else self.current_step
        self.ep_reward = ep_reward if ep_reward else self.ep_reward
        self.logging_type = logging_type if logging_type else self.logging_type

    def analytics_update(self) -> None:
        """
        Updates the analytics state details that are `None` dynamically, using
        the current training state.
        """
        agent_name = self.agent.__class__.__name__
        env_name = self.env.spec.name

        new_state = self.analytics_state

        new_state.experiment_name = (
            new_state.experiment_name
            if new_state.experiment_name
            else f"{agent_name}_{env_name}_{self.total_episodes}ep"
        )

        new_state.tags = new_state.tags if new_state.tags else [agent_name, env_name]

        # Update state
        self.analytics_state = new_state

`analytics_update()` ¶

Updates the analytics state details that are None dynamically, using the current training state.

Source code in velora/state.py

Python
def analytics_update(self) -> None:
    """
    Updates the analytics state details that are `None` dynamically, using
    the current training state.
    """
    agent_name = self.agent.__class__.__name__
    env_name = self.env.spec.name

    new_state = self.analytics_state

    new_state.experiment_name = (
        new_state.experiment_name
        if new_state.experiment_name
        else f"{agent_name}_{env_name}_{self.total_episodes}ep"
    )

    new_state.tags = new_state.tags if new_state.tags else [agent_name, env_name]

    # Update state
    self.analytics_state = new_state

`update(*, status=None, current_ep=None, current_step=None, ep_reward=None, logging_type=None)` ¶

Updates the training state. When any input is None, uses existing value.

Parameters:

Name	Type	Description	Default
`status`	`Literal['start', 'episode', 'logging', 'step', 'complete']`	the current stage of training. `start` - before training start. `episode` - inside the episode loop. `logging` - metric logging. `step` - inside the timestep loop. `complete` - completed training.	`None`
`current_ep`	`int`	the current episode index	`None`
`current_step`	`int`	the current training timestep	`None`
`ep_reward`	`float`	the current episode or rollout update reward	`None`
`logging_type`	`Literal['episode', 'step']`	the logging type	`None`

Source code in velora/state.py

Python
def update(
    self,
    *,
    status: StatusLiteral | None = None,
    current_ep: int | None = None,
    current_step: int | None = None,
    ep_reward: int | None = None,
    logging_type: Literal["episode", "step"] | None = None,
) -> None:
    """
    Updates the training state. When any input is `None`, uses existing value.

    Parameters:
        status (Literal["start", "episode", "logging", "step", "complete"], optional): the current stage of training.

            - `start` - before training start.
            - `episode` - inside the episode loop.
            - `logging` - metric logging.
            - `step` - inside the timestep loop.
            - `complete` - completed training.

        current_ep (int, optional): the current episode index
        current_step (int, optional): the current training timestep
        ep_reward (float, optional): the current episode or rollout update reward
        logging_type (Literal["episode", "step"], optional): the logging type
    """
    self.status = status if status else self.status
    self.current_ep = current_ep if current_ep else self.current_ep
    self.current_step = current_step if current_step else self.current_step
    self.ep_reward = ep_reward if ep_reward else self.ep_reward
    self.logging_type = logging_type if logging_type else self.logging_type