academia.utils.models module
Ready-made network architectures to use with agents and environments provided by academia package.
Note
These are just example architectures that allow users to quickly set up simple experiments. Users are encouraged to design their own network architectures that better suit their needs.
Models for BridgeBuilding environment
See also
- class academia.utils.models.bridge_building.MLPActor
Bases:
ModuleNeural network architecture for PPO (actor) compatible with Bridge Building environment with the following parameters:
n_frames_stacked=1append_step_count=Falseobs_type='array'
- forward(obs) Tensor
Defines the computation performed at every call.
- class academia.utils.models.bridge_building.MLPCritic
Bases:
ModuleNeural network architecture for PPO (critic) compatible with Bridge Building environment with the following parameters:
n_frames_stacked=1append_step_count=Falseobs_type='array'
- forward(obs) Tensor
Defines the computation performed at every call.
- class academia.utils.models.bridge_building.MLPDQN
Bases:
ModuleNeural network architecture for DQN compatible with Bridge Building environment with the following parameters:
n_frames_stacked=1append_step_count=Falseobs_type='array'
- forward(obs) Tensor
Defines the computation performed at every call.
- class academia.utils.models.bridge_building.MLPStepActor
Bases:
ModuleNeural network architecture for PPO (actor) compatible with Bridge Building environment with the following parameters:
n_frames_stacked=1append_step_count=Trueobs_type='array'
- forward(obs) Tensor
Defines the computation performed at every call.
- class academia.utils.models.bridge_building.MLPStepCritic
Bases:
ModuleNeural network architecture for PPO (critic) compatible with Bridge Building environment with the following parameters:
n_frames_stacked=1append_step_count=Trueobs_type='array'
- forward(obs) Tensor
Defines the computation performed at every call.
- class academia.utils.models.bridge_building.MLPStepDQN
Bases:
ModuleNeural network architecture for DQN compatible with Bridge Building environment with the following parameters:
n_frames_stacked=1append_step_count=Trueobs_type='array'
- forward(obs) Tensor
Defines the computation performed at every call.
Models for DoorKey environment
See also
- class academia.utils.models.door_key.MLPActor
Bases:
ModuleNeural network architecture for PPO (actor) compatible with Door Key environment with the following parameters:
n_frames_stacked=1append_step_count=False
- forward(obs) Tensor
Defines the computation performed at every call.
- class academia.utils.models.door_key.MLPCritic
Bases:
ModuleNeural network architecture for PPO (critic) compatible with Door Key environment with the following parameters:
n_frames_stacked=1append_step_count=False
- forward(obs) Tensor
Defines the computation performed at every call.
- class academia.utils.models.door_key.MLPDQN
Bases:
ModuleNeural network architecture for DQN compatible with Door Key environment with the following parameters:
n_frames_stacked=1append_step_count=False
- forward(x) Tensor
Defines the computation performed at every call.
- class academia.utils.models.door_key.MLPStepActor
Bases:
ModuleNeural network architecture for PPO (actor) compatible with Door Key environment with the following parameters:
n_frames_stacked=1append_step_count=True
- forward(obs) Tensor
Defines the computation performed at every call.
Models for LavaCrossing environment
See also
- class academia.utils.models.lava_crossing.MLPActor
Bases:
ModuleNeural network architecture for PPO (actor) compatible with Lava Crossing environment with the following parameters:
n_frames_stacked=1append_step_count=False
- forward(obs) Tensor
Defines the computation performed at every call.
- class academia.utils.models.lava_crossing.MLPCritic
Bases:
ModuleNeural network architecture for PPO (critic) compatible with Lava Crossing environment with the following parameters:
n_frames_stacked=1append_step_count=False
- forward(obs) Tensor
Defines the computation performed at every call.
- class academia.utils.models.lava_crossing.MLPDQN
Bases:
ModuleNeural network architecture for DQN compatible with Lava Crossing environment with the following parameters:
n_frames_stacked=1append_step_count=False
- forward(x) Tensor
Defines the computation performed at every call.
- class academia.utils.models.lava_crossing.MLPStepActor
Bases:
ModuleNeural network architecture for PPO (actor) compatible with Lava Crossing environment with the following parameters:
n_frames_stacked=1append_step_count=True
- forward(obs) Tensor
Defines the computation performed at every call.
- class academia.utils.models.lava_crossing.MLPStepCritic
Bases:
ModuleNeural network architecture for PPO (critic) compatible with Lava Crossing environment with the following parameters:
n_frames_stacked=1append_step_count=True
- forward(obs) Tensor
Defines the computation performed at every call.
Models for LunarLander environment
See also
- class academia.utils.models.lunar_lander.MLPActor
Bases:
ModuleNeural network architecture for PPO (actor) compatible with Lunar Lander environment with the following parameters:
n_frames_stacked=1append_step_count=False
- forward(obs) Tensor
Defines the computation performed at every call.
- class academia.utils.models.lunar_lander.MLPCritic
Bases:
ModuleNeural network architecture for PPO (critic) compatible with Lunar Lander environment with the following parameters:
n_frames_stacked=1append_step_count=False
- forward(obs) Tensor
Defines the computation performed at every call.
- class academia.utils.models.lunar_lander.MLPDQN
Bases:
ModuleNeural network architecture for DQN compatible with Lunar Lander environment with the following parameters:
n_frames_stacked=1append_step_count=False
- forward(obs) Tensor
Defines the computation performed at every call.
- class academia.utils.models.lunar_lander.MLPStepActor
Bases:
ModuleNeural network architecture for PPO (actor) compatible with Lunar Lander environment with the following parameters:
n_frames_stacked=1append_step_count=True
- forward(obs) Tensor
Defines the computation performed at every call.
Models for MsPacman environment
See also
- class academia.utils.models.ms_pacman.MLPActor
Bases:
ModuleNeural network architecture for PPO (actor) compatible with Ms Pacman environment with the following parameters:
n_frames_stacked=1append_step_count=Falseobs_type='ram'
- forward(obs) Tensor
Defines the computation performed at every call.
- class academia.utils.models.ms_pacman.MLPCritic
Bases:
ModuleNeural network architecture for PPO (critic) compatible with Ms Pacman environment with the following parameters:
n_frames_stacked=1append_step_count=Falseobs_type='ram'
- forward(obs) Tensor
Defines the computation performed at every call.
- class academia.utils.models.ms_pacman.MLPDQN
Bases:
ModuleNeural network architecture for DQN compatible with Ms Pacman environment with the following parameters:
n_frames_stacked=1append_step_count=Falseobs_type='ram'
- forward(x) Tensor
Defines the computation performed at every call.
- class academia.utils.models.ms_pacman.MLPStepActor
Bases:
ModuleNeural network architecture for PPO (actor) compatible with Ms Pacman environment with the following parameters:
n_frames_stacked=1append_step_count=Trueobs_type='ram'
- forward(obs) Tensor
Defines the computation performed at every call.
- class academia.utils.models.ms_pacman.MLPStepCritic
Bases:
ModuleNeural network architecture for PPO (critic) compatible with Ms Pacman environment with the following parameters:
n_frames_stacked=1append_step_count=Trueobs_type='ram'
- forward(obs) Tensor
Defines the computation performed at every call.