pantheonrl.envs.rpsgym.rps.RPSWeightedAgent
- class RPSWeightedAgent(r=1, p=1, s=1, np_random=<module 'numpy.random' from '/opt/homebrew/Caskroom/miniforge/base/envs/PantheonRL_dev/lib/python3.10/site-packages/numpy/random/__init__.py'>)[source]
Bases:
AgentRandom RPS agent based on weights of each action.
Methods
Return an action given an observation.
Add new rewards and done information if the agent can learn.
- get_action(obs)[source]
Return an action given an observation.
- Parameters:
obs – The observation to use
- Returns:
The action to take
- update(reward, done)[source]
Add new rewards and done information if the agent can learn.
Each update corresponds to the most recent get_action.
If there are multiple calls to update that correspond to the same get_action, their rewards are summed up and the last done flag will be used.
- Parameters:
reward – The reward receieved from the previous action step
done – Whether the game is done