Skip to content

Conversation

@marginloop
Copy link

env.reset() returns an observation and info. This fixes issues with the expected size of a tuple.

from gym library api https://www.gymlibrary.dev/api/core/#gym.Env.reset

RETURNS:
observation (object) – Observation of the initial state. This will be an element of observation_space (typically a numpy array) and is analogous to the observation returned by step().

info (dictionary) – This dictionary contains auxiliary information complementing observation. It should be analogous to the info returned by step().

env.step returns a tuple with an additional variable. adding a throwaway variable fixes errors with expected return lengths.

from gym library api https://www.gymlibrary.dev/api/core/#gym.Env.step

Accepts an action and returns either a tuple (observation, reward, terminated, truncated, info).

from Gym API "Accepts an action and returns a tuple (observation, reward, terminated, truncated, info)"
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant