Gymnasium/examples/agents/random_agent.py

import argparse
import logging
import sys

import gym
from gym import wrappers


class RandomAgent(object):
    """The world's simplest agent!"""
    def __init__(self, action_space):
        self.action_space = action_space

    def act(self, observation, reward, done):
        return self.action_space.sample()

if __name__ == '__main__':
    parser = argparse.ArgumentParser(description=None)
    parser.add_argument('env_id', nargs='?', default='CartPole-v0', help='Select the environment to run')
    args = parser.parse_args()

    # Call `undo_logger_setup` if you want to undo Gym's logger setup
    # and configure things manually. (The default should be fine most
    # of the time.)
    gym.undo_logger_setup()
    logger = logging.getLogger()
    formatter = logging.Formatter('[%(asctime)s] %(message)s')
    handler = logging.StreamHandler(sys.stderr)
    handler.setFormatter(formatter)
    logger.addHandler(handler)

    # You can set the level to logging.DEBUG or logging.WARN if you
    # want to change the amount of output.
    logger.setLevel(logging.INFO)

    env = gym.make(args.env_id)

    # You provide the directory to write to (can be an existing
    # directory, including one with existing data -- all monitor files
    # will be namespaced). You can also dump to a tempdir if you'd
    # like: tempfile.mkdtemp().
    outdir = '/tmp/random-agent-results'
    env = wrappers.Monitor(directory=outdir, force=True)(env)
    env.seed(0)
    agent = RandomAgent(env.action_space)

    episode_count = 100
    reward = 0
    done = False

    for i in range(episode_count):
        ob = env.reset()
        while True:
            action = agent.act(ob, reward, done)
            ob, reward, done, _ = env.step(action)
            if done:
                break
            # Note there's no env.render() here. But the environment still can open window and
            # render if asked by env.monitor: it calls env.render('rgb_array') to record video.
            # Video is not recorded every episode, see capped_cubic_video_schedule for details.

    # Close the env and write monitor result info to disk
    env.close()

    # Upload to the scoreboard. We could also do this from another
    # process if we wanted.
    logger.info("Successfully ran RandomAgent. Now trying to upload results to the scoreboard. If it breaks, you can always just try re-uploading the same results.")
    gym.upload(outdir)
Switch the Gym automated logger setup to configure the root logger rather than just the 'gym' logger Python doesn't make it easy for libraries to take responsibility for logging configuration (which we do to make simple usage much easier), and as we see more Gym plugins, we want their loggers to have an appropriate log level too. So we may as well configure the root logger level. 2016-09-21 14:55:04 -07:00			`import argparse`
Initial release. Hello world :). 2016-04-27 08:00:58 -07:00			`import logging`
Switch the Gym automated logger setup to configure the root logger rather than just the 'gym' logger Python doesn't make it easy for libraries to take responsibility for logging configuration (which we do to make simple usage much easier), and as we see more Gym plugins, we want their loggers to have an appropriate log level too. So we may as well configure the root logger level. 2016-09-21 14:55:04 -07:00			`import sys`
Initial release. Hello world :). 2016-04-27 08:00:58 -07:00
			`import gym`
Add Monitored wrapper (#434) * Add WIP Monitored wrapper * Remove irrelevant render after close monitor test * py27 compatibility * Fix test_benchmark * Move Monitored out of wrappers __init__ * Turn Monitored into a function that returns a Monitor class * Fix monitor tests * Remove deprecated test * Remove deprecated utility * Prevent duplicate wrapping, add test * Fix test * close env in tests to prevent writing to nonexistent file * Disable semisuper tests * typo * Fix failing spec * Fix monitoring on semisuper tasks * Allow disabling of duplicate check * Rename MonitorManager * Monitored -> Monitor * Clean up comments * Remove cruft 2016-12-23 16:21:42 -08:00			`from gym import wrappers`

Initial release. Hello world :). 2016-04-27 08:00:58 -07:00
			`class RandomAgent(object):`
Add Monitored wrapper (#434) * Add WIP Monitored wrapper * Remove irrelevant render after close monitor test * py27 compatibility * Fix test_benchmark * Move Monitored out of wrappers __init__ * Turn Monitored into a function that returns a Monitor class * Fix monitor tests * Remove deprecated test * Remove deprecated utility * Prevent duplicate wrapping, add test * Fix test * close env in tests to prevent writing to nonexistent file * Disable semisuper tests * typo * Fix failing spec * Fix monitoring on semisuper tasks * Allow disabling of duplicate check * Rename MonitorManager * Monitored -> Monitor * Clean up comments * Remove cruft 2016-12-23 16:21:42 -08:00			`"""The world's simplest agent!"""`
Initial release. Hello world :). 2016-04-27 08:00:58 -07:00			`def __init__(self, action_space):`
			`self.action_space = action_space`

			`def act(self, observation, reward, done):`
			`return self.action_space.sample()`

			`if __name__ == '__main__':`
Switch the Gym automated logger setup to configure the root logger rather than just the 'gym' logger Python doesn't make it easy for libraries to take responsibility for logging configuration (which we do to make simple usage much easier), and as we see more Gym plugins, we want their loggers to have an appropriate log level too. So we may as well configure the root logger level. 2016-09-21 14:55:04 -07:00			`parser = argparse.ArgumentParser(description=None)`
			`parser.add_argument('env_id', nargs='?', default='CartPole-v0', help='Select the environment to run')`
			`args = parser.parse_args()`

			# Call `undo_logger_setup` if you want to undo Gym's logger setup
			`# and configure things manually. (The default should be fine most`
			`# of the time.)`
			`gym.undo_logger_setup()`
Initial release. Hello world :). 2016-04-27 08:00:58 -07:00			`logger = logging.getLogger()`
Switch the Gym automated logger setup to configure the root logger rather than just the 'gym' logger Python doesn't make it easy for libraries to take responsibility for logging configuration (which we do to make simple usage much easier), and as we see more Gym plugins, we want their loggers to have an appropriate log level too. So we may as well configure the root logger level. 2016-09-21 14:55:04 -07:00			`formatter = logging.Formatter('[%(asctime)s] %(message)s')`
			`handler = logging.StreamHandler(sys.stderr)`
			`handler.setFormatter(formatter)`
			`logger.addHandler(handler)`

			`# You can set the level to logging.DEBUG or logging.WARN if you`
			`# want to change the amount of output.`
Initial release. Hello world :). 2016-04-27 08:00:58 -07:00			`logger.setLevel(logging.INFO)`

Switch the Gym automated logger setup to configure the root logger rather than just the 'gym' logger Python doesn't make it easy for libraries to take responsibility for logging configuration (which we do to make simple usage much easier), and as we see more Gym plugins, we want their loggers to have an appropriate log level too. So we may as well configure the root logger level. 2016-09-21 14:55:04 -07:00			`env = gym.make(args.env_id)`
Initial release. Hello world :). 2016-04-27 08:00:58 -07:00
			`# You provide the directory to write to (can be an existing`
Tweak RandomAgent comment 2016-05-26 13:44:14 -07:00			`# directory, including one with existing data -- all monitor files`
			`# will be namespaced). You can also dump to a tempdir if you'd`
			`# like: tempfile.mkdtemp().`
Initial release. Hello world :). 2016-04-27 08:00:58 -07:00			`outdir = '/tmp/random-agent-results'`
Add Monitored wrapper (#434) * Add WIP Monitored wrapper * Remove irrelevant render after close monitor test * py27 compatibility * Fix test_benchmark * Move Monitored out of wrappers __init__ * Turn Monitored into a function that returns a Monitor class * Fix monitor tests * Remove deprecated test * Remove deprecated utility * Prevent duplicate wrapping, add test * Fix test * close env in tests to prevent writing to nonexistent file * Disable semisuper tests * typo * Fix failing spec * Fix monitoring on semisuper tasks * Allow disabling of duplicate check * Rename MonitorManager * Monitored -> Monitor * Clean up comments * Remove cruft 2016-12-23 16:21:42 -08:00			`env = wrappers.Monitor(directory=outdir, force=True)(env)`
Stop seeding in monitor / uploading seeds to scoreboard 2016-10-31 22:20:02 -07:00			`env.seed(0)`
[WIP] add support for seeding environments (#135) * Make environments seedable * Fix monitor bugs - Set monitor_id before setting the infix. This was a bug that would yield incorrect results with multiple monitors. - Remove extra pid from stats recorder filename. This should be purely cosmetic. * Start uploading seeds in episode_batch * Fix _bigint_from_bytes for python3 * Set seed explicitly in random_agent * Pass through seed argument * Also pass through random state to spaces * Pass random state into the observation/action spaces * Make all _seed methods return the list of used seeds * Switch over to np.random where possible * Start hashing seeds, and also seed doom engine * Fixup seeding determinism in many cases * Seed before loading the ROM * Make seeding more Python3 friendly * Make the MuJoCo skipping a bit more forgiving * Remove debugging PDB calls * Make setInt argument into raw bytes * Validate and upload seeds * Skip box2d * Make seeds smaller, and change representation of seeds in upload * Handle long seeds * Fix RandomAgent example to be deterministic * Handle integer types correctly in Python2 and Python3 * Try caching pip * Try adding swap * Add df and free calls * Bump swap * Bump swap size * Try setting overcommit * Try other sysctls * Try fixing overcommit * Try just setting overcommit_memory=1 * Add explanatory comment * Add what's new section to readme * BUG: Mark ElevatorAction-ram-v0 as non-deterministic for now * Document seed * Move nondetermistic check into spec 2016-05-29 09:07:09 -07:00			`agent = RandomAgent(env.action_space)`
Initial release. Hello world :). 2016-04-27 08:00:58 -07:00
Change episode_count and max_step in example agent to reflect description 2016-04-29 10:47:11 +02:00			`episode_count = 100`
Initial release. Hello world :). 2016-04-27 08:00:58 -07:00			`reward = 0`
			`done = False`

Replace xrange -> range in example scripts 2016-05-01 23:17:38 -04:00			`for i in range(episode_count):`
Initial release. Hello world :). 2016-04-27 08:00:58 -07:00			`ob = env.reset()`
Unbreak random_agent (#406) The semantics of `env.reset()` have changed. It's no longer possible to reset an environment until it's done. Notably, this commit changes the behavior of random_agent. It used to have a limit on number of actions per episode. If we wanted that now, we'd have to close the environment and recreate it on each episode, which may be slow. 2016-12-06 17:07:23 -08:00			`while True:`
Initial release. Hello world :). 2016-04-27 08:00:58 -07:00			`action = agent.act(ob, reward, done)`
			`ob, reward, done, _ = env.step(action)`
			`if done:`
			`break`
Car racing (#117) * CarRacing-v0 new box2d environment 2016-05-26 21:39:57 +03:00			`# Note there's no env.render() here. But the environment still can open window and`
			`# render if asked by env.monitor: it calls env.render('rgb_array') to record video.`
			`# Video is not recorded every episode, see capped_cubic_video_schedule for details.`
Initial release. Hello world :). 2016-04-27 08:00:58 -07:00
Add Monitored wrapper (#434) * Add WIP Monitored wrapper * Remove irrelevant render after close monitor test * py27 compatibility * Fix test_benchmark * Move Monitored out of wrappers __init__ * Turn Monitored into a function that returns a Monitor class * Fix monitor tests * Remove deprecated test * Remove deprecated utility * Prevent duplicate wrapping, add test * Fix test * close env in tests to prevent writing to nonexistent file * Disable semisuper tests * typo * Fix failing spec * Fix monitoring on semisuper tasks * Allow disabling of duplicate check * Rename MonitorManager * Monitored -> Monitor * Clean up comments * Remove cruft 2016-12-23 16:21:42 -08:00			`# Close the env and write monitor result info to disk`
			`env.close()`
Initial release. Hello world :). 2016-04-27 08:00:58 -07:00
			`# Upload to the scoreboard. We could also do this from another`
			`# process if we wanted.`
			`logger.info("Successfully ran RandomAgent. Now trying to upload results to the scoreboard. If it breaks, you can always just try re-uploading the same results.")`
Make agent examples compatible with python 3 (#150) * make cem agen exaple compatible with python 2 and 3 * make the keyboard_agent example compatible with python 2 and 3 Changing `xrange` to `range` should not impact performance unless we're generating millions of elements (currently only 1000). * remove algorithm_id from the upload call 2016-06-01 16:15:18 +02:00			`gym.upload(outdir)`