Gymnasium

mirror of https://github.com/Farama-Foundation/Gymnasium.git synced 2025-08-19 13:32:03 +00:00

Author	SHA1	Message	Date
J K Terry	ff8c269abb	typo	2021-07-29 14:07:03 -04:00
Justin Terry	e9d2c41f2b	redo black	2021-07-29 12:42:48 -04:00
Justin Terry	d5004b7ec1	fix arxiv link	2021-07-29 12:20:12 -04:00
Justin Terry	b3a88b0a46	more readme updates	2021-07-29 12:19:39 -04:00
Kevin Corder	ab5527ee47	Update README.md (#2269 ) Add basic API section. Fix a couple typos toward beginning.	2021-07-29 12:14:44 -04:00
Justin Terry	3133e99a64	deprecate built in wrappers for supersuit	2021-07-28 22:21:47 -04:00
J K Terry	bc76034167	typo	2021-07-28 22:06:19 -04:00
J K Terry	f535d8ac0d	typo	2021-07-28 21:50:39 -04:00
J K Terry	10d62f60d7	fix link	2021-07-28 21:48:55 -04:00
J K Terry	b7cbbaed68	typo	2021-07-28 21:47:17 -04:00
J K Terry	a32767e03a	New and dramatically cleaner readme.md file	2021-07-28 21:46:11 -04:00
J K Terry	f297ed73da	Delete README.rst	2021-07-28 21:45:45 -04:00
J K Terry	38095d4621	typo	2021-07-28 21:22:35 -04:00
Christian Clauss	bb81e141ea	Blacken the codebase (#2265 )	2021-07-28 20:26:34 -04:00
J K Terry	2f3133461c	Brief readme cleanup	2021-07-28 20:26:00 -04:00
Mathieu Poliquin	31be35ecd4	Added 'stable-retro' to third party env list (#2266 )	2021-07-28 12:40:56 -04:00
Zain Patel	b064313d20	Fix CI: add tests to GitHub actions (#2262 ) * Initial CI fixes	2021-07-27 23:32:32 -04:00
zlig	fe1e0837e6	Fixes video recorder ffmpeg on Centos7 and RHEL7 (#1893 ) * Conditionally select different arguments for ffmpeg, and add meaningful error for third party application in charge of encoding * Consistency with other logger() calls Co-authored-by: J K Terry <justinkterry@gmail.com>	2021-07-27 15:18:11 -04:00
Yunzhi Zhang	41e571d065	pass args to expickle (#2172 )	2021-07-27 14:49:45 -04:00
Justin Terry	b60a40db6e	@Indoril007 s ez pickle fix	2021-07-27 14:36:06 -04:00
Christian Clauss	29c96a9a83	GitHub Action to lint Python code (#2258 ) * GitHub Action to lint Python code * pip install -e . * pip install -e .[all]	2021-07-27 14:16:00 -04:00
Justin Terry	0989f0d3d8	update contributing.md	2021-07-27 14:14:21 -04:00
Justin Terry	e5a093a1c5	add requirements.txt	2021-07-27 12:35:48 -04:00
Justin Terry	ee7811c26d	peg atari-py version, add requirements.txt, bump numpy version	2021-07-27 12:35:35 -04:00
Christian Clauss	2ef26144a2	Fix typos discovered by codespell (#2256 ) Typos	2021-07-27 01:20:29 -04:00
Iheb Haboubi	63171e352e	Fix typo in docstring (#2080 )	2021-07-26 16:11:20 -04:00
Won Seok Jung	1dbd97e8ed	minor fix (#2113 )	2021-07-26 16:09:48 -04:00
m-orsini	6b01f4fa9c	Allow custom data type for multi distrete space. (#2148 )	2021-07-26 16:07:58 -04:00
Justin Terry	562a8cc7ce	fix qoute	2021-07-26 16:03:32 -04:00
Ikko Ashimine	065e6a5d74	Fix URL (#2244 ) http -> https	2021-07-26 15:54:42 -04:00
Merve Noyan	98777c455c	Fixed broken link (arcade learning environment) (#2222 )	2021-07-26 15:53:24 -04:00
Nashir Janmohamed	c882703bc8	Fix broken link in docstring (#2154 ) The link to the original implementation of mountain car on incompleteideas.net was broken. Replaced with the correct link.	2021-07-26 15:51:54 -04:00
ahmadF	48baefeaf5	Add description to countinous mountain car task (#2107 )	2021-07-26 15:51:29 -04:00
Erfan Miahi	a766dd0a2c	A third-party environment named RiverSwim (#2086 ) RiverSwim which is a simple hard exploration environment has been added to the list of third-party environments. Co-authored-by: J K Terry <justinkterry@gmail.com>	2021-07-26 15:51:08 -04:00
Akash Patel	a8153126a0	add gym-ccc third-party environment package (#2087 ) Co-authored-by: J K Terry <justinkterry@gmail.com>	2021-07-26 15:50:45 -04:00
MouseAndKeyboard	985de1f017	Added gym-spoof to third party environments (#2092 )	2021-07-26 15:50:21 -04:00
miluChen	9a9e68a888	close env after done (#2162 ) * close env after done * fix if Co-authored-by: He Chen <hchen584@bloomberg.net> Co-authored-by: J K Terry <justinkterry@gmail.com>	2021-07-26 15:49:56 -04:00
Rajkumar Ramamurthy	cbcb6668ae	Update documentation for new NLP environments (#2110 ) Co-authored-by: J K Terry <justinkterry@gmail.com>	2021-07-26 15:46:59 -04:00
Johnny	db80dbe4ad	Add math_prog_synth_env to list of environments (#2252 ) Co-authored-by: J K Terry <justinkterry@gmail.com>	2021-07-26 15:45:01 -04:00
Yang Yu	e382c939bf	Update environments.md (#2127 ) Co-authored-by: J K Terry <justinkterry@gmail.com>	2021-07-26 15:44:27 -04:00
Zhiqing Xiao	81aeb68906	remove constraints on package versions (#2135 ) Co-authored-by: J K Terry <justinkterry@gmail.com>	2021-07-26 15:43:08 -04:00
qdevpsi3	0de634b2ce	Add qas-gym to environments.md (#2226 ) Co-authored-by: J K Terry <justinkterry@gmail.com>	2021-07-26 15:41:35 -04:00
Matteo Lucchi	b1217f7308	Add robo-gym to third party environment list (#2248 ) Co-authored-by: J K Terry <justinkterry@gmail.com>	2021-07-26 15:40:08 -04:00
Vasco CC	8f6eb1ebd0	BUG FIX: Bipedal Walker; Car Racing; Guessing Game; Hotter Colder - Showing Warning (#2126 ) * Fixing Warning issue Fixing an issue causing a Warning message to pop because of conversion from python standard float64 to np.float32 * car_racing.py Warning Fix Fixed a bug generating a warning with conversion from standard `float 64` to `numpy float 32`. * Fixing Warning issue Fixed a bug generating a warning with conversion from standard `float 64` to `numpy float 32`. * Fixing Warning issue Fixed a bug generating a warning with conversion from standard `float 64` to `numpy float 32`. * bracket missing * Bracket misplaced	2021-07-26 15:39:04 -04:00
Hojoung (Brian) Jang	81b2ddd106	Add gym-xiangqi to third party environment list (#2230 ) * Add gym-xiangqi to env list * Update description Co-authored-by: J K Terry <justinkterry@gmail.com>	2021-07-26 15:36:28 -04:00
Mohamad H. Danesh	7d72f9535f	Update environments.md (#2251 ) Add a new environment to the list.	2021-07-26 15:35:27 -04:00
pzhokhov	ee5ee3a4a5	fix build by disabling mujoco and moving to newer python patch versions (#2220 ) * reverting to older version of 3.6 python to test if the build is failing because of that * revert 3.7 to 3.7.3 for the build * revert python 3.8 version to 3.8.1 * do not install mujoco on 3.8 and 3.9 * enable mujoco for 3.7 * . * . * . * use regex to navigate python version dependent package installation * . * try only one python version at a time * switch to possibly more popular python tag for 3.6 * disable mujoco	2021-05-17 16:19:33 -07:00
Daniel Ciborowski	a5a6ae6bc0	Update Pillow Python Library for CVE-2020-35653 & CVE-2021-27921 (#2189 ) * Update setup.py * Update setup.py * Update setup.py	2021-04-06 08:18:55 -07:00
Zhiqing Xiao	151ba406eb	Revise the unattainable reward_threshold to an attainable value (#2205 ) Issues: The current `reward_threhold` for `FrozenLake-v0` and `FrozenLake8x8-v0` is too high to be attained. Commit: `df515de07d` @joschu Solution: Reduce the `reward_threhold` to make them attainable. Reference: Codes to calculate the theoretic optimal reward expectations: ```python import gym env = gym.make('FrozenLake-v0') print(env.observation_space.n) # 16 print(env.action_space.n) # 4 print(env.spec.reward_threshold) # 0.78, should be smaller print(env.spec.max_episode_steps) # 100 import numpy as np v = np.zeros((101, 16), dtype=float) q = np.zeros((101, 16, 4), dtype=float) pi = np.zeros((101, 16), dtype=float) for t in range(99, -1, -1): # backward for s in range(16): for a in range(4): for p, next_s, r, d in env.P[s][a]: q[t, s, a] += p * (r + (1. - float(d)) * v[t+1, next_s]) v[t, s] = q[t, s].max() pi[t, s] = q[t, s].argmax() print(v[0, 0]) # ~0.74 < 0.78 ``` ```python import gym env = gym.make('FrozenLake8x8-v0') print(env.observation_space.n) # 64 print(env.action_space.n) # 4 print(env.spec.reward_threshold) # 0.99, should be smaller print(env.spec.max_episode_steps) # 200 import numpy as np v = np.zeros((201, 64), dtype=float) q = np.zeros((201, 64, 4), dtype=float) pi = np.zeros((201, 64), dtype=float) for t in range(199, -1, -1): # backward for s in range(64): for a in range(4): for p, next_s, r, d in env.P[s][a]: q[t, s, a] += p * (r + (1. - float(d)) * v[t+1, next_s]) v[t, s] = q[t, s].max() pi[t, s] = q[t, s].argmax() print(v[0, 0]) # ~0.91 < 0.99 ```	2021-03-31 11:02:11 -07:00
Jaekyung-Cho	c8a659369d	pull request about issue #2062 (car_racing.py memory leaking issue ) (#2096 ) * car_racing.py memory leaking issue #2062 solving * additional memory leaking resolve	2021-02-16 11:31:40 -08:00

... 3 4 5 6 7 ...

1491 Commits