Commit Graph

1491 Commits

Author SHA1 Message Date
J K Terry
ff8c269abb typo 2021-07-29 14:07:03 -04:00
Justin Terry
e9d2c41f2b redo black 2021-07-29 12:42:48 -04:00
Justin Terry
d5004b7ec1 fix arxiv link 2021-07-29 12:20:12 -04:00
Justin Terry
b3a88b0a46 more readme updates 2021-07-29 12:19:39 -04:00
Kevin Corder
ab5527ee47 Update README.md (#2269)
Add basic API section. Fix a couple typos toward beginning.
2021-07-29 12:14:44 -04:00
Justin Terry
3133e99a64 deprecate built in wrappers for supersuit 2021-07-28 22:21:47 -04:00
J K Terry
bc76034167 typo 2021-07-28 22:06:19 -04:00
J K Terry
f535d8ac0d typo 2021-07-28 21:50:39 -04:00
J K Terry
10d62f60d7 fix link 2021-07-28 21:48:55 -04:00
J K Terry
b7cbbaed68 typo 2021-07-28 21:47:17 -04:00
J K Terry
a32767e03a New and dramatically cleaner readme.md file 2021-07-28 21:46:11 -04:00
J K Terry
f297ed73da Delete README.rst 2021-07-28 21:45:45 -04:00
J K Terry
38095d4621 typo 2021-07-28 21:22:35 -04:00
Christian Clauss
bb81e141ea Blacken the codebase (#2265) 2021-07-28 20:26:34 -04:00
J K Terry
2f3133461c Brief readme cleanup 2021-07-28 20:26:00 -04:00
Mathieu Poliquin
31be35ecd4 Added 'stable-retro' to third party env list (#2266) 2021-07-28 12:40:56 -04:00
Zain Patel
b064313d20 Fix CI: add tests to GitHub actions (#2262)
* Initial CI fixes
2021-07-27 23:32:32 -04:00
zlig
fe1e0837e6 Fixes video recorder ffmpeg on Centos7 and RHEL7 (#1893)
* Conditionally select different arguments for ffmpeg, and add meaningful error for third party application in charge of encoding

* Consistency with other logger() calls

Co-authored-by: J K Terry <justinkterry@gmail.com>
2021-07-27 15:18:11 -04:00
Yunzhi Zhang
41e571d065 pass args to expickle (#2172) 2021-07-27 14:49:45 -04:00
Justin Terry
b60a40db6e @Indoril007 s ez pickle fix 2021-07-27 14:36:06 -04:00
Christian Clauss
29c96a9a83 GitHub Action to lint Python code (#2258)
* GitHub Action to lint Python code

* pip install -e .

* pip install -e .[all]
2021-07-27 14:16:00 -04:00
Justin Terry
0989f0d3d8 update contributing.md 2021-07-27 14:14:21 -04:00
Justin Terry
e5a093a1c5 add requirements.txt 2021-07-27 12:35:48 -04:00
Justin Terry
ee7811c26d peg atari-py version, add requirements.txt, bump numpy version 2021-07-27 12:35:35 -04:00
Christian Clauss
2ef26144a2 Fix typos discovered by codespell (#2256)
Typos
2021-07-27 01:20:29 -04:00
Iheb Haboubi
63171e352e Fix typo in docstring (#2080) 2021-07-26 16:11:20 -04:00
Won Seok Jung
1dbd97e8ed minor fix (#2113) 2021-07-26 16:09:48 -04:00
m-orsini
6b01f4fa9c Allow custom data type for multi distrete space. (#2148) 2021-07-26 16:07:58 -04:00
Justin Terry
562a8cc7ce fix qoute 2021-07-26 16:03:32 -04:00
Ikko Ashimine
065e6a5d74 Fix URL (#2244)
http -> https
2021-07-26 15:54:42 -04:00
Merve Noyan
98777c455c Fixed broken link (arcade learning environment) (#2222) 2021-07-26 15:53:24 -04:00
Nashir Janmohamed
c882703bc8 Fix broken link in docstring (#2154)
The link to the original implementation of mountain car on incompleteideas.net was broken. Replaced with the correct link.
2021-07-26 15:51:54 -04:00
ahmadF
48baefeaf5 Add description to countinous mountain car task (#2107) 2021-07-26 15:51:29 -04:00
Erfan Miahi
a766dd0a2c A third-party environment named RiverSwim (#2086)
RiverSwim which is a simple hard exploration environment has been added to the list of third-party environments.

Co-authored-by: J K Terry <justinkterry@gmail.com>
2021-07-26 15:51:08 -04:00
Akash Patel
a8153126a0 add gym-ccc third-party environment package (#2087)
Co-authored-by: J K Terry <justinkterry@gmail.com>
2021-07-26 15:50:45 -04:00
MouseAndKeyboard
985de1f017 Added gym-spoof to third party environments (#2092) 2021-07-26 15:50:21 -04:00
miluChen
9a9e68a888 close env after done (#2162)
* close env after done

* fix if

Co-authored-by: He Chen <hchen584@bloomberg.net>
Co-authored-by: J K Terry <justinkterry@gmail.com>
2021-07-26 15:49:56 -04:00
Rajkumar Ramamurthy
cbcb6668ae Update documentation for new NLP environments (#2110)
Co-authored-by: J K Terry <justinkterry@gmail.com>
2021-07-26 15:46:59 -04:00
Johnny
db80dbe4ad Add math_prog_synth_env to list of environments (#2252)
Co-authored-by: J K Terry <justinkterry@gmail.com>
2021-07-26 15:45:01 -04:00
Yang Yu
e382c939bf Update environments.md (#2127)
Co-authored-by: J K Terry <justinkterry@gmail.com>
2021-07-26 15:44:27 -04:00
Zhiqing Xiao
81aeb68906 remove constraints on package versions (#2135)
Co-authored-by: J K Terry <justinkterry@gmail.com>
2021-07-26 15:43:08 -04:00
qdevpsi3
0de634b2ce Add qas-gym to environments.md (#2226)
Co-authored-by: J K Terry <justinkterry@gmail.com>
2021-07-26 15:41:35 -04:00
Matteo Lucchi
b1217f7308 Add robo-gym to third party environment list (#2248)
Co-authored-by: J K Terry <justinkterry@gmail.com>
2021-07-26 15:40:08 -04:00
Vasco CC
8f6eb1ebd0 BUG FIX: Bipedal Walker; Car Racing; Guessing Game; Hotter Colder - Showing Warning (#2126)
* Fixing Warning issue

Fixing an issue causing a Warning message to pop because of conversion from python standard float64 to np.float32

* car_racing.py Warning Fix

Fixed a bug generating a warning with conversion from standard `float 64` to `numpy float 32`.

* Fixing Warning issue 

Fixed a bug generating a warning with conversion from standard `float 64` to `numpy float 32`.

* Fixing Warning issue

Fixed a bug generating a warning with conversion from standard `float 64` to `numpy float 32`.

* bracket missing

* Bracket misplaced
2021-07-26 15:39:04 -04:00
Hojoung (Brian) Jang
81b2ddd106 Add gym-xiangqi to third party environment list (#2230)
* Add gym-xiangqi to env list

* Update description

Co-authored-by: J K Terry <justinkterry@gmail.com>
2021-07-26 15:36:28 -04:00
Mohamad H. Danesh
7d72f9535f Update environments.md (#2251)
Add a new environment to the list.
2021-07-26 15:35:27 -04:00
pzhokhov
ee5ee3a4a5 fix build by disabling mujoco and moving to newer python patch versions (#2220)
* reverting to older version of 3.6 python to test if the build is failing because of that

* revert 3.7 to 3.7.3 for the build

* revert python 3.8 version to 3.8.1

* do not install mujoco on 3.8 and 3.9

* enable mujoco for 3.7

* .

* .

* .

* use regex to navigate python version dependent package installation

* .

* try only one python version at a time

* switch to possibly more popular python tag for 3.6

* disable mujoco
2021-05-17 16:19:33 -07:00
Daniel Ciborowski
a5a6ae6bc0 Update Pillow Python Library for CVE-2020-35653 & CVE-2021-27921 (#2189)
* Update setup.py

* Update setup.py

* Update setup.py
2021-04-06 08:18:55 -07:00
Zhiqing Xiao
151ba406eb Revise the unattainable reward_threshold to an attainable value (#2205)
**Issues:**   The current `reward_threhold` for `FrozenLake-v0` and `FrozenLake8x8-v0` is too high to be attained.

Commit: df515de07d   @joschu  

**Solution:**   Reduce the `reward_threhold` to make them attainable.

**Reference:**   Codes to calculate the theoretic optimal reward expectations:

```python
import gym
env = gym.make('FrozenLake-v0')
print(env.observation_space.n) # 16
print(env.action_space.n) # 4
print(env.spec.reward_threshold) # 0.78, should be smaller
print(env.spec.max_episode_steps) # 100

import numpy as np
v = np.zeros((101, 16), dtype=float)
q = np.zeros((101, 16, 4), dtype=float)
pi = np.zeros((101, 16), dtype=float)
for t in range(99, -1, -1): # backward
    for s in range(16):
        for a in range(4):
            for p, next_s, r, d in env.P[s][a]:
                q[t, s, a] += p * (r + (1. - float(d)) * v[t+1, next_s])
        v[t, s] = q[t, s].max()
        pi[t, s] = q[t, s].argmax()
print(v[0, 0]) # ~0.74 < 0.78
```

```python
import gym
env = gym.make('FrozenLake8x8-v0')
print(env.observation_space.n) # 64
print(env.action_space.n) # 4
print(env.spec.reward_threshold) # 0.99, should be smaller
print(env.spec.max_episode_steps) # 200

import numpy as np
v = np.zeros((201, 64), dtype=float)
q = np.zeros((201, 64, 4), dtype=float)
pi = np.zeros((201, 64), dtype=float)
for t in range(199, -1, -1): # backward
    for s in range(64):
        for a in range(4):
            for p, next_s, r, d in env.P[s][a]:
                q[t, s, a] += p * (r + (1. - float(d)) * v[t+1, next_s])
        v[t, s] = q[t, s].max()
        pi[t, s] = q[t, s].argmax()
print(v[0, 0]) # ~0.91 < 0.99
```
2021-03-31 11:02:11 -07:00
Jaekyung-Cho
c8a659369d pull request about issue #2062 (car_racing.py memory leaking issue ) (#2096)
* car_racing.py memory leaking issue #2062 solving

* additional memory leaking resolve
2021-02-16 11:31:40 -08:00