daniel
|
c1700114fa
|
Add tutorial: qlearning with/without action masking for Taxi v3 env (#1345)
Co-authored-by: Mark Towers <mark.m.towers@gmail.com>
|
2025-07-02 09:38:20 +01:00 |
|
Mark Towers
|
9e2b3cbd4e
|
Improve the tutorial rendering (#1353)
|
2025-04-02 21:17:14 +01:00 |
|
Mark Towers
|
17bd3ff486
|
Update tutorial ci (#1344)
|
2025-03-31 16:25:33 +01:00 |
|
JacopoBandoni
|
ddf5c6f7b0
|
Fix reinforce tutorial (#1337)
|
2025-03-23 15:12:39 +00:00 |
|
润泽
|
6137784f34
|
Added some popular third-party beginner tutorials。 (#1271)
|
2024-12-14 11:13:56 +02:00 |
|
Arshia Ilaty
|
38dab5e05e
|
Adding links to the tutorials section #1256 (#1266)
|
2024-12-05 10:33:56 +00:00 |
|
Tyler Crosse
|
7e2062d04c
|
Fix RecordEpisodeStatistics param in Blackjack Tutorial (one line) (#1263)
|
2024-11-28 10:16:23 +00:00 |
|
Baptiste Pesquet
|
ec7e6d71f3
|
Fix parameter name in tutorial code (#1242)
|
2024-11-08 15:44:43 +00:00 |
|
Ariel Kwiatkowski
|
4d15c99c3d
|
Remove reward_range (#1167)
|
2024-09-19 15:58:52 +01:00 |
|
Chen Tao
|
4a85589b92
|
Update vector_envs_tutorial.py (#1133)
|
2024-08-05 10:18:21 +01:00 |
|
Mark Towers
|
52b6878618
|
Pre commit autoupdate (#1082)
|
2024-06-10 17:07:47 +01:00 |
|
edelauna
|
5bf72690e1
|
doc(FrozenLake_tuto): update policy exploitation logic to handle variable sets of maximum Q-values (#1037)
|
2024-04-29 10:25:08 +01:00 |
|
Timon Erhart
|
01f7e7e5e5
|
Fix missing closing single quote in environment creation tutorial (#1023)
|
2024-04-16 10:31:53 +01:00 |
|
TobiasKallehauge
|
fd4ae52045
|
Randomize LunarLander wind generation at reset to gain statistical independence between episodes (#959)
|
2024-03-09 09:42:08 +00:00 |
|
Manuel Goulão
|
b8c542592d
|
Fix tutorials written in MarkDown not showing (#955)
|
2024-03-07 17:15:42 +02:00 |
|
Kallinteris Andreas
|
379d152ed9
|
add disclaimer and categories to third-party-tutorials.md (#935)
|
2024-02-23 17:37:43 +00:00 |
|
Kallinteris Andreas
|
d3a511f425
|
load_quadruped_model tutorial typo fix (#919)
|
2024-02-09 12:46:18 +00:00 |
|
Mark Towers
|
0b2cd171a4
|
Documentation update (#909)
|
2024-02-05 15:26:56 +00:00 |
|
Kallinteris Andreas
|
f88a61852b
|
Add tutorial Load custom quadruped robot environments using Gymnasium/MuJoCo/Ant-v5 framework (#838)
|
2024-02-01 13:03:16 +00:00 |
|
Mark Towers
|
b4ed889b0a
|
Add third-party tutorial page (#867)
|
2024-01-08 11:59:06 +00:00 |
|
Mark Towers
|
650aa5f49b
|
Add more introductory pages (#791)
|
2023-12-08 12:46:40 +00:00 |
|
Mohammed Hamdy
|
7dca6832bf
|
Fix typos in docs FrozenLake_tuto.py (#757)
|
2023-10-30 14:43:28 +00:00 |
|
Andrea PIERRÉ
|
baf78078c3
|
[Docs] Custom environment tutorial refactoring (#709)
|
2023-09-11 10:25:48 +01:00 |
|
Zarif Aziz
|
fa8a982f99
|
Documentation update that fixes typo REINFROCE to REINFORCE (#672)
|
2023-08-18 01:23:39 +01:00 |
|
mvhensbergen
|
7012d22902
|
Fix scoping of variables (#670)
Co-authored-by: Martin van Hensbergen <martin@mvhensbergen.com>
|
2023-08-13 17:03:05 +01:00 |
|
mvhensbergen
|
933d481189
|
Fix explanation of training agent (#650)
|
2023-08-05 11:35:20 +01:00 |
|
Ariel Kwiatkowski
|
d6ea183807
|
Fix CI (#644)
|
2023-08-02 12:43:23 +02:00 |
|
Mohamad Zamini
|
552bf8c1a7
|
optimize reinforce_invpend_gym_v26.py (#638)
|
2023-08-01 16:16:54 +01:00 |
|
Roberto Schiavone
|
985c7005c7
|
docs: update custom env step return 📝 (#565)
|
2023-06-23 11:14:12 +01:00 |
|
Andrea PIERRÉ
|
2ac4d401a9
|
Update FrozenLake tutorial (#391)
|
2023-03-17 16:35:54 +00:00 |
|
D. Debnath
|
6a98a26de6
|
Update vector_envs_tutorial.py (#390)
|
2023-03-15 10:34:09 +00:00 |
|
DYK
|
be0d319b77
|
Update vector_envs_tutorial.py (#347)
Co-authored-by: Mark Towers <mark.m.towers@gmail.com>
|
2023-03-14 15:18:26 +00:00 |
|
Andrea PIERRÉ
|
dde6b460f0
|
FrozenLake tutorial (#376)
|
2023-03-14 08:41:31 +00:00 |
|
Mark Towers
|
f9b81acf12
|
update pre-commit (#344)
|
2023-02-22 15:41:40 +00:00 |
|
Luke Meyers
|
bb368fe75f
|
Doc improvement: add comment describing what convolve does (#264)
|
2023-01-17 15:06:11 +00:00 |
|
Manuel Goulão
|
4e6dc3e420
|
Tutorials galleries (#258)
|
2023-01-11 20:00:51 +00:00 |
|
Mark Towers
|
1dfdf45480
|
Revert "ENH: allow metadata["render_modes"] to be a set" (#251)
|
2023-01-09 13:12:07 +00:00 |
|
Omar Younis
|
3413b04534
|
ENH: allow metadata["render_modes"] to be a set (#248)
|
2023-01-08 19:56:52 +00:00 |
|
Luca Pizzini
|
3599bf1d7c
|
docs(tutorials): fixed environment creation link (#244)
|
2023-01-07 15:31:49 +00:00 |
|
Till Zemann
|
300b3832d6
|
Add a tutorial for vectorized envs using A2C. (#234)
|
2023-01-04 15:03:43 +00:00 |
|
Markus Krimmel
|
af740a66ca
|
Add dark version of blackjack AE loop (#191)
|
2022-12-07 20:04:48 +00:00 |
|
Xuehai Pan
|
7e88559763
|
Update and rerun pre-commit hooks for better code quality (#179)
|
2022-12-04 14:24:02 +00:00 |
|
Markus Krimmel
|
851b2f4be6
|
Updated Wrapper docs (#173)
|
2022-12-03 12:46:11 +00:00 |
|
Siddarth C
|
024c05c9ca
|
Add REINFORCE implementation tutorial (#155)
Co-authored-by: Mark Towers <mark.m.towers@gmail.com>
|
2022-11-26 13:02:23 +00:00 |
|
Manuel Goulão
|
22445d26c8
|
Fix #118 (#121)
|
2022-11-10 12:18:57 +00:00 |
|
Till Zemann
|
88067cafbd
|
Incorporated a few spelling and logical fixes and added training logs (#109)
|
2022-11-03 10:46:29 +00:00 |
|
Till Zemann
|
67ab18e3d7
|
Add a tutorial notebook for Blackjack-v1 (#64)
|
2022-10-23 11:43:13 +01:00 |
|
Manuel Goulão
|
08dacf2f7b
|
Updating tutorials (#63)
|
2022-10-21 16:36:36 +01:00 |
|
Manuel Goulão
|
1b6b0975c9
|
Add Tutorials using sphinx-gallery and other template updates (#56)
|
2022-10-16 14:54:03 +01:00 |
|