Files

miyaliu666 3ea2fe4f77 feat(i18n,curriculum): add Bilibili ids for Chinese (#43564 )

Co-authored-by: Kristofer Koishigawa <scissorsneedfoodtoo@gmail.com>
Co-authored-by: Oliver Eyton-Williams <ojeytonwilliams@gmail.com>

2021-10-01 09:54:12 +05:30

id, title, challengeType, videoId, bilibiliIds, dashedName

title

challengeType

videoId

bilibiliIds

dashedName

5e8f2f13c4cdbe86b5c72da5

Reinforcement Learning With Q-Learning: Example

RBBSNta234s

aid	bvid	cid
848073871	BV1uL4y187Eq	409139471

reinforcement-learning-with-q-learning-example

--question--

--text--

Fill in the blanks to complete the following Q-Learning equation:

Q[__A__, __B__] = Q[__A__, __B__] + LEARNING_RATE * (reward + GAMMA * np.max(Q[__C__, :]) - Q[__A__, __B__])

A: state

B: action

C: next_state

A: state

B: action

C: prev_state

A: state

B: reaction

C: next_state