Oliver Eyton-Williams 0bd52f8bd1

and change all the challenges to new `md` format.

2020-11-27 10:02:05 -08:00

id, title, challengeType, videoId

id	title	challengeType	videoId
5e8f2f13c4cdbe86b5c72da5	Reinforcement Learning With Q-Learning: Example	11	RBBSNta234s

--question--

--text--

Fill in the blanks to complete the following Q-Learning equation:

Q[__A__, __B__] = Q[__A__, __B__] + LEARNING_RATE * (reward + GAMMA * np.max(Q[__C__, :]) - Q[__A__, __B__])

A: state

B: action

C: next_state

A: state

B: action

C: prev_state

A: state

B: reaction

C: next_state