-
Notifications
You must be signed in to change notification settings - Fork 368
Open
Description
In tutorial1, qlearn_mod_random.pyline 32:
if random.random() < self.epsilon:
minQ = min(q)
mag = max(abs(minQ), abs(maxQ))
# add random values to all the actions, recalculate maxQ
q = [q[i] + random.random() * mag - .5 * mag for i in range(len(self.actions))]
maxQ = max(q)why use this(versus qlearn.py)?
Metadata
Metadata
Assignees
Labels
No labels