|
|
|
|
|
|
|
|
• |
Updating
sequence: start at random state
|
|
|
|
and
act until it reaches absorbing goal state
|
|
|
• |
For
the first updating sequence and our grid
|
|
|
world
example, how many weights get
|
|
|
|
updated
from the first sequence?
|
|
|
• |
What
could we do if we kept the whole
|
|
|
|
sequence
in memory?
|
|