CS 104: Introduction to Computer Science


Experimentation Strategies

	(cont.)


•	Another possibility: probabilistic approach

•	Choose actions probabilistic such that there's

	always a positive probability of choose each

	action.

•	One example: P(a_i\|s) = k^Q(s,aⁱ⁾ / sum_j(k^Q(s,a^j⁾)


•	Greater k à greater greedy exploitation

•	Lesser k à greater random exploration