< previous page page_124 next page >

Page 124
2. The Robustness of Plans C0140-01.gif
It might seem that the natural first step in establishing the robustness of an adaptive plan would be to show that it will ultimately converge to an optimal structure. However, as early as section 1.5 it was possible to make a good argument against convergence as a criterion for distinguishing useful plans. Enumerative plans converge, yet in all but the most restricted circumstances they are useless either as hypotheses or algorithms. Moreover, when data can be retained by no more than M structures and M < |C0039-11.gif|, no plan for searching C0039-11.gif can yield convergence. More formally, for any M < |C0039-11.gif|, there exists C0140-03.gif such that as T ®¥,
C0140-02.gif
where C0140-05.gif is a subset of C0039-11.gif consisting of one or more structures with optimal mean performance (i.e., structures C0140-04.gif such that the mean of the random variable µE(A') is at least as high as the mean for any C0140-06.gif). This is so because for any finite sequence of trials of a suboptimal structure in C0039-11.gif, there is a non-zero probability that its observed average performance will exceed the observed performance of the optimal structure(s) (assuming overlapping distributions). Clearly, if enough of the structures being tested exhibit observed performances above that of the optimal structure(s) (again an event with a non-zero probability), the result will be the deletion of data concerning the optimal structure. Thus, unless possible convergence to a suboptimal structure is to be allowed, each structure must be repeatedly tested (infinitely often in the limit). But this repeated testing (and the law of large numbers) assures that suboptimal structures which have a finite probability of displacing an optimal structure will do so with a limiting frequency approaching that probability. Hence, for M < |C0039-11.gif|, no plan can yield (1/T)STP(C0140-05.gif, t) ® 1. At the same time, C0140-07.gif (even when M' < |C0039-11.gif|) because of two effects:
3ec098e70743fcb2f9b43be50b94c009.gif
(i) the more copies there are of a suboptimal structure A in a given generation, the smaller the variance of the associated average payoff µA(t) (making it less likely that C0140-08.gif),
3ec098e70743fcb2f9b43be50b94c009.gif
(ii) more generations are required to displace A' in the whole population (meaning that C0140-09.gif has to exceed C0140-10.gif over a longer period, a progressively less likely event).
At the cost of a small increase in the complexity of the algorithms C0140-01.gif we can assure that they converge when M > |C0039-11.gif| (as, for example, in the

 
< previous page page_124 next page >

If you like this book, buy it!