Q(λ)-learning algorithm question



Hi,

For Watkin's Q(\lambda)-learning algorithm; why would someone
prefer higher values of \lambda than lower? My experiments both in
simulation and on real robot show that \lambda=0.5 gives a solution
that tends to converge. For lower and higher \lambda values (0.25
and 0.75) learning diverges. Is it task specific?

Thanks,

Uri.

http://www.compactech.com/kartoun

[ comp.ai is moderated. To submit, just post and be patient, or if ]
[ that fails mail your article to <comp-ai@xxxxxxxxxxxxxxxxxx>, and ]
[ ask your news administrator to fix the problems with your system. ]
.



Relevant Pages

  • Re: Solution to the halting Problem?
    ... running on a computing machine or a simulation of a computing machine, ... simulated algorithm without disturbing its operation. ... >> subjects halt or not. ...
    (comp.lang.cpp)
  • Re: Humaniform robots... yea or nay?
    ... algorithm does what you think it does. ... at least the course left you with the realization that genetic algorithms/programmings are not to be trifled with if you have a lousy simulation environment/test case. ... I think there's an obvious problem of generalizing this experience to genetic algorithms/programming in general. ... But then, a lot of the things that would at one point have been considered the forefront of AI research are now becoming mainstream: facial recognition software, speech recognition and synthesis, and so on. ...
    (rec.arts.sf.science)
  • Re: Variational calculus : A drop of water
    ... Simulation of a Dripping Faucet by Nobuko Fuchikami, ... By taking a stable equilibrium shape as an initial ... we describe a variational algorithm to examine ... Our algorithm of simulation of dynamics ...
    (sci.math)
  • Re: need a good implementation of pseudorandom generators
    ... use SPSS to generate the random mumbers. ... The Marsaglia algorithm is also available as an option. ... pseudorandom generator for it. ... But if i run the simulation 100 times then on all 100 times it gives ...
    (sci.stat.math)

Loading