Re: Representationalism rescues reinforcement learning



On May 28, 8:16 pm, Neil W Rickert <rickert...@xxxxxxxxxx> wrote:
"J.A. Legris" <jaleg...@xxxxxxxxxxxx> writes:
So here's the thing: in the real world, an organism that learns by
trial, error and (yawn) reinforcement, is likely to get eaten before
it is lucky enough to stumble on the appropriate response. But, it if
has a virtual environment in its head where it can test various
responses before committing to any, it has a leg up on the challenges
of existence, which appears to be just what we mammals have managed to
evolve - internal representations of the real world with little
homunculi going at it, and just possibly, another level or two of
homunculi contained therein (not so much for the good of the theory,
but just to irritate antirepresentationalists a little bit more). And
get this - it's TESTABLE!

That might be a bit of a strawman. It depends on what is learned,
and on what is reinforced. The argument works against simplistic
versions of reinforcement learning.

In any case, you evade the issue of how these alleged internal models
are built. If, for example, they are built using reinforcement
methods, then you are only arguing for a more sophisticated version
of reinforcement learning.

I have no problem with reinforcement learning as a general phenomenon
- I am trying to rationalize a role for internal representations that
could provide a selective advantage in a dangerous environment. In a
harmless environment, a simple reinforcement algorithm will eventually
optimize a response after a sufficient number of trials. But in a
natural environment, this may be too risky, so the "organism" reduces
the risk of stumbling into an unrecoverable state by pretesting its
responses in the safety of a simulated environment. The problem of
obtaining a useful simulation is another issue, which, as you suggest,
might be just another reinforcement learning task.

--
Joe

.



Relevant Pages

  • Re: Reinforce learn this
    ... the response can come to be under stimulus control of many ... the parts that we do because of the contingencies arranged by the social ... Gestaltic effects happen on stimuli that have not been seen before. ... function of exposure to contingencies of reinforcement. ...
    (comp.ai.philosophy)
  • Re: Reinforce learn this
    ... attribute to reinforcement. ... response "blew up" in the presence of chemical explosions, ... Behaviorism has no problem with the principles of Gestalt perception, ... Gestaltic effects happen on stimuli that have not been seen before. ...
    (comp.ai.philosophy)
  • Re: Reinforce learn this
    ... response "blew up" in the presence of chemical explosions, ... explain some of the phenomena in question here by pointing to the contingencies ... Gestaltic effects happen on stimuli that have not been seen before. ... contingencies of reinforcement. ...
    (comp.ai.philosophy)
  • Re: "Its uncertain whether intelligence has any long term
    ... Language is language whether it is spoken, written, signed or ... > on an incorrect understanding of the notion of operant response class. ... > positive reinforcement) occurs when 1.) ... reinforcing stimulus in this case? ...
    (sci.bio.evolution)
  • Re: Conditioned Taste Aversion (was metablather)
    ... reinforcement is meaningless - every reinforcer follows every response ... A schedule of reinforcement that had a widely varying random delay ... It depends on what you mean by "widely varying" and how the delay was ...
    (comp.ai.philosophy)