Re: Goal of AI: Perfect or Bounded Rationality
- From: curt@xxxxxxxx (Curt Welch)
- Date: 26 May 2006 23:12:33 GMT
jalegris@xxxxxxxxxxxx wrote:
Another thing I'm not clear on is explaining classical conditioning in
terms of the operation of a reinforcement learning machine. Can you
give an example?
Classical conditioning is just the association of a behavior with a new
stimulus though pairing. Food -> salivation when paired with a bell
ringing will end up causing the bell ringing alone to trigger salivation.
However, strong reinforcement learning requires the machine to make
internal predictions about future rewards, and use those predictions as
internal reinforcers. This is what creates secondary reinforcers. This is
why money acts as a reinforcer to us even though we can't eat it, or have
sex with it, etc. Our brain has learned to recognize money as something
which leads to higher future rewards. And as such, money acts as a
reinforcer for our behavior.
In the end, a strong reinforcement learning machine must assign value to
every sensation and every behavior and use those values as reinforcers for
all behavior. Though many things have a fairly neutral value, they all must
have some value. This is done in simple computer reinforcement learning
algorithms for example when they play a board game and assign a value to
every board position. Moves in the game that lead to stronger board
positions end up being reinforced because stronger board positions are more
likely to lead to a real future reward (which might only happen when the
game is won or lost).
So, strong reinforcement learning systems are constantly learning using
internal rewards. The behavior of the machine is constantly being adjusted
based on everything that happens.
If the machine already has a behavior for salivation in response to food,
and a bell happens at the same time, that bell->salivation response is
going to be rewarded by the secondary reinforcers. And after enough
rewards, assuming some other behavior isn't being associated with the bell,
then the machine will start to produce the salivation response for the
bell.
All classical conditioning can be explained in terms of the actions of
internal rewards used by a reinforcement learning machine which is
attempting to maximize not just current rewards, but all future rewards.
In order to do that, it must be making internal predictions about potential
future rewards, and using those predictions to condition behavior instead
of using only current rewards.
The language of classical conditioning is just a simple way of explaining
the more complex effects of secondary reinforcers.
--
Curt Welch http://CurtWelch.Com/
curt@xxxxxxxx http://NewsReader.Com/
.
- Follow-Ups:
- Re: Goal of AI: Perfect or Bounded Rationality
- From: jalegris
- Re: Goal of AI: Perfect or Bounded Rationality
- From: emotioncube
- Re: Goal of AI: Perfect or Bounded Rationality
- References:
- Goal of AI: Perfect or Bounded Rationality
- From: adityar7
- Re: Goal of AI: Perfect or Bounded Rationality
- From: JGCASEY
- Re: Goal of AI: Perfect or Bounded Rationality
- From: adityar7
- Re: Goal of AI: Perfect or Bounded Rationality
- From: Curt Welch
- Re: Goal of AI: Perfect or Bounded Rationality
- From: jalegris
- Re: Goal of AI: Perfect or Bounded Rationality
- From: Curt Welch
- Re: Goal of AI: Perfect or Bounded Rationality
- From: jalegris
- Re: Goal of AI: Perfect or Bounded Rationality
- From: Curt Welch
- Re: Goal of AI: Perfect or Bounded Rationality
- From: jalegris
- Goal of AI: Perfect or Bounded Rationality
- Prev by Date: Re: The animal, not the environment, controls behavior.
- Next by Date: Re: Seeing is behavior
- Previous by thread: Re: Goal of AI: Perfect or Bounded Rationality
- Next by thread: Re: Goal of AI: Perfect or Bounded Rationality
- Index(es):
Relevant Pages
|