Re: Curtnetrons Don't Do Parity
- From: curt@xxxxxxxx (Curt Welch)
- Date: 28 Jun 2006 23:15:36 GMT
Michael Olea <oleaj@xxxxxxxxxxxxx> wrote:
Curt Welch wrote:
And I really don't grasp the connections between thermodynamic entropy
and information entropy. But I'll figure that out as some point
here... :)
Think about the game of clue. At the start of the game the possibilities,
who, where, and with what, are least constrained. When you ask a question
and get an answer you gain information to the extent that it narrows the
possibilities. You have complete information, all the information there
is to be had, when the possibilities have been narrowed to one: Colonel
Mustard, in the library, with the lead pipe. If you ask a question and
get an answer that does not narrow the possibilities then it was useless
- you wasted your turn.
Ok, well, that's a problem of transferring information. And I did see some
comments about thermodynamic entropy being looked at as transferring
information about internal structure. So I kinda get how there could be a
link there.
And in relation to reinforcement learning, I could see how the problem
might be looked at in terms of the learning system trying to get
information about rewards from the environment. And how some ability to
quantify the transfer of information about rewards into the system would be
a way to quantify it's learning ability. But I'm having a harder time
seeing how it would be useful to look at the information flow from sensory
inputs to behavior output in terms of entropy. This is because the goal of
a reinforcement learning machine is not to perform some type of data
compression or optimization on the sensory information. Instead, it goal
is to calculate the value of the data so that it can throw out what is not
important and make use of what is important in the generation of behavior.
I see as a programmable information filter. The entropy of the input can
be measured, but that's just a measure of the bandwidth of the sensory
data. It seems to just be a way to quantify the rate of data flowing into
the filter. You can likewise quantify the flow of the data out of the
filter, but you can't determine anything about how close to optimal the
system is performing by only looking at those numbers. This is because
"good" is not defined by information flow. It's defined by the rewards
generated by the current configuration of the "filter". And the filter
configuration which is mostly heavily rewarded might have a very low
entropy, or it might have a very high entropy, or something in between. So
you can't tell anything about how "well" the filter is functioning by only
looking at its entropy output. But there does seem to be value in using
these concepts to understand what the system is doing. So I've got lots to
work on. :)
--
Curt Welch http://CurtWelch.Com/
curt@xxxxxxxx http://NewsReader.Com/
.
- References:
- Curtnetrons Don't Do Parity
- From: Michael Olea
- Re: Curtnetrons Don't Do Parity
- From: feedbackdroid
- Re: Curtnetrons Don't Do Parity
- From: Michael Olea
- Re: Curtnetrons Don't Do Parity
- From: Curt Welch
- Re: Curtnetrons Don't Do Parity
- From: Michael Olea
- Re: Curtnetrons Don't Do Parity
- From: Curt Welch
- Re: Curtnetrons Don't Do Parity
- From: Michael Olea
- Curtnetrons Don't Do Parity
- Prev by Date: Re: Curtnetrons Don't Do Parity
- Next by Date: Re: Curtnetrons Don't Do Parity
- Previous by thread: Re: Curtnetrons Don't Do Parity
- Next by thread: Re: Curtnetrons Don't Do Parity
- Index(es):
Relevant Pages
|
|