Re: Gradual Learning, not Reinforcement Learning




Glen M. Sizemore wrote:

"J.A. Legris" <jalegris@xxxxxxxxxxxx> wrote in message
news:1153167830.117066.205590@xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx

Michael Olea wrote:
J.A. Legris wrote:

What you've described so far sounds like the Bayesian model that
Michael Olea has been describing, where an estimate of the posterior
probability of an event is updated afer each observation of the
evidence. Is this the sort of thing you have in mind? At some point,
perhaps depending on a threshold probability level, a decision would
have to be made about whether the corresponding alarm should be
triggered.

That would be where the "utility model" comes in (moving from Bayesian
Inference into Bayesian Decision Theory) - the cost and gain functions
over
consequences. So you pick the thresold to maximize expected utility. That
is, of course, a normative theory, not a descriptive one - what an agent
should do, not what particular agents do in fact do. Even so it is often
a
good model of behavior under experimental conditions. There is a
consistent
difference, I've mentioned a few times, between the normative model and a
descriptive model of "matching law" like behavior. Suppose you have two
choices A and B, and that the expected utility is 90 for A and 10 for B.
The optimal choice is pick A every time. The observed behavior is more
like
pick A 90% of the time, pick B 10% of the time. The discrepancy arises
only
if the probability distribution is known, and stationary. If the
distribution is unknown (i.e. being estimated, or "learned"), and if it
might be changing then the matching law makes more sense, has been shown
to
be optimal under some idealized conditions, and is a form of "importance
sampling", very much like particle filtering methods of approximate
Bayesian inference.

It seems like a big jump from predicting outcomes, even thousands of
them, to running interactive experiments to test the predictions. How
might that work?

That, "intervention", gets a lot of attention in Judea Pearl's second
major
book, the one on "Causality". It also has been studied in terms of "value
of information". Bayesian medical expert systems do a limited form of
this
by suggesting tests to perform in order to arrive at a diagnosis. The
role
of intervention in learning has also been studied in, for example,
developmental psychology. Discounting evidence ("let me try it, you just
aren't doing it right") is one example. It is a major theme in Allison
Gopnik's work:

http://ihd.berkeley.edu/gopnik.htm

For example:

A.Gopnik, C. Glymour, D. Sobel, L. Schulz, T. Kushnir, & D. Danks (2004).
A
theory of causal learning in children: Causal maps and Bayes nets.
Psychological Review, 111, 1, 1-31.

T. Kushnir, A. Gopnik, L Schulz, & D. Danks. (in press). Inferring hidden
causes. Proceedings of the Twenty-Fourth Annual Meeting of the Cognitive
Science Society

-- Michael

The experimental results in the first paper (starting on p.64) are
fascinating. Required reading for behaviourists! Thanks for the links.

--
Joe Legris


I looked over the paper (no, I didn't read it), and my first impression is
that this is not "must" reading for behaviorists. Or rather, it is far less
"must" reading than some of the tutorials on Bayesian analyses of coin
tosses and paper-frog jumps. But let's cut to the quick, Joe. Why do you
think it is "must" reading for behaviorists? Pitch me. After all, you can
argue that I can't be persuaded, but you know that you can get a rise out of
me.



I was referring to the first paper: A theory of causal learning in
children: Causal maps and Bayes nets. Read pages 64-71 in particular.

--
Joe Legris

.



Relevant Pages

  • Re: Gradual Learning, not Reinforcement Learning
    ... perhaps depending on a threshold probability level, ... That would be where the "utility model" comes in (moving from Bayesian ... book, the one on "Causality". ... Discounting evidence ("let me try it, ...
    (comp.ai.philosophy)
  • Re: Gradual Learning, not Reinforcement Learning
    ... I looked over the paper, and my first impression is ... that this is not "must" reading for behaviorists. ... That would be where the "utility model" comes in (moving from Bayesian ... book, the one on "Causality". ...
    (comp.ai.philosophy)
  • Re: Gradual Learning, not Reinforcement Learning
    ... that this is not "must" reading for behaviorists. ... That would be where the "utility model" comes in (moving from Bayesian ... So you pick the thresold to maximize expected utility. ... book, the one on "Causality". ...
    (comp.ai.philosophy)
  • Re: Correlation and Causation
    ... Say we have a variable A. If I control it, the variable will be do, and then you can interpret anything that's correlated with doas caused by do*IF* there is no Simpson's paradox. ... basis of observational data WITHOUT a controlled experiment designed to test the causality part. ... probability theory is a foundation for causality ... Or, as a Bayesian, the causal diagrams are irrelevant to that fact that data cannot support a FALSE model, if you analyze it properly the Bayesian way -- this fact is discussed in length Edwards, Lindman, and Savage paper. ...
    (sci.stat.math)
  • Re: What is dialectics?
    ... causality in principle. ... "mediation". ... mediation as a structure which a) constrains the probability ... were merely another entity within a causal chain, ...
    (soc.politics.marxism)