Re: Combining Probabilities?
- From: Matt Mahoney <matmahoney@xxxxxxxxx>
- Date: 1 May 2007 16:54:26 -0700
On May 1, 10:32 am, moogie <budgetan...@xxxxxxxxxxxxxx> wrote:
I am wondering the best ( correct? ) way of combining probabilities to
predict the next symbol in a sequence.
I have two different predictors which generate a probability between
0..1 for each symbol in a set to be the next symbol in the sequence.
One predictor has a limited set of symbols it will generate
probabilities for while the other predictor generates probability for
all known symbols. Thus for some symbols there will only be
probabilities generated by only one predictor.
This leads me to my potentially erroneous combination algorithm:
Currently I am using a weighted sum of these two probabilities to
generate a combined probability. This seems to work but is this
correct?
Should i use the mean of the probabilites?
Thanks
Nick
There is no best way. One technique is to average: p = w1*p1 + w2*p2,
then adjust the weights in the direction that favors the more accurate
model. You can also use sets of weights selected by some small
context.
In paq I predict one bit at a time. I experimented with lots of
methods. In paq1 I counted 0 and 1 bits in each context, then did a
weighted sum of the 0 and 1 counts. I tuned the weights by hand. I
found that for an order n context, a weight of n^2 worked pretty
well. For this method, you can't allow the 0 and 1 counts to both get
large. After a 1 bit, you discard some 0 counts and vice versa.
Starting in paq4 I used a method of adjusting the weights to favor the
models that made the most accurate prediction. I use different sets
of weights in different contexts.
Starting in paq7 each model outputs a prediction instead of 0,1
counts. I convert each probability into the logistic domain, log(p)/
log(1-p), then combine by weighted averaging, and convert back to the
linear domain, p = 1/(1+exp(-x)). After each prediction the weights
are adjusted to favor the best predictors.
This is for combining hundreds of models. If you only have 2, you can
use a 2-D lookup table with interpolation, then adjust the entry after
the prediction in the direction of the actual outcome.
See also http://en.wikipedia.org/wiki/PAQ
-- Matt Mahoney
.
- Follow-Ups:
- Re: Combining Probabilities?
- From: moogie
- Re: Combining Probabilities?
- References:
- Combining Probabilities?
- From: moogie
- Combining Probabilities?
- Prev by Date: Re: Quick Huffman theory question
- Next by Date: Re: Combining Probabilities?
- Previous by thread: Combining Probabilities?
- Next by thread: Re: Combining Probabilities?
- Index(es):
Relevant Pages
|