Re: question about entropy and compression method



cp wrote:
) I have many files and I need a method , this was just an example , but this
) example is ok for all of the files !
)
) There are no sequences of more than six consecutives "1" in any of these
) files, and there are much more sequences "1" than "11" and so on...

Maybe you should generate statistics on how many times certain length
sequences occur. Check if these statistics match the expected statistics
for independent data. An easy way to do is is to calculate the entropy
as if a sequence of ones is a single symbol and compare that with the
entropy you got for the binary sequence.

By the way, how do you know that a sequence of 7 "1"s cannot occur ?

) no, of course, but I need a method, some explanations about what you told
)
) if you have the time, can you explain me how do you the static order-7 (or
) as an easier example the order-4 ) model ?

Well, an order-7 model is a model that 'looks back' 7 symbols and
determines the statistics from that. A static model means that it is
precalculated, and hardcoded means it is programmed into the compressor.

See below.

) And how do you use the two probability sets ?

Well, when you haven't seen 6 "1"s, the probabilities are 75/25,
but when you have, the probabilities are 100/0.


SaSW, Willem
--
Disclaimer: I am in no way responsible for any of the statements
made in the above text. For all I know I might be
drugged or something..
No I'm not paranoid. You all think I'm paranoid, don't you !
#EOT
.



Relevant Pages

  • Re: Martingale in the field
    ... probabilities can be determined by calculations. ... the odds can be quantified and are in play on every roll of the dice. ... Statistics can be used to confirm and elucidate the effect of the underlying ... times the number of trials in the sequence. ...
    (rec.gambling.craps)
  • Re: Most valuable poster
    ... The statistics used by Olson, Sauer, and Yockey demonstrate that ... Very very few possible sequences in sequence space ... As these threshold requirements become greater and greater, ... What are the odds that evolutionary mechanisms ...
    (talk.origins)
  • Re: Complexity; was: SQL
    ... > good definition for "Random sequence". ... > *might* have been generated by a random process, ... I can't comment on algoritmic versus statistics point of view. ... our computers are more like DFAs than TMs. ...
    (comp.object)
  • Re: The Starting Point Problem - for Howard Hershey
    ... statistics against evolution are really meaningless without ever trying ... random starting point in total sequence space and a random walk to the ... mechanism of random mutation and function-based selection. ...
    (talk.origins)
  • Re: The Starting Point Problem - for Howard Hershey
    ... statistics against evolution are really meaningless without ever trying ... random starting point in total sequence space and a random walk to the ... mechanism of random mutation and function-based selection. ...
    (talk.origins)