Re: Athlon question revisited



David Kanter wrote:
Bill Davidsen wrote:

Let's keep asking, has anyone run similar Athlon X2 chips with 512k and
1M cache on a similar integer workload, and if so was there any
measurable difference?

I'm looking at the 4{2,4,6}00+ chips trying to guess if the larger cache
will reduce memory contention. If I read the description correctly all
of the memory is on one core, so it should make a difference.


That's not right.  The K8 dual core chips all have private caches, but
share the memory controller, I/O and router.  Much smarter than a
shared package chip, but not as good as sharing your last level of
cache, like the POWER4/5, SPARC-IV+, Fujitsu SPARC64-V/VI and Intel's
next gen systems.

More than share the memory controller, does't the memory controller hang off one core and the other use a NUMA-like connection to access the RAM? But essentially the core are still competing for the same bandwidth to memory, however it's laid out.


I'm trying to determine if 1MB cache at 9% slower clock is more or less than a faster chip with 512k cache. Thought someone might have a benchmark.

--
bill davidsen (davidsen@xxxxxxxxxxxxxxxxxxxx)
  SBC/Prodigy Yorktown Heights NY data center
  Project Leader, USENET news
  http://newsgroups.news.prodigy.com
.



Relevant Pages

  • Re: a dozen cpus on a chip
    ... Neither does the core kernel for an operating system. ... They'd surround a shared cache. ... common memory model to its knees would be to set a few small tasks ... One thing I've always thought that CPUs should have is hardware task ...
    (sci.electronics.design)
  • Re: Cached memory never gets released
    ... Stock linux 2.4.26 kernel. ... Due to flash bug 3M of memory gets lost due to font memory getting lost ... The output of "free" cache number steadily grows. ... longer to exhaust all of system memory with the cache. ...
    (Linux-Kernel)
  • Re: Problem: Creating a raw binary string
    ... > While its true that a 64-bit cpu will move twice the data per instruction it ... > Memory bus width plays an important role here and unless it too is widened / ... You are forgetting the two levels of cache in the processor. ... The memory chips are addressed in Row col fashion. ...
    (alt.comp.lang.borland-delphi)
  • Re: Is Greenspun enough?
    ... Most OSes memory map executables directly from the file system so code doesn't pollute the file cache or swap space. ...
    (comp.lang.lisp)
  • Re: Superstitious learning in Computer Architecture
    ... Without a LOT of logic or some other better approach, re-executing the instructions requires re-decoding and it ties up the cache memory bus transferring more data as instructions than the instructions are working on. ... The concept of cache is fundamentally flawed in that it STILL restricts access to one word per clock cycle, when a single modern ALU can easily use 5 plus whatever is eaten up with instruction accesses. ... The size of an optimizing compiler is proportional to the SQUARE of the size of the language times the SQUARE of the complexity of the machine - because all interactions must be considered. ...
    (comp.arch.arithmetic)