Re: Highest possible speed algo in Vb or C or Pseudocode
- From: "Steven G. Johnson" <stevenj@xxxxxxxxxxxx>
- Date: Thu, 3 Jan 2008 14:46:54 -0800 (PST)
On Jan 1, 10:06 pm, "gebe" <g...@xxxxxxxxxxxx> wrote:
I am trying to implement a fast 256 to 2048 (in 2's)algorithm in ASM
(DLL)not to compete with FFTW for versatility but SPEED in specialised
medias .
I am at the moment (using the best I could find) getting 36 usecs(1024
pts) on a 1800Mhz clocked AMD 32 bits and 78 uSec for 2048 points
That's not bad at all for a first try, but you still have a fairly
long way to go.
Let me assume you are using single precision, which should be more
than sufficient for most applications involving such small
transforms. In that case, according to the benchmark numbers on our
web site (http://www.fftw.org/speed/), FFTW can do a 1024-point
complex-data transform in about 10usecs on a 2.4GHz AMD Opteron 275
(32-bit mode), and a 2048-point transform in about 23usecs; naively
scaling by the clock speed to 1.8GHz gives 13 and 31usecs,
respectively. On a newer machine, a 3GHz Core Duo in 64-bit mode, the
times are a bit under 4usecs and 10usecs, respectively. And if your
data are purely real, then you can gain an additional factor of 1.5 to
2.
(It's fairly hard to get anywhere close to optimal performance on a
general-purpose CPU these days if you start with a textbook radix-2
FFT algorithm and try to optimize it, if that is what you are doing.)
Regards,
Steven G. Johnson
.
- Follow-Ups:
- Re: Highest possible speed algo in Vb or C or Pseudocode
- From: Rune Allnor
- Re: Highest possible speed algo in Vb or C or Pseudocode
- References:
- Prev by Date: Re: how to remove such kind of noise
- Next by Date: Using Visual Studio 8 Editor instead of CCS 3.3
- Previous by thread: Re: Highest possible speed algo in Vb or C or Pseudocode
- Next by thread: Re: Highest possible speed algo in Vb or C or Pseudocode
- Index(es):
Relevant Pages
|