Re: jpeg decoding speeds: x86, ARM, dsp, fpga, GPU



amygdala.jones@xxxxxxxxx schrieb:

We're doing some image processing on standard PCs (dual core, 3+ Ghz
x86, etc), and we're looking for more performance.

Image processing in general or JPEG compression/decompression?

We're looking at
various jpeg libraries, and considering hotter boxes (quad core), but
we also need to investigate more specialized hardware, such as a GPU
or an fpga.

I'd say it really depends on what you want to invest. You can buy very
fast software-only solutions (e.g. www.jpg.com has AFAIK the fastest
JPEG on the market), latest NVIDIA hardware supporting CUDA would be
able to run the DCT on the chip, and if you design your own FPGA, that's
of course fastest. Concerning GPUs I'm not so clear how much that will
buy you. While I haven't done any DCT on a GPU, I did test with DWT and
that wasn't worth it (my CPU was faster). The bottleneck is that you
need to load your data into the GPU first before performing the
operation, and that requires a larger overhead than to just do the
transform in the CPU right away. That is, unless you have specific
conditions where one or the other overhead can be avoided (i.e. the data
is on the GPU anyhow, or has to go there anyhow). This would also be my
next question.

Anyone know of any benchmarks that compare the speed of
jpeg decompression on an x86 cpu, an ARM cpu, a DSP chip, an fpga, an
GPU, etc? I know each generation of each type of chip gets more
powerful, and other system factors can make or break performance, but
I'd like to get an idea of what's possible for each chip type and
determine is specialized hardware wil be enough of a win to justify
the investment.

I think it would be worth discussing your constraints as well. My
thinking is that unless very specific conditions hold, your best bet is
likely a software solution with a specialized codec, but maybe you do
not want to run it on PC hardware?

So long,
Thomas
.



Relevant Pages

  • Re: Flickering Toshiba Satellites
    ... freeze can. ... first he says to uncover the chip and hit it with some freeze when the ... That is impossible as likely the GPU will have shut ... the CPU and one for the GPU, however I don't know specifics for the Tosh ...
    (comp.sys.laptops)
  • New Inflection point in comp arch?
    ... one of Nvidia's chip architects gave a talk about their ... highly parallel code on a GPU. ... CPU that can run C-code for problems that can be solved using the SIMD ... They can get married (Intel buys nvidia) ...
    (comp.arch)
  • Re: jpeg decoding speeds: x86, ARM, dsp, fpga, GPU
    ... x86, etc), and we're looking for more performance. ... various jpeg libraries, and considering hotter boxes, but ... jpeg decompression on an x86 cpu, an ARM cpu, a DSP chip, an fpga, an ... I know each generation of each type of chip gets more ...
    (comp.compression)
  • Re: jpeg decoding speeds: x86, ARM, dsp, fpga, GPU
    ... x86, etc), and we're looking for more performance. ... various jpeg libraries, and considering hotter boxes, but ... I know each generation of each type of chip gets more ... never done this myself (I get plenty fast decode speeds using traditional ...
    (comp.compression)
  • Re: Astonishing technological achievement
    ... >> codec hardware or the CPU horsepower required? ... > intensive than Motion JPEG. ... MPEG to compress, but then again, MPEG has a temporal component JPEG ...
    (sci.lang.japan)