PDF convertor



Hello everybody,

I was set a problem to figure out how to convert PDF documents to
another proprietary vector format.
The main rub I've struck against is how to extract text content (with
all metrics and position) and corresponding font information from PDF
file. The most convenient way of solving this problem I think is to
find any intermediate file format to convert PDF to instead of
parsing raw pdf stream.
Does anybody know any suitable and simple file formats (XML or smth.
else) I could convert PDF to using stable
software ( may be from Adobe, it's desirable of course ) and what kind
of software it could be ?

Best regards,
Serg.

.



Relevant Pages

  • Re: vector drawing?
    ... Best Format for Placing Vector Graphics ... printers, it doesn't do well on their B&W Brother & HP printers. ... Just another opinion, but for a logo I'd use Illy's Export as PDF - In fact, ...
    (microsoft.public.word.drawing.graphics)
  • Re: linux-flashplugin7
    ... That presumably still has a fibre or copper backbone. ... Wouldn't mind if they followed up that with the flash format. ... Same goes for things being released in PDF. ... I would prefer the format specs to be open over them ...
    (comp.unix.bsd.freebsd.misc)
  • Re: No Word 2007 Viewer?
    ... I forget that sometimes because while I have PDF creating ... Word 2007 was used to create it in Word 2007 format. ... come out with a genuine Word 2007 viewer at some point. ... You simply can't access new Word 2007-only features unless you're ...
    (microsoft.public.word.conversions)
  • Re: What about these?
    ... individual HTML pages, jumping to the relevant location - if the site ... I have dowloaded books on HTML format ... sure may be better PDF own. ... good printed results as TeX which is less complex. ...
    (comp.lang.lisp)
  • Re: Can Word/Excel 2004 and Word/Excel 2008 be on same machine?
    ... Those things are worth having to my clients :-) ... use, and, most importantly to us, the PDF format is a fully-governmental ... If you get a PDF version, ... Microsoft to make substantial changes to its core products, ...
    (microsoft.public.mac.office.word)