Re: Converting PDF to text



Larry Kilgallen wrote:
What would people recommend for converting PDF to Text that:

a) can be purchased on CDROM (no downloading)

b) is compatible with MacOS X 10.5.1

c) can do this to a document that is 400 pages long

I realize the result will be ugly compared to PDF.

I am hoping for something less ugly than reading the PDF file with
a text editor.

I have looked at the Adobe.com website and I do not know enough about
their various offerings that have Acrobat in the name. Besides, you
folks might be less biased.

If the .pdf was made from a text editor then the imbedded text can be extracted. *IF* the .pdf was made by scanning an original paper document, then the .pdf is really just a series of .tiff page pictures. Extracting text from the .tiffs is not easy or maybe not even possible.
.



Relevant Pages

  • Re: OT: Copying text from a PDF
    ... >>Quite often I have trouble extracting text from a PDF. ... >>tool, copy, but on then pasting into my text editor I get garbage. ...
    (sci.electronics.design)
  • Re: OT: Copying text from a PDF
    ... >>Quite often I have trouble extracting text from a PDF. ... >>tool, copy, but on then pasting into my text editor I get garbage. ...
    (sci.electronics.design)
  • Re: OT: Copying text from a PDF
    ... >Quite often I have trouble extracting text from a PDF. ... >tool, copy, but on then pasting into my text editor I get garbage. ... Problems in extracting text are mostly a function of the application ...
    (sci.electronics.design)
  • Re: fillable PDF forms
    ... I just tried it and it is doable and everyone with a computer, running any OS has a text editor available. ... for family group sheets. ... return them as PDF files. ... But you mention family group sheets produced my genealogy programs as reports. ...
    (soc.genealogy.computing)
  • Re: Converting PDF to text
    ... breyfogle wrote: ... then the .pdf is really just a series of .tiff page ... Extracting text from the .tiffs is not easy or maybe not ... pages out as tiff and then used acrobat to import and ocr. ...
    (comp.sys.mac.system)