Re: GSViewer - PS2ASCII Pstotext Unsuccessful pdf_page failed



On Aug 1, 4:38 pm, pstl...@xxxxxxxxxxxxxxx wrote:
On Aug 1, 2:51 pm, Ross Presser <rpres...@xxxxxxxxx> wrote:



On Aug 1, 3:15 pm, pstl...@xxxxxxxxxxxxxxx wrote:

I am new to Ghostscript and GSView. I searched for posts about
PS2ASCII and found a hefty 354. However, I have not found, as yet,
discussion(s) related to the error message that generated for me when
I attempted to convert a PDF to text within the GSView application.
Boiled down the message is:

GSview 4.8 2006-02-25
GPL Ghostscript 8.56 (2007-03-14)
Scanning PDF file
**** Warning: File has a corrupted %%EOF marker, or garbage after %
%EOF.

Ghostscript returns error code -8
9278
QS 2 47856 -49278 1 e 48388 -49278
QS 2 48389 -49278 1 d 48989 -49278
QS 2 48989 -49278 1 : 49322 -49278
QM 3
**** Warning: An error occurred while reading an XREF table.
**** The file has been damaged. This may have been caused
**** by a problem while converting or transfering the file.
**** Ghostscript will attempt to recover the data.
**** Warning: There are objects with matching object and
generation
**** numbers. The accuracy of the resulting image is unknown.
Ghostscript returns error code -8
3 35421 -22776 1 : 35754 -22776
QM 5

Has anyone experienced this? I wanted to test the application before I
jumped into using a batch process on multiple files. Meanwhile, I'll
read the Ghostscript manual. May be a darn good place to start.

Are you sure the PDF is not corrupt, as the top of the message says?
Can you post the PDF somewhere? Have you tried it with other PDFs?

What do you get when you try the commandline script pdftotext.cmd?- Hide quoted text -

- Show quoted text -

Hi Ross,

I suspect that either the PDF is corrupt, or the situation has to do
with several imbedded Word tables. Unfortunately, I have no access to
post the document to the web.

I had not tried pdftotext. I downloaded a copy. It works fine. Now
I'll need to learn if possible to call it within a batch file and pass
a macro containing 100 file names. Thanks for the lead....learning one
small step at a time.

pdftotext works fine in a batch file, but only takes one file at a
time, so you'll have to loop.

Confession: I mistyped my helpful leading question. I meant to ask
what happens when you use pdf2ascii.bat -- the command-line version of
converting to text using Ghostscript. As you now know pdftotext is a
separate program (part of the xpdf package) and works quite
differently.

You might also want to take a look at pdftohtml, which despite its
name has the option for XML output. (pdftohtml is also derived from
xpdf.)
http://pdftohtml.sourceforge.net

.



Relevant Pages

  • Re: GSViewer - PS2ASCII Pstotext Unsuccessful pdf_page failed
    ... PS2ASCII and found a hefty 354. ... discussionrelated to the error message that generated for me when ... I attempted to convert a PDF to text within the GSView application. ... GPL Ghostscript 8.56 ...
    (comp.lang.postscript)
  • Re: GSViewer - PS2ASCII Pstotext Unsuccessful pdf_page failed
    ... discussionrelated to the error message that generated for me when ... I attempted to convert a PDF to text within the GSView application. ... GPL Ghostscript 8.56 ... I'll need to learn if possible to call it within a batch file and pass ...
    (comp.lang.postscript)
  • Re: GSViewer - PS2ASCII Pstotext Unsuccessful pdf_page failed
    ... PS2ASCII and found a hefty 354. ... discussionrelated to the error message that generated for me when ... I attempted to convert a PDF to text within the GSView application. ... GPL Ghostscript 8.56 ...
    (comp.lang.postscript)
  • [opensuse] Print to file PDF Firefox 2 & Konqueror
    ... I need to print to PDF from FireFox2. ... I have also read that Ghostscript 5.10 upwards installs ps2pdf. ... Error message received from system: ...
    (SuSE)
  • Re: Schriftenkatalog
    ... Ist die Berling nun ein Bembo-Klon ... > Das resultierende PDF könntest Du vielleicht nochmal durch Ghostscript ... Da ich Ghostscript selten nutze: ... da ich den Traffic sowieso verringern muss. ...
    (de.comp.text.tex)