Re: GSViewer - PS2ASCII Pstotext Unsuccessful pdf_page failed
- From: Ross Presser <rpresser@xxxxxxxxx>
- Date: Thu, 02 Aug 2007 12:58:12 -0700
On Aug 1, 4:38 pm, pstl...@xxxxxxxxxxxxxxx wrote:
On Aug 1, 2:51 pm, Ross Presser <rpres...@xxxxxxxxx> wrote:
On Aug 1, 3:15 pm, pstl...@xxxxxxxxxxxxxxx wrote:
I am new to Ghostscript and GSView. I searched for posts about
PS2ASCII and found a hefty 354. However, I have not found, as yet,
discussion(s) related to the error message that generated for me when
I attempted to convert a PDF to text within the GSView application.
Boiled down the message is:
GSview 4.8 2006-02-25
GPL Ghostscript 8.56 (2007-03-14)
Scanning PDF file
**** Warning: File has a corrupted %%EOF marker, or garbage after %
%EOF.
Ghostscript returns error code -8
9278
QS 2 47856 -49278 1 e 48388 -49278
QS 2 48389 -49278 1 d 48989 -49278
QS 2 48989 -49278 1 : 49322 -49278
QM 3
**** Warning: An error occurred while reading an XREF table.
**** The file has been damaged. This may have been caused
**** by a problem while converting or transfering the file.
**** Ghostscript will attempt to recover the data.
**** Warning: There are objects with matching object and
generation
**** numbers. The accuracy of the resulting image is unknown.
Ghostscript returns error code -8
3 35421 -22776 1 : 35754 -22776
QM 5
Has anyone experienced this? I wanted to test the application before I
jumped into using a batch process on multiple files. Meanwhile, I'll
read the Ghostscript manual. May be a darn good place to start.
Are you sure the PDF is not corrupt, as the top of the message says?
Can you post the PDF somewhere? Have you tried it with other PDFs?
What do you get when you try the commandline script pdftotext.cmd?- Hide quoted text -
- Show quoted text -
Hi Ross,
I suspect that either the PDF is corrupt, or the situation has to do
with several imbedded Word tables. Unfortunately, I have no access to
post the document to the web.
I had not tried pdftotext. I downloaded a copy. It works fine. Now
I'll need to learn if possible to call it within a batch file and pass
a macro containing 100 file names. Thanks for the lead....learning one
small step at a time.
pdftotext works fine in a batch file, but only takes one file at a
time, so you'll have to loop.
Confession: I mistyped my helpful leading question. I meant to ask
what happens when you use pdf2ascii.bat -- the command-line version of
converting to text using Ghostscript. As you now know pdftotext is a
separate program (part of the xpdf package) and works quite
differently.
You might also want to take a look at pdftohtml, which despite its
name has the option for XML output. (pdftohtml is also derived from
xpdf.)
http://pdftohtml.sourceforge.net
.
- Follow-Ups:
- References:
- GSViewer - PS2ASCII Pstotext Unsuccessful pdf_page failed
- From: pstloui
- Re: GSViewer - PS2ASCII Pstotext Unsuccessful pdf_page failed
- From: Ross Presser
- Re: GSViewer - PS2ASCII Pstotext Unsuccessful pdf_page failed
- From: pstloui
- GSViewer - PS2ASCII Pstotext Unsuccessful pdf_page failed
- Prev by Date: Re: composite font, GS vs Distiller
- Next by Date: welcome to buy runescape gold on www.runescape2money.com
- Previous by thread: Re: GSViewer - PS2ASCII Pstotext Unsuccessful pdf_page failed
- Next by thread: Re: GSViewer - PS2ASCII Pstotext Unsuccessful pdf_page failed
- Index(es):
Relevant Pages
|
|