Re: Converting .doc files to .xml
- From: "Steve Ball" <Steve.Ball@xxxxxxxxxxxxxx>
- Date: 6 Oct 2005 05:34:04 -0700
How about using MS Word 2003? The "File->Save As" dialog box allows
you to select XML as a file type. This saves the document as WordML,
which has all of the metadata you are looking for.
If you need this to be command line, then your best bet is to probably
write some VBA or C# code to fire up Word in the background.
HTHs,
Steve Ball
Explain
http://www.explain.com.au/
Jordi Cuenca wrote:
> Hi,
>
> I am looking for an already developped tool that could convert Word .doc
> files (I do not mind the version) to .xml format.
>
> That tool should be a command line tool and I should be able to convert
> several files from .doc to .xml and, very important, I should be able to
> get the non-text data of the file (I mean, for example, author, last
> printing time and so on).
>
> The idea I have of it is somethink like:
>
> c:\> doc2xml [list of parameters] *.doc *.xml
>
> I've been trying with antiword but I did not succeed because it is
> asking me for a DTD file that I do not know from where to get it.
>
> Thank you.
>
> Jordi.
> despertaferro@xxxxxxxxxxxxxxxx
.
- Follow-Ups:
- Re: Converting .doc files to .xml
- From: Harrie
- Re: Converting .doc files to .xml
- References:
- Converting .doc files to .xml
- From: Jordi Cuenca
- Converting .doc files to .xml
- Prev by Date: Re: At least one child
- Next by Date: Defining different elements in XML Schema
- Previous by thread: Converting .doc files to .xml
- Next by thread: Re: Converting .doc files to .xml
- Index(es):
Relevant Pages
|