Fixing mangled mbox 'From ' header lines?
- From: Mark <mark@xxxxxxxxxxx>
- Date: Thu, 07 Jul 2005 21:42:31 GMT
Hello,
I have an archive of a 10 year old public mailing list that I plan to
import into GoogleGroups for archival and retrieval. There are over
27000 messages in the archive. It is in standard 'mbox' format.
In preparation for uploading to Google, I've been doing a lot of
cleanup of the archive -- finding duplicate and off-topic posts,
fixing some mangled headers, removing excess EOL spaces, etc. The
tools I've used for this cleanup are 'vi' and 'The Bat' email client.
One problem I notice is that over 2000 messages have badly misdated
'From ' header fields (the first line in the header). The date in the
field is essentially bogus (however, the data in the 'Date:' and the
various 'Received:' fields look correct.)
So, is there a tool or script which will fix the 'From ' lines?
If you can, post your reply to this newsgroup.
Thanks!
Mark
.
- Follow-Ups:
- Re: Fixing mangled mbox 'From ' header lines?
- From: Frank Slootweg
- Re: Fixing mangled mbox 'From ' header lines?
- From: AK
- Re: Fixing mangled mbox 'From ' header lines?
- From: Alan Connor
- Re: Fixing mangled mbox 'From ' header lines?
- Prev by Date: Re: How Can I Track Down a Spammer?
- Next by Date: Re: Fixing mangled mbox 'From ' header lines?
- Previous by thread: domain registration + disposable email options?
- Next by thread: Re: Fixing mangled mbox 'From ' header lines?
- Index(es):
Relevant Pages
|