Re: Fixing mangled mbox 'From ' header lines?



On comp.mail.misc, in
<ib8rc19n5v5g9ia7jojtsmta1nunklor99@xxxxxxx>, "Mark" wrote:

> Hello,
>
> I have an archive of a 10 year old public mailing list
> that I plan to import into GoogleGroups for archival and
> retrieval. There are over 27000 messages in the archive. It is
> in standard 'mbox' format.
>
> In preparation for uploading to Google, I've been doing a lot
> of cleanup of the archive -- finding duplicate and off-topic
> posts, fixing some mangled headers, removing excess EOL spaces,
> etc. The tools I've used for this cleanup are 'vi' and 'The
> Bat' email client.
>
> One problem I notice is that over 2000 messages have badly
> misdated 'From ' header fields (the first line in the
> header). The date in the field is essentially bogus (however,
> the data in the 'Date:' and the various 'Received:' fields look
> correct.)
>
> So, is there a tool or script which will fix the 'From ' lines?
>
> If you can, post your reply to this newsgroup.

That's how it's normally done, and it is obvious that you know
that full well.

>
> Thanks!
>
> Mark
>

Why would someone need to cover their tracks in google groups
by using a common first name for an alias and a newsserver that
doesn't give their IP, and a forged Message-ID, for something
like this?

Not to mention that there are DNS problems resolving
nowhere.com, which is (marginally) registered by Tucows.

Piss off.

AC

--
http://angel.1jh.com./nanae/kooks/alanconnor.html
http://home.earthlink.net/~alanconnor/
.



Relevant Pages

  • Re: RAID 1
    ... >> repeated context that also clutters the archives, ... I suppose I should have given up on usenet ... he didn't have google to help sift through the cruft and probably ... It shows parts of your 100 most recent posts with links to the full text ...
    (comp.os.linux.networking)
  • Re: JSH: Measuring post impact
    ... but mostly I use Google. ... noticed their original postings. ... Today my posts do probably get read by thousands of people ... Do some searches on primes and probability now, ...
    (sci.math)
  • Re: A rather timely documentary on UKTV History today..
    ... that you must have used Google to retrieve that quote. ... you must have used Google - you called me a liar. ... A lie by omission this time. ... Not verbatim posts, no. ...
    (uk.media.tv.misc)
  • Re: Google/Gmail filter off
    ... despite the fact that Google says different. ... when there are posts visible from him in at least May. ... posts are e-mailed from any NNTP server to Darwin and are ... He's straight-out lying when he ...
    (talk.origins)
  • Re: Belize in April
    ... there are posts asking questions about the very placeI'm visiting at ... Since you are in the mood to Google. ... Ramon's expecting to find a shady pool area and instead get lots of ... you'd have been able to eat lobster every ...
    (rec.scuba.locations)