Re: What is with the From field containing UTF-8 stuff?



In news.software.readers on Mon, 10 Sep 2007 09:51:17 -0300, Shmuel
(Seymour J.) Metz <spamtrap@xxxxxxxxxxxxxxxxxxxxxxxxxx> wrote:

In <slrnfea9tl.itq.pjr@xxxxxxxxxxxxxx>, on 09/10/2007
at 11:17 AM, Peter J Ross <pjr@xxxxxxxxxxxxxxx> said:

RFC 3977, from which I quote part of section 10.2, page 101:

That does not permit Google to use UTF-8 in headers; au contraire, it says
not to.

It rightly discourages Google, and the rest of us, from doing so, but
"SHOULD [NOT]" remains different from "MUST [NOT]".

Worse, that RFC relates to NNTP, not to article formats. The
actual rules are the more restrictive of those in RFC 3977 and RFC1036++.

RFC 1036 can be read as forbidding even MIME-encoded headers, which
you presumably don't want to do.

Incidentally, RFC 3977 doesn't recognise any standard for article
formats later than RFC 1036, so your reference to RFC 1036++ is
questionable. "Son-of-RFC-1036" and later drafts have no authority.

So UTF-8 is permitted to the extent that SHOULD and SHOULD NOT are more
permissive than MUST and MUST NOT.

FSVO permitted. Failure to use MUST NOT is not a grant of permission.

RFC 2119 states:

1. MUST This word, or the terms "REQUIRED" or "SHALL", mean that the
definition is an absolute requirement of the specification.

2. MUST NOT This phrase, or the phrase "SHALL NOT", mean that the
definition is an absolute prohibition of the specification.

3. SHOULD This word, or the adjective "RECOMMENDED", mean that there
may exist valid reasons in particular circumstances to ignore a
particular item, but the full implications must be understood and
carefully weighed before choosing a different course.

4. SHOULD NOT This phrase, or the phrase "NOT RECOMMENDED" mean that
there may exist valid reasons in particular circumstances when the
particular behavior is acceptable or even useful, but the full
implications should be understood and the case carefully weighed
before implementing any behavior described with this label.


I don't think the loophole should be left open, and RFC 3977 certainly
envisages that a future replacement for RFC 1036 MAY close it but, at
present, if Google or anybody else wants to allow 8-bit header fields,
all they have to say is that they've "understood the implications" and
"weighed the case" before departing from the recommendation.

Anyway, I don't think you and I disagree about the fundamentals.

1. There was nothing technically wrong with the post quoted at the
beginning of this thread.

2. Any person or system that sends non-ASCII characters in Usenet
headers without encoding them into ASCII is likely to cause problems,
and should be discouraged from doing so.

All we disagree about is whether we can use the RFCs to support what
ought to be obvious even without consulting the RFCs.


Incidentally, where are your References: headers? Your replies to me
are starting new threads, which they used not to do.

--
PJR :-)
.



Relevant Pages

  • Re: sed usage question
    ... Standardizing Network Mail Headers from September 1973 ... I know it is in the Mail and Http RFC having had to debug some agents. ... carriage or printer to start of line and feeds the paper to the next ...
    (uk.comp.sys.mac)
  • Re: NAI Webshield SMTP for WinNT MIME header vuln that allows BadTrans to pass]
    ... I'd be real interested to know how you determined that the boundary field ... According to the RFC you referenced, ... it isn't WebShield's job to correctly parse headers. ... incorrectly formed header is all it takes to bypass virus detection, ...
    (Bugtraq)
  • Re: Remove Internal Hops from Header
    ... Just because it is an external process, ... An MTA should deliver mail. ... Removal of headers is considered bad because it hampers diagnosing routing ... Please be aware of this clause in RFC 2821: ...
    (comp.mail.sendmail)
  • Re: list of (almost) all HTTP headers?
    ... A quick google ("rfc cookie") suggests that RFC 2965 defines ... HTTP specs, but X- headers in HTTP seem to be used in the same way as ...
    (comp.infosystems.www.servers.unix)
  • Re: anyone confirm header info please?
    ... >> the spamcop Web Page/Form, ... >> You need to only Paste in the Headers ... RFC 2822 Internet Message Format. ...
    (uk.people.silversurfers)