OT: HTML Problem -- anyone help?



I just put up this week some MP3s of the recording the NYU Collegium
did at the end of the semester here:

http://www.dfenton.com/Collegium/Tenebrae/

It's holy week music of Couperin and Charpentier.

The problem is that with 5 of the 6 files being Lessons of Tenebrae,
I named them with the word "Leçon" in the file names.

Now, in all Mozilla-based browsers, the page works great, but in IE
and Opera, something wrong is being sent to the web server, and it
can't open the files.

I've encoded all of them is HTML with the HTML entity, as you can
see from viewing source:

<LI><a href="01-Couperin-Magnificat.mp3">
Couperin-Magnificat, 11:36 (16.7MB)</a></li>
<LI><a href="02-Charpentier-Le&ccedil;on 1.mp3">
Charpentier-Le&ccedil;on 1, 8:28 (12.2MB)</a></li>
<LI><a href="03-Charpentier-Le&ccedil;on 2.mp3">
Charpentier-Le&ccedil;on 2, 9:00 (12.9MB)</a></li>

and so forth.

It works just fine on all Firefox-based browsers and NS4.x, but
fails in IE and Opera. The second URL gives this 404 in IE6:

The requested URL /Collegium/Tenebrae/02-Charpentier-Leçon 1.mp3
was not found on this server.

and that's the same 404 that comes through with Opera.

Should filenames not be HTML entity encoded?

Is there something I can do with the document type declaration to
fix this? I just checked and added the content-type header in
http://www.dfenton.com/Collegium/Tenebrae/index_utf8.html:

<meta http-equiv="Content-Type" content="text/html; charset=utf-8">

and now the problem happens on all browsers. So, it seems IE and
Opera assume utf-8, whereas Mozilla doesn't.

What *should* the content-type header be?

Here are the examples:

1. my original, with no content-type header and HTML encoding of the
c cedillas:

http://www.dfenton.com/Collegium/Tenebrae/index_HTML.html

2. here's the HTML encoded version with UTF-8 content-type:

http://www.dfenton.com/Collegium/Tenebrae/index_utf8.html

3. here's the version with no content-type and with the HREFs having
raw "ç" instead of HTML entities:

http://www.dfenton.com/Collegium/Tenebrae/index_cedilla.html

4. and here's raw c cedilla (no HTML encoding), as in 3), but with
the UTF-8 content-type:

http://www.dfenton.com/Collegium/Tenebrae/index_utf8_cedilla.html

That one produces *this* 404 message in IE:

The requested URL /Collegium/Tenebrae/02-Charpentier-Le篮 1.mp3
was not found on this server.

(the square block that I see in my email reader after the c cedilla
is rendered in IE as a vertical bar, top aligned)

This is the 404 message for the same file in Mozilla and Opera:

The requested URL /Collegium/Tenebrae/02-Charpentier-Le�on 1.mp3
was not found on this server.

And that last file works just FINE in NS4.x!

What do people in France do? Not use c cedilla in file names?

--
David W. Fenton http://www.bway.net/~dfenton
dfenton at bway dot net http://www.bway.net/~dfassoc
.



Relevant Pages

  • Re: Simple question
    ... formatting in an email message generally ... message as having 'text/html' for the Content-type header. ... HTML in the message, rather than the formatted text inside the tags. ... and a formatted html branch for everyone else. ...
    (perl.beginners)
  • Oxygen3 24h-365d [SquirrelMail 1.4.9a update fixes multiple vulnerabilities - 12/7/06]
    ... (On December 7, 1732, The Royal Opera House opens in London) ... from incorrect filtering of certain parameters before they are sent to ... An attacker could inject HTML code or scripts through these ... vulnerabilities and run it on the user's mail client. ...
    (Fedora)
  • Re: Opera doesnt like Javascript object, but IE, Firefox, Safari do
    ... which allows a variable number of files to be uploaded from an HTML ... But Opera does. ... which defines the MultiSelector object. ... In the head script, you are not using defer flag? ...
    (comp.lang.javascript)
  • [NEWS] Opera FTP View Cross-Site Scripting Vulnerability
    ... Opera allows running Malicious Scripts due to a bug in 'FTP view' feature. ... allowing embedding of malicious HTML and JavaScript code. ... The information in this bulletin is provided "AS IS" without warranty of any kind. ... In no event shall we be liable for any damages whatsoever including direct, indirect, incidental, consequential, loss of business profits or special damages. ...
    (Securiteam)
  • Re: Diacritical marks in array dont translate
    ... >>> encoding you specified in your HTML. ... > header transmitted by a server. ... MUST or MAY allow the Content-Type header to be overridden by the `meta' ... There are used UAs out there which does not support Unicode, ...
    (comp.lang.javascript)