Re: Spidering home page - and nothing else



Paul B <lamewolf2004@xxxxxxxxx> wrote:

On 16 May 2006 11:20:16 GMT, John Bokma <john@xxxxxxxxxxxxxxx> wrote:

Paul B <lamewolf2004@xxxxxxxxx> wrote:

On 16 May 2006 10:39:55 GMT, John Bokma <john@xxxxxxxxxxxxxxx>
wrote:


[ .. ]

Nope. Please read:
<http://johnbokma.com/windows/apache-virtual-hosts-xp.html>

[ .. ]

1: I use a hosting company

So do I. Note that I am not saying to host your site on your computer
at home (read the article). What I am saying, if your local copy has
issues when you use / instead of index.html it's because you're using
a file system, and not a webserver. Despite an URL looking like a file
path, those are two different things. For example, your filesystem is
not going to map / to the index.html and hence people "fix" this by
using index.html.

My local copy does not have issues. Works fine for me.

Then you either use a webserver, or have to use weird paths and refer to
index.html to make it work. I wouldn't call that "fine". It it worked
fine, I wouldn't have several of my projects running under Apache.

I use
dreamweaver for sites as well as by hand, so I could alter it myself,
but why should I ...........*no one* here has proved that by having
both .com and .com/index.html is bad - either for SERPS or otherwise.

Then you haven't read what I wrote. For the outside world .com/ and
..com/index.html are two different objects, and hence they are (AFAIK)
*both* cached. This means that if someone goes to http://example.com/,
next goes to http://example.com/foo.html and then clicks on
http://example.com/index.html the browser will fetch (again AFAIK, but I
can check it tomorrow) the index.html because the browser doesn't know
that index.html is *indentical* to / since the webserver does this
mapping, not your browser. Nor is it a "rule". Like I said, you can have
both a / and an index.html without them looking identical.

Probably not going to convince you, but it's bad bandwidth wise (and
user experience, since he/she has to wait again for the same page).


2: you don't use mod_rewrite for html

I do use it because [a] I now and then make a mistake with naming my
HTML pages (for example I called frogs toads recently :-) ) and [b]
now and then I get people (or software) making links with mistakes in
them. I rather test at home first. Note that I don't say that you
*have to* use mod_write, I only stated: you can test things like
mod_rewrite. Since the article describes how to install and configure
a web server, you can bypass all limitations a file system has when
testing a website.

Yes. Apache ......... I am not on an Apache server.

For serving HTML and images that doesn't matter much. And for several
other things ditto.

You don't *have to* use a webserver, I mean, if you prefer it the hard
way, be my guest. But don't give arguments like "hosting company" and
"don't use mod_rewrite" for not using it. Both are irrelevant IMO :-D.

"prefer it the hard way" LOL
So you are suggesting that I :
1: Install an apache server
2: get my head round it all

If you read the aforementioned article you are able to do so. I did
quite my best to explain the most important things in very great detail.


just to alter /index to .us .......... sounds like I will be heating
up a swimming pool to boil an egg ........... and for what ?

I can give you plenty of reasons why it's a good thing to read that
article, but they are my reasons (but I think most are generally valid).

No proof that it affects anything.

If you want it, I can give it on Wednesday. I am quite sure that I am
right (the caching story given above), but when I am not 100% sure I
write IIRC/AFAIK. OTOH I am afraid you will say it's not important.

--
John

Net::Google and Perl: http://johnbokma.com/perl/net-google.html
.



Relevant Pages

  • Re: Re-enable disabled form elements when user goes Back
    ... Object inferences are still browser sniffing. ... avoiding object inference *completely* seems ... go without saying) that my ideas about browser scripting have evolved ... What causes memory leaks is unfortunately a consequence of certain ...
    (comp.lang.javascript)
  • Re: Re-enable disabled form elements when user goes Back
    ... Object inferences are still browser sniffing. ... You sure can avoid closing element over and so creating a leak. ... go without saying) that my ideas about browser scripting have evolved ... the library creates expandos in some modules. ...
    (comp.lang.javascript)
  • Re: Validating file names
    ... running Novel, from your G: running Linux file system, from your H: running ... I am not saying one does not exist! ... file file and handle any exceptions that were thrown. ... If an exception is thrown then the specified ...
    (microsoft.public.dotnet.languages.vb)
  • Re: Do antivirus programs work with all browsers
    ... exploit code prior to be handed off to the Internet Browser. ... it seems you are saying that McAfee has some special functional ... Retail Mcafee will not install. ...
    (alt.comp.anti-virus)
  • Re: I cant see my Web site from Internet Explorer because I use LINUX!!!!!!!!!!!!!!!!
    ... >> I'm not saying that I agree with a IE only site, ... > remote browser? ... those customers are pure idiots. ... > you're not part of this category as you recognise standards.. ...
    (comp.lang.php)