sks-devel
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Sks-devel] sks not serving robots.txt?


From: Jason Harris
Subject: Re: [Sks-devel] sks not serving robots.txt?
Date: Tue, 31 Aug 2004 23:13:16 -0400
User-agent: Mutt/1.4.2.1i

On Tue, Aug 31, 2004 at 04:45:34PM -0700, Jack Cummings wrote:
 
> Upon seeing this, I went to create a robots.txt file for the sks server,
> and noticed that I already had one. It just wasn't being served. 
> 
> In fact, if I try and get it, sks won't:
> 
> 2004-08-31 16:33:20 Get request: <ADDR_INET xx.xx.xx.xx:57064> => /robots.txt
> 2004-08-31 16:33:20 Page not found: /robots.txt

OK, as a dirty hack, you can add a rule to dbserver.ml around line 364
to let SKS recognize .txt files:

                 if extension = ".jpg" || extension = ".gif" || extension = 
".png" || extension = ".txt"
                 then
                   let base = base </> (1,0) in
                   let image = read_file ~binary:true (convert_web_fname base) i
n
                   cout#write_string image;
                   (match extension with
                      | ".jpg" -> "image/jpeg" | ".gif" -> "image/gif"
                      | ".png" -> "image/png"  | ".txt" -> "text/plain"
                      | _ -> raise (Wserver.Misc_error
                                      ("internal error: no mimetype " ^

If the crawlers need CR+LF endings, you can cut and paste the code just
above line 364 that handles index.html or add the carriage returns into
the robots.txt file itself.

The code can be made more generic to handle all .html and .txt files in
./web/ as well.

-- 
Jason Harris           |  NIC:  JH329, PGP:  This _is_ PGP-signed, isn't it?
address@hidden _|_ web:  http://keyserver.kjsl.com/~jharris/
          Got photons?   (TM), (C) 2004

Attachment: pgp9JqQrz8xuk.pgp
Description: PGP signature


reply via email to

[Prev in Thread] Current Thread [Next in Thread]