[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: lynx-dev URL Case Sensitivity in Crawler
From: |
Thomas Dickey |
Subject: |
Re: lynx-dev URL Case Sensitivity in Crawler |
Date: |
Fri, 28 Dec 2001 12:55:04 -0500 |
User-agent: |
Mutt/1.2.5i |
On Fri, Dec 28, 2001 at 10:43:46AM -0700, address@hidden wrote:
> In a recent note, Thomas Dickey said:
>
> > Date: Fri, 28 Dec 2001 12:07:21 -0500
> >
> > On Fri, Dec 07, 2001 at 10:24:52AM -0800, Brian S Queen wrote:
> > > Version: 2.8.4
> > >
> > > Problem: Capitalization changes in the hostname of a URL make the URL
> > > acceptable when the URL should be rejected.
> > >
> > > Solution: Force case insensitivity on the host part of the URL during
> > > URL
> > > comparison.
> >
> > I understand that a pathname on a host is case-sensitive (usually).
> > But I'd thought hostnames were not. Anyone know the applicable RFC?
> >
> I believe that's what he's reporting/requesting. I haven't used
at first glance, it sounded like that, but the "should be rejected" is
confusing me...
> crawl. I assume that if he selects a filter which is supposed
> to prune away http://foo-bar/ he rightly sees a problem if Lynx
> traverses http://Foo-Bar/
--
Thomas E. Dickey <address@hidden>
http://invisible-island.net
ftp://invisible-island.net
; To UNSUBSCRIBE: Send "unsubscribe lynx-dev" to address@hidden