bug-wget
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Bug-wget] really no "wget --list http://..." ?


From: Micah Cowan
Subject: Re: [Bug-wget] really no "wget --list http://..." ?
Date: Fri, 20 Mar 2009 11:14:44 -0700
User-agent: Thunderbird 2.0.0.19 (X11/20090105)

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Denis wrote:
> Micah,
>   not to be dense, but is there really no way to "wget --list http://...";
> a directory without downloading all its files ?
> To browse any file system, local or remote, I want to be able to LIST it 
> first.
> I gather that there's no www variant of a Unix-like file system
> (tree structure independent of file contents => very fast ls -R)
> but a WFS, web file system, would sure simplify life

HTTP has no concept of a directory, and provides no way to list it, so
no. The WebDAV extensions _do_ provide such a thing, but they're not
commonly implemented on web servers (especially without authentication),
so there'd be little point in making Wget use that.

It _could_ be useful for wget to download a given URL, parse out its
links, and spit them out as a list, but wget doesn't currently do that
either. Even if it did, there could be no way to guarantee that that
list represents the complete contents of the "directory", as all wget
will see is whatever links happen to be on that one single page, so if
it's not an automatically-generated index page, it's unlikely to be a
very good representation of directory contents. But implementing that
would not be a high priority for me at this time (patch, anyone?).

In the meantime, the usual suggestion is to have wget download the
single HTML page, and then parse out the links yourself with a suitable
perl/awk/sed script.

- --
Micah J. Cowan
Programmer, musician, typesetting enthusiast, gamer.
Maintainer of GNU Wget and GNU Teseq
http://micah.cowan.name/
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.9 (GNU/Linux)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org

iEYEARECAAYFAknD3RQACgkQ7M8hyUobTrHFlgCfQTcSoCAkgVGPEcnBMI0GlojL
jqAAn0cK+PcKDEuZwFKyEdCoA9EFQn3N
=ujth
-----END PGP SIGNATURE-----




reply via email to

[Prev in Thread] Current Thread [Next in Thread]