wget-dev
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: wget2 | Super slow recursive download when using filter (#686)


From: @rockdaboot
Subject: Re: wget2 | Super slow recursive download when using filter (#686)
Date: Sun, 01 Dec 2024 18:11:41 +0000



Tim Rühsen commented: 
https://gitlab.com/gnuwget/wget2/-/issues/686#note_2235285390


There seem to be two issues.

1. the `fnmatch` only matches if you use `--accept=*/kernel-firmware-*.txz` 
(this needs to be amended to be wget1 compatible).

2. Wget2 is very thorough - despite of `--accept`, it queries all found files 
whether they are CSS or HTML or atom/rss. This requires a HEAD request to be 
made and a mime type check. The old wget1 just looks at the filename extension 
(`.html` or `.css`).
If you want to `fnmatch` without a HEAD request, use `--filter-urls`.

While 1 needs to be checked, 2 potentially looks much deeper into websites to 
find your matching files. The downside is that it will take longer and more 
requests (mostly HEAD requests).

-- 
Reply to this email directly or view it on GitLab: 
https://gitlab.com/gnuwget/wget2/-/issues/686#note_2235285390
You're receiving this email because of your account on gitlab.com.




reply via email to

[Prev in Thread] Current Thread [Next in Thread]