[Top][All Lists]

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Bug-wget] wget source code about -p option

From: Ángel González
Subject: Re: [Bug-wget] wget source code about -p option
Date: Wed, 26 Jun 2013 01:13:59 +0200
User-agent: Thunderbird

On 26/06/13 00:20, Yiwei Yang wrote:
   I'm trying to understand how wget -p find out "everything that supports
the web page" to be downloaded . Could someone refer to me where I could
find this part of code in wget source code? Thank you!

See src/html-url.c
The interesting tags are described at known_tags (line 91), whose attributes
are listed at tag_url_attributes (line 140).
The real work is done by src/html-parse.c, but you can treat it as a black box
implementing map_html_tags().
get_urls_html calls map_html_tags(), which then calls collect_tags_mapper()
for each tag we marked.


reply via email to

[Prev in Thread] Current Thread [Next in Thread]