bug-wget
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Bug-wget] handling javascript links


From: Micah Cowan
Subject: Re: [Bug-wget] handling javascript links
Date: Wed, 24 Dec 2008 10:34:50 -0800
User-agent: Thunderbird 2.0.0.18 (X11/20081125)

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Andre Majorel wrote:
> On 2008-12-16 10:08 -0500, Allan Spiegel wrote:
> 
>> is there any way to convince wget to follow links in javascript?  for  
>> example I have a lot of links like:
>>
>> <a  
>> href="javascript:openPopupWindow('help/login_help.html',520,380)">Help</a>
>>
>> it would be great if wget could either
>>
>> - be told to treat anything that looks like a URL inside a href  
>> parameter as an ordinary link, or
>> - be provided with a set of sed scripts to run against javascript that  
>> spit out links
> 
> Wget should have a way to filter all href= and/or src= through an
> arbitrary command. That would solve in one go two classes of
> problems :
> - with commands like grep, that of specifying what links are to be
>   followed in a more flexible way than -A, -R, -np and friends,
> - with commands like awk and sed, that of rewriting links (like
>   you are proposing).
> 
> The cost would be one fork/exec per page. IMHO quite acceptable in
> regard of the very high power/development-time ratio.

We plan to do this at some point. 1.13, maybe.

- --
Micah J. Cowan
Programmer, musician, typesetting enthusiast, gamer.
GNU Maintainer: wget, screen, teseq
http://micah.cowan.name/
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.9 (GNU/Linux)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org

iEYEARECAAYFAklSgMoACgkQ7M8hyUobTrE8SACdHOO2tOvzyGFOImwj3bkpmjgs
3VkAnjhOkf8fJ28vNPCvq9GwJ0DVzzM7
=jJBf
-----END PGP SIGNATURE-----




reply via email to

[Prev in Thread] Current Thread [Next in Thread]