bug-wget
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Bug-wget] download that require login/password form submit


From: Voytek Eymont
Subject: Re: [Bug-wget] download that require login/password form submit
Date: Sat, 10 Apr 2010 01:42:59 +1000 (EST)
User-agent: SquirrelMail/1.5.1

<quote who="Keisial">
> Voytek Eymont wrote:

> Just surround it with double " " or single ' ' quotes.
> If & is not quoted your shell thinks you want to execute a program called
> wget and then assign active_list to a shell variable called _module (if
> there wasn't a = it would try to run a program called _module, which would
> give you an error message you could notice)


thnaks, Keisial

>> next question: the resulting file has lots and lots of bumpf like
>> space.gif galore, etc,
>> how do I make into text as much as possible, is there a wget function,
>> or ?

> Remove anything between < and >, then unescape the entities. That should
> give you quite clean text with a minimal effort.

how to remove, what with ? how to unescape ?

meanwhile, I tried 'links -dump', that seems to give reasonable results




-- 
Voytek





reply via email to

[Prev in Thread] Current Thread [Next in Thread]