bug-wget
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Bug-wget] ot: extracting content from d/l pages


From: voytek
Subject: [Bug-wget] ot: extracting content from d/l pages
Date: Tue, 7 Jan 2014 14:40:21 +1100
User-agent: SquirrelMail/1.5.2 [SVN]

thanks for all the tips, snippets and encouragement!

I eventually suceeded in 'pushing buttons', and, 'got in', got my page

this is somewhat off topic, but, perhaps there is option that can help me ?

I have like:

wget -O page.html url
links -dump page.html > page.txt

that worked well, till server got re-developed

when I run the script, page.html DOES contain desired data, BUT, NOT page.txt

looking at page.html it has like[1]:

readonly? is this some sort of attempt to prevent copying of data..?

thanks for any pointers

[1]/snip/
<label class="pfbc-label">Suburb</label><input type="text"
name="SYS_Addresses_e_address_i_0_e_district_tx" value="SYDNEY"
readonly="readonly" class="ro pfbc-textbox"/>

<label class="pfbc-label">State</label><input type="hidden" value="NSW"
name="SYS_Addresses_e_address_i_0_e_state_cd"><input type="text"
name="SYS_Addresses_e_address_i_0_e_state_cd_d" value="NSW"
readonly="readonly" class="ro pfbc-textbox"/>

<label class="pfbc-label">Postcode</label><input type="text"
name="SYS_Addresses_e_address_i_0_e_postcode_tx" value="2000"
readonly="readonly" class="ro pfbc-textbox"/>






reply via email to

[Prev in Thread] Current Thread [Next in Thread]