bug-wget
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Bug-wget] question related to download file in Plant Journal


From: Ray Satiro
Subject: Re: [Bug-wget] question related to download file in Plant Journal
Date: Tue, 18 Jan 2011 10:24:17 -0800 (PST)

--- On Mon, 1/17/11, jinxiang wang <address@hidden> wrote:
[...]
> 
>    My problem is that I just download the
> html file not the PDF file. But if
> I use firefox, I can save the linked file to PDF file. So
> please help me.
> Thank you in advance!
> 
> Jinxiang
> 

Hello,

You have to login to download PDF files from that site. Please review the FAQ:
http://wget.addictivecode.org/FrequentlyAskedQuestions?action=show&redirect=Faq#How_do_I_use_wget_to_download_pages_or_files_that_require_login.2BAC8-password.3F

As it says the easiest way is to use your browser's plaintext cookies file 
after logging in. If you are using Firefox 3 or later you'll have to export 
your cookies to plaintext. There is an extension available that will export 
your cookies:
https://addons.mozilla.org/en-US/firefox/addon/cookie-exporter/
That extension will put an 'Export Cookies' entry in one of your menus; the 
menu labeled Tools or its chinese equivalent. See the screenshot on the page.

Then invoke like this (all one line):
wget --load-cookies=cookies.txt 
http://onlinelibrary.wiley.com/doi/10.1111/j.1365-313X.2010.04411.x/pdf

If it fails you might have to pass in your user agent and/or referer as well 
and invoke like this (all one line):
wget --load-cookies=cookies.txt --user-agent="Mozilla/5.0 (Windows; U; Windows 
NT 5.1; en-US; rv:1.9.2.13) Gecko/20101203 Firefox/3.6.13" 
--referer="http://onlinelibrary.wiley.com/doi/10.1111/j.1365-313X.2010.04411.x/full";
 "http://onlinelibrary.wiley.com/doi/10.1111/j.1365-313X.2010.04411.x/pdf";






reply via email to

[Prev in Thread] Current Thread [Next in Thread]