bug-wget
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Bug-wget] Guidance needed


From: Ronald F. Guilmette
Subject: Re: [Bug-wget] Guidance needed
Date: Thu, 12 Nov 2015 11:09:10 -0800

In message <address@hidden>, 
Jookia <address@hidden> wrote:

>On Wed, Nov 11, 2015 at 01:07:09PM -0800, Ronald F. Guilmette wrote:
>> Thanks a lot for the reply.
>>
>> I tried both of your two suggestions and alas, neither one really
>> helped the appearance of one of the pages I've been having trouble
>> with, so I guess that I'm just out of luck on that.  I'm almost
>> totally ignorance about modern HTML, so I have to assume that
>> you're correct, and that there is some sort of JavaScript and/or
>> dynamic page generation going on.
>
>Hey there,
>
>Sorry if I didn't catch the first email you sent, but are you using
>--page-requisites ?

Well, yes,  I started out using both -p and -k.

Those options alone are good enough for some cases, but there are
other pages where they just aren't enough to get the local copy
of some of the pages I'm interested in to display properly.

>That said if there's dynamic page generation going on you might be able to have
>some luck with the 'run a web browser headless' crowd. I'm not sure if there's
>any archiving solution but some good leads are PhantomJS or the WebDriver
>(work in progress?) standard. These could be used to build one.

I dont' know the first thing about this.  What the heck is "web browser
headless"?  I guess I'll need to go do some googling.

Even if I don't find anything useful, you've given me an idea.  I suppose
that if worse comes to worse, I might just capture the pages I'm interested
in by using a regular browser and then just grabbing a .png image capture
of the browser window (as I am looking at the page of interest in an
actual browser).  That won't be nearly as good as actually being able to
save all the bits and pieces (.html, .js) needed to fully and properly
render the page in question, but it will probably do for my purposes.


Regards,
rfg



reply via email to

[Prev in Thread] Current Thread [Next in Thread]