bug-wget
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Bug-wget] download inline elements from remote hosts (page requisites)


From: Jonas Gasser
Subject: [Bug-wget] download inline elements from remote hosts (page requisites)
Date: Thu, 29 Dec 2011 04:59:50 -0800 (PST)

Hello everybody

As I use wget quite often for mirrors, it's very annoying to be limited to the 
host for inline elements (without the span-host option).
So I modified recur.c with 2-3 lines

  /* 7. */
  if (schemes_are_similar_p (u->scheme, parent->scheme)) {
    if (!opt.spanhost && 0 != strcasecmp (parent->host, u->host))
      {
    if (!(opt.page_requisites && upos->link_inline_p))
    {
      DEBUGP (("This is not the same hostname as the parent's (%s and %s).\n",
                 u->host, parent->host));
      goto out;
    }
      }
  }
  /* 8. */

and I use it like:

$ ./wget --output-file=test.log -k -E -K --no-check-certificate -p -e 
robots=off -m URL

First tests are done (simple examples - inline images and iframes included from 
amazon) and seem to work.
My question : Is there interest in this and would an additional option be the 
better way?


Greets from Switzerland, Jonas




reply via email to

[Prev in Thread] Current Thread [Next in Thread]