bug-wget
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Bug-wget] Fwd: Trying to download HTML from Google's Cache. Pls hel


From: Yan Grossman
Subject: Re: [Bug-wget] Fwd: Trying to download HTML from Google's Cache. Pls help
Date: Mon, 10 Nov 2008 20:22:20 -0800

Hi, is there anybody that can help me with that?

Anyway, I did read the manual and got a pretty good understanding of how it works. I could probably run it now pointed to a regular domain and download the files.

But in this case, I am trying to download from Googles Cache, so I can't use my domain. I think I need to go through Google domain to mine. Do you know what I mean?

here is how I see all my pages on google cache

site:www.snowbrasil.com/fotos

so you see there are about 500 pages. But I can't do a wget on www.snowbrasil.com/fotos cause those are the exact pages I lost. They are not on my server anymore. So I am trying to get from google cache.

I would like to save and download those files. The HTML files only.
As if I was going into each cached page and saving the HTML, but instead of doing one at a time I would use wget to do it in batch.

Thanks


On Mon, Nov 10, 2008 at 3:22 PM, Micah Cowan <address@hidden> wrote:
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Yan Grossman wrote:
> Hi, sorry, I didn't get it myself and didn't find on archives so I sent
> it again. Sorry, I will just wait next time.
>
> Anyway, I did read the manual and got a pretty good understanding of how
> it works. I could probably run it now pointed to a regular domain and
> download the files.
>
> But in this case, I am trying to download from Googles Cache, so I can't
> use my domain. I think I need to go through Google domain to mine. Do
> you know what I mean?
>
> here is how I see all my pages on google cache
>
> site:www.snowbrasil.com/fotos <http://www.snowbrasil.com/fotos>
>
> so you see there are about 500 pages. But I can't do a wget
> on www.snowbrasil.com/fotos <http://www.snowbrasil.com/fotos> cause
> those are the exact pages I lost. They are not on my server anymore. So
> I am trying to get from google cache.

That sounds more like a Google question than a Wget question, then.

Just find the cache pages you want, and see what Google's URL for them
is, and feed that to Wget.

> I would appreciate if you can suggest what command options to use.

That depends greatly on what you want Wget to do with your pages; I
can't really help you there without more information.

- --
Micah J. Cowan
Programmer, musician, typesetting enthusiast, gamer.
GNU Maintainer: wget, screen, teseq
http://micah.cowan.name/
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.7 (GNU/Linux)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org

iD8DBQFJGMJE7M8hyUobTrERAspMAJoD9XzdbteHavQDD+2C2vxCF7DT2ACdHgsA
1M4iypEUaLMwUBNEMFT/G0w=
=dd0x
-----END PGP SIGNATURE-----





reply via email to

[Prev in Thread] Current Thread [Next in Thread]