bug-wget
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Bug-wget] Memory Leakage


From: Tim Ruehsen
Subject: Re: [Bug-wget] Memory Leakage
Date: Mon, 14 Mar 2016 16:28:51 +0100
User-agent: KMail/4.14.10 (Linux/4.4.0-1-amd64; KDE/4.14.14; x86_64; ; )

Hi Thomas,

as you say, wget stores the links to traverse in memory. Also the links 
already traversed. That sums up after a while... but what are talking about 
here ? One million links with ~100 bytes sum to ~100MB. That's not very much 
for a modern machine.

To get a better idea:
- what version of wget do you use (wget --version)
- how long is wget running resp. how many pages are referred (rough estimate) 
?
- did you run a top (with pressing 'm' to sort for memory usage) ? if not, 
please do and report memory usage and speed of eating up memory.

You could send me the real URL you are using via PM so I can try to reproduce 
the behavior here. 

Tim

On Monday 14 March 2016 10:59:25 thomas hood wrote:
>   Hey,
> 
> Using wget recently to handle large amounts of data. Finding it frequently
> eats into memory quite a bit.Mar 12 06:32:08 ubuntu kernel:
> [1439386.493486] Out of memory: Kill process 63125 (wget) score 661 or
> sacrifice childMar 12 06:32:08 ubuntu kernel: [1439386.494595] Killed
> process 63125 (wget) total-vm:4206452kB, anon-rss:3161380kB,
> file-rss:0kBThese are the parameters I'm using; wget -c -nd -b -r -e
> robots=off --header="Accept: text/html" --user-agent="Mozilla/5.0 (Windows
> NT 6.1; WOW64; rv:40.0) Gecko/20100101 Firefox/40.1" http://$URL/$3 2>&1 |
> grep '^--' 2>&1 | awk '{ print $3 }' | grep -v
> '\.\(css\|js\|png\|gif\|jpg\|JPG\)$' &Doing a bit of research on Google
> I've found other people have experienced this in the past one or two years.
> Can anyone shed light on the cause? Is it just wget storing the relative
> links it has to traverse in main memory? Obviously there's ways to manage
> that but I'm unsure if it's an issue of memory management.
> 
> Regards,Thomas Hood

Attachment: signature.asc
Description: This is a digitally signed message part.


reply via email to

[Prev in Thread] Current Thread [Next in Thread]