bug-wget
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Bug-wget] possible bug? pathconf: Not a directory


From: Ed Summers
Subject: [Bug-wget] possible bug? pathconf: Not a directory
Date: Fri, 5 Jun 2015 12:23:38 -0400

Hi all,

While using wget to mirror and warc a website I noticed a bunch of errors like 
this in my log:

pathconf: Not a directory

I narrowed it down to a particular request for a set of resources, e.g.:

http://politwoops.sunlightfoundation.com/tweet/599258910776754176

which have page requisites like:

http://politwoops.sunlightfoundation.com/tweet/599258910776754176/thumb/599258910776754176-0.jpg

It seems that the fetch for the HTML creates a file at:

politwoops.sunlightfoundation.com/tweet/599258910776754176

But then the fetch for the image is failing because that path is already  file 
and not a directory, so there is nowhere to save the jpg?

Here’s a command you can use to see for replicating the error:

wget --page-requisites 
http://politwoops.sunlightfoundation.com/tweet/599258910776754176

At first I didn’t mind because I actually can make do with just the WARC file. 
But it seems that the representation is not written to the WARC after 
encountering the write error.

Is this a known bug, or perhaps I’ve overlooked a wget option that will help 
here.

Thanks for any assistance you can provide. And since this is my first time 
writing to bug-wget, thanks for an incredibly useful tool!

//Ed

Attachment: signature.asc
Description: Message signed with OpenPGP using GPGMail


reply via email to

[Prev in Thread] Current Thread [Next in Thread]