[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: missing something to download mp3 files from host
From: |
Tim Rühsen |
Subject: |
Re: missing something to download mp3 files from host |
Date: |
Mon, 26 Dec 2022 14:28:35 +0100 |
User-agent: |
Mozilla/5.0 (X11; Linux x86_64; rv:102.0) Gecko/20100101 Thunderbird/102.6.0 |
On 17.12.22 20:32, Paolo Dista via Primary discussion list for GNU Wget
wrote:
Hello,
I am trying to download all the radio programmes from this page:
https://www.radiofrance.fr/personnes/gilles-deleuze with:
wget -r -l 1 -H -nd -np -A '*.mp3' -D media.radiofrance-podcast.net
https://www.radiofrance.fr/personnes/gilles-deleuze
In vain.
The terminal prints this:
--2022-12-17 20:06:36-- https://www.radiofrance.fr/personnes/gilles-deleuze
Resolving www.radiofrance.fr (www.radiofrance.fr)... 23.210.120.113,
2a02:26f0:300:192::3658, 2a02:26f0:300:1a7::3658
Connecting to www.radiofrance.fr (www.radiofrance.fr)|23.210.120.113|:443...
connected.
HTTP request sent, awaiting response... 200 OK
Length: unspecified [text/html]
Saving to: ‘gilles-deleuze.tmp’
gilles-deleuze.tmp [ <=> ] 157.09K --.-KB/s in 0.1s
2022-12-17 20:06:36 (1.21 MB/s) - ‘gilles-deleuze.tmp’ saved [160862]
Removing gilles-deleuze.tmp since it should be rejected.
FINISHED --2022-12-17 20:06:36--
Total wall clock time: 0.3s
Downloaded: 1 files, 157K in 0.1s (1.21 MB/s)
Could you please help me see what I am missing to?
As random.coder wrote, the URLs are part of Javascript.
But it's possible to grep the URLs from there.
You can do this in several (very hacky) steps:
wget https://www.radiofrance.fr/personnes/gilles-deleuze
cat gilles-deleuze | sed 's/http/\nhttp/g' | grep ^http | cut -d'\' -f1
| grep mp3 >mp3.list
wget -i mp3.list
Regards, Tim
OpenPGP_signature
Description: OpenPGP digital signature