bug-wget
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Bug-wget] Fwd: Regarding wget to download webpage


From: Bhargavi N
Subject: Re: [Bug-wget] Fwd: Regarding wget to download webpage
Date: Tue, 31 Jan 2012 10:06:33 +0530

Hello Henrik,

   Thanks Thanks Thanks a LOT!!

Thanks Entire wget team for helping me so generously. Even in your busy
schedules.

Issue is now resolved.

I used Live HTTP headers add on as suggested by you and it really helped me.

Initially i passed all the headers as displayed in the capture window,
with "--header" option of wget.

Later i understood that session details have to be captured.

I used "--keep-session-cookies" option of wget, since sessions details have
to be saved.

I saved the session details in cookies.txt, which i used later to load
ADHOC query page and run the query.

Below are the commands i used to get the query results.

wget -O logon.html --keep-session-cookies --save-cookies cookies.txt
 --post-data 'loginID=mylogon&password=mypasswd'
https://www.collabnet.timeinc.net

wget -O query_res.html --load-cookies cookies.txt "
https://www.collabnet.timeinc.net/servlets/AdHocQuery?query=select+count%28*%29+from+HELM_USER&Button=Run+query
"


Special thanks to you and whole team for sparing your time to help me out
with the issue.

Please let me know if i can be part of your team or i can be of any help to
you.

HAVE NICE DAY !!

With wishes,
Bhargavi

On Tue, Jan 24, 2012 at 3:20 PM, Henrik Holst
<address@hidden>wrote:

> I looked at your web site and it does not perform standard http
> authentication so --username and --password cannot be used to logon to
> that page.
>
> You have to supply the username and password using --post-data there
> aswell. If you had followed my advice to use the Live Headers extension
> with Firefox you would have seen exactly what to do, please use that
> tool aswell as learn some http basics and you will soon learn how to
> perform what you want. Because since we do not have access to that site
> of yours (no username or password) we as a community will have quite
> some hard time to tell you exactly how to proceed since we cannot test
> things at our end.
>
> Anyways, as I wrote I tested to perform a logon attempt with the site
> and with Live HTTP Headers extension active I could see that the
> authentication should be performed like this:
>
> wget --post-data "detour=https%3A%2F%2Fwww.collabnet.timeinc.net%
> 2F&loginID=username&password=password&Login=Login" --save-cookies
> cookies.txt "https://www.collabnet.timeinc.net/servlets/TLogin";
>
> Replace the "username" and "password" in the post data with your account
> details.
>
> However since I have no account on that site of yours, I do not know if
> this really works and whether the detour=xxx thing is really needed and
> whether you have to also add a Referer: header or not to the request.
>
> So if I where you:
> 1. Install Live HTTPS Headers extension in Firefox
> 2. Open it's capture window
> 3. Perform a logon and a sql query
> 4. Study the results in the capture window and try to mimic what you see
> with wget.
>
> /HH
>
>
> tis 2012-01-24 klockan 10:11 +0530 skrev Bhargavi N:
> > Hello Henrik,
> >
> >
> >    I am saving the log on details in the cookies same way as you have
> > mentioned.
> >
> >
> > But cookie file looks empty..
> >
> >
> > Next i loaded the cookie.txt with the
> > URL: https://www.collabnet.timeinc.net/servlets/AdHocQuery
> >
> >
> > I saved the results page and then opened in firefox, i get the same
> > logon page to ollabnet instead of results page.
> >
> >
> > Steps to run the query:
> >
> >
> > 1) I need to logon to collabnet website:
> > https://www.collabnet.timeinc.net
> >
> >
> > 2) In the collabnet website, we have an option to run the query:
> >
> >
> > https://www.collabnet.timeinc.net/servlets/AdHocQuery
> >
> >
> > I need to go to this page and then need to provide SQL query in the
> > query text area.
> >
> >
> > Then need to submit the button "Run Query", which will submit the
> > query.
> >
> >
> > 3) Next need to download the results page to my local directoy on
> > LINUX box.
> >
> >
> > Please help me regarding this.
> >
> >
> > Thanks !
> >
> >
> > Regards,
> > Bhargavi
> >
> >
> > On Mon, Jan 23, 2012 at 12:43 PM, Henrik Holst
> > <address@hidden> wrote:
> >         Ok,
> >
> >           most probably the first site (where you logon) returns a
> >         cookie which you must present to the other site (where you
> >         perform the sql-query). So
> >
> >         wget --username xx --password yy "https://collabnet website"
> >         -O /dev/null --save-cookies cookies.txt
> >
> >         That will logon and save the resulting cookie in the
> >         cookies.txt file. Next you have to send this cookie and your
> >         query to the other site:
> >
> >         whet --load-cookies cookies.txt "https://the query site"
> >         --post-data "the query" -O result
> >
> >         And the result should be in the result file.
> >
> >         Now as Angel Gonzales wrote you probably have to send the
> >         query using the form like --post-data "query=Select%20*%20From
> >         %20table" where the sql query in question has to be
> >         url-encoded (something that wget cannot do for you) which is
> >         mostly replave all spaces with %20 and also all occurences of
> >         +, &, " and ' with their respective %hex code.
> >
> >         A easy way to see exactly which formfield to use in the
> >         post-data is to install the Live Headers extension to Firefox
> >         and open it's capture window and peform the query on the site
> >         using Firefox, you'll then get to see the exact query, what
> >         the GET string is, what the post-data is etc. It's a very good
> >         start to go from.
> >
> >
> >         /HH
> >
> >         2012/1/20 Bhargavi N <address@hidden>
> >                 I meant i am unable to logon with to the website.
> >
> >
> >                 Sorry, to confuse you with lot of questions.
> >
> >
> >                 I have to logon to collabnet website first, providing
> >                 username and passwd.
> >
> >
> >                 Next i want to go to servlet page where i can provide
> >                 SQL query in the textarea of the form.
> >
> >
> >                 Then i need to submit the form and download the
> >                 results page to my local directory in linux box.
> >
> >
> >                 I will be running wget command on linux box.
> >
> >
> >                 Please help me regarding this.
> >
> >
> >                 Thanks !
> >
> >
> >                 Regards,
> >                 Bhargavi
> >
> >
> >
> >                 On Fri, Jan 20, 2012 at 7:20 PM, Bhargavi N
> >                 <address@hidden> wrote:
> >                         Hi  Henrik,
> >
> >
> >                            Thanks for the reply.
> >
> >
> >                         But when i run as you have suggested me, then
> >                         i get the login page of the collabnet
> >                         website..
> >
> >
> >                         instead of the servlet page where i can enter
> >                         sql.
> >
> >
> >                         Please help me !
> >
> >
> >                         I am unable to logon to that page 1st.
> >
> >
> >                         Thanks!
> >
> >
> >                         Regards,
> >                         Bhargavi
> >
> >
> >                         On Fri, Jan 20, 2012 at 4:44 PM, Henrik Holst
> >                         <address@hidden> wrote:
> >                                 Well, it normally should work with
> >                                 this:
> >                                   wget
> >                                 "
> https://collabnet.net/servlets/adhocquery"; --user="username"
> --password="password" --post-data "select * from emp" -O output_filename
> >
> >                                 But then according to the man page,
> >                                 wget only supports post-data this is
> >                                 in urlencoded format like
> >                                 "key=value&key2=value2" so wget might
> >                                 not support sql queries at all.
> >                                 Perhaps cURL works better for your use
> >                                 case.
> >
> >                                 /HH
> >
> >
> >                                 2012/1/20 Bhargavi N
> >                                 <address@hidden>
> >                                         Including the team to this
> >                                         mail chain..
> >
> >                                         ---------- Forwarded message
> >                                         ----------
> >                                         From: Bhargavi N
> >                                         <address@hidden>
> >                                         Date: Fri, Jan 20, 2012 at
> >                                         4:30 PM
> >                                         Subject: Re: [Bug-wget]
> >                                         Regarding wget to download
> >                                         webpage
> >                                         To: Dayanand Kumbhar
> >                                         <address@hidden>,
> >                                         address@hidden,
> >                                         address@hidden
> >
> >
> >
> >                                         Hello Everyone,
> >
> >                                          I am extremely thankful to
> >                                         all of you for the help
> >                                         regarding wget. But
> >                                         still i am unable to get the
> >                                         right page downloaded.
> >
> >                                         Requirement:
> >
> >                                          I need to run adhoc sql query
> >                                         on remote web page. ie;
> >                                         collabnet. The
> >                                         webpage has text area where i
> >                                         can enter sql query and then
> >                                         click submit.
> >
> >                                         Once i submit sql runs and
> >                                         results are displayed in the
> >                                         page.
> >
> >                                         This REMOTE page is call to
> >                                         servlet ie;
> >
> https://collabnet.net/servlets/adhocquery for example.
> >
> >                                         I want to pass SQL query as
> >                                         form data through POST/GET
> >                                         method and get the
> >                                         sql results page downloaded to
> >                                         my local directory on LINUX
> >                                         box. I am
> >                                         running wget on LINUX host on
> >                                         commandline / shell script.
> >
> >                                         I will invoke wget with the
> >                                         URL
> >
> https://collabnet.net/servlets/adhocquery and
> >                                         formdata ie; SQL as
> >                                         --post-data "select * from
> >                                         emp". Finally i want to
> >                                         download the sql query results
> >                                         page to my local directory.
> >
> >                                         I tried all the options that
> >                                         you suggested me, but still
> >                                         nable to get the
> >                                         right page. Every time it is
> >                                         displaying me the index page
> >                                         for collabnet,
> >                                         which i do not need. I need to
> >                                         run the servelet page with the
> >                                         formdata
> >                                         passed to it.
> >
> >                                         Then i want to get the results
> >                                         page downloaded to my local
> >                                         directory on my
> >                                         linux host.
> >
> >                                         Please help me regarding this.
> >
> >                                         Again thanking all of you for
> >                                         your reply and helping me out.
> >
> >                                         Thanks !
> >
> >                                         Regards,
> >                                         Bhargavi
> >
> >                                         On Wed, Jan 18, 2012 at 12:40
> >                                         PM, Dayanand Kumbhar
> >                                         <address@hidden>wrote:
> >
> >                                         > Hi
> >                                         >
> >                                         > i am assuming you have to
> >                                         post the query string
> >                                         > You can use this
> >                                         >
> >                                         > wget -qO-
> >                                         http://example.com/page.php
> >                                         --post-data "SQL QUERY" | cat
> >                                         >
> >                                         >
> >                                         > On Wed, Jan 18, 2012 at
> >                                         12:24 PM, Bhargavi N
> >                                         <address@hidden> wrote:
> >                                         >
> >                                         >> Hello Team,
> >                                         >>
> >                                         >>   My name is Bhargavi and i
> >                                         am working on UNIX for over
> >                                         7years.
> >                                         >>
> >                                         >> I work on project, that
> >                                         needs to download results from
> >                                         remote web page.
> >                                         >> This web page has adhoc sql
> >                                         querying option. I need to run
> >                                         this sql query
> >                                         >> on webpage and get the
> >                                         results downloaded using wget.
> >                                         >>
> >                                         >> How can i do that ?
> >                                         >>
> >                                         >> Please suggest me way to
> >                                         download the results page
> >                                         using wget.
> >                                         >>
> >                                         >> Thanks !
> >                                         >>
> >                                         >> Warm Regards,
> >                                         >> Bhargavi
> >                                         >>
> >                                         >
> >                                         >
> >                                         >
> >                                         > --
> >                                         > Dayanand Kumbhar
> >                                         >
> >
> >
> >
> >
> >
> >
> >
> >
> >
> >
>
>
>


reply via email to

[Prev in Thread] Current Thread [Next in Thread]