webbot -title file-title.log

Previous Topic Next Topic
 
classic Classic list List threaded Threaded
4 messages Options
Reply | Threaded
Open this post in threaded view
|

webbot -title file-title.log

Jan Hutař
Hello,
hopefully last question from me for some time :) When running:

webbot -n -depth 10 -prefix http://localhost/test/ -title file-title.log http://localhost/test/index.html

I gets file with only <none> fields like this one:

<none> `<none>' http://localhost/test/index.html

But when add -format or -charset option:

webbot -n -depth 10 -prefix http://localhost/test/ -format del_me.log -title file-title.log http://localhost/test/index.html

I gets right results like:

<none> `This is index' http://localhost/test/index.html

Is it OK?

Regards,
Jan



--
Jan Hutar   Quality Assurance Engineer
[hidden email]        Red Hat, Inc.

attachment0 (196 bytes) Download Attachment
Reply | Threaded
Open this post in threaded view
|

Re: webbot -title file-title.log

Vic Bancroft-2

Jan Hutař wrote:

>hopefully last question from me for some time :)
>
No please feel free to ask about the behavior of the robots and such . . .

>When running:
>
>webbot -n -depth 10 -prefix http://localhost/test/ -title file-title.log http://localhost/test/index.html
>
>I gets file with only <none> fields like this one:
>
><none> `<none>' http://localhost/test/index.html
>  
>
What would you expect to see ?

>But when add -format or -charset option:
>
>webbot -n -depth 10 -prefix http://localhost/test/ -format del_me.log -title file-title.log http://localhost/test/index.html
>
>I gets right results like:
>
><none> `This is index' http://localhost/test/index.html
>
>Is it OK?
>  
>
Is there some change in the code or documentation that might make it
more intuitive ?

more,
l8r,
v

--
"The future is here. It's just not evenly distributed yet."
 -- William Gibson, quoted by Whitfield Diffie


Reply | Threaded
Open this post in threaded view
|

Re: webbot -title file-title.log

Jan Hutař
Hello,
I guess there is same issue with "-lm" option: I have to add "-format" or "-charset" option to get some output. Command

webbot -n -depth 10 -prefix http://localhost.localdomain/test/ -lm file-lm.log http://localhost.localdomain/test/index.html

gives only robots.txt file listed, while

webbot -n -depth 10 -prefix http://localhost.localdomain/test/ -format del_me.log -lm file-lm.log http://localhost.localdomain/test/index.html

lists all files.

Regards,
Jan



--
Jan Hutar   Quality Assurance Engineer
[hidden email]        Red Hat, Inc.

attachment0 (196 bytes) Download Attachment
Reply | Threaded
Open this post in threaded view
|

Re: webbot -title file-title.log

Vic Bancroft-2

Jan Hutař wrote:

>Hello,
>I guess there is same issue with "-lm" option: I have to add "-format" or "-charset" option to get some output. Command
>
>webbot -n -depth 10 -prefix http://localhost.localdomain/test/ -lm file-lm.log http://localhost.localdomain/test/index.html
>
>gives only robots.txt file listed, while
>
>webbot -n -depth 10 -prefix http://localhost.localdomain/test/ -format del_me.log -lm file-lm.log http://localhost.localdomain/test/index.html
>
>lists all files.
>  
>
I am not sure how keeping all the metadata around in the anchors has
this effect, nevertheless the last modified option is fixed in new
revision: 1.17 of RobotMain.c . . .

Hurrms, looking at the code it would appear that -hit, -relation and
-rellog may have the same issue.

more,
l8r,
v


--
"The future is here. It's just not evenly distributed yet."
 -- William Gibson, quoted by Whitfield Diffie