[ILUG] Re: Grabbing text from a web site

Sean Rima sean at tcob1.net
Fri Apr 23 22:00:28 IST 2004


ajh writes:

> * Sean Rima (sean at tcob1.net) [040423 18:55]:
>> I am trying to grab text from a web site but using lynx --dump is not the 
>> answer. I ant to grab text that starts after a <!-------------Text Starts 
>> and finishes with a <!-------------Test End
>> 
>> Is this possible
> 
> Depends. Can you use perl? Have you looked at the perl module
> www::mechanise? Ideal. Take a look at
> http://search.cpan.org/~petdance/WWW-Mechanize-1.02/
> 
> Ideal for this kind of stuff, maybe a little overkill for what you are
> doing though.
> 

I will have a look at it and see if I can work it out :)

Sean

-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 226 bytes
Desc: not available
Url : http://mail.linux.ie/pipermail/ilug/attachments/20040423/907217be/attachment.pgp


More information about the ILUG mailing list