[ILUG] Gawk query

Brendan Halpin brendan.halpin at ul.ie
Thu Aug 23 20:31:03 IST 2007


Brian Foster <blf at utvinternet.ie> writes:

>  after a bit of head-scratching, the easiest approach
>  seems to be a bit of pre-processing; that is, make
>  the two types of spaces unique.

Frankly, the easiest approach is to bite the bullet and go for a
regexp approach. Digging out a bit of old Perl code that did this
job (if for slightly differently formatted log lines):

while (<STDIN>) {
    /^([0-9.]+) ([^ ]) ([^ ]) (\[.+\]) \"([^\"]+)\" ([0-9]+) ([0-9]+) \"([^\"]+)\" \"([^\"]+)\"/;

    $hitter = $1;
    $datestr = $4;
    $httpcmd = $5;
    $httpres = $6;
    $size = $7;
    $referer = $8;
    $agentid = $9;

}

Brendan
-- 
Brendan Halpin,  Department of Sociology,  University of Limerick,  Ireland
Tel: w +353-61-213147 f +353-61-202569 h +353-61-338562; Room F2-025 x 3147
mailto:brendan.halpin at ul.ie  http://www.ul.ie/sociology/brendan.halpin.html



More information about the ILUG mailing list