close Warning: Can't synchronize with repository "(default)" (Unsupported version control system "svn": /usr/lib/python2.7/dist-packages/libsvn/_core.so: failed to map segment from shared object: Cannot allocate memory). Look in the Trac log for more information.

Changes between Version 6 and Version 7 of jazz/13-08-14


Ignore:
Timestamp:
Aug 14, 2013, 10:38:43 PM (11 years ago)
Author:
jazz
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • jazz/13-08-14

    v6 v7  
    22
    33 * http://blog.cloudera.com/blog/2009/06/analyzing-apache-logs-with-pig/
     4
    45 *
     6
    57{{{
    68#!html
     
    911
    1012 *
     13
    1114{{{
    1215#!html
     
    1518
    1619 *
     20
     21{{{
     22# pig -x mapreduce -f scripts/blogparse.pig -param LOGS='/mirror.cloudera.com/logs/access_log.*'
     23}}}
     24
     25 *
     26
    1727{{{
    1828#!html
     
    2131<p>while (<>) {<br /> chomp;<br /> if (/([^\t]*)\t(.*)/) {<br /> my ($ip, $rest) = ($1, $2);<br /> my ($country_code, undef, $country_name, $region, $city)<br /> = $gi->get_city_record($ip);<br /> print join("\t", $country_code||'', $country_name||'',<br /> $region||'', $city||'', $ip, $rest), "\n";<br /> }<br /> }<br />
    2232}}}
     33
    2334 *
     35
    2436{{{
    2537#!html