AnaloG |
The most popular logfile analyser in the world.
LOGFILE /path/to/access_log OUTFILE /path/to/analog/reports/report.html HOSTNAME "geekfarm.org" HOSTURL "http://www.geekfarm.org/" IMAGEDIR "http://www.geekfarm.org/images/" LOGO "mylogo.jpg" STYLESHEET myanalogstyle.css
analog -G +myconfig.cfg
# expand subdirs of /wu/ SUBDIR /wu/*
# collapse all GET variations of a cgi FILEALIAS /cgi-bin/script.pl* /cgi-bin/script.pl
HOSTEXCLUDE 207.224.86.76
from http://www.analog.cx/docs/cache.html
Analog has the ability to archive some of the data in your logfile into a cache file so that the logfile can be thrown away without losing the most important data. (This is sometimes known as incremental processing.)
For most people, the cache file will not be needed: compressing the logfile using a standard compression utility such as gzip will be sufficient. Compressing a logfile is very efficient owing to the large number of repeated strings: I find about 12 times compression in practice. That in itself may solve your filespace problems, without needing to throw away any information.
The cache file is also not the best format for post-processing the data or feeding it into a spreadsheet. For that you should use the computer-readable output style.