Oracle9iAS Clickstream Intelligence Administrator's Guide Release 2 (9.0.2) Part Number A90500-02 |
|
Oracle9iAS Clickstream Intelligence supports the collection and analysis of data from the following standard log file types:
The log formats above contain some fields that are case-insensitive. Case-insensitive log fields are converted to lower-case when loaded into the Clickstream Intelligence database. Case-insensitive log fields include the following:
All other log fields are treated in a case-sensitive manner - that is, the original case of the field is not changed as it is loaded into the database.
This appendix provides information about each log file type supported by Oracle9iAS Clickstream Intelligence. Log format fields are defined in a summary table at the end of each section.
Note: Unlike Oracle9i Application Server, Oracle9iAS Clickstream Intelligence does not support Netscape Web Server log files. |
Apache is an open-source Web server supported by the Apache Software Foundation. The Apache HTTP Server Project develops and maintains the Apache HTTP server, one of the most popular servers on the internet. The two formats used most often with Apache are Common and Combined.
Apache Common format displays each request as a separate line in the Web log. Fields are separated with a space, and may be enclosed in quotation marks (" "), as with the "Request Line" field in the Apache Log File Format Example that follows. Time in the "Date and Time" field includes an offset from Greenwich Mean Time (GMT). Null values for log fields are represented by a dash ( - ).
Apache Common Log Format contains all basic Web log parameters (see Apache Log Format Details), but does not include information about the Referrer, Agent, Time to Serve (transfer time), Domain Name, or Cookie string.
Apache Combined format is simply an extension of Common Log Format. It contains the same fields as Apache Common Log Format, with the addition of two fields: Referrer and User Agent. Apache Combined Log Format does not provide Domain Name, Time to Serve (transfer time), or Cookie information.
Consider the following example of a Web log entry in the Apache Combined Log Format:
203.93.245.97 - oracleuser [28/Sep/2000:23:59:07 -0700] "GET /files/search/search.jsp?s=driver&a=10 HTTP/1.0" 200 2374 "http://datawarehouse.us.oracle.com/datamining/contents.htm" "Mozilla/4.7 [en] (WinNT; I)"
In the entry above, the following parameters can be identified:
The User Agent field is comprised of the following data:
Apache log files may contain some or all of the following fields.
W3C Extended is a flexible, highly configurable Web log format developed by the World Wide Web Consortium (W3C) as a common standard to support the needs of servers, clients, and proxies. Each W3C Extended format log file is self-identifying -- it displays at the beginning of each log file a header containing information about the data types recorded, as well as the version of the extended log file format used. As with Microsoft IIS Extended Log File Format, field directives may begin with:
W3C Extended log fields are separated by a whitespace (although tabs are most commonly used and are encouraged by the W3C specification). A null value for a field is typically represented by a dash ( - ); the "Date" and "Time" fields are expressed as GMT time.
Consider the following example of a Web log entry generated in W3C Extended Log File Format:
#Version: 1.0 #GMT-Offset: -0800 #Software: Oracle9iAS Web Cache/2.0.0.2.0 #Start-Date: 2001-10-31 00:00:18 #Fields: c-ip c-dns c-auth-id date time cs-method cs-uri sc-status bytes cs(Cookie) cs(Referrer) time-taken cs(User-Agent) #Date: 2001-10-31 00:00:18 64.103.37.2 client_joaz7 DMS.user 2001-10-31 00:00:18 GET /admin/images/oc_bottomleft.gif 200 350 "BIGipServerwww_ webcache_pool=1443321748.19460.0000;ORA_UCM_ AGID=%2fMP%2f8M7%3etSHPV%40%2fS%3f%3fDh3VHO" "http://www.oracle.com/nl/partner/content.html" 370879 "Mozilla/4.5 [en] (WinNT; I)"
In the entry above, the following parameters can be identified. The #Fields header indicates the order of fields that appear in the log file entry.
The User Agent field is comprised of the following data:
W3C Extended Format log files may contain the following fields:
Microsoft IIS Extended Web logs are loosely based upon W3C Web log format. Microsoft IIS Extended logs contain the basic information found in both Apache and W3C format Web logs, in addition to other unique fields (such as "Win32 Status"). The beginning of every Microsoft IIS Extended log file (IIS 4.0 and higher) displays a header indicating the fields in the order that they were recorded by the Web server. Most field indentifiers begin with one of the following prefixes:
Fields in Microsoft IIS Extended format logs can be separated by a space. To ensure that spaces contained in the data are not misinterpreted as field delimiters, fields are URL-encoded. Null values are displayed as a dash ( - ). The "Date" field is expressed in local time.
Consider the following example of a Web log entry generated in Microsoft IIS Extended Log File Format:
2000-09-28 06:59:07 203.93.245.97 oracleuser W3SVC1 DATAWAREHOUSE 144.25.86.192 GET /files/search/search.jsp s=driver&a=10 200 0 2374 369 2938 80 HTTP/1.0 Mozilla/4.7+[en]+(WinNT;+I) - http://datawarehouse.us.oracle.com/datamining/contents.htm
In the entry above, the following parameters can be identified:
The User Agent field is comprised of the following data:
Microsoft IIS Extended Format log files may contain the following fields:
|
Copyright © 2002 Oracle Corporation. All Rights Reserved. |
|