Any request
made to the server which is logged, is considered a 'hit'.
The requests can be for anything... html pages, graphic
images, audio files, CGI scripts, etc... Each valid line
in the server log is counted as a hit. This number represents
the total number of requests that were made to the server
during the specified report period.
Files
Some requests
made to the server, require that the server then send
something back to the requesting client, such as a html
page or graphic image. When this happens, it is considered
a 'file' and the files total is incremented. The relationship
between 'hits' and 'files' can be thought of as 'incoming
requests' and 'outgoing responses'.
| Top
|
Pages
Pages are, well,
pages! Generally, any HTML document, or anything that generates
an HTML document, would be considered a page. This does
not include the other stuff that goes into a document, such
as graphic images, audio clips, etc... This number represents
the number of 'pages' requested only, and does not include
the other 'stuff' that is in the page. What actually constitutes
a 'page' can vary from server to server. The default action
is to treat anything with the extension '.htm', '.html'
or '.cgi' as a page. A lot of sites will probably define
other extensions, such as '.phtml', '.php3' and '.pl' as
pages as well. Some people consider this number as the number
of 'pure' hits... I'm not sure if I totally agree with that
viewpoint. Some other programs (and people :) refer to this
as 'Pageviews'.
| Top
|
Sites
Each request
made to the server comes from a unique 'site', which can
be referenced by a name or ultimately, an IP address. The
'sites' number shows how many unique IP addresses made requests
to the server during the reporting time period. This DOES
NOT mean the number of unique individual users (real people)
that visited, which is impossible to determine using just
logs and the HTTP protocol (however, this number might be
about as close as you will get).
| Top
|
Visits
Whenever a request
is made to the server from a given IP address (site), the
amount of time since a previous request by the address is
calculated (if any). If the time difference is greater than
a pre-configured 'visit timeout' value (or has never made
a request before), it is considered a 'new visit', and this
total is incremented (both for the site, and the IP address).
The default timeout value is 30 minutes (can be changed),
so if a user visits your site at 1:00 in the afternoon,
and then returns at 3:00, two visits would be registered.
Note: in the 'Top Sites' table, the visits total should
be discounted on 'Grouped' records, and thought of as the
"Minimum number of visits" that came from that grouping
instead. Note: Visits only occur on PageType requests, that
is, for any request whose URL is one of the 'page' types
defined with the PageType option. Due to the limitation
of the HTTP protocol, log rotations and other factors, this
number should not be taken as absolutely accurate, rather,
it should be considered a pretty close "guess".
| Top
|
KBytes
The KBytes (kilobytes)
value shows the amount of data, in KB, that was sent out
by the server during the specified reporting period. This
value is generated directly from the log file, so it is
up to the web server to produce accurate numbers in the
logs (some web servers do stupid things when it comes to
reporting the number of bytes). In general, this should
be a fairly accurate representation of the amount of outgoing
traffic the server had, regardless of the web servers reporting
quirks.
Note: A kilobyte is 1024 bytes, not 1000 :)
| Top
|
Top
Entry and Exit Pages
The Top Entry
and Exit tables give a rough estimate of what URL's are
used to enter your site, and what the last pages viewed
are. Because of limitations in the HTTP protocol, log rotations,
etc... this number should be considered a good "rough guess"
of the actual numbers, however will give a good indication
of the overall trend in where users come into, and exit,
your site.
| Top
|
Referrers
Referrers
take many shapes and forms. What is contained in the referrer
field varies depending on many factors, such as what site
did the referral, what type of system it comes from and
how the actual referral was generated. For instance, a referrer
may have your site bookmarked in their browser, they may
hae simply type your sites URL field in their browser, they
could have clicked on a link on some remote web page or
they may have found your site from one of the many search
engines and site indexes found on the web.
If the
user just typed your URL into their browser or clicked on
a bookmark, there won't be any information in the referrer
field and will take the form "-" (Direct Request).
Referrers
that came from search engines, dynamic HTML documents, CGI
scripts and other external programs usually tack on additional
information that it used to create the page. A common example
of this can be found in referrals that come from search
engines and site indexes common on the web. Sometimes, these
referrers URL's can be several hundred characters long and
include all the information that the user typed in to search
for your site. The Webalizer deals with this type of referrer
by stripping off all the query information, which starts
with a question mark '?'. The Referrer "http://search.yahoo.com/search?
p=usa%26global%26link" will be converted to just "http://search.yahoo.com/search".
| Top
|
Search
Strings
The
Webalizer will do an analysis on referrer strings that it
finds, looking for well known search string patterns. Most
of the major search engines are supported, such as Yahoo!,
Altavista, Lycos, etc... Unfortunately, search engines are
always changing their internal/CGI query formats, new search
engines are coming on line every day, and the ability to
detect _all_ search strings is nearly impossible. However,
it should be accurate enough to give a good indication of
what users were searching for when they stumbled across
your site.
| Top
|
Hits
by Response Codes
| Code
200 - OK |
The
request was fulfilled |
| Code
206 - Partial Content |
The
transfer was aborted, cancelled or there was an internet
error. |
| Code
304 - Not Modified |
Since
the document has not been modified since the last date
and time of the last request, the document was not sent. |
| Code
404 - Not Found |
The
server has not found anything matching the URL given The
file is not found or was mis-spelled. |
| Top
|
|