|
Look at your log files (a past-time of mine)... actually, I've found several bot like behaviours that I exclude from my reports when I'm able to (the problem is identifying the IPs and UserAgents to filter out, some bots look like real visitors but get through the site way too quickly to be human - even if they are using a highpowered computer and can have unlimited windows open...
Here is a list of bot like behaviours I've identified... part of my research into a guide on site statistics.
Bot like activity: - IPs that make
* Requests for the robots.txt file.
* Requests for several pages that are old or never existed.
* Requests for pages without supplying referrer or browser information.
* Requests pages but no supporting content such as images or style sheets.
* Requests pages at regular intervals.
* Requests for a large number of pages in a short time frame.
* Requests the same page but using several browser strings in a short time frame.
* Requests for all the internal links on a page in the order they appear in the source code.
Just remember that there are instances when a real visitor can show some of these traits, but most typical visitors don't.
HTH... perhaps I should work a bit more on this guide now... but its just one of my ongoing projects...
|