Web crawler: Crawler identification

Web crawlers about analyze themselves to a Web server by application the User-agent acreage of an HTTP request. Web armpit administrators about appraise their Web servers' log and use the user abettor acreage to actuate which crawlers accept visited the web server and how often. The user abettor acreage may accommodate a URL area the Web armpit ambassador may acquisition out added advice about the crawler. Spambots and added awful Web crawlers are absurd to abode anecdotic advice in the user abettor field, or they may affectation their character as a browser or added acclaimed crawler.

It is important for Web crawlers to analyze themselves so that Web armpit administrators can acquaintance the buyer if needed. In some cases, crawlers may be accidentally trapped in a crawler allurement or they may be overloading a Web server with requests, and the buyer needs to stop the crawler. Identification is additionally advantageous for administrators that are absorbed in alive back they may apprehend their Web pages to be indexed by a accurate chase engine.

Web crawler

Thursday, 8 March 2012

Crawler identification

No comments:

Post a Comment