Identification and characterization of crawlers through analysis of web logs

Algiriyage, N; Jayasena, VSD; Dias, G; Perera, A; Dayananda, K; Sharma, K

UoM IR
→
Research Publications
→
Conference Proceedings
→
Workshops, Seminars, Symposiums & Conferences
→
Workshops, Seminars, Symposiums & Conferences
→
View Item

dc.contributor.author	Algiriyage, N
dc.contributor.author	Jayasena, VSD
dc.contributor.author	Dias, G
dc.contributor.author	Perera, A
dc.contributor.author	Dayananda, K
dc.contributor.author	Sharma, K
dc.date.accessioned	2014-06-18T16:57:07Z
dc.date.available	2014-06-18T16:57:07Z
dc.date.issued	2014-06-18
dc.identifier.uri	http://dl.lib.mrt.ac.lk/handle/123/10041
dc.description.abstract	Web crawlers are software programs that automatically traverse the hyperlink structure of the world-wide web in order to locate and retrieve information. In addition to crawlers from search engines, we observed many other crawlers which may gather business intelligence, confidential information or even execute attacks based on gathered information while camouflaging their identity. Therefore, it is important for a website owner to know who has crawled his site, and what they have done. In this study we have analyzed crawler patterns in web server logs, developed a methodology to identify crawlers and classified them into three categories. To evaluate our methodology we used seven test crawler scenarios. We found that approximately 53.25% of web crawler sessions were from 'known' crawlers and 34.16% exhibit suspicious behavior.	en_US
dc.language.iso	en	en_US
dc.source.uri	www.iciis.org	en_US
dc.title	Identification and characterization of crawlers through analysis of web logs	en_US
dc.type	Conference-Abstract	en_US
dc.identifier.faculty	Engineering	en_US
dc.identifier.department	Department of Computer Science and Engineering	en_US
dc.identifier.year	2013	en_US
dc.identifier.conference	International Conference on Industrial and Information System [8th ms] - ICIIS 2013	en_US
dc.identifier.place	Peradeniya	en_US
dc.identifier.pgnos	pp. 150-155	en_US