Show simple item record

dc.contributor.author Udantha, M
dc.contributor.author Ranathunga, S
dc.contributor.author Dias, G
dc.date.accessioned 2016-03-08T02:38:14Z
dc.date.available 2016-03-08T02:38:14Z
dc.date.issued 2016-03-08
dc.identifier.uri http://dl.lib.mrt.ac.lk/handle/123/11671
dc.description.abstract Mining web access log data is a popular technique to identify frequent access patterns of website users. There are many mining techniques such as clustering, sequential pattern mining and association rule mining to identify these frequent access patterns. Each can find interesting access patterns and group the users, but they cannot identify the slight differences between accesses patterns included in individual clusters. But in reality these could refer to important information about attacks. This paper introduces a methodology to identify these access patterns at a much lower level than what is provided by traditional clustering techniques, such as nearest neighbour based techniques and classification techniques. This technique makes use of the concept of episodes to represent web sessions. These episodes are expressed in the form of regular expressions. To the best of our knowledge, this is the first time to apply the concept of regular expressions to identify user access patterns in web server log data. In addition to identifying frequent patterns, we demonstrate that this technique is able to identify access patterns that occur rarely, which would have been simply treated as noise in traditional clustering mechanisms. en_US
dc.description.sponsorship INSTICC - Institute for Systems and Technologies of Information, Control and Communication en_US
dc.language.iso en en_US
dc.subject Web Usage Mining en_US
dc.subject Pattern Mining en_US
dc.subject Regular Expressions en_US
dc.subject Anomaly Detection en_US
dc.title An Episode-based approach to Identify Website user access patterns en_US
dc.type Conference-Full-text
dc.identifier.faculty Engineering en_US
dc.identifier.year 2016 en_US
dc.identifier.conference International Conference on Pattern Recognition Applications and Methods en_US
dc.identifier.place Rome - Italy en_US
dc.identifier.pgnos 24-26
dc.identifier.proceeding International Conference on Pattern Recognition Applications and Methods en_US
dc.identifier.email madhuka@nic.lk en_US
dc.identifier.email surangika@cse.mrt.ac.lk en_US
dc.identifier.email gihan@cse.mrt.ac.lk en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record