Modelling website user behaviors by combining the em and dbscan algorithms

dc.contributor.authorUdantha, M
dc.contributor.authorRanathunga, S
dc.contributor.authorDias, G
dc.contributor.editorJayasekara, AGBP
dc.contributor.editorBandara, HMND
dc.contributor.editorAmarasinghe, YWR
dc.date.accessioned2022-09-08T04:19:37Z
dc.date.available2022-09-08T04:19:37Z
dc.date.issued2016-04
dc.description.abstractWeb logs can provide a wealth of information on user access patterns of a corresponding website, when they are properly analyzed. However, finding interesting patterns hidden in the low-level log data is non-trivial due to large log volumes, and the distribution of the log files in cluster environments. This paper presents a novel technique, the application of Density- Based Spatial Clustering of Applications with Noise (DBSCAN) and Expectation Maximization (EM) algorithms in an iterative manner for clustering web user sessions. Each cluster corresponds to one or more web user activities. The unique user access pattern of each cluster is identified by frequent pattern mining and sequential pattern mining techniques. When compared with the clustering output of EM, DBSCAN, and kmeans algorithms, this technique shows better accuracy in web session mining, and it is more effective in identifying cluster changes with time. We demonstrate that the implemented system is capable of not only identifying common user behaviors, but also of identifying cyber-attacks.en_US
dc.identifier.citationM. Udantha, S. Ranathunga and G. Dias, "Modelling website user behaviors by combining the EM and DBSCAN algorithms," 2016 Moratuwa Engineering Research Conference (MERCon), 2016, pp. 168-173, doi: 10.1109/MERCon.2016.7480134.en_US
dc.identifier.conference2016 Moratuwa Engineering Research Conference (MERCon)en_US
dc.identifier.departmentEngineering Research Unit, University of Moratuwaen_US
dc.identifier.doi10.1109/MERCon.2016.7480134en_US
dc.identifier.emailmadhuka@nic.lken_US
dc.identifier.emailsurangika@cse.mrt.ac.lken_US
dc.identifier.emailgihan@cse.mrt.ac.lken_US
dc.identifier.facultyEngineeringen_US
dc.identifier.pgnospp. 168-173en_US
dc.identifier.placeMoratuwa, Sri Lankaen_US
dc.identifier.proceedingProceedings of 2016 Moratuwa Engineering Research Conference (MERCon)en_US
dc.identifier.urihttp://dl.lib.uom.lk/handle/123/18969
dc.identifier.year2016en_US
dc.language.isoenen_US
dc.publisherIEEEen_US
dc.relation.urihttps://ieeexplore.ieee.org/document/7480134en_US
dc.subjectclusteringen_US
dc.subjectweb usage miningen_US
dc.titleModelling website user behaviors by combining the em and dbscan algorithmsen_US
dc.typeConference-Full-texten_US

Files

Collections