Clustering sinhala news articles using corpus- based similarity measures

dc.contributor.authorNanayakkara, P
dc.contributor.authorRanathunga, S
dc.contributor.editorChathuranga, D
dc.date.accessioned2022-08-22T10:04:42Z
dc.date.available2022-08-22T10:04:42Z
dc.date.issued2018-05
dc.description.abstractNews aggregators help readers to handle large numbers of news items in a convenient manner by collecting them into a single place with meaningful groupings. Such news aggregators/clusters are available for English and some other popular languages. However, no such tools are available for Sinhala language. To address this void, this paper presents a system to collect news articles published across the web and group related articles using corpus-based similarity measures. Despite the simplicity of the technique and morphological richness of Sinhala, we achieved very promising results that prove the viability of the presented technique.en_US
dc.identifier.citationP. Nanayakkara and S. Ranathunga, "Clustering Sinhala News Articles Using Corpus-Based Similarity Measures," 2018 Moratuwa Engineering Research Conference (MERCon), 2018, pp. 437-442, doi: 10.1109/MERCon.2018.8421890.en_US
dc.identifier.conference2018 Moratuwa Engineering Research Conference (MERCon)en_US
dc.identifier.doi10.1109/MERCon.2018.8421890en_US
dc.identifier.emailnanayakkara.purnima@gmail.comen_US
dc.identifier.emailsurangika@cse.mrt.ac.lken_US
dc.identifier.facultyEngineeringen_US
dc.identifier.pgnospp. 437-442en_US
dc.identifier.placeMoratuwa, Sri Lankaen_US
dc.identifier.proceedingProceedings of 2018 Moratuwa Engineering Research Conference (MERCon)en_US
dc.identifier.urihttp://dl.lib.uom.lk/handle/123/18670
dc.identifier.year2018en_US
dc.language.isoenen_US
dc.publisherIEEEen_US
dc.relation.urihttps://ieeexplore.ieee.org/document/8421890en_US
dc.subjectdocument clusteringen_US
dc.subjectCorpus-based similarity measurementen_US
dc.subjectSinhalaen_US
dc.titleClustering sinhala news articles using corpus- based similarity measuresen_US
dc.typeConference-Full-texten_US

Files

Collections