Ananya - a named-entity-recognition (ner) system for sinhala language

dc.contributor.authorManamini, SAPM
dc.contributor.authorAhamed, AF
dc.contributor.authorRajapakshe, RAEC
dc.contributor.authorReemal, GHA
dc.contributor.authorJayasena, S
dc.contributor.authorDias, GV
dc.contributor.authorRanathunga, S
dc.contributor.editorJayasekara, AGBP
dc.contributor.editorBandara, HMND
dc.contributor.editorAmarasinghe, YWR
dc.date.accessioned2022-09-08T09:16:35Z
dc.date.available2022-09-08T09:16:35Z
dc.date.issued2016-04
dc.description.abstractNamed-Entity-Recognition (NER) is one of the major tasks under Natural Language Processing, which is widely used in the fields of Computer Science and Computational Linguistics. However, the amount of prior research done on NER for Sinhala is very minimal. In this paper, we present data-driven techniques to detect Named Entities in Sinhala text, with the use of Conditional Random Fields (CRF) and Maximum Entropy (ME) statistical modeling methods. Results obtained from experiments indicate that CRF, which provided the highest accuracy for the same task for other languages outperforms ME in Sinhala NER as well. Furthermore, we identify different linguistic features such as orthographic word level and contextual information that are effective with both CRF and ME Algorithms.en_US
dc.identifier.citationS. A. P. M. Manamini et al., "Ananya - a Named-Entity-Recognition (NER) system for Sinhala language," 2016 Moratuwa Engineering Research Conference (MERCon), 2016, pp. 30-35, doi: 10.1109/MERCon.2016.7480111.en_US
dc.identifier.conference2016 Moratuwa Engineering Research Conference (MERCon)en_US
dc.identifier.departmentEngineering Research Unit, University of Moratuwaen_US
dc.identifier.doi10.1109/MERCon.2016.7480111en_US
dc.identifier.emailprabushi.11@cse.mrt.ac.lken_US
dc.identifier.emaileranda.11@cse.mrt.ac.lken_US
dc.identifier.emailachintha.11@cse.mrt.ac.lken_US
dc.identifier.emailfarazath.11@cse.mrt.ac.lken_US
dc.identifier.emailsanath@cse.mrt.ac.lken_US
dc.identifier.emailgihan@cse.mrt.ac.lken_US
dc.identifier.emailsurangika@cse.mrt.ac.lken_US
dc.identifier.facultyEngineeringen_US
dc.identifier.pgnospp. 30-35en_US
dc.identifier.placeMoratuwa, Sri Lankaen_US
dc.identifier.proceedingProceedings of 2016 Moratuwa Engineering Research Conference (MERCon)en_US
dc.identifier.urihttp://dl.lib.uom.lk/handle/123/18992
dc.identifier.year2016en_US
dc.language.isoenen_US
dc.publisherIEEEen_US
dc.relation.urihttps://ieeexplore.ieee.org/document/7480111en_US
dc.subjectNamed Entity Recognitionen_US
dc.subjectConditional Random Fieldsen_US
dc.subjectMaximum Entropy Modelen_US
dc.subjectSinhala languageen_US
dc.subjectNatural Language Processingen_US
dc.titleAnanya - a named-entity-recognition (ner) system for sinhala languageen_US
dc.typeConference-Full-texten_US

Files

Collections