2014-9th
Permanent URI for this collectionhttp://192.248.9.226/handle/123/13514
Browse
Browsing 2014-9th by Conference "Moratuwa Engineering Research Conference - MERCon 2017"
Now showing 1 - 1 of 1
- Results Per Page
- Sort Options
- item: Conference-AbstractShort Tamil sentence similarity calculation using knowledge-based and corpus-based similarity measuresSelvarasa, A; Thirunavukkarasu, N; Rajendran, N; Yogalingam, C; Ranathunga, S; Dias, GSentence similarity calculation plays an important role in text processing-related research. Many unsupervised techniques such as knowledge-based techniques, corpus-based techniques, string similarity based techniques, and graph alignment techniques are available to measure sentence similarity. However, none of these techniques have been experimented with Tamil. In this paper, we present the first-ever system to measure semantic similarity for Tamil short phrases using a hybrid approach that makes use of knowledge-based and corpus-based techniques. We tested this system with 2000 general sentence pairs and 100 mathematical sentence pairs. For the dataset of 2000 sentence pairs, this approach achieved a Mean Squared Error of 0.195 and a Pearson Correlation factor of 0.815. For the 100 mathematical sentence pairs, this approach achieved an 85% of accuracy.