Early detection of Sinhala language fake news in social media networks

dc.contributor.advisorAhangama S
dc.contributor.advisorAdikari S
dc.contributor.authorHathnapitiya, H.G.H.S
dc.date.accept2024
dc.date.accessioned2024-10-10T08:25:35Z
dc.date.available2024-10-10T08:25:35Z
dc.date.issued2024
dc.description.abstractWith human evolution, people invented new technologies to make life easier. In the early twentieth century, people read newspapers, listened to radio, and watched television to gather information. With the refinement of technologies, tech people introduced social media platforms to connect with people. Busy modern people started to browse and rely on these platforms to gather news while losing interest in traditional platforms. Social media is easy to access and cost-effective. These platforms can be effortlessly used for propagating fake news content and misleading people for personal, political, or religious benefits. Society must have a proper mechanism to avoid the spread of false information. The knowledge of human experts can be used to overcome the issue by manually investigating news content. However, it requires many human experts, and it consumes time. The study introduced an automated system to detect Sinhala fake news published on social media when the content is published. The data set was created by gathering news from Facebook, which was proven fake by Sri Lankan fact-checkers or legitimate by Sri Lankan news broadcasting channels. The proposed method considered content-related features with deep learning and machine learning techniques. The deep learning model was implemented by extracting Sinhala POS tags and their TF-IDF values combined with XLM-R embeddings. The introduced deep learning approach achieved 86% accuracy. The machine learning approach used TF-IDF values of Sinhala POS tags, FastText embeddings, and punctuation count. The proposed machine learning approach achieved 85% accuracy. The proposed methods can identify fake news early, preventing its spread. The performance can be further enhanced by increasing the dataset size by collecting more data. Keywords – Sinhala fake news, social media, content-related features, natural language processing (NLP), deep learning (DL), machine learning (ML)en_US
dc.identifier.accnoTH5543en_US
dc.identifier.citationHathnapitiya, H.G.H.S. (2024). Early detection of Sinhala language fake news in social media networks [Master's theses, University of Moratuwa]. Institutional Repository University of Moratuwa. http://dl.lib.uom.lk/handle/123/22899
dc.identifier.degreeMSc in Information Technology By researchen_US
dc.identifier.departmentDepartment of Information Technologyen_US
dc.identifier.facultyITen_US
dc.identifier.urihttp://dl.lib.uom.lk/handle/123/22899
dc.language.isoenen_US
dc.subjectSOCIAL MEDIA
dc.subjectSINHALA FAKE NEWS
dc.subjectNATURAL LANGUAGE PROCESSING (NLP)
dc.subjectCONTENT-RELATED FEATURES
dc.subjectMACHINE LEARNING (ML)
dc.subjectDEEP LEARNING (DL)
dc.subjectINFORMATION TECHNOLOGY COMPUTER SCIENCE- Dissertation
dc.subjectMSc (Major Component Research)
dc.titleEarly detection of Sinhala language fake news in social media networksen_US
dc.typeThesis-Abstracten_US

Files

Original bundle

Now showing 1 - 3 of 3
Loading...
Thumbnail Image
Name:
TH5543-1.pdf
Size:
200.11 KB
Format:
Adobe Portable Document Format
Description:
Pre-text
Loading...
Thumbnail Image
Name:
TH5543-2.pdf
Size:
268 KB
Format:
Adobe Portable Document Format
Description:
Post-text
No Thumbnail Available
Name:
TH5543.pdf
Size:
1.1 MB
Format:
Adobe Portable Document Format
Description:
Full-thesis

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: