Assessing information quality of wikipedia articles through google's E-A-T model

dc.contributor.authorSugandhika, C
dc.contributor.authorAhangam, S
dc.date.accessioned2023-06-26T05:56:38Z
dc.date.available2023-06-26T05:56:38Z
dc.date.issued2022
dc.description.abstractAlong with the emergence of Web 2.0, User Generated Content (UGC) is becoming increasingly important for knowledge sharing. Wikipedia being the world’s largest-ever community-based collaborative encyclopedia, is also one of the biggest UGC databases in the world. Wikipedia is dealing with a significant problem of Information Quality (IQ) because of its open-source and collaborative nature. When carrying out attacks such as link spamming, malicious users take advantage of Wikipedia’s popularity on the World Wide Web (WWW). As a result, Wikipedia is generally not recommended for academic-related work. There are, however, some articles that are both rich in information and quality. Existing approaches for assessing Wikipedia’s IQ involve statistical models and machine learning algorithms; however, the existing models do not produce satisfactory results. In this study, a novel theoretical model based on Google’s E-A-T framework is introduced to assess Wikipedia’s IQ. The model comprises three IQ constructs Expertise, Authority and Trustworthiness. Based on the empirical findings and study results, a set of IQ dimensions that influence the above three IQ constructs, as well as 45 IQ attributes to measure the IQ dimensions, were identified. The IQ attributes were automatically and inexpensively extracted from the content and meta-data statistics of Wikipedia articles using a Selenium 3.14 web automation script. A sample of 2000 articles comprising 1000 Featured Articles (FA) and 1000 non-FA articles from six WikiProjects was used for the data analysis. The proposed model was compared with three previously published models in terms of classification and clustering accuracy. It received classification and clustering accuracies of 95% and 93% respectively, which is a drastic improvement over the existing models. Furthermore, an average inter-rater agreement of 84% was observed. Thus, the proposed model’s effectiveness is fairly validated by this extensive experiment. This study contributes to the related knowledge area by introducing a novel framework to assess Wikipedia articles’ IQ.en_US
dc.identifier.citationSugandhika, C., & Ahangama, S. (2022). Assessing information quality of wikipedia articles through google’s E-A-T model. IEEE Access, 10, 52196-52209. https://doi.org/10.1109/ACCESS.2022.3172962en_US
dc.identifier.databaseIEEE Xploreen_US
dc.identifier.doi10.1109/ACCESS.2022.3172962en_US
dc.identifier.issn2169-3536(Online)en_US
dc.identifier.journalIEEE Accessen_US
dc.identifier.pgnos52196-52209en_US
dc.identifier.urihttp://dl.lib.uom.lk/handle/123/21164
dc.identifier.volume10en_US
dc.identifier.year2022en_US
dc.language.isoenen_US
dc.publisherIEEEen_US
dc.subjectAuthorityen_US
dc.subjectexpertiseen_US
dc.subjectGoogle's E-A-Ten_US
dc.subjecthybrid approachen_US
dc.subjectinformation qualityen_US
dc.subjecttrustworthinessen_US
dc.subjectweb 2.0en_US
dc.subjectWikipedia.en_US
dc.titleAssessing information quality of wikipedia articles through google's E-A-T modelen_US
dc.typeArticle-Full-texten_US

Files