Please use this identifier to cite or link to this item: https://hdl.handle.net/1959.11/61394
Full metadata record
DC FieldValueLanguage
dc.contributor.authorBudhi, Gregorius Satiaen
dc.contributor.authorChiong, Raymonden
dc.contributor.authorWang, Zulien
dc.date.accessioned2024-07-10T01:01:12Z-
dc.date.available2024-07-10T01:01:12Z-
dc.date.issued2021-
dc.identifier.citationMultimedia Tools and Applications, v.80, p. 13079-13097en
dc.identifier.issn1573-7721en
dc.identifier.issn1380-7501en
dc.identifier.urihttps://hdl.handle.net/1959.11/61394-
dc.description.abstract<p>Fraudulent online sellers often collude with reviewers to garner fake reviews for their products. This act undermines the trust of buyers in product reviews, and potentially reduces the effectiveness of online markets. Being able to accurately detect fake reviews is, therefore, critical. In this study, we investigate several preprocessing and textual-based featuring methods along with machine learning classifiers, including single and ensemble models, to build a fake review detection system. Given the nature of product review data, where the number of fake reviews is far less than that of genuine reviews, we look into the results of each class in detail in addition to the overall results. We recognise from our preliminary analysis that, owing to imbalanced data, there is a high imbalance between the accuracies for different classes (e.g., 1.3% for the fake review class and 99.7% for the genuine review class), despite the overall accuracy looking promising (around 89.7%). We propose two dynamic random sampling techniques that are possible for textual-based featuring methods to solve this class imbalance problem. Our results indicate that both sampling techniques can improve the accuracy of the fake review class—for balanced datasets, the accuracies can be improved to a maximum of 84.5% and 75.6% for random under and over-sampling, respectively. However, the accuracies for genuine reviews decrease to 75% and 58.8% for random under and over-sampling, respectively. We also discover that, for smaller datasets, the Adaptive Boosting ensemble model outperforms other single classifiers" whereas, for larger datasets, the performance improvement from ensemble models is insignificant compared to the best results obtained by single classifiers.</p>en
dc.languageenen
dc.publisherSpringer New York LLCen
dc.relation.ispartofMultimedia Tools and Applicationsen
dc.titleResampling imbalanced data to detect fake reviews using machine learning classifiers and textual-based featuresen
dc.typeJournal Articleen
dc.identifier.doi10.1007/s11042-020-10299-5en
local.contributor.firstnameGregorius Satiaen
local.contributor.firstnameRaymonden
local.contributor.firstnameZulien
local.profile.schoolSchool of Science & Technologyen
local.profile.emailrchiong@une.edu.auen
local.output.categoryC1en
local.record.placeauen
local.record.institutionUniversity of New Englanden
local.publisher.placeUnited States of Americaen
local.format.startpage13079en
local.format.endpage13097en
local.peerreviewedYesen
local.identifier.volume80en
local.contributor.lastnameBudhien
local.contributor.lastnameChiongen
local.contributor.lastnameWangen
dc.identifier.staffune-id:rchiongen
local.profile.orcid0000-0002-8285-1903en
local.profile.roleauthoren
local.profile.roleauthoren
local.profile.roleauthoren
local.identifier.unepublicationidune:1959.11/61394en
dc.identifier.academiclevelAcademicen
dc.identifier.academiclevelAcademicen
dc.identifier.academiclevelAcademicen
local.title.maintitleResampling imbalanced data to detect fake reviews using machine learning classifiers and textual-based featuresen
local.output.categorydescriptionC1 Refereed Article in a Scholarly Journalen
local.search.authorBudhi, Gregorius Satiaen
local.search.authorChiong, Raymonden
local.search.authorWang, Zulien
local.uneassociationNoen
dc.date.presented2021-
local.atsiresearchNoen
local.sensitive.culturalNoen
local.year.published2021en
local.year.presented2021en
local.fileurl.closedpublishedhttps://rune.une.edu.au/web/retrieve/8abc80ae-a366-4a3e-8435-f73165055c42en
local.subject.for20204602 Artificial intelligenceen
local.profile.affiliationtypeExternal Affiliationen
local.profile.affiliationtypeExternal Affiliationen
local.profile.affiliationtypeExternal Affiliationen
local.date.moved2024-07-26en
Appears in Collections:Journal Article
School of Science and Technology
Files in This Item:
1 files
File SizeFormat 
Show simple item record

SCOPUSTM   
Citations

38
checked on Sep 28, 2024
Google Media

Google ScholarTM

Check

Altmetric


Items in Research UNE are protected by copyright, with all rights reserved, unless otherwise indicated.