Logo

Using a hybrid content-based and behaviour-based featuring approach in a parallel environment to detect fake reviews

Budhi, Gregorius Satia and Chiong, Raymond and Wang, Zuli and Dhakal, Sandeep (2021) Using a hybrid content-based and behaviour-based featuring approach in a parallel environment to detect fake reviews. [UNSPECIFIED]

[img] PDF
Download (7Mb)
    [img] PDF
    Download (5Mb)
      [img]
      Preview
      PDF (paper - Gregorius)
      Download (8Mb) | Preview

        Abstract

        The financial impact of positive reviews has prompted some fraudulent sellers to generate fake product reviews for either promoting their products or discrediting competing products. Many e-commerce portals have implemented measures to detect such fake reviews, and these measures require excellent detectors to be effective. In this work, we propose 133 unique features from the combination of content and behaviour-based features to detect fake reviews using machine learning classifiers. Preliminary results show that these features can provide good results for all datasets tested. Detailed analysis of the results, however, reveals the existence of class imbalance issues for two of the bigger datasets - there is a high imbalance between the accuracies of different classes (e.g., 7.73% for the fake class and 99.3% for the genuine class using a Multilayer Perceptron classifier). We therefore introduce two sampling methods that can improve the accuracy of the fake review class on balanced datasets. The accuracies can be improved to a maximum of 89% for both random under and oversampling on Convolutional Neural Networks. Additionally, we propose a parallel cross-validation method that can speed up the validation process in a parallel environment.

        Item Type: UNSPECIFIED
        Additional Information: -
        Uncontrolled Keywords: fake review detection, featuring approach, machine learning, deep learning, imbalanced data, parallel processing
        Subjects: Q Science > QA Mathematics > QA76 Computer software
        Q Science > QA Mathematics > QA75 Electronic computers. Computer science
        Divisions: Faculty of Industrial Technology > Informatics Engineering Department
        Depositing User: Admin
        Date Deposited: 25 May 2021 04:04
        Last Modified: 21 Oct 2025 19:14
        URI: https://repository.petra.ac.id/id/eprint/20063

        Actions (login required)

        View Item