Logo

A textual-based featuring approach for depression detection using machine learning classifiers and social media texts

Chiong, Raymond and Budhi, Gregorius Satia and Dhakal, Sandeep and Chiong, Fabian (2021) A textual-based featuring approach for depression detection using machine learning classifiers and social media texts. [UNSPECIFIED]

[img] PDF
Download (2812Kb)
    [img] PDF
    Download (4Mb)

      Abstract

      Depression is one of the leading causes of suicide worldwide. However, a large percentage of cases of depression go undiagnosed and, thus, untreated. Previous studies have found that messages posted by individuals with major depressive disorder on social media platforms can be analysed to predict if they are suffering, or likely to suffer, from depression. This study aims to determine whether machine learning could be effectively used to detect signs of depression in social media users by analysing their social media posts—especially when those messages do not explicitly contain specific keywords such as ‘depression’ or ‘diagnosis’. To this end, we investigate several text preprocessing and textual-based featuring methods along with machine learning classifiers, including single and ensemble models, to propose a generalised approach for depression detection using social media texts. We first use two public, labelled Twitter datasets to train and test the machine learning models, and then another three non-Twitter depression-class only datasets (sourced from Facebook, Reddit, and an electronic diary) to test the performance of our trained models in other social media sources. Experimental results indicate that the proposed approach is able to effectively detect depression via social media texts even when the training datasets do not contain specific keywords (such as ‘depression’ and ‘diagnose’), as well as when unrelated datasets are used for testing.

      Item Type: UNSPECIFIED
      Additional Information: -
      Uncontrolled Keywords: depression detection, social media, textual-based featuring, machine learning, imbalanced data
      Subjects: Q Science > QA Mathematics > QA76 Computer software
      Divisions: Faculty of Industrial Technology > Informatics Engineering Department
      Depositing User: Admin
      Date Deposited: 25 May 2021 04:46
      Last Modified: 23 Sep 2022 21:49
      URI: https://repository.petra.ac.id/id/eprint/20062

      Actions (login required)

      View Item