Detecting depression from social media data as a multiple-instance learning task

Published in 2022 10th International Conference on Affective Computing and Intelligent Interaction (ACII), 2022

Major Depression Disorder (MDD) is a mental condition that causes severe impairments in a person's life. Early detection of such a condition is imperative, especially to detect cases where the person does not even know that they are affected by it. Social media provide a ubiquitous platform with self-generated data that Machine Learning models enhanced with Natural Language Processing mechanisms can extensively explore. Nonetheless, we advocate that previous Machine Learning approaches to early detection of MDD do not adequately model the problem at hand. Most assume that the isolated publications on social media are enough for early detection. This paper proposes a new problem formulation for this task using the multiple-instance learning paradigm. Furthermore, we include a theoretical and experimental analysis of the method on a dataset of Brazilian university students using the transformer and LSTM architectures. The results indicate that the proposed approach yields better accuracy and explanation capabilities than previous studies that do not rely on multiple-instance learning.

Download Full Paper