Interpretability of Fine-grained Classification
of Sadness and Depression

Tiasa Singha Roy*, Priyam Basu Aman Priyanshu
Manipal Insitute of Technology
{tiasa.singharoy, priyam.basu1, aman.priyanshu}@learner.manipal.edu \ANDRakshit Naidu
Carnegie Mellon University
[email protected]

Abstract

While sadness is a human emotion that people experience at certain times throughout their lives, inflicting them with emotional disappointment and pain, depression is a longer term mental illness which impairs social, occupational, and other vital regions of functioning making it a much more serious issue and needs to be catered to at the earliest. NLP techniques can be utilized for the detection and subsequent diagnosis of these emotions. Most of the open sourced data on the web deal with sadness as a part of depression, as an emotion even though the difference in severity of both is huge. Thus, we create our own novel dataset illustrating the difference between the two. In this paper, we aim to highlight the difference between the two and highlight how interpretable our models are to distinctly label sadness and depression. Due to the sensitive nature of such information, privacy measures need to be taken for handling and training of such data. Hence, we also explore the effect of Federated Learning (FL) on contextualised language models. The code for this paper can be found at: ¹¹1https://github.com/tiasa2/Interpretability-of-Federated-Learning-for-Fine-grained-Classification-of-Sadness-and-Depression

Tiasa Singha Roy*, Priyam Basu* and Aman Priyanshu Manipal Insitute of Technology {tiasa.singharoy, priyam.basu1, aman.priyanshu}@learner.manipal.edu

Rakshit Naidu Carnegie Mellon University [email protected]

1 Introduction

Mental Health is defined as a “state of well-being in which individuals realize their potential, cope with the normal stresses of life, work productively, and contribute to their communities” by The World Health Organization (WHO). Depression is a very common mental disease that a large number of people throughout the world suffer from. According to research conducted by the WHO, in 2020 more than 280 million people all over the world suffer from depression (WHO). It acts as leading cause for disability and is the most common form of neuro-psychiatric disorder () (WHO). According to Substance Abuse and Mental Health Services Association, in 2018, adolescents aged 12 to 17 years old had the highest rate of major depressive episodes (14.4%) followed by young adults 18 to 25 years old (13.8%). Older adults aged 50 and older had the lowest rate of major depressive episodes (4.5%). Two-thirds of those who commit suicide struggle with depression Team (2022).

Sadness, on the other hand, is an emotional state characterized by feelings of unhappiness and low mood. A person may say they are feeling ‘depressed,’ but if it goes away on its own and does not impact life in a big way, it probably isn’t the illness of depression Canadian Mental Health Association (2022); American Psychiatric Association . Unlike depression, which is persistent and longer-lasting, sadness is temporary and transitory Holmes (2021). Hence, it is very important for us to be able to detect the difference between the two conditions as depression requires urgent care and treatment.

Refer to caption — Figure 1: Pipeline of data creation system

Most of the open-sourced data related to depression available also contain texts that imply just "sadness" as a part of it. Sadness related data, within itself, also comprises of depressing texts as seen in the GoEmotions dataset Demszky et al. (2020). Sentences such as "I feel so much lonely" and "I’m so depressed" are found under the "sad" label. The converse, i.e sad texts are also often found under Depression corpora. In the Benchmarking dataset Basu et al. (2021), sentences like "Forgot my cheese cake at work" and "Wish I had took holidays instead of being at work", which imply the narrator is sad but not depressed, are present under the "depression" label. Motivated by such problems, we propose a novel dataset where we try to tackle this problem by providing labelled samples of sad and depressing data separately to enable private training of machine learning models for detection of the presence of, as well as differentiation between, the two emotions. In this paper, we intend to explore how interpretable detecting sadness and depression are using common language models– given that the definitions of both are very close to each other and yet, both are distinctly different. We also investigate whether Federated Learning (FL) could be a potential solution for training models on our sensitive dataset and if FL models lead to higher interpretable models than the baseline models as FL invloves collective aggregation of model updates.

2 Related Works

With the fast increase of social media usage, efforts for depression and other emotion detection are being made as a part of a Sentiment analysis task. Park et al. (2015) indicated that depressed Twitter users tend to post tweets containing negative emotional sentiments more than healthy users. In their paper AlSagri and Ykhlef (2020), the authors explored multiple ML classification techniques for detecting depression on Twitter using content and activity features. Random forest techniques have also been used for detecting signs of sadness Cacheda et al. (2019). Deep learning approaches are popularly used for sentiment analysis task Babu and Kanaga (2022) and have been explored later in this paper. However, none of these works target the contrast between sadness and depression. Hence, we create our own dataset which has been discussed in the next section. Alongside their powerful performance deep learning models also act as black-box prediction systems, therefore explainable methods become important for interpretable communication of model inference Ribeiro et al. (2016). The ability of LIME a model-agnostic local method for interpretability introduction within text classification systems, has become well known for its impactful performance Mardaoui and Garreau (2021). We employ said XAI method for further substantiating our study.

3 Sad-Depression Dataset Creation

3.1 Depression

3.1.1 Data Collection

1.

SDCNL dataset: This paper Haque et al. (2021) was presented at ICANN 2021, proposing a dataset of depression vs suicide tweets, where the authors had scraped data from posts, that belonged to the subreddits - r/SuicideWatch and r/Depression and were labelled as suicidal and depressed respectively. Most of the data-points in a paragraph format having more than 2000 characters. We utilized the depression labelled posts in this dataset to create the initial set of depression labelled texts in our dataset.
2.

Goodbye World Dataset: This work aimed at identifying individuals at risk of suicide Hesamuel . The data collection here was similar to the one in the previous SDCNL dataset, using the Reddit API to collect posts from r/SuicideWatch and r/Depression. Unlike the SDCNL dataset, the data-points available here were mostly in a sentence or sentence-couplet format and thereby, better suited for the aim of our study. We used the posts from the depression labelled file which were non-identical from the ones we had earlier collected.
3.

Benchmarking Dataset: This used as a dataset to benchmark transformer models Basu et al. (2021). It contains two labels - depression/sad(label 1) and non depression or neutral(label 0). For the purpose of this work, we utilized the data under the label depression.

3.1.2 Paraphrasing

As mentioned earlier, the data collected from SDCNL dataset contained majorly paragraphs, with more than 2000 characters in each, describing the depression context. Since our work aims to understand the distinction of context at a sentence level, we utilize a paraphrasing approach to get a sentence representation of the paragraph while retaining the equivalent contextual meaning, with the intention to retain the depression context. Google’s Pegasus transformer model Zhang et al. (2020) was used for this task because of its abstractive summarization capabilities.

3.1.3 Semantic Matching

After paraphrasing, few of the paraphrased sentences were unable to capture the context of their respective paragraphs accurately. To get only the accurate sentences we utilized S-BERT Reimers and Gurevych (2019) to perform semantic matching between the paraphrased sentences and their original paragraphs. An arbitrary threshold value was set for selection of the correct sentences, which were then added to the data from SDCNL and goodbye world dataset to form the depression section of our dataset.

3.1.4 Data Annotation and Cleaning

For the final data under the "depression" label, we manually reviewed and annotated the processed data. It was carried out to remove ambiguous samples and to maintain the distinction between the two classes by re-labelling the samples that fit better under the "sadness" label. To differentiate between depression and sadness, the authors examined the context of the sentences in question based on the distinction made by American Psychiatric Association (APA) American Psychiatric Association After this cleaning was carried out on the data to remove special characters, external links and ’@’ tags. The cleaned data was labelled as 1 for depression.

3.2 Sadness

3.2.1 Data Collection

1.

GoEmotions Dataset: Created by Google Research Demszky et al. (2020), GoEmotions is a human-annotated dataset of 58k Reddit comments extracted from popular English-language subreddits and labelled with 28 emotion categories. It includes 12 positive, 11 negative, 4 ambiguous emotion categories and 1 “neutral”. Within the negative, we picked comments labeled under sadness for the "sadness" label of our dataset.
2.

NLP-text-emotion dataset: This dataset Lukasgarbas was combined from dailydialog Li et al. (2017), isear uni (2022), and emotion-stimulus Ghazi et al. (2015) to create a balanced dataset with 5 labels. The texts mainly consist of short messages and dialog utterances. We utilized it for the sadness related data for our dataset.
3.

Emotions dataset for NLP: Open sourced dataset on Kaggle Praveen (2020). This was created as a dataset for NLP classification tasks utilizing CARER methodology Saravia et al. (2018) to generate 6 label categories. For our dataset, we used the data marked under the label - "sadness".

3.2.2 Pseudo-Labelling

Upon inspection of the collected data we found an issue unique to the sadness labels. These labels contained, apart from sadness, sentences of neutral or even positive sentiment. To improve the quality of data under this label, we utilized a BERT Devlin et al. (2018) based classification model to assemble the correct ones, by training on the benchmarking dataset. We picked this dataset as it contained neutral or non depressing/sad samples(label 0) as well as depressing/sad samples(label 1). The trained model was then used to classify our collected "sadness" data to create pseudo labels to remove data points that are closer in sentiment to the "neutral" data and retaining the ones that have a closer relation to the depression/sad data. This allows us to essentially extract potentially "sad" data from the original collected samples. Finally all the sentences labelled as 1 are collected under the label sadness in our final dataset.

3.2.3 Data Annotation and Cleaning

For the final data under the label "sadness", we again manually reviewed and annotated the processed data. Similar to the depression data, this was done to re-label any samples that are closer to depression rather than sadness or remove samples with ambiguous context. The same source was utilized as used for depression as a baseline definition while annotating in order to maintain uniformity. We also removed a few neutral and positive context based samples that remained. To retain only "sad" samples, we focused on the definition provided by American Psychiatric Association to remove all the samples that did not align with the definition. Finally cleaning was carried out on the data to remove special characters, external links and ’@’ tags. The cleaned data was labelled as 0 for sadness.

3.3 Final Dataset

At the end, we took all the depression-labelled and sadness-labelled data-points and combined them into a single dataset, which consisted of 3256 samples, of which 1914 samples were labelled as "sadness"(label 0) and 1342 samples under the label "depression"(label 1). It essentially follows a 0.58-0.42 class ratio, favouring effective training of classification models.

4 Interpretability of Federated Models

The rise in the use of machine learning models such as deep neural networks for intent mining has led to more accurate predictions. Explainable AI provides a set of methods to help us understand and interpret a model’s decisions. In order to trust a model’s predictions, one must be able to interpret the reasons behind its decisions. Several attempts have been made to infer the text classification for language models. State of the art model-agnostic explanations such as LIME (Local Interpretable Model-Agnostic Explanations) have been used to enable better visualization and analysis of AI models.

5 Experimental Results

LIME is model agnostic in nature, that is, one can use it for any machine learning model. LIME allows interpretability to be supplemented to benchmark models. In our venture to integrate explainable AI for sadness-depression classification, we evaluated multiple models for performance comparison.

5.1 Baseline Results

We intially conduct a detailed benchmark inference, over BERT and RoBERTa classification models. We provide these results in table 1.

Table 1: Performance of Baseline models

Model Name	Accuracy
BERT	91.9%
RoBERTa	96.62%

Inferring the performance presented in Table 1, we can see that the RoBERTa model is the best performing model and acts as the appropriate candidate for further exploration using explainable AI. Alongside RoBERTa we also take a look at the performance of BERT as a control for our analysis.

5.2 Federated Setting Results

We further reproduce these results in a federated setting. For this we employ the federated averaging algorithm, we present these results in table 2. The table discusses results in both an IID-setting and a Non-IID setting, allowing us to substantiate the use of federated learning for privacy-enabled classification of this sensitive data.

Table 2: Performance in Federated Setting

Model Name	IID Setting	Non-IID Setting
BERT	75.07%	73.6%
RoBERTa	81.85%	83.66%

For our experiments, we synthetically simulate $K$ —Total number of clients clients, with $C$ —Fraction of clients randomly chosen to perform computation on each round, and $E$ —Number of training passes each client makes over its local dataset per round. For our inferential setting, we set $K=10,C=0.3$ , and $E=1$ for the experiments. Having produced successful convergence, with satisfactory performance, we can claim the utility of federated learning and explainable AI for sadness—depression classification.

We also demonstrate the utility and inference that LIME brings to this federated system. In a study on topics as delicate as depression classification, it becomes integral that over-seeing authorities/deployers understand the inferential steps the model may be taking. We supplement samples of explainability for Sadness and Depression detection in Figure 2 and Figure 3. Figure 2 deliberates on the case of Sadness detection for the sentence "I feel so emotional today," where the LIME interprets model decision on perturbed word analysis. As one can see, the impact of the words "today" and "i" hold significant weightage to model inference for class label 0 or SAD. On the other hand, Figure 3 deliberates on the case of Depression detection for the sentence "If I had a gun, I’d blow my fu*king brains out right now," where LIME distinctly highlights the words "brains", "gun", and "fu*king" which significantly impact the model’s prediction to be class label 1 or DEPRESSION. We can ascertain that certain words like "now," "had," and "if" had an opposing effect; however, their impact wasn’t as consequential as that of the aforementioned highlighted words. Employing interpretability, therefore, allows more accessible and successful affirmation of model predictions during deployment even within a federated setting, thereby substantiating the experimental setting.

6 Conclusion

Sadness and depression both are sensitive attributes of one’s life. Social media has given a new platform for people to vent their feelings, at the same time, an opportunity for one to detect impactful signs early on. The application of SOTA interpretable methods on this assortment of data will pave a path towards the adoption of the same during real-world implementation. Our use of LIME, which quantify local model explanations, allows us to display their importance and relevance in depression sadness classification. For future explorations, we would like to explore these explainable techniques for the same. We would also like to investigate other techniques such as lexical normalization to reduce noise and measure its impact on our predictions. We believe that our work serves as a valuable resource for sadness and depression classification. The integration with explainable as well as privacy-preserving methods paves a new path towards future research and development.

References

uni (2022) 2022. Swiss center for affective sciences swiss center for affective sciences.
AlSagri and Ykhlef (2020) Hatoon S AlSagri and Mourad Ykhlef. 2020. Machine learning-based approach for depression detection in twitter using content and activity features. IEICE Transactions on Information and Systems, 103(8):1825–1832.
(3) APA American Psychiatric Association. What is depression?
Babu and Kanaga (2022) Nirmal Varghese Babu and E Kanaga. 2022. Sentiment analysis in social media data for depression detection using artificial intelligence: A review. SN Computer Science, 3(1):1–20.
Basu et al. (2021) Priyam Basu, Tiasa Singha Roy, Rakshit Naidu, Zumrut Muftuoglu, Sahib Singh, and Fatemehsadat Mireshghallah. 2021. Benchmarking differential privacy and federated learning for bert models. arXiv preprint arXiv:2106.13973.
Cacheda et al. (2019) Fidel Cacheda, Diego Fernandez, Francisco J Novoa, Victor Carneiro, et al. 2019. Early detection of depression: social network analysis and random forest techniques. Journal of medical Internet research, 21(6):e12554.
Canadian Mental Health Association (2022) BC Division Canadian Mental Health Association. 2022. Difference between sadness and depression.
Demszky et al. (2020) Dorottya Demszky, Dana Movshovitz-Attias, Jeongwoo Ko, Alan Cowen, Gaurav Nemade, and Sujith Ravi. 2020. Goemotions: A dataset of fine-grained emotions. arXiv preprint arXiv:2005.00547.
Devlin et al. (2018) Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2018. Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805.
Ghazi et al. (2015) Diman Ghazi, Diana Inkpen, and Stan Szpakowicz. 2015. Detecting emotion stimuli in emotion-bearing sentences. In International Conference on Intelligent Text Processing and Computational Linguistics, pages 152–165. Springer.
Haque et al. (2021) Ayaan Haque, Viraaj Reddi, and Tyler Giallanza. 2021. Deep learning for suicide and depression identification with unsupervised label correction. In International Conference on Artificial Neural Networks, pages 436–447. Springer.
(12) Hesamuel. Hesamuel/goodbye_world.
Holmes (2021) Leonard Holmes. 2021. Differences between sadness and clinical depression.
Li et al. (2017) Yanran Li, Hui Su, Xiaoyu Shen, Wenjie Li, Ziqiang Cao, and Shuzi Niu. 2017. Dailydialog: A manually labelled multi-turn dialogue dataset. In Proceedings of The 8th International Joint Conference on Natural Language Processing (IJCNLP 2017).
(15) Lukasgarbas. Lukasgarbas/nlp-text-emotion: Multi-class sentiment analysis lstm, finetuned bert.
Mardaoui and Garreau (2021) Dina Mardaoui and Damien Garreau. 2021. An analysis of lime for text data. In Proceedings of The 24th International Conference on Artificial Intelligence and Statistics, volume 130 of Proceedings of Machine Learning Research, pages 3493–3501. PMLR.
Park et al. (2015) Gregory Park, H Andrew Schwartz, Johannes C Eichstaedt, Margaret L Kern, Michal Kosinski, David J Stillwell, Lyle H Ungar, and Martin EP Seligman. 2015. Automatic personality assessment through social media language. Journal of personality and social psychology, 108(6):934.
Praveen (2020) Praveen. 2020. Emotions dataset for nlp.
Reimers and Gurevych (2019) Nils Reimers and Iryna Gurevych. 2019. Sentence-bert: Sentence embeddings using siamese bert-networks. arXiv preprint arXiv:1908.10084.
Ribeiro et al. (2016) Marco Tulio Ribeiro, Sameer Singh, and Carlos Guestrin. 2016. "why should i trust you?": Explaining the predictions of any classifier.
Saravia et al. (2018) Elvis Saravia, Hsien-Chi Toby Liu, Yen-Hao Huang, Junlin Wu, and Yi-Shin Chen. 2018. CARER: Contextualized affect representations for emotion recognition. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, pages 3687–3697. Association for Computational Linguistics.
Team (2022) SingleCare Team. 2022. Statistics about depression in the u.s.
(23) World Health Organization (WHO). Depression data.
Zhang et al. (2020) Jingqing Zhang, Yao Zhao, Mohammad Saleh, and Peter Liu. 2020. Pegasus: Pre-training with extracted gap-sentences for abstractive summarization. In International Conference on Machine Learning, pages 11328–11339. PMLR.

Interpretability of Fine-grained Classification of Sadness and Depression

Abstract

1 Introduction

2 Related Works

3 Sad-Depression Dataset Creation

3.1 Depression

3.1.1 Data Collection

3.1.2 Paraphrasing

3.1.3 Semantic Matching

3.1.4 Data Annotation and Cleaning

3.2 Sadness

3.2.1 Data Collection

3.2.2 Pseudo-Labelling

3.2.3 Data Annotation and Cleaning

3.3 Final Dataset

4 Interpretability of Federated Models

5 Experimental Results

5.1 Baseline Results

5.2 Federated Setting Results

6 Conclusion

References

Interpretability of Fine-grained Classification
of Sadness and Depression