Skip to content

This repository contains the dataset 'DEPTWEET' published in the journal of Computers in Human Behavior.

License

Notifications You must be signed in to change notification settings

mohsinulkabir14/DEPTWEET

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 

Repository files navigation

DEPTWEET: A Typology for Social Media Texts to Detect Depression Severities

This repository contains the code, data, and models of the paper titled "DEPTWEET: A Typology for Social Media Texts to Detect Depression Severities" published in the journal of Computers in Human Behavior (Q1, H-Index: 226, Impact Factor: ~10).

arXiv GoogleScholar ResearchGate

Data

The data folder contains tweet IDs corresponding to severities of depression collected from Twitter. Owing to Twitter's policy on data sharing, we can not directly upload the dataset and are only restricted to sharing tweet-id's here. Users can rehydrate this dataset using hydrator using the tweet-id's.
In case of missing tweets or difficulties in extracting tweets, readers may fill up and sign the scanned copy of the Data Usage Agreement (DUA) available in this link and send to (mohsinulkabir@iut-dhaka.edu). Authors are committed to sharing the dataset in the shortest possible time once they receive the signed DUA.

Code

Code for obtaining the baseline result is available in this repo.

Citation

If you find our work and dataset useful, please cite our Paper.

@article{kabir2022deptweet,
title = {{DEPTWEET: A typology for social media texts to detect depression severities}},
journal = {{Computers in Human Behavior}},
pages = {107503},
year = {2022},
issn = {0747-5632},
doi = {10.1016/j.chb.2022.107503},
url = {https://www.sciencedirect.com/science/article/pii/S0747563222003235},
author = {Mohsinul Kabir and Tasnim Ahmed and Md. Bakhtiar Hasan and Md Tahmid Rahman Laskar and Tarun Kumar Joarder and Hasan Mahmud and Kamrul Hasan},
keywords = {Social media, Mental health, Depression severity, Dataset},
abstract = {Mental health research through data-driven methods has been hindered by a lack of standard typology and scarcity of adequate data. In this study, we leverage the clinical articulation of depression to build a typology for social media texts for detecting the severity of depression. It emulates the standard clinical assessment procedure Diagnostic and Statistical Manual of Mental Disorders (DSM-5) and Patient Health Questionnaire (PHQ-9) to encompass subtle indications of depressive disorders from tweets. Along with the typology, we present a new dataset of 40191 tweets labeled by expert annotators. Each tweet is labeled as ‘non-depressed’ or ‘depressed’. Moreover, three severity levels are considered for ‘depressed’ tweets: (1) mild, (2) moderate, and (3) severe. An associated confidence score is provided with each label to validate the quality of annotation. We examine the quality of the dataset via representing summary statistics while setting strong baseline results using attention-based models like BERT and DistilBERT. Finally, we extensively address the limitations of the study to provide directions for further research.}
}

About

This repository contains the dataset 'DEPTWEET' published in the journal of Computers in Human Behavior.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published