Skip navigation
Please use this identifier to cite or link to this item: http://arks.princeton.edu/ark:/88435/dsp01s4655k03t
Title: Identifying Signs of Depression on Twitter
Authors: Tang, Eugene
Advisors: Fellbaum, Christiane
Department: Computer Science
Class Year: 2016
Abstract: Affecting 350 million people worldwide, depression is a source of serious costs to both personal and public well-being. However, most individuals with depression do not receive proper treatment. One possible way to address this issue is through automatic screening of individuals for depression. In this study we explore the possibility of automatically detecting individuals with depression through their behavior on Twitter. We build off of previous studies in the following ways. First, we identify a group of depressed and non-depressed users through Twitter’s API. Next, we explore the effect of additional features—topic-based features, removing retweets, and standardization of tweets—on the predictive ability of the logistic regression and SVM models to differentiate between depressed and non-depressed users. Using data collected at two different time points, we test the robustness of our model over time and over different incidence rates. Finally, we explore the possible implementations of a screening tool based on our work. Our findings demonstrate, first, the feasibility of discriminating depressed from non-depressed users through Twitter’s API, and second, the relative robustness of our model over time, although our additional features do not make a large difference in predictive power. However, our classifier’s performance significantly decreases when the incidence rate of the testing dataset is decreased to a more realistic level of 7.6%. This finding indicates that the incidence rate of depression in the training and testing datasets is an important additional factor to consider in future studies. Overall, although much work remains to be done before such a tool could be implemented, our work provides additional evidence that it may indeed be feasible to identify individuals struggling with depression through their behavior on Twitter.
Extent: 95 pages
URI: http://arks.princeton.edu/ark:/88435/dsp01s4655k03t
Type of Material: Princeton University Senior Theses
Language: en_US
Appears in Collections:Computer Science, 1988-2016

Files in This Item:
File SizeFormat 
Tan_Eugene_2016_Thesis.pdf1.67 MBAdobe PDF    Request a copy


Items in Dataspace are protected by copyright, with all rights reserved, unless otherwise indicated.