Creating screening questionnaires from archival social media data
In recent years, the internet has become a primary source of medical information. Reddit
is an example of this. Reddit is a popular social networking site that allows sharing text and media as posts and commenting on them through topic-specific forums. The forums span a gamut of topics including politics, health, and pictures of cute cats.
In my thesis, I utilized the data from Reddit to create a generic semi-automatic pipeline that, given a disease with long diagnosis time, creates medical screening questionnaires for doctors and laypeople that can be used for early detection of this disease.