COVID-19 coronavirus In the news Tools & methods

What Twitter can tell us about the health of a community

Adi Gaskell21 Jul 2020

395 2 minutes read

Originally posted on The Horizons Tracker.

Social media has often been regarded as a wholly imperfect means of understanding any event or community, if for no other reason than a relatively small proportion of any community will be using the platform. Nonetheless, new research¹ from Stanford University advocates using Twitter and AI to reveal the psychological health of a community.

The researchers acknowledge that Twitter doesn’t provide a representative sample of the population, but nonetheless believe it’s useful. The researchers analyzed around a billion geo-tagged tweets sent between 2009 and 2015. The tweets were compared to 1.7 million responses sent to the Gallup-Sharecare Wellbeing Index.

Surveys such as that done by Gallup have historically been a central part of attempts to understand a population’s wellbeing. They are accurate, but require a lot of time and money to undertake, with some surveys taking years to garner a robust response. The hope is that data from social media can help augment this data and alleviate some of the burden involved in collecting survey data.

Reliable data

The researchers trained an algorithm to assess both the responses to the survey and posts made on social media from the same people to try and understand key similarities in style and content. This has traditionally been difficult, as online slang, such as LOL, might ordinarily be seen as a positive expression, yet is also associated with lower income areas.

Similarly, words such as ‘homework’ and ‘taxes’ can appear negative, yet are also commonly associated with higher income areas. As such, it’s important when using language to measure wellbeing that these cultural differences are understood.

The machine learning algorithm helps to do that, and the AI found that phrases such as LOL were not a good indicator of our wellbeing, and instead proposed words such as ‘fun’ and ‘excited’.

“Having the computer learn the words may be the best way to find words that measure well-being,” the researchers say. “Differences in language use can be quite complex.”

This is important, as wellbeing is a complex thing, and can be associated with a wide range of other factors, including our overall health. For instance, the researchers note that stress or depression are strongly linked to excessive drinking or smoking, which have clear implications for overall health.

They also believe that the approach they’re taking could be useful in such fast moving crises as the current COVID-19 pandemic, and help provide researchers with real-time insights into the health of a community.

“COVID-19 is a natural disaster that interrupts our social norms and routines at an unprecedented scale,” they conclude. “With this real-time Twitter-based technology, psychologists can monitor if loneliness and anxiety are taking hold in communities, and how our well-being is impacted by social distancing.”

Article source: What Twitter Can Tell Us About The Health Of A Community.

Header image source: Edar on Pixabay, Public Domain.

Reference:

Jaidka, K., Giorgi, S., Schwartz, H. A., Kern, M. L., Ungar, L. H., & Eichstaedt, J. C. (2020). Estimating geographic subjective well-being from Twitter: A comparison of dictionary and data-driven language methods. Proceedings of the National Academy of Sciences, 117(19), 10165-10171. ↩

Rate this post

Cookie	Duration	Description
cookielawinfo-checkbox-advertisement	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Advertisement" category .
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Cookie	Duration	Description
_ga	2 years	The _ga cookie, installed by Google Analytics, calculates visitor, session and campaign data and also keeps track of site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognize unique visitors.
_gid	1 day	Installed by Google Analytics, _gid cookie stores information on how visitors use a website, while also creating an analytics report of the website's performance. Some of the data that are collected include the number of visitors, their source, and the pages they visit anonymously.

Reliable data

Adi Gaskell

Related Articles

How Twitter can help track the spread of COVID-19

Global smartphone ownership and internet usage

ChatGPT is a data privacy nightmare. If you’ve ever posted online, you ought to be concerned

Certification program supports US local governments in using data and evidence