March 27, 2014

Estimating county health statistics by looking at tweets

by Illinois Institute of Technology

A researcher at Illinois Institute of Technology (IIT) has found that Twitter knows if you're obese—or at least, if your county is. Tweets can accurately predict a county's rates of obesity, diabetes, teen births, health insurance coverage, and access to health foods, according to Aron Culotta, assistant professor of computer science and director of the Text Analysis in the Public Interest Lab. As a result, Twitter and other social media may complement other data sources for public health officials to identify at-risk communities and offer support. Culotta will report his findings in a paper, "Estimating County Health Statistics with Twitter," to be given at CHI 2014, the ACM (Association for Computing Machinery) CHI Conference on Human Factors in Computing Systems, April 26-May 1 in Toronto.

For each of the 100 most populous counties in the U.S., Culotta collected 27 health-related statistics. He also collected more than 1.4 million Twitter user proﬁles and 4.3 million Tweets over a nine-month span from the same 100 counties. He then performed a statistical analysis to identify how accurately the health outcomes can be predicted from the Twitter data and which linguistic markers are most predictive of each statistic.

Among other things, Culotta found the Tweets predicted county-level health statistics for 6 of 27 statistics, including obesity, diabetes, teen births, health insurance coverage, and access to healthy foods. Models that augmented demographic variables (race, age, gender, income) with linguistic variables (from Twitter) were more accurate than models using demographic variables alone for 20 of the 27 health statistics considered. That is, the Twitter data helped to make the traditional models more accurate, suggesting that this new methodology can complement existing approaches. For two statistics—limited access to health foods and prevalence of fast foods—the Twitter model alone was actually more accurate than the demographic variable model.

Analysis of social media for most health concerns such as influenza focus on detecting specific mentions of a symptom of interest—e.g., "Staying home from work today with a sore throat." But Culotta investigated more nuanced linguistic cues that correlate with the overall health of a population. He identified the linguistic indicators that are most predictive of each outcome. For example, references to religion and certain pronouns ("we", "her") correlate with better socio-emotional support. References to money and inhibition correlate with lower unemployment. References to family and love correlate with higher rates of teen births. For obesity, indicators include what are known as Negative Engagement words (e.g., "tired", "bored", "sleepy"), as well as profanity.

"Twitter activity provides a more ﬁne-grained representation of a community's health than demographics alone," Culotta said. "The reason for this appears to come from the insights Twitter provides into personality, attitudes, and behavior, which in turn correlate health outcomes.

The U.S. Centers for Disease Control and Prevention lead community health data collection and intervention efforts such as the Behavioral Risk Factor Surveillance System to identify vulnerable populations to better target intervention strategies. But such programs take considerable time and often are limited in sample size or geographic specificity. Culotta's research suggests that social media could be a complementary data source to identify at-risk communities.

Culotta said, "While this new methodology requires further experimentation, we believe it can aid public health researchers by providing (1) a more nuanced alternative to demographic proﬁles for identifying at-risk populations; (2) a low-cost method to measure risk across different subpopulations; (3) a process to help formulate new hypotheses about the relationship between environment, behaviors, and health outcomes, which can then be tested in a more controlled setting."

Provided by Illinois Institute of Technology

Citation: Estimating county health statistics by looking at tweets (2014, March 27) retrieved 11 July 2024 from https://medicalxpress.com/news/2014-03-county-health-statistics-tweets.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Twitter 'big data' can be used to monitor HIV and drug-related behavior, study shows

Feedback to editors

Coordinated activity of mossy cells contributes to encoding of spatial and contextual memories, study finds

2 hours ago

Blood fat profiles confirm health benefits of replacing butter with high-quality plant oils

3 hours ago

Major trial looks at most effective speech therapy for people with Parkinson's disease

14 hours ago

Models show promise in predicting cognitive decline in early Alzheimer's

16 hours ago

New material derived from graphene improves the performance of neuroprostheses

18 hours ago

Discovery could help with early detection of vision loss in age-related macular degeneration

18 hours ago

New Co-STAR T cells show promise for treating cancers in laboratory study

18 hours ago

Microproteins exclusively produced in liver tumors could lead to cancer vaccines

18 hours ago

Scientists demonstrate a combination treatment can increase human insulin-producing cells in vivo

18 hours ago

Cognitive skills in early toddlerhood: Study demonstrates importance of 16-months

19 hours ago

Load comments (0)

Estimating county health statistics by looking at tweets

Coordinated activity of mossy cells contributes to encoding of spatial and contextual memories, study finds

Blood fat profiles confirm health benefits of replacing butter with high-quality plant oils

Major trial looks at most effective speech therapy for people with Parkinson's disease

Models show promise in predicting cognitive decline in early Alzheimer's

New material derived from graphene improves the performance of neuroprostheses

Discovery could help with early detection of vision loss in age-related macular degeneration

New Co-STAR T cells show promise for treating cancers in laboratory study

Microproteins exclusively produced in liver tumors could lead to cancer vaccines

Scientists demonstrate a combination treatment can increase human insulin-producing cells in vivo

Cognitive skills in early toddlerhood: Study demonstrates importance of 16-months

Twitter 'big data' can be used to monitor HIV and drug-related behavior, study shows

IBM researchers' algorithm explores tweets for home location cues

Tweets can help track national health trends—and now local ones too

Suicidal talk on Twitter mirrors suicide rates

Study finds more tweets means more votes for political candidates

Apple buys analytics firm for $200 mn: report (Update)

Blood fat profiles confirm health benefits of replacing butter with high-quality plant oils

Feedback plus cash incentives reduce phone use while driving, researchers discover

New period product offers progress in women's health

Maintaining prediabetic status after diagnosis results in better long-term health, study finds

Study: American diets got briefly healthier, more diverse during COVID-19 pandemic

Gut microbe could hold key to help people benefit from healthy foods

Phys.org

Tech Xplore

Science X

Estimating county health statistics by looking at tweets

Coordinated activity of mossy cells contributes to encoding of spatial and contextual memories, study finds

Blood fat profiles confirm health benefits of replacing butter with high-quality plant oils

Major trial looks at most effective speech therapy for people with Parkinson's disease

Models show promise in predicting cognitive decline in early Alzheimer's

New material derived from graphene improves the performance of neuroprostheses

Discovery could help with early detection of vision loss in age-related macular degeneration

New Co-STAR T cells show promise for treating cancers in laboratory study

Microproteins exclusively produced in liver tumors could lead to cancer vaccines

Scientists demonstrate a combination treatment can increase human insulin-producing cells in vivo

Cognitive skills in early toddlerhood: Study demonstrates importance of 16-months

Related Stories

Twitter 'big data' can be used to monitor HIV and drug-related behavior, study shows

IBM researchers' algorithm explores tweets for home location cues

Tweets can help track national health trends—and now local ones too

Suicidal talk on Twitter mirrors suicide rates

Study finds more tweets means more votes for political candidates

Apple buys analytics firm for $200 mn: report (Update)

Recommended for you

Blood fat profiles confirm health benefits of replacing butter with high-quality plant oils

Feedback plus cash incentives reduce phone use while driving, researchers discover

New period product offers progress in women's health

Maintaining prediabetic status after diagnosis results in better long-term health, study finds

Study: American diets got briefly healthier, more diverse during COVID-19 pandemic

Gut microbe could hold key to help people benefit from healthy foods

Newsletter sign up

Donate and enjoy an ad-free experience