Research Shows Twitter Activity Can Predict Rates Of Heart Disease

January 25, 2015

165

Twitter has broken news stories, launched and ended careers, started social movements and toppled governments, all by being an easy, direct and immediate way for people to share what’s on their minds. Now, researchers have shown that the social media platform has another use: Twitter can serve as a dashboard indicator of a community’s psychological well being and can predict rates of heart disease.

Previous studies have identified many factors that contribute to the risk of heart disease: traditional ones, like low income or smoking but also psychological ones, like stress. In the new study, researchers from the University of Pennsylvania demonstrated that Twitter can capture more information about heart disease risk than many traditional factors combined, as it also characterizes the psychological atmosphere of a community.

They found that expressions of negative emotions — such as anger, stress, and fatigue — in a county’s tweets were associated with higher heart disease risk. On the other hand, positive emotions like excitement and optimism were associated with lower risk. The team published their findings this week in the journal Psychological Science.

Language choices on Twitter provide rich source of data

Researchers have long assumed that the psychological well being of communities is important for physical health, but is hard to measure. Using Twitter as a window into a community’s collective mental state may provide a useful tool in epidemiology and for measuring the effectiveness of public health interventions.

With billions of users writing daily about their daily experiences, thoughts and feelings, the world of social media represents a new frontier for psychological research. Such data could be an invaluable public health tool if able to be tied to real-world outcomes. With this in mind, the researchers from the World Well-Being Project have long been studying the degree to which the language people use online represents their inner thoughts and feelings.

As there is no way to directly measure peoples’ inner emotional lives, the team drew on traditions in psychological research that glean this information from the words people use when speaking or writing. Earlier research from the group has shown that such linguistic analysis can work as well as traditional questionnaires in assessing an individual’s personality.

“Getting this data through surveys is expensive and time consuming, but, more important, you’re limited by the questions included on the survey,” says lead author Johannes Eichstaedt, a graduate student in the Department of Psychology at Penn. “You’ll never get the psychological richness that comes with the infinite variables of what language people choose to use.”

Connections between Twitter language, emotions, and heart disease risk

Having seen correlations between language and emotional states, the researchers went on to see if they could show connections between those emotional states and physical outcomes rooted in them. They had an ideal candidate in coronary heart disease, the leading cause of death worldwide.

“Psychological states have long been thought to have an effect on coronary heart disease,” said co-author Dr. Margaret Kern, an assistant professor at the University of Melbourne, Australia. “For example, hostility and depression have been linked with heart disease at the individual level through biological effects. But negative emotions can also trigger behavioral and social responses; you are also more likely to drink, eat poorly and be isolated from other people which can indirectly lead to heart disease.”

As a common cause of early mortality, public health officials carefully count when heart disease is identified as the underlying cause on death certificates. They also collect meticulous data about possible risk factors, such as rates of smoking, obesity, hypertension and lack of exercise. This data is available on a county-by-county level in the United States, so the research team aimed to match this physical epidemiology with their digital Twitter version.

Drawing on a set of public tweets made between 2009 and 2010, the researchers used established emotional dictionaries, as well as automatically generated clusters of words reflecting behaviors and attitudes, to analyze a random sample of tweets from individuals who had made their locations available. The analysis included tweets and health data from about 1,300 counties, which contained 88 percent of the country’s population.

Positive and negative emotional language predict heart disease deaths

They found that negative emotional language and topics, such as words like “hate” or expletives, remained strongly correlated with heart disease mortality, even after variables like income and education were taken into account. Positive emotional language showed the opposite correlation, suggesting that optimism and positive experiences, words like “wonderful” or “friends,” may be protective against heart disease. (Indeed, a recent study found that people with upbeat outlooks on life have significantly better cardiovascular health than those with less optimistic outlooks).

The researchers compared the language of tweets and CDC data on a county-by-county level. As you can see, Twitter data provided a close estimate of heart disease deaths.

The researchers compared the language of tweets and CDC data on a county-by-county level. As you can see, Twitter data very accurately predicted the geographical distribution of heart disease deaths.

“The relationship between language and mortality is particularly surprising,” noted Eichstaedt, “since the people tweeting angry words and topics are in general not the ones dying of heart disease. But that means if many of your neighbors are angry, you are more likely to die of heart disease.”

This finding fits into existing sociological research that suggests that the combined characteristics of communities can be more predictive of physical health than the reports of any one individual.

“We believe that we are picking up more long-term characteristics of communities,” said Eichstaedt. “The language may represent the ‘drying out of the wood’ rather than the ‘spark’ that immediately leads to mortality. We can’t predict the number of heart attacks a county will have in a given timeframe, but the language may reveal places to intervene.”

‘Predictions from Twitter can be more accurate than traditional indicators’

Other caveats to the method’s predictive power include the social factors that influence what kinds of messages people choose to share on Twitter. “If everyone is a little more positive on Twitter than they are in real life, however, we would still see variation from location to location, which is what we’re most interested in,” Eichstaedt noted.

This variation could be used to marshal evidence of the effectiveness of public health interventions on the community level, rather than on an individual level. The team’s findings show that these tweets are aggregating information about people that can’t be readily accessed in other ways.

“Twitter seems to capture a lot of the same information that you get from health and demographic indicators,” said Dr. Kern, “but it also adds something extra. So predictions from Twitter can actually be more accurate than using a set of traditional variables.”

Indeed, researchers from a variety of scientific disciplines are increasingly turning to Twitter as a source of valuable quantitative and qualitative data. In one of the first studies of its kind, published in 2011, a team from Johns Hopkins analyzed more than two billion tweets for health-related terms and found that the social media platform can serve as a sort of window into the health of its users, yielding information with potentially important public health implications.

Although the JHU study was intended primarily as a “proof of concept,” in order to show that filtering information from Twitter could produce valuable data, the researchers said they uncovered some intriguing patterns about everything from allergies and the flu to other illnesses and ailments such as cancer, obesity and depression. And because many people posted details of the medications they were using to treat themselves, they discovered a number of users were taking antibiotics to treat the flu, even though antibiotics don’t work on the flu — something that could be a potential public-health issue.

Twitter is also useful during a crisis, when information needs to be moved quickly and efficiently around social networks — researchers can even detect emergency events like earthquakes by monitoring Twitter. The social media platform has also proven useful for economists, who have developed programs to predict individual stock trends based on tweets, and it could even help gamblers make more accurate bets on next weekend’s Superbowl game.