Regional personality assessment through social media language

Salvatore Giorgi, Khoa Le Nguyen, Johannes C. Eichstaedt, Margaret L. Kern, David B. Yaden, Michal Kosinski, Martin E.P. Seligman, Lyle H. Ungar, H. Andrew Schwartz, Gregory Park

Research output: Contribution to journalArticlepeer-review


Objective: We explore the personality of counties as assessed through linguistic patterns on social media. Such studies were previously limited by the cost and feasibility of large-scale surveys; however, language-based computational models applied to large social media datasets now allow for large-scale personality assessment. Method: We applied a language-based assessment of the five factor model of personality to 6,064,267 U.S. Twitter users. We aggregated the Twitter-based personality scores to 2,041 counties and compared to political, economic, social, and health outcomes measured through surveys and by government agencies. Results: There was significant personality variation across counties. Openness to experience was higher on the coasts, conscientiousness was uniformly spread, extraversion was higher in southern states, agreeableness was higher in western states, and emotional stability was highest in the south. Across 13 outcomes, language-based personality estimates replicated patterns that have been observed in individual-level and geographic studies. This includes higher Republican vote share in less agreeable counties and increased life satisfaction in more conscientious counties. Conclusions: Results suggest that regions vary in their personality and that these differences can be studied through computational linguistic analysis of social media. Furthermore, these methods may be used to explore other psychological constructs across geographies.

Original languageEnglish (US)
JournalJournal of personality
StateAccepted/In press - 2021


  • big data
  • language
  • measurement
  • personality assessment
  • social media

ASJC Scopus subject areas

  • Social Psychology


Dive into the research topics of 'Regional personality assessment through social media language'. Together they form a unique fingerprint.

Cite this