Capturing distinctions while mining text data: Toward low-tech formalization for text analysis

Ronald L Breiger, Robin Wagner-Pacifici, John W. Mohr

Research output: Contribution to journalArticle

4 Citations (Scopus)

Abstract

In this article we consider some low-tech approaches to text mining. Our goal is to articulate a RiCH (Reader in Control of Hermeneutics) style of text analysis that takes advantage of the digital affordances of modern reading practices and easily deployable computational tools while also preserving the primacy of the interpretive lens of the human reader. In the article we offer three analytical interventions that are suitable to the low-tech formalizations we propose: the first and most developed intervention tracks the (normally computationally ignored) “stop” words; the second identifies the use of strategic anxiety terms in the texts; and the third (less developed in this article) introduces the grammatical features of modality (including modalization statements of probability and usuality, and modulation statements regarding degrees of obligation and inclination). All three analytical interventions provide a productive tracking of various modes and degrees of strategic decisiveness, contradiction, uncertainty and indeterminacy in a corpus of recent U.S. National Security Strategy reports.

Original languageEnglish (US)
JournalPoetics
DOIs
StateAccepted/In press - Jan 1 2018

Fingerprint

text analysis
National security
formalization
Lenses
Modulation
hermeneutics
national security
obligation
uncertainty
anxiety
Uncertainty
Text Analysis
Formalization
Reader

Keywords

  • Big data
  • Close reading
  • Computational sociology
  • Hermeneutics
  • National security
  • Text mining

ASJC Scopus subject areas

  • Cultural Studies
  • Language and Linguistics
  • Communication
  • Sociology and Political Science
  • Linguistics and Language
  • Literature and Literary Theory

Cite this

Capturing distinctions while mining text data : Toward low-tech formalization for text analysis. / Breiger, Ronald L; Wagner-Pacifici, Robin; Mohr, John W.

In: Poetics, 01.01.2018.

Research output: Contribution to journalArticle

@article{fb1a3b948d3d48cfacd895355f36a1c6,
title = "Capturing distinctions while mining text data: Toward low-tech formalization for text analysis",
abstract = "In this article we consider some low-tech approaches to text mining. Our goal is to articulate a RiCH (Reader in Control of Hermeneutics) style of text analysis that takes advantage of the digital affordances of modern reading practices and easily deployable computational tools while also preserving the primacy of the interpretive lens of the human reader. In the article we offer three analytical interventions that are suitable to the low-tech formalizations we propose: the first and most developed intervention tracks the (normally computationally ignored) “stop” words; the second identifies the use of strategic anxiety terms in the texts; and the third (less developed in this article) introduces the grammatical features of modality (including modalization statements of probability and usuality, and modulation statements regarding degrees of obligation and inclination). All three analytical interventions provide a productive tracking of various modes and degrees of strategic decisiveness, contradiction, uncertainty and indeterminacy in a corpus of recent U.S. National Security Strategy reports.",
keywords = "Big data, Close reading, Computational sociology, Hermeneutics, National security, Text mining",
author = "Breiger, {Ronald L} and Robin Wagner-Pacifici and Mohr, {John W.}",
year = "2018",
month = "1",
day = "1",
doi = "10.1016/j.poetic.2018.02.005",
language = "English (US)",
journal = "Poetics",
issn = "0304-422X",
publisher = "Elsevier",

}

TY - JOUR

T1 - Capturing distinctions while mining text data

T2 - Toward low-tech formalization for text analysis

AU - Breiger, Ronald L

AU - Wagner-Pacifici, Robin

AU - Mohr, John W.

PY - 2018/1/1

Y1 - 2018/1/1

N2 - In this article we consider some low-tech approaches to text mining. Our goal is to articulate a RiCH (Reader in Control of Hermeneutics) style of text analysis that takes advantage of the digital affordances of modern reading practices and easily deployable computational tools while also preserving the primacy of the interpretive lens of the human reader. In the article we offer three analytical interventions that are suitable to the low-tech formalizations we propose: the first and most developed intervention tracks the (normally computationally ignored) “stop” words; the second identifies the use of strategic anxiety terms in the texts; and the third (less developed in this article) introduces the grammatical features of modality (including modalization statements of probability and usuality, and modulation statements regarding degrees of obligation and inclination). All three analytical interventions provide a productive tracking of various modes and degrees of strategic decisiveness, contradiction, uncertainty and indeterminacy in a corpus of recent U.S. National Security Strategy reports.

AB - In this article we consider some low-tech approaches to text mining. Our goal is to articulate a RiCH (Reader in Control of Hermeneutics) style of text analysis that takes advantage of the digital affordances of modern reading practices and easily deployable computational tools while also preserving the primacy of the interpretive lens of the human reader. In the article we offer three analytical interventions that are suitable to the low-tech formalizations we propose: the first and most developed intervention tracks the (normally computationally ignored) “stop” words; the second identifies the use of strategic anxiety terms in the texts; and the third (less developed in this article) introduces the grammatical features of modality (including modalization statements of probability and usuality, and modulation statements regarding degrees of obligation and inclination). All three analytical interventions provide a productive tracking of various modes and degrees of strategic decisiveness, contradiction, uncertainty and indeterminacy in a corpus of recent U.S. National Security Strategy reports.

KW - Big data

KW - Close reading

KW - Computational sociology

KW - Hermeneutics

KW - National security

KW - Text mining

UR - http://www.scopus.com/inward/record.url?scp=85045022909&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85045022909&partnerID=8YFLogxK

U2 - 10.1016/j.poetic.2018.02.005

DO - 10.1016/j.poetic.2018.02.005

M3 - Article

AN - SCOPUS:85045022909

JO - Poetics

JF - Poetics

SN - 0304-422X

ER -