A hybrid approach to web forum interactional coherence analysis

Tianjun Fu, Ahmed Abbasi, Hsinchun Chen

Research output: Contribution to journalArticle

24 Citations (Scopus)

Abstract

Despite the rapid growth of text-based computer-mediated communication (CMC), its limitations have rendered the media highly incoherent. This poses problems for content analysis of online discourse archives. Interactional coherence analysis (ICA) attempts to accurately identify and construct CMC interaction networks. In this study, we propose the Hybrid Interactional Coherence (HIC) algorithm for identification of web forum interaction. HIC utilizes a bevy of system and linguistic features, including message header information, quotations, direct address, and lexical relations. Furthermore, several similarity-based methods including a Lexical Match Algorithm (LMA) and a sliding window method are utilized to account for interactional idiosyncrasies. Experiments results on two web forums revealed that the proposed HIC algorithm significantly outperformed comparison techniques in terms of precision, recall, and F-measure at both the forum and thread levels. Additionally, an example was used to illustrate how the improved ICA results can facilitate enhanced social network and role analysis capabilities.

Original languageEnglish (US)
Pages (from-to)1195-1209
Number of pages15
JournalJournal of the American Society for Information Science and Technology
Volume59
Issue number8
DOIs
StatePublished - Mar 2008

Fingerprint

computer-mediated communication
Communication
Social Role
interaction
Linguistics
quotation
content analysis
social network
coherence
Hybrid approach
Interaction
World Wide Web
linguistics
discourse
experiment
Experiments
Computer-mediated communication

ASJC Scopus subject areas

  • Information Systems
  • Library and Information Sciences

Cite this

A hybrid approach to web forum interactional coherence analysis. / Fu, Tianjun; Abbasi, Ahmed; Chen, Hsinchun.

In: Journal of the American Society for Information Science and Technology, Vol. 59, No. 8, 03.2008, p. 1195-1209.

Research output: Contribution to journalArticle

@article{6e68191d0e4147a18b1b99139eb63d23,
title = "A hybrid approach to web forum interactional coherence analysis",
abstract = "Despite the rapid growth of text-based computer-mediated communication (CMC), its limitations have rendered the media highly incoherent. This poses problems for content analysis of online discourse archives. Interactional coherence analysis (ICA) attempts to accurately identify and construct CMC interaction networks. In this study, we propose the Hybrid Interactional Coherence (HIC) algorithm for identification of web forum interaction. HIC utilizes a bevy of system and linguistic features, including message header information, quotations, direct address, and lexical relations. Furthermore, several similarity-based methods including a Lexical Match Algorithm (LMA) and a sliding window method are utilized to account for interactional idiosyncrasies. Experiments results on two web forums revealed that the proposed HIC algorithm significantly outperformed comparison techniques in terms of precision, recall, and F-measure at both the forum and thread levels. Additionally, an example was used to illustrate how the improved ICA results can facilitate enhanced social network and role analysis capabilities.",
author = "Tianjun Fu and Ahmed Abbasi and Hsinchun Chen",
year = "2008",
month = "3",
doi = "10.1002/asi.20827",
language = "English (US)",
volume = "59",
pages = "1195--1209",
journal = "Journal of the Association for Information Science and Technology",
issn = "2330-1635",
publisher = "John Wiley and Sons Ltd",
number = "8",

}

TY - JOUR

T1 - A hybrid approach to web forum interactional coherence analysis

AU - Fu, Tianjun

AU - Abbasi, Ahmed

AU - Chen, Hsinchun

PY - 2008/3

Y1 - 2008/3

N2 - Despite the rapid growth of text-based computer-mediated communication (CMC), its limitations have rendered the media highly incoherent. This poses problems for content analysis of online discourse archives. Interactional coherence analysis (ICA) attempts to accurately identify and construct CMC interaction networks. In this study, we propose the Hybrid Interactional Coherence (HIC) algorithm for identification of web forum interaction. HIC utilizes a bevy of system and linguistic features, including message header information, quotations, direct address, and lexical relations. Furthermore, several similarity-based methods including a Lexical Match Algorithm (LMA) and a sliding window method are utilized to account for interactional idiosyncrasies. Experiments results on two web forums revealed that the proposed HIC algorithm significantly outperformed comparison techniques in terms of precision, recall, and F-measure at both the forum and thread levels. Additionally, an example was used to illustrate how the improved ICA results can facilitate enhanced social network and role analysis capabilities.

AB - Despite the rapid growth of text-based computer-mediated communication (CMC), its limitations have rendered the media highly incoherent. This poses problems for content analysis of online discourse archives. Interactional coherence analysis (ICA) attempts to accurately identify and construct CMC interaction networks. In this study, we propose the Hybrid Interactional Coherence (HIC) algorithm for identification of web forum interaction. HIC utilizes a bevy of system and linguistic features, including message header information, quotations, direct address, and lexical relations. Furthermore, several similarity-based methods including a Lexical Match Algorithm (LMA) and a sliding window method are utilized to account for interactional idiosyncrasies. Experiments results on two web forums revealed that the proposed HIC algorithm significantly outperformed comparison techniques in terms of precision, recall, and F-measure at both the forum and thread levels. Additionally, an example was used to illustrate how the improved ICA results can facilitate enhanced social network and role analysis capabilities.

UR - http://www.scopus.com/inward/record.url?scp=45849134844&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=45849134844&partnerID=8YFLogxK

U2 - 10.1002/asi.20827

DO - 10.1002/asi.20827

M3 - Article

AN - SCOPUS:45849134844

VL - 59

SP - 1195

EP - 1209

JO - Journal of the Association for Information Science and Technology

JF - Journal of the Association for Information Science and Technology

SN - 2330-1635

IS - 8

ER -