Correspondence and canonical analysis of relational data

Stanley Wasserman, Katherine Faust, Joseph J Galaskiewicz

Research output: Contribution to journalArticle

16 Citations (Scopus)

Abstract

Correspondence analysis, a data analytic technique used to study two-way cross-classifications, is applied to social relational data. Such data are frequently termed “sociometric” or “network” data. The method allows one to model forms of relational data and types of empirical relationships not easily analyzed using either standard social network methods or common scaling or clustering techniques. In particular, correspondence analysis allows one to model: —two-mode networks (rows and columns of a sociomatrix refer to different objects) —valued relations (e.g. counts, ratings, or frequencies). In general, the technique provides scale values for row and column units, visual presentation of relationships among rows and columns, and criteria for assessing “dimensionality” or graphical complexity of the data and goodness-of-fit to particular models. Correspondence analysis has recently been the subject of research by Goodman, Haberman, and Gilula, who have termed their approach to the problem “canonical analysis” to reflect its similarity to canonical correlation analysis of continuous multivariate data. This generalization links the technique to more standard categorical data analysis models, and provides a much-needed statistical justification. We review both correspondence and canonical analysis, and present these ideas by analyzing relational data on the 1980 monetary donations from corporations to nonprofit organizations in the Minneapolis St. Paul metropolitan area. We also show how these techniques are related to dyadic independence models, first introduced by Holland, Leinhardt, Fienberg, and Wasserman in the early 1980's. The highlight of this paper is the relationship between correspondence and canonical analysis, and these dyadic independence models, which are designed specifically for relational data. The paper concludes with a discussion of this relationship, and some data analyses that illustrate the fact that correspondence analysis models can be used as approximate dyadic independence models.

Original languageEnglish (US)
Pages (from-to)11-64
Number of pages54
JournalThe Journal of Mathematical Sociology
Volume15
Issue number1
DOIs
StatePublished - Jan 1 1990
Externally publishedYes

Fingerprint

Canonical Analysis
Correspondence Analysis
correspondence analysis
Model
Canonical Correlation Analysis
data network
Nominal or categorical data
Multivariate Data
non-profit-organization
donation
Goodness of fit
scaling
Justification
Social Networks
Dimensionality
corporation
agglomeration area
Data analysis
social network
Count

ASJC Scopus subject areas

  • Algebra and Number Theory
  • Social Sciences (miscellaneous)
  • Sociology and Political Science

Cite this

Correspondence and canonical analysis of relational data. / Wasserman, Stanley; Faust, Katherine; Galaskiewicz, Joseph J.

In: The Journal of Mathematical Sociology, Vol. 15, No. 1, 01.01.1990, p. 11-64.

Research output: Contribution to journalArticle

Wasserman, Stanley ; Faust, Katherine ; Galaskiewicz, Joseph J. / Correspondence and canonical analysis of relational data. In: The Journal of Mathematical Sociology. 1990 ; Vol. 15, No. 1. pp. 11-64.
@article{db0d54f617164b7294d996c1010a4074,
title = "Correspondence and canonical analysis of relational data",
abstract = "Correspondence analysis, a data analytic technique used to study two-way cross-classifications, is applied to social relational data. Such data are frequently termed “sociometric” or “network” data. The method allows one to model forms of relational data and types of empirical relationships not easily analyzed using either standard social network methods or common scaling or clustering techniques. In particular, correspondence analysis allows one to model: —two-mode networks (rows and columns of a sociomatrix refer to different objects) —valued relations (e.g. counts, ratings, or frequencies). In general, the technique provides scale values for row and column units, visual presentation of relationships among rows and columns, and criteria for assessing “dimensionality” or graphical complexity of the data and goodness-of-fit to particular models. Correspondence analysis has recently been the subject of research by Goodman, Haberman, and Gilula, who have termed their approach to the problem “canonical analysis” to reflect its similarity to canonical correlation analysis of continuous multivariate data. This generalization links the technique to more standard categorical data analysis models, and provides a much-needed statistical justification. We review both correspondence and canonical analysis, and present these ideas by analyzing relational data on the 1980 monetary donations from corporations to nonprofit organizations in the Minneapolis St. Paul metropolitan area. We also show how these techniques are related to dyadic independence models, first introduced by Holland, Leinhardt, Fienberg, and Wasserman in the early 1980's. The highlight of this paper is the relationship between correspondence and canonical analysis, and these dyadic independence models, which are designed specifically for relational data. The paper concludes with a discussion of this relationship, and some data analyses that illustrate the fact that correspondence analysis models can be used as approximate dyadic independence models.",
author = "Stanley Wasserman and Katherine Faust and Galaskiewicz, {Joseph J}",
year = "1990",
month = "1",
day = "1",
doi = "10.1080/0022250X.1990.9990060",
language = "English (US)",
volume = "15",
pages = "11--64",
journal = "Journal of Mathematical Sociology",
issn = "0022-250X",
publisher = "Taylor and Francis Ltd.",
number = "1",

}

TY - JOUR

T1 - Correspondence and canonical analysis of relational data

AU - Wasserman, Stanley

AU - Faust, Katherine

AU - Galaskiewicz, Joseph J

PY - 1990/1/1

Y1 - 1990/1/1

N2 - Correspondence analysis, a data analytic technique used to study two-way cross-classifications, is applied to social relational data. Such data are frequently termed “sociometric” or “network” data. The method allows one to model forms of relational data and types of empirical relationships not easily analyzed using either standard social network methods or common scaling or clustering techniques. In particular, correspondence analysis allows one to model: —two-mode networks (rows and columns of a sociomatrix refer to different objects) —valued relations (e.g. counts, ratings, or frequencies). In general, the technique provides scale values for row and column units, visual presentation of relationships among rows and columns, and criteria for assessing “dimensionality” or graphical complexity of the data and goodness-of-fit to particular models. Correspondence analysis has recently been the subject of research by Goodman, Haberman, and Gilula, who have termed their approach to the problem “canonical analysis” to reflect its similarity to canonical correlation analysis of continuous multivariate data. This generalization links the technique to more standard categorical data analysis models, and provides a much-needed statistical justification. We review both correspondence and canonical analysis, and present these ideas by analyzing relational data on the 1980 monetary donations from corporations to nonprofit organizations in the Minneapolis St. Paul metropolitan area. We also show how these techniques are related to dyadic independence models, first introduced by Holland, Leinhardt, Fienberg, and Wasserman in the early 1980's. The highlight of this paper is the relationship between correspondence and canonical analysis, and these dyadic independence models, which are designed specifically for relational data. The paper concludes with a discussion of this relationship, and some data analyses that illustrate the fact that correspondence analysis models can be used as approximate dyadic independence models.

AB - Correspondence analysis, a data analytic technique used to study two-way cross-classifications, is applied to social relational data. Such data are frequently termed “sociometric” or “network” data. The method allows one to model forms of relational data and types of empirical relationships not easily analyzed using either standard social network methods or common scaling or clustering techniques. In particular, correspondence analysis allows one to model: —two-mode networks (rows and columns of a sociomatrix refer to different objects) —valued relations (e.g. counts, ratings, or frequencies). In general, the technique provides scale values for row and column units, visual presentation of relationships among rows and columns, and criteria for assessing “dimensionality” or graphical complexity of the data and goodness-of-fit to particular models. Correspondence analysis has recently been the subject of research by Goodman, Haberman, and Gilula, who have termed their approach to the problem “canonical analysis” to reflect its similarity to canonical correlation analysis of continuous multivariate data. This generalization links the technique to more standard categorical data analysis models, and provides a much-needed statistical justification. We review both correspondence and canonical analysis, and present these ideas by analyzing relational data on the 1980 monetary donations from corporations to nonprofit organizations in the Minneapolis St. Paul metropolitan area. We also show how these techniques are related to dyadic independence models, first introduced by Holland, Leinhardt, Fienberg, and Wasserman in the early 1980's. The highlight of this paper is the relationship between correspondence and canonical analysis, and these dyadic independence models, which are designed specifically for relational data. The paper concludes with a discussion of this relationship, and some data analyses that illustrate the fact that correspondence analysis models can be used as approximate dyadic independence models.

UR - http://www.scopus.com/inward/record.url?scp=84929229544&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84929229544&partnerID=8YFLogxK

U2 - 10.1080/0022250X.1990.9990060

DO - 10.1080/0022250X.1990.9990060

M3 - Article

AN - SCOPUS:84929229544

VL - 15

SP - 11

EP - 64

JO - Journal of Mathematical Sociology

JF - Journal of Mathematical Sociology

SN - 0022-250X

IS - 1

ER -