Linking ontological resources using aggregatable substance identifiers to organize extracted relations

Byron Marshall, Hua Su, Daniel McDonald, Hsinchun Chen

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Citations (Scopus)

Abstract

Systems that extract biological regulatory pathway relations from free-text sources are intended to help researchers leverage vast and growing collections of research literature. Several systems to extract such relations have been developed but little work has focused on how those relations can be usefully organized (aggregated) to support visualization systems or analysis algorithms. Ontological resources that enumerate name strings for different types of biomedical objects should play a key role in the organization process. In this paper we delineate five potentially useful levels of relational granularity and propose the use of aggregatable substance identifiers to help reduce lexical ambiguity. An aggregatable substance identifier applies to a gene and its products. We merged 4 extensive lexicons and compared the extracted strings to the text of five million MEDLINE abstracts. We report on the ambiguity within and between name strings and common English words. Our results show an 89% reduction in ambiguity for the extracted human substance name strings when using an aggregatable substance approach.

Original languageEnglish (US)
Title of host publicationProceedings of the Pacific Symposium on Biocomputing 2005, PSB 2005
Pages162-173
Number of pages12
StatePublished - 2005
Event10th Pacific Symposium on Biocomputing, PSB 2005 - Big Island of Hawaii, United States
Duration: Jan 4 2005Jan 8 2005

Other

Other10th Pacific Symposium on Biocomputing, PSB 2005
CountryUnited States
CityBig Island of Hawaii
Period1/4/051/8/05

Fingerprint

Visualization
Genes

ASJC Scopus subject areas

  • Computational Theory and Mathematics
  • Biomedical Engineering

Cite this

Marshall, B., Su, H., McDonald, D., & Chen, H. (2005). Linking ontological resources using aggregatable substance identifiers to organize extracted relations. In Proceedings of the Pacific Symposium on Biocomputing 2005, PSB 2005 (pp. 162-173)

Linking ontological resources using aggregatable substance identifiers to organize extracted relations. / Marshall, Byron; Su, Hua; McDonald, Daniel; Chen, Hsinchun.

Proceedings of the Pacific Symposium on Biocomputing 2005, PSB 2005. 2005. p. 162-173.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Marshall, B, Su, H, McDonald, D & Chen, H 2005, Linking ontological resources using aggregatable substance identifiers to organize extracted relations. in Proceedings of the Pacific Symposium on Biocomputing 2005, PSB 2005. pp. 162-173, 10th Pacific Symposium on Biocomputing, PSB 2005, Big Island of Hawaii, United States, 1/4/05.
Marshall B, Su H, McDonald D, Chen H. Linking ontological resources using aggregatable substance identifiers to organize extracted relations. In Proceedings of the Pacific Symposium on Biocomputing 2005, PSB 2005. 2005. p. 162-173
Marshall, Byron ; Su, Hua ; McDonald, Daniel ; Chen, Hsinchun. / Linking ontological resources using aggregatable substance identifiers to organize extracted relations. Proceedings of the Pacific Symposium on Biocomputing 2005, PSB 2005. 2005. pp. 162-173
@inproceedings{5e7ab0b2d30b49619e0b0265c2d32e54,
title = "Linking ontological resources using aggregatable substance identifiers to organize extracted relations",
abstract = "Systems that extract biological regulatory pathway relations from free-text sources are intended to help researchers leverage vast and growing collections of research literature. Several systems to extract such relations have been developed but little work has focused on how those relations can be usefully organized (aggregated) to support visualization systems or analysis algorithms. Ontological resources that enumerate name strings for different types of biomedical objects should play a key role in the organization process. In this paper we delineate five potentially useful levels of relational granularity and propose the use of aggregatable substance identifiers to help reduce lexical ambiguity. An aggregatable substance identifier applies to a gene and its products. We merged 4 extensive lexicons and compared the extracted strings to the text of five million MEDLINE abstracts. We report on the ambiguity within and between name strings and common English words. Our results show an 89{\%} reduction in ambiguity for the extracted human substance name strings when using an aggregatable substance approach.",
author = "Byron Marshall and Hua Su and Daniel McDonald and Hsinchun Chen",
year = "2005",
language = "English (US)",
isbn = "9812560467",
pages = "162--173",
booktitle = "Proceedings of the Pacific Symposium on Biocomputing 2005, PSB 2005",

}

TY - GEN

T1 - Linking ontological resources using aggregatable substance identifiers to organize extracted relations

AU - Marshall, Byron

AU - Su, Hua

AU - McDonald, Daniel

AU - Chen, Hsinchun

PY - 2005

Y1 - 2005

N2 - Systems that extract biological regulatory pathway relations from free-text sources are intended to help researchers leverage vast and growing collections of research literature. Several systems to extract such relations have been developed but little work has focused on how those relations can be usefully organized (aggregated) to support visualization systems or analysis algorithms. Ontological resources that enumerate name strings for different types of biomedical objects should play a key role in the organization process. In this paper we delineate five potentially useful levels of relational granularity and propose the use of aggregatable substance identifiers to help reduce lexical ambiguity. An aggregatable substance identifier applies to a gene and its products. We merged 4 extensive lexicons and compared the extracted strings to the text of five million MEDLINE abstracts. We report on the ambiguity within and between name strings and common English words. Our results show an 89% reduction in ambiguity for the extracted human substance name strings when using an aggregatable substance approach.

AB - Systems that extract biological regulatory pathway relations from free-text sources are intended to help researchers leverage vast and growing collections of research literature. Several systems to extract such relations have been developed but little work has focused on how those relations can be usefully organized (aggregated) to support visualization systems or analysis algorithms. Ontological resources that enumerate name strings for different types of biomedical objects should play a key role in the organization process. In this paper we delineate five potentially useful levels of relational granularity and propose the use of aggregatable substance identifiers to help reduce lexical ambiguity. An aggregatable substance identifier applies to a gene and its products. We merged 4 extensive lexicons and compared the extracted strings to the text of five million MEDLINE abstracts. We report on the ambiguity within and between name strings and common English words. Our results show an 89% reduction in ambiguity for the extracted human substance name strings when using an aggregatable substance approach.

UR - http://www.scopus.com/inward/record.url?scp=15944413448&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=15944413448&partnerID=8YFLogxK

M3 - Conference contribution

C2 - 15759623

AN - SCOPUS:15944413448

SN - 9812560467

SN - 9789812560469

SP - 162

EP - 173

BT - Proceedings of the Pacific Symposium on Biocomputing 2005, PSB 2005

ER -