Toward semantic interoperability of heterogeneous biological data sources

Research output: Chapter in Book/Report/Conference proceedingConference contribution

3 Citations (Scopus)

Abstract

Genomic researchers use a number of heterogeneous data sources including nucleotides, protein sequences, 3-D Protein structures, taxonomies, and research publications such as MEDLINE. This research aims to discover as much biological knowledge as possible about the properties and functions of the structures such as DNA sequences and protein structures and to explore the connections among all the data, so that the knowledge can be used to improve human lives. Currently it is very difficult to connect all of these data sources seamlessly unless all the data is transformed into a common format with an id connecting all of them. The state-of-the-art facilities for searching these data sources provide interfaces through which scientists can access multiple databases. Most of these searches are primarily text-based, requiring users to specify keywords using which the systems search through each individual data source and returns results. The user is then required to create the connections between the results from each source. This is a major problem because researchers do not always know how to create these connections. To solve this problem we propose a semantics-based mechanism for automatically linking and connecting the various data sources. Our approach is based on a model that explicitly captures the semantics of the heterogeneous data sources and makes them available for searching. In this talk I will discuss issues related to capturing the semantics of biological data and using these semantics to automate the integration of diverse heterogeneous sources.

Original languageEnglish (US)
Title of host publicationLecture Notes in Computer Science
EditorsO. Pastor, J.F. Cunha
Pages32
Number of pages1
Volume3520
StatePublished - 2005
Event17th International Conference on Advanced Information Systems Engineering, CAiSE 2005 - Porto, Portugal
Duration: Jun 13 2005Jun 17 2005

Other

Other17th International Conference on Advanced Information Systems Engineering, CAiSE 2005
CountryPortugal
CityPorto
Period6/13/056/17/05

Fingerprint

Interoperability
Semantics
Proteins
DNA sequences
Taxonomies
Nucleotides

ASJC Scopus subject areas

  • Computer Science (miscellaneous)

Cite this

Ram, S. (2005). Toward semantic interoperability of heterogeneous biological data sources. In O. Pastor, & J. F. Cunha (Eds.), Lecture Notes in Computer Science (Vol. 3520, pp. 32)

Toward semantic interoperability of heterogeneous biological data sources. / Ram, Sudha.

Lecture Notes in Computer Science. ed. / O. Pastor; J.F. Cunha. Vol. 3520 2005. p. 32.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Ram, S 2005, Toward semantic interoperability of heterogeneous biological data sources. in O Pastor & JF Cunha (eds), Lecture Notes in Computer Science. vol. 3520, pp. 32, 17th International Conference on Advanced Information Systems Engineering, CAiSE 2005, Porto, Portugal, 6/13/05.
Ram S. Toward semantic interoperability of heterogeneous biological data sources. In Pastor O, Cunha JF, editors, Lecture Notes in Computer Science. Vol. 3520. 2005. p. 32
Ram, Sudha. / Toward semantic interoperability of heterogeneous biological data sources. Lecture Notes in Computer Science. editor / O. Pastor ; J.F. Cunha. Vol. 3520 2005. pp. 32
@inproceedings{8a2e79df025b49cdba06f1b7c30f93c0,
title = "Toward semantic interoperability of heterogeneous biological data sources",
abstract = "Genomic researchers use a number of heterogeneous data sources including nucleotides, protein sequences, 3-D Protein structures, taxonomies, and research publications such as MEDLINE. This research aims to discover as much biological knowledge as possible about the properties and functions of the structures such as DNA sequences and protein structures and to explore the connections among all the data, so that the knowledge can be used to improve human lives. Currently it is very difficult to connect all of these data sources seamlessly unless all the data is transformed into a common format with an id connecting all of them. The state-of-the-art facilities for searching these data sources provide interfaces through which scientists can access multiple databases. Most of these searches are primarily text-based, requiring users to specify keywords using which the systems search through each individual data source and returns results. The user is then required to create the connections between the results from each source. This is a major problem because researchers do not always know how to create these connections. To solve this problem we propose a semantics-based mechanism for automatically linking and connecting the various data sources. Our approach is based on a model that explicitly captures the semantics of the heterogeneous data sources and makes them available for searching. In this talk I will discuss issues related to capturing the semantics of biological data and using these semantics to automate the integration of diverse heterogeneous sources.",
author = "Sudha Ram",
year = "2005",
language = "English (US)",
volume = "3520",
pages = "32",
editor = "O. Pastor and J.F. Cunha",
booktitle = "Lecture Notes in Computer Science",

}

TY - GEN

T1 - Toward semantic interoperability of heterogeneous biological data sources

AU - Ram, Sudha

PY - 2005

Y1 - 2005

N2 - Genomic researchers use a number of heterogeneous data sources including nucleotides, protein sequences, 3-D Protein structures, taxonomies, and research publications such as MEDLINE. This research aims to discover as much biological knowledge as possible about the properties and functions of the structures such as DNA sequences and protein structures and to explore the connections among all the data, so that the knowledge can be used to improve human lives. Currently it is very difficult to connect all of these data sources seamlessly unless all the data is transformed into a common format with an id connecting all of them. The state-of-the-art facilities for searching these data sources provide interfaces through which scientists can access multiple databases. Most of these searches are primarily text-based, requiring users to specify keywords using which the systems search through each individual data source and returns results. The user is then required to create the connections between the results from each source. This is a major problem because researchers do not always know how to create these connections. To solve this problem we propose a semantics-based mechanism for automatically linking and connecting the various data sources. Our approach is based on a model that explicitly captures the semantics of the heterogeneous data sources and makes them available for searching. In this talk I will discuss issues related to capturing the semantics of biological data and using these semantics to automate the integration of diverse heterogeneous sources.

AB - Genomic researchers use a number of heterogeneous data sources including nucleotides, protein sequences, 3-D Protein structures, taxonomies, and research publications such as MEDLINE. This research aims to discover as much biological knowledge as possible about the properties and functions of the structures such as DNA sequences and protein structures and to explore the connections among all the data, so that the knowledge can be used to improve human lives. Currently it is very difficult to connect all of these data sources seamlessly unless all the data is transformed into a common format with an id connecting all of them. The state-of-the-art facilities for searching these data sources provide interfaces through which scientists can access multiple databases. Most of these searches are primarily text-based, requiring users to specify keywords using which the systems search through each individual data source and returns results. The user is then required to create the connections between the results from each source. This is a major problem because researchers do not always know how to create these connections. To solve this problem we propose a semantics-based mechanism for automatically linking and connecting the various data sources. Our approach is based on a model that explicitly captures the semantics of the heterogeneous data sources and makes them available for searching. In this talk I will discuss issues related to capturing the semantics of biological data and using these semantics to automate the integration of diverse heterogeneous sources.

UR - http://www.scopus.com/inward/record.url?scp=25144448075&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=25144448075&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:25144448075

VL - 3520

SP - 32

BT - Lecture Notes in Computer Science

A2 - Pastor, O.

A2 - Cunha, J.F.

ER -