Unsupervised semantic markup of literature for biodiversity digital libraries

Research output: Chapter in Book/Report/Conference proceedingConference contribution

6 Citations (Scopus)

Abstract

This paper reports the further development of machine learning techniques for semantic markup of biodiversity literature, especially morphological descriptions of living organisms such as those hosted at efloras.org and algaebase.org. Syntactic parsing and supervised machine learning techniques have been explored by earlier research. Limitations of these techniques promoted our investigation of an unsupervised learning approach that combines the strength of earlier techniques and avoids the limitations. Semantic markup at the organ and character levels is discussed. Research on semantic markup of natural heritage literature has direct impact on the development of semantic-based access in biodiversity digital libraries.

Original languageEnglish (US)
Title of host publicationProceedings of the ACM International Conference on Digital Libraries
Pages25-28
Number of pages4
DOIs
StatePublished - 2008
Event8th ACM/IEEE-CS Joint Conference on Digital Libraries 2008, JCDL'08 - Pittsburgh, PA, United States
Duration: Jun 16 2008Jun 20 2008

Other

Other8th ACM/IEEE-CS Joint Conference on Digital Libraries 2008, JCDL'08
CountryUnited States
CityPittsburgh, PA
Period6/16/086/20/08

Fingerprint

Digital libraries
Biodiversity
biodiversity
Semantics
semantics
Learning systems
learning
Unsupervised learning
Syntactics
literature

Keywords

  • Biodiversity informatics
  • Morphological description
  • Natural heritage literature
  • Semantic annotation
  • Semantic markup
  • Tagging
  • Unsupervised machine learning
  • XML

ASJC Scopus subject areas

  • Computer Science Applications
  • Software
  • Information Systems
  • Library and Information Sciences

Cite this

Cui, H. (2008). Unsupervised semantic markup of literature for biodiversity digital libraries. In Proceedings of the ACM International Conference on Digital Libraries (pp. 25-28) https://doi.org/10.1145/1378889.1378894

Unsupervised semantic markup of literature for biodiversity digital libraries. / Cui, Hong.

Proceedings of the ACM International Conference on Digital Libraries. 2008. p. 25-28.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Cui, H 2008, Unsupervised semantic markup of literature for biodiversity digital libraries. in Proceedings of the ACM International Conference on Digital Libraries. pp. 25-28, 8th ACM/IEEE-CS Joint Conference on Digital Libraries 2008, JCDL'08, Pittsburgh, PA, United States, 6/16/08. https://doi.org/10.1145/1378889.1378894
Cui H. Unsupervised semantic markup of literature for biodiversity digital libraries. In Proceedings of the ACM International Conference on Digital Libraries. 2008. p. 25-28 https://doi.org/10.1145/1378889.1378894
Cui, Hong. / Unsupervised semantic markup of literature for biodiversity digital libraries. Proceedings of the ACM International Conference on Digital Libraries. 2008. pp. 25-28
@inproceedings{8a5e66aba2f1416a847d74faa61a2c4e,
title = "Unsupervised semantic markup of literature for biodiversity digital libraries",
abstract = "This paper reports the further development of machine learning techniques for semantic markup of biodiversity literature, especially morphological descriptions of living organisms such as those hosted at efloras.org and algaebase.org. Syntactic parsing and supervised machine learning techniques have been explored by earlier research. Limitations of these techniques promoted our investigation of an unsupervised learning approach that combines the strength of earlier techniques and avoids the limitations. Semantic markup at the organ and character levels is discussed. Research on semantic markup of natural heritage literature has direct impact on the development of semantic-based access in biodiversity digital libraries.",
keywords = "Biodiversity informatics, Morphological description, Natural heritage literature, Semantic annotation, Semantic markup, Tagging, Unsupervised machine learning, XML",
author = "Hong Cui",
year = "2008",
doi = "10.1145/1378889.1378894",
language = "English (US)",
isbn = "9781595939982",
pages = "25--28",
booktitle = "Proceedings of the ACM International Conference on Digital Libraries",

}

TY - GEN

T1 - Unsupervised semantic markup of literature for biodiversity digital libraries

AU - Cui, Hong

PY - 2008

Y1 - 2008

N2 - This paper reports the further development of machine learning techniques for semantic markup of biodiversity literature, especially morphological descriptions of living organisms such as those hosted at efloras.org and algaebase.org. Syntactic parsing and supervised machine learning techniques have been explored by earlier research. Limitations of these techniques promoted our investigation of an unsupervised learning approach that combines the strength of earlier techniques and avoids the limitations. Semantic markup at the organ and character levels is discussed. Research on semantic markup of natural heritage literature has direct impact on the development of semantic-based access in biodiversity digital libraries.

AB - This paper reports the further development of machine learning techniques for semantic markup of biodiversity literature, especially morphological descriptions of living organisms such as those hosted at efloras.org and algaebase.org. Syntactic parsing and supervised machine learning techniques have been explored by earlier research. Limitations of these techniques promoted our investigation of an unsupervised learning approach that combines the strength of earlier techniques and avoids the limitations. Semantic markup at the organ and character levels is discussed. Research on semantic markup of natural heritage literature has direct impact on the development of semantic-based access in biodiversity digital libraries.

KW - Biodiversity informatics

KW - Morphological description

KW - Natural heritage literature

KW - Semantic annotation

KW - Semantic markup

KW - Tagging

KW - Unsupervised machine learning

KW - XML

UR - http://www.scopus.com/inward/record.url?scp=57749115937&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=57749115937&partnerID=8YFLogxK

U2 - 10.1145/1378889.1378894

DO - 10.1145/1378889.1378894

M3 - Conference contribution

AN - SCOPUS:57749115937

SN - 9781595939982

SP - 25

EP - 28

BT - Proceedings of the ACM International Conference on Digital Libraries

ER -