Tools for semantic annotation of taxonomic descriptions

Hong Cui, Partha Pratim Sanyal, Chunshui Yu

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

A software application for automated semantic annotation of taxonomic, especially morphological, descriptions is reported in this paper. The tool is based on unsupervised machine learning methods. It is designed to annotate descriptions in a deviated syntax that is not normal English but often used in morphological descriptions. The unsupervised annotation system does not need any training examples to annotate text descriptions. It uses a relevant glossary available to it but aims to learn as much information as possible from the text itself. Tools such as this are needed to reformat free-text or OCRed taxonomic documents to a semantic-explicit format for easy and intelligent access, providing character data for phylogenetic research, climate impact on biodiversity, and traditional biosystematics research.

Original languageEnglish (US)
Title of host publicationLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Pages506-516
Number of pages11
Volume6279 LNAI
EditionPART 4
DOIs
StatePublished - 2010
Event14th International Conference on Knowledge-Based and Intelligent Information and Engineering Systems, KES 2010 - Cardiff, United Kingdom
Duration: Sep 8 2010Sep 10 2010

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
NumberPART 4
Volume6279 LNAI
ISSN (Print)03029743
ISSN (Electronic)16113349

Other

Other14th International Conference on Knowledge-Based and Intelligent Information and Engineering Systems, KES 2010
CountryUnited Kingdom
CityCardiff
Period9/8/109/10/10

Fingerprint

Semantic Annotation
Semantics
Biodiversity
Glossaries
Application programs
Learning systems
Unsupervised Learning
Phylogenetics
Climate
Annotation
Machine Learning
Software
Text

Keywords

  • Morphological descriptions
  • Semantic annotation
  • Software applications
  • Supervised machine learning
  • Unsupervised machine learning

ASJC Scopus subject areas

  • Computer Science(all)
  • Theoretical Computer Science

Cite this

Cui, H., Sanyal, P. P., & Yu, C. (2010). Tools for semantic annotation of taxonomic descriptions. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (PART 4 ed., Vol. 6279 LNAI, pp. 506-516). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 6279 LNAI, No. PART 4). https://doi.org/10.1007/978-3-642-15384-6_54

Tools for semantic annotation of taxonomic descriptions. / Cui, Hong; Sanyal, Partha Pratim; Yu, Chunshui.

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 6279 LNAI PART 4. ed. 2010. p. 506-516 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 6279 LNAI, No. PART 4).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Cui, H, Sanyal, PP & Yu, C 2010, Tools for semantic annotation of taxonomic descriptions. in Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). PART 4 edn, vol. 6279 LNAI, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), no. PART 4, vol. 6279 LNAI, pp. 506-516, 14th International Conference on Knowledge-Based and Intelligent Information and Engineering Systems, KES 2010, Cardiff, United Kingdom, 9/8/10. https://doi.org/10.1007/978-3-642-15384-6_54
Cui H, Sanyal PP, Yu C. Tools for semantic annotation of taxonomic descriptions. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). PART 4 ed. Vol. 6279 LNAI. 2010. p. 506-516. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); PART 4). https://doi.org/10.1007/978-3-642-15384-6_54
Cui, Hong ; Sanyal, Partha Pratim ; Yu, Chunshui. / Tools for semantic annotation of taxonomic descriptions. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 6279 LNAI PART 4. ed. 2010. pp. 506-516 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); PART 4).
@inproceedings{1cad9d0030db4a278196838920f7afde,
title = "Tools for semantic annotation of taxonomic descriptions",
abstract = "A software application for automated semantic annotation of taxonomic, especially morphological, descriptions is reported in this paper. The tool is based on unsupervised machine learning methods. It is designed to annotate descriptions in a deviated syntax that is not normal English but often used in morphological descriptions. The unsupervised annotation system does not need any training examples to annotate text descriptions. It uses a relevant glossary available to it but aims to learn as much information as possible from the text itself. Tools such as this are needed to reformat free-text or OCRed taxonomic documents to a semantic-explicit format for easy and intelligent access, providing character data for phylogenetic research, climate impact on biodiversity, and traditional biosystematics research.",
keywords = "Morphological descriptions, Semantic annotation, Software applications, Supervised machine learning, Unsupervised machine learning",
author = "Hong Cui and Sanyal, {Partha Pratim} and Chunshui Yu",
year = "2010",
doi = "10.1007/978-3-642-15384-6_54",
language = "English (US)",
isbn = "3642153836",
volume = "6279 LNAI",
series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
number = "PART 4",
pages = "506--516",
booktitle = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
edition = "PART 4",

}

TY - GEN

T1 - Tools for semantic annotation of taxonomic descriptions

AU - Cui, Hong

AU - Sanyal, Partha Pratim

AU - Yu, Chunshui

PY - 2010

Y1 - 2010

N2 - A software application for automated semantic annotation of taxonomic, especially morphological, descriptions is reported in this paper. The tool is based on unsupervised machine learning methods. It is designed to annotate descriptions in a deviated syntax that is not normal English but often used in morphological descriptions. The unsupervised annotation system does not need any training examples to annotate text descriptions. It uses a relevant glossary available to it but aims to learn as much information as possible from the text itself. Tools such as this are needed to reformat free-text or OCRed taxonomic documents to a semantic-explicit format for easy and intelligent access, providing character data for phylogenetic research, climate impact on biodiversity, and traditional biosystematics research.

AB - A software application for automated semantic annotation of taxonomic, especially morphological, descriptions is reported in this paper. The tool is based on unsupervised machine learning methods. It is designed to annotate descriptions in a deviated syntax that is not normal English but often used in morphological descriptions. The unsupervised annotation system does not need any training examples to annotate text descriptions. It uses a relevant glossary available to it but aims to learn as much information as possible from the text itself. Tools such as this are needed to reformat free-text or OCRed taxonomic documents to a semantic-explicit format for easy and intelligent access, providing character data for phylogenetic research, climate impact on biodiversity, and traditional biosystematics research.

KW - Morphological descriptions

KW - Semantic annotation

KW - Software applications

KW - Supervised machine learning

KW - Unsupervised machine learning

UR - http://www.scopus.com/inward/record.url?scp=78649250203&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=78649250203&partnerID=8YFLogxK

U2 - 10.1007/978-3-642-15384-6_54

DO - 10.1007/978-3-642-15384-6_54

M3 - Conference contribution

AN - SCOPUS:78649250203

SN - 3642153836

SN - 9783642153839

VL - 6279 LNAI

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 506

EP - 516

BT - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

ER -