The identification of index terms in natural language object descriptions

Research output: Contribution to journalArticle

1 Citation (Scopus)

Abstract

"The flowering part, it looks like someone is sticking their tongue out" (a subject's description of Arethusa bulbosa, see Figure 1). The mechanisms that people use in natural settings to describe objects to one another can be used to inform the design of image retrieval and museum systems. The image retrieval problem may be recast as an object description problem where the images are of objects. This study examines the vocabulary and communication constructs that are used by novices and domain experts to describe objects in an object identification task. These human-centered devices may prove to be more understandable and easier to use than some purely computational approaches. The experimental conditions mimic a scenario where a person queries an agent (active botanical information resource) in natural language in order to identify plant images. The analysis identified the objects of discourse (objects, parts and relations) including analogies, exemplars, prototypical shapes and shape modification predicates such as "longer," and "wider." In spoken language novices and horticulturists use descriptive mechanisms similar to that in botanical text but at different frequencies. For example, participants rely heavily on visual analogies to objects both within and outside of the domain. "This looks like a X" where X is a plant (i.e. "daisy") or a non-plant (i.e. "butterfly" or "child's drawing of the sun"). The results suggest that indexing and retrieval systems should provide semantic level similarity mechanisms to allow for whole-object as well as part-wise visual analogy. The systems should also provide a visual vocabulary, a set of images that represent prototypes of the verbal terms collected in this study.

Original languageEnglish (US)
Pages (from-to)472-481
Number of pages10
JournalProceedings of the ASIS Annual Meeting
Volume36
StatePublished - 1999
Externally publishedYes

Fingerprint

Image retrieval
Museums
language
Sun
Semantics
Communication
vocabulary
spoken language
indexing
museum
semantics
expert
scenario
human being
discourse
communication

ASJC Scopus subject areas

  • Information Systems
  • Library and Information Sciences

Cite this

The identification of index terms in natural language object descriptions. / Heidorn, Patrick B.

In: Proceedings of the ASIS Annual Meeting, Vol. 36, 1999, p. 472-481.

Research output: Contribution to journalArticle

@article{1be77b36605a46af894b70cb8899dc95,
title = "The identification of index terms in natural language object descriptions",
abstract = "{"}The flowering part, it looks like someone is sticking their tongue out{"} (a subject's description of Arethusa bulbosa, see Figure 1). The mechanisms that people use in natural settings to describe objects to one another can be used to inform the design of image retrieval and museum systems. The image retrieval problem may be recast as an object description problem where the images are of objects. This study examines the vocabulary and communication constructs that are used by novices and domain experts to describe objects in an object identification task. These human-centered devices may prove to be more understandable and easier to use than some purely computational approaches. The experimental conditions mimic a scenario where a person queries an agent (active botanical information resource) in natural language in order to identify plant images. The analysis identified the objects of discourse (objects, parts and relations) including analogies, exemplars, prototypical shapes and shape modification predicates such as {"}longer,{"} and {"}wider.{"} In spoken language novices and horticulturists use descriptive mechanisms similar to that in botanical text but at different frequencies. For example, participants rely heavily on visual analogies to objects both within and outside of the domain. {"}This looks like a X{"} where X is a plant (i.e. {"}daisy{"}) or a non-plant (i.e. {"}butterfly{"} or {"}child's drawing of the sun{"}). The results suggest that indexing and retrieval systems should provide semantic level similarity mechanisms to allow for whole-object as well as part-wise visual analogy. The systems should also provide a visual vocabulary, a set of images that represent prototypes of the verbal terms collected in this study.",
author = "Heidorn, {Patrick B}",
year = "1999",
language = "English (US)",
volume = "36",
pages = "472--481",
journal = "Proceedings of the ASIST Annual Meeting",
issn = "0044-7870",
publisher = "Learned Information",

}

TY - JOUR

T1 - The identification of index terms in natural language object descriptions

AU - Heidorn, Patrick B

PY - 1999

Y1 - 1999

N2 - "The flowering part, it looks like someone is sticking their tongue out" (a subject's description of Arethusa bulbosa, see Figure 1). The mechanisms that people use in natural settings to describe objects to one another can be used to inform the design of image retrieval and museum systems. The image retrieval problem may be recast as an object description problem where the images are of objects. This study examines the vocabulary and communication constructs that are used by novices and domain experts to describe objects in an object identification task. These human-centered devices may prove to be more understandable and easier to use than some purely computational approaches. The experimental conditions mimic a scenario where a person queries an agent (active botanical information resource) in natural language in order to identify plant images. The analysis identified the objects of discourse (objects, parts and relations) including analogies, exemplars, prototypical shapes and shape modification predicates such as "longer," and "wider." In spoken language novices and horticulturists use descriptive mechanisms similar to that in botanical text but at different frequencies. For example, participants rely heavily on visual analogies to objects both within and outside of the domain. "This looks like a X" where X is a plant (i.e. "daisy") or a non-plant (i.e. "butterfly" or "child's drawing of the sun"). The results suggest that indexing and retrieval systems should provide semantic level similarity mechanisms to allow for whole-object as well as part-wise visual analogy. The systems should also provide a visual vocabulary, a set of images that represent prototypes of the verbal terms collected in this study.

AB - "The flowering part, it looks like someone is sticking their tongue out" (a subject's description of Arethusa bulbosa, see Figure 1). The mechanisms that people use in natural settings to describe objects to one another can be used to inform the design of image retrieval and museum systems. The image retrieval problem may be recast as an object description problem where the images are of objects. This study examines the vocabulary and communication constructs that are used by novices and domain experts to describe objects in an object identification task. These human-centered devices may prove to be more understandable and easier to use than some purely computational approaches. The experimental conditions mimic a scenario where a person queries an agent (active botanical information resource) in natural language in order to identify plant images. The analysis identified the objects of discourse (objects, parts and relations) including analogies, exemplars, prototypical shapes and shape modification predicates such as "longer," and "wider." In spoken language novices and horticulturists use descriptive mechanisms similar to that in botanical text but at different frequencies. For example, participants rely heavily on visual analogies to objects both within and outside of the domain. "This looks like a X" where X is a plant (i.e. "daisy") or a non-plant (i.e. "butterfly" or "child's drawing of the sun"). The results suggest that indexing and retrieval systems should provide semantic level similarity mechanisms to allow for whole-object as well as part-wise visual analogy. The systems should also provide a visual vocabulary, a set of images that represent prototypes of the verbal terms collected in this study.

UR - http://www.scopus.com/inward/record.url?scp=27844441266&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=27844441266&partnerID=8YFLogxK

M3 - Article

AN - SCOPUS:27844441266

VL - 36

SP - 472

EP - 481

JO - Proceedings of the ASIST Annual Meeting

JF - Proceedings of the ASIST Annual Meeting

SN - 0044-7870

ER -