Reprocessing paper-based reference materials for the digital environment

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)

Abstract

One of the primary challenges for the creation of digital libraries is to enhance the value of paper-based publications by providing digital access to the materials. Simple full-text searching is just a first step in this process. Better functionality may be gained by exploiting the natural structure within text. The following paper describes the process of digital conversion and integration of encyclopedic publications, glossaries and thesauri. The Biological Information Browsing (http://www.biobrowser.org) team developed text-processing tools, and an information retrieval and visualization environment that provides greater functionality for these traditionally paper-based publications. The process includes automatic text segmentation and structuring, automated XML markup, structure-based indexing, automatic thesaurus extraction for query expansion and on-line definitions. Very few other information systems provide complete services for publishing, indexing, XML query and retrieving documents.

Original languageEnglish (US)
Title of host publicationProceedings of the ACM International Conference on Digital Libraries
EditorsG. Marchionini, W. Hersh
Pages377
Number of pages1
StatePublished - 2002
Externally publishedYes
EventProceedings of the Second ACM/IEEE-CS Joint Conference on Digital Libraries - Portland, OR, United States
Duration: Jul 14 2002Jul 18 2002

Other

OtherProceedings of the Second ACM/IEEE-CS Joint Conference on Digital Libraries
CountryUnited States
CityPortland, OR
Period7/14/027/18/02

Fingerprint

reprocessing
Thesauri
XML
thesaurus
Automatic indexing
Text processing
indexing
functionality
Digital libraries
Glossaries
Information retrieval
text processing
Information systems
Visualization
information retrieval
visualization
information system
Values

Keywords

  • Electronic publishing
  • Indexing
  • Information retrieval
  • Structured text
  • XML

ASJC Scopus subject areas

  • Computer Science(all)
  • Social Sciences(all)

Cite this

Heidorn, P. B. (2002). Reprocessing paper-based reference materials for the digital environment. In G. Marchionini, & W. Hersh (Eds.), Proceedings of the ACM International Conference on Digital Libraries (pp. 377)

Reprocessing paper-based reference materials for the digital environment. / Heidorn, Patrick B.

Proceedings of the ACM International Conference on Digital Libraries. ed. / G. Marchionini; W. Hersh. 2002. p. 377.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Heidorn, PB 2002, Reprocessing paper-based reference materials for the digital environment. in G Marchionini & W Hersh (eds), Proceedings of the ACM International Conference on Digital Libraries. pp. 377, Proceedings of the Second ACM/IEEE-CS Joint Conference on Digital Libraries, Portland, OR, United States, 7/14/02.
Heidorn PB. Reprocessing paper-based reference materials for the digital environment. In Marchionini G, Hersh W, editors, Proceedings of the ACM International Conference on Digital Libraries. 2002. p. 377
Heidorn, Patrick B. / Reprocessing paper-based reference materials for the digital environment. Proceedings of the ACM International Conference on Digital Libraries. editor / G. Marchionini ; W. Hersh. 2002. pp. 377
@inproceedings{5c8e0abccc854e359f35f2c3c8e8bb6d,
title = "Reprocessing paper-based reference materials for the digital environment",
abstract = "One of the primary challenges for the creation of digital libraries is to enhance the value of paper-based publications by providing digital access to the materials. Simple full-text searching is just a first step in this process. Better functionality may be gained by exploiting the natural structure within text. The following paper describes the process of digital conversion and integration of encyclopedic publications, glossaries and thesauri. The Biological Information Browsing (http://www.biobrowser.org) team developed text-processing tools, and an information retrieval and visualization environment that provides greater functionality for these traditionally paper-based publications. The process includes automatic text segmentation and structuring, automated XML markup, structure-based indexing, automatic thesaurus extraction for query expansion and on-line definitions. Very few other information systems provide complete services for publishing, indexing, XML query and retrieving documents.",
keywords = "Electronic publishing, Indexing, Information retrieval, Structured text, XML",
author = "Heidorn, {Patrick B}",
year = "2002",
language = "English (US)",
pages = "377",
editor = "G. Marchionini and W. Hersh",
booktitle = "Proceedings of the ACM International Conference on Digital Libraries",

}

TY - GEN

T1 - Reprocessing paper-based reference materials for the digital environment

AU - Heidorn, Patrick B

PY - 2002

Y1 - 2002

N2 - One of the primary challenges for the creation of digital libraries is to enhance the value of paper-based publications by providing digital access to the materials. Simple full-text searching is just a first step in this process. Better functionality may be gained by exploiting the natural structure within text. The following paper describes the process of digital conversion and integration of encyclopedic publications, glossaries and thesauri. The Biological Information Browsing (http://www.biobrowser.org) team developed text-processing tools, and an information retrieval and visualization environment that provides greater functionality for these traditionally paper-based publications. The process includes automatic text segmentation and structuring, automated XML markup, structure-based indexing, automatic thesaurus extraction for query expansion and on-line definitions. Very few other information systems provide complete services for publishing, indexing, XML query and retrieving documents.

AB - One of the primary challenges for the creation of digital libraries is to enhance the value of paper-based publications by providing digital access to the materials. Simple full-text searching is just a first step in this process. Better functionality may be gained by exploiting the natural structure within text. The following paper describes the process of digital conversion and integration of encyclopedic publications, glossaries and thesauri. The Biological Information Browsing (http://www.biobrowser.org) team developed text-processing tools, and an information retrieval and visualization environment that provides greater functionality for these traditionally paper-based publications. The process includes automatic text segmentation and structuring, automated XML markup, structure-based indexing, automatic thesaurus extraction for query expansion and on-line definitions. Very few other information systems provide complete services for publishing, indexing, XML query and retrieving documents.

KW - Electronic publishing

KW - Indexing

KW - Information retrieval

KW - Structured text

KW - XML

UR - http://www.scopus.com/inward/record.url?scp=0036989146&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0036989146&partnerID=8YFLogxK

M3 - Conference contribution

SP - 377

BT - Proceedings of the ACM International Conference on Digital Libraries

A2 - Marchionini, G.

A2 - Hersh, W.

ER -