Odin's Runes: A rule language for information extraction

Marco A. Valenzuela-Escárcega, Gus Hahn-Powell, Mihai Surdeanu

Research output: Chapter in Book/Report/Conference proceedingConference contribution

13 Scopus citations

Abstract

Odin is an information extraction framework that applies cascades of finite state automata over both surface text and syntactic dependency graphs. Support for syntactic patterns allow us to concisely define relations that are otherwise difficult to express in languages such as Common Pattern Specification Language (CPSL), which are currently limited to shallow linguistic features. The interaction of lexical and syntactic automata provides robustness and flexibility when writing extraction rules. This paper describes Odin's declarative language for writing these cascaded automata.

Original languageEnglish (US)
Title of host publicationProceedings of the 10th International Conference on Language Resources and Evaluation, LREC 2016
PublisherEuropean Language Resources Association (ELRA)
Pages322-329
Number of pages8
ISBN (Electronic)9782951740891
StatePublished - Jan 1 2016
Event10th International Conference on Language Resources and Evaluation, LREC 2016 - Portoroz, Slovenia
Duration: May 23 2016May 28 2016

Other

Other10th International Conference on Language Resources and Evaluation, LREC 2016
CountrySlovenia
CityPortoroz
Period5/23/165/28/16

Keywords

  • Cascade of finite state automata
  • Information extraction
  • Rule-based

ASJC Scopus subject areas

  • Linguistics and Language
  • Library and Information Sciences
  • Language and Linguistics
  • Education

Fingerprint Dive into the research topics of 'Odin's Runes: A rule language for information extraction'. Together they form a unique fingerprint.

  • Cite this

    Valenzuela-Escárcega, M. A., Hahn-Powell, G., & Surdeanu, M. (2016). Odin's Runes: A rule language for information extraction. In Proceedings of the 10th International Conference on Language Resources and Evaluation, LREC 2016 (pp. 322-329). European Language Resources Association (ELRA).