Odin's Runes

A rule language for information extraction

Marco A. Valenzuela-Escárcega, Gus Hahn-Powell, Mihai Surdeanu

Research output: Chapter in Book/Report/Conference proceedingConference contribution

8 Citations (Scopus)

Abstract

Odin is an information extraction framework that applies cascades of finite state automata over both surface text and syntactic dependency graphs. Support for syntactic patterns allow us to concisely define relations that are otherwise difficult to express in languages such as Common Pattern Specification Language (CPSL), which are currently limited to shallow linguistic features. The interaction of lexical and syntactic automata provides robustness and flexibility when writing extraction rules. This paper describes Odin's declarative language for writing these cascaded automata.

Original languageEnglish (US)
Title of host publicationProceedings of the 10th International Conference on Language Resources and Evaluation, LREC 2016
PublisherEuropean Language Resources Association (ELRA)
Pages322-329
Number of pages8
ISBN (Electronic)9782951740891
StatePublished - Jan 1 2016
Event10th International Conference on Language Resources and Evaluation, LREC 2016 - Portoroz, Slovenia
Duration: May 23 2016May 28 2016

Other

Other10th International Conference on Language Resources and Evaluation, LREC 2016
CountrySlovenia
CityPortoroz
Period5/23/165/28/16

Fingerprint

language
flexibility
linguistics
interaction
Automata
Information Extraction
Language
Runes
Syntax
Syntactic Dependency
Graph
Linguistic Features
Interaction
Robustness

Keywords

  • Cascade of finite state automata
  • Information extraction
  • Rule-based

ASJC Scopus subject areas

  • Linguistics and Language
  • Library and Information Sciences
  • Language and Linguistics
  • Education

Cite this

Valenzuela-Escárcega, M. A., Hahn-Powell, G., & Surdeanu, M. (2016). Odin's Runes: A rule language for information extraction. In Proceedings of the 10th International Conference on Language Resources and Evaluation, LREC 2016 (pp. 322-329). European Language Resources Association (ELRA).

Odin's Runes : A rule language for information extraction. / Valenzuela-Escárcega, Marco A.; Hahn-Powell, Gus; Surdeanu, Mihai.

Proceedings of the 10th International Conference on Language Resources and Evaluation, LREC 2016. European Language Resources Association (ELRA), 2016. p. 322-329.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Valenzuela-Escárcega, MA, Hahn-Powell, G & Surdeanu, M 2016, Odin's Runes: A rule language for information extraction. in Proceedings of the 10th International Conference on Language Resources and Evaluation, LREC 2016. European Language Resources Association (ELRA), pp. 322-329, 10th International Conference on Language Resources and Evaluation, LREC 2016, Portoroz, Slovenia, 5/23/16.
Valenzuela-Escárcega MA, Hahn-Powell G, Surdeanu M. Odin's Runes: A rule language for information extraction. In Proceedings of the 10th International Conference on Language Resources and Evaluation, LREC 2016. European Language Resources Association (ELRA). 2016. p. 322-329
Valenzuela-Escárcega, Marco A. ; Hahn-Powell, Gus ; Surdeanu, Mihai. / Odin's Runes : A rule language for information extraction. Proceedings of the 10th International Conference on Language Resources and Evaluation, LREC 2016. European Language Resources Association (ELRA), 2016. pp. 322-329
@inproceedings{cc3e49a81ee54cd0b7dd4b6e0646abfa,
title = "Odin's Runes: A rule language for information extraction",
abstract = "Odin is an information extraction framework that applies cascades of finite state automata over both surface text and syntactic dependency graphs. Support for syntactic patterns allow us to concisely define relations that are otherwise difficult to express in languages such as Common Pattern Specification Language (CPSL), which are currently limited to shallow linguistic features. The interaction of lexical and syntactic automata provides robustness and flexibility when writing extraction rules. This paper describes Odin's declarative language for writing these cascaded automata.",
keywords = "Cascade of finite state automata, Information extraction, Rule-based",
author = "Valenzuela-Esc{\'a}rcega, {Marco A.} and Gus Hahn-Powell and Mihai Surdeanu",
year = "2016",
month = "1",
day = "1",
language = "English (US)",
pages = "322--329",
booktitle = "Proceedings of the 10th International Conference on Language Resources and Evaluation, LREC 2016",
publisher = "European Language Resources Association (ELRA)",

}

TY - GEN

T1 - Odin's Runes

T2 - A rule language for information extraction

AU - Valenzuela-Escárcega, Marco A.

AU - Hahn-Powell, Gus

AU - Surdeanu, Mihai

PY - 2016/1/1

Y1 - 2016/1/1

N2 - Odin is an information extraction framework that applies cascades of finite state automata over both surface text and syntactic dependency graphs. Support for syntactic patterns allow us to concisely define relations that are otherwise difficult to express in languages such as Common Pattern Specification Language (CPSL), which are currently limited to shallow linguistic features. The interaction of lexical and syntactic automata provides robustness and flexibility when writing extraction rules. This paper describes Odin's declarative language for writing these cascaded automata.

AB - Odin is an information extraction framework that applies cascades of finite state automata over both surface text and syntactic dependency graphs. Support for syntactic patterns allow us to concisely define relations that are otherwise difficult to express in languages such as Common Pattern Specification Language (CPSL), which are currently limited to shallow linguistic features. The interaction of lexical and syntactic automata provides robustness and flexibility when writing extraction rules. This paper describes Odin's declarative language for writing these cascaded automata.

KW - Cascade of finite state automata

KW - Information extraction

KW - Rule-based

UR - http://www.scopus.com/inward/record.url?scp=85016500768&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85016500768&partnerID=8YFLogxK

M3 - Conference contribution

SP - 322

EP - 329

BT - Proceedings of the 10th International Conference on Language Resources and Evaluation, LREC 2016

PB - European Language Resources Association (ELRA)

ER -