A domain-independent rule-based framework for event extraction

Marco A. Valenzuela-Escárcega, Gus Hahn-Powell, Thomas Hicks, Mihai Surdeanu

Research output: Chapter in Book/Report/Conference proceedingConference contribution

19 Citations (Scopus)

Abstract

We describe the design, development, and API of ODIN (Open Domain INformer), a domainindependent, rule-based event extraction (EE) framework. The proposed EE approach is: simple (most events are captured with simple lexico-syntactic patterns), powerful (the language can capture complex constructs, such as events taking other events as arguments, and regular expressions over syntactic graphs), robust (to recover from syntactic parsing errors, syntactic patterns can be freely mixed with surface, token-based patterns), and fast (the runtime environment processes 110 sentences/ second in a real-world domain with a grammar of over 200 rules). We used this framework to develop a grammar for the biochemical domain, which approached human performance. Our EE framework is accompanied by a web-based user interface for the rapid development of event grammars and visualization of matches. The ODIN framework and the domain-specific grammars are available as open-source code.

Original languageEnglish (US)
Title of host publicationACL-IJCNLP 2015 - 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing, Proceedings of System Demonstrations
PublisherAssociation for Computational Linguistics (ACL)
Pages127-132
Number of pages6
ISBN (Print)9781941643990
StatePublished - 2015
Event53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing, ACL-IJCNLP 2015 - Beijing, China
Duration: Jul 26 2015Jul 31 2015

Other

Other53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing, ACL-IJCNLP 2015
CountryChina
CityBeijing
Period7/26/157/31/15

Fingerprint

Syntactics
Application programming interfaces (API)
User interfaces
Visualization
Grammar
Syntax

ASJC Scopus subject areas

  • Language and Linguistics
  • Pollution

Cite this

Valenzuela-Escárcega, M. A., Hahn-Powell, G., Hicks, T., & Surdeanu, M. (2015). A domain-independent rule-based framework for event extraction. In ACL-IJCNLP 2015 - 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing, Proceedings of System Demonstrations (pp. 127-132). Association for Computational Linguistics (ACL).

A domain-independent rule-based framework for event extraction. / Valenzuela-Escárcega, Marco A.; Hahn-Powell, Gus; Hicks, Thomas; Surdeanu, Mihai.

ACL-IJCNLP 2015 - 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing, Proceedings of System Demonstrations. Association for Computational Linguistics (ACL), 2015. p. 127-132.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Valenzuela-Escárcega, MA, Hahn-Powell, G, Hicks, T & Surdeanu, M 2015, A domain-independent rule-based framework for event extraction. in ACL-IJCNLP 2015 - 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing, Proceedings of System Demonstrations. Association for Computational Linguistics (ACL), pp. 127-132, 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing, ACL-IJCNLP 2015, Beijing, China, 7/26/15.
Valenzuela-Escárcega MA, Hahn-Powell G, Hicks T, Surdeanu M. A domain-independent rule-based framework for event extraction. In ACL-IJCNLP 2015 - 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing, Proceedings of System Demonstrations. Association for Computational Linguistics (ACL). 2015. p. 127-132
Valenzuela-Escárcega, Marco A. ; Hahn-Powell, Gus ; Hicks, Thomas ; Surdeanu, Mihai. / A domain-independent rule-based framework for event extraction. ACL-IJCNLP 2015 - 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing, Proceedings of System Demonstrations. Association for Computational Linguistics (ACL), 2015. pp. 127-132
@inproceedings{cd55f5a7209c468e95b073d0c7e2e27c,
title = "A domain-independent rule-based framework for event extraction",
abstract = "We describe the design, development, and API of ODIN (Open Domain INformer), a domainindependent, rule-based event extraction (EE) framework. The proposed EE approach is: simple (most events are captured with simple lexico-syntactic patterns), powerful (the language can capture complex constructs, such as events taking other events as arguments, and regular expressions over syntactic graphs), robust (to recover from syntactic parsing errors, syntactic patterns can be freely mixed with surface, token-based patterns), and fast (the runtime environment processes 110 sentences/ second in a real-world domain with a grammar of over 200 rules). We used this framework to develop a grammar for the biochemical domain, which approached human performance. Our EE framework is accompanied by a web-based user interface for the rapid development of event grammars and visualization of matches. The ODIN framework and the domain-specific grammars are available as open-source code.",
author = "Valenzuela-Esc{\'a}rcega, {Marco A.} and Gus Hahn-Powell and Thomas Hicks and Mihai Surdeanu",
year = "2015",
language = "English (US)",
isbn = "9781941643990",
pages = "127--132",
booktitle = "ACL-IJCNLP 2015 - 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing, Proceedings of System Demonstrations",
publisher = "Association for Computational Linguistics (ACL)",

}

TY - GEN

T1 - A domain-independent rule-based framework for event extraction

AU - Valenzuela-Escárcega, Marco A.

AU - Hahn-Powell, Gus

AU - Hicks, Thomas

AU - Surdeanu, Mihai

PY - 2015

Y1 - 2015

N2 - We describe the design, development, and API of ODIN (Open Domain INformer), a domainindependent, rule-based event extraction (EE) framework. The proposed EE approach is: simple (most events are captured with simple lexico-syntactic patterns), powerful (the language can capture complex constructs, such as events taking other events as arguments, and regular expressions over syntactic graphs), robust (to recover from syntactic parsing errors, syntactic patterns can be freely mixed with surface, token-based patterns), and fast (the runtime environment processes 110 sentences/ second in a real-world domain with a grammar of over 200 rules). We used this framework to develop a grammar for the biochemical domain, which approached human performance. Our EE framework is accompanied by a web-based user interface for the rapid development of event grammars and visualization of matches. The ODIN framework and the domain-specific grammars are available as open-source code.

AB - We describe the design, development, and API of ODIN (Open Domain INformer), a domainindependent, rule-based event extraction (EE) framework. The proposed EE approach is: simple (most events are captured with simple lexico-syntactic patterns), powerful (the language can capture complex constructs, such as events taking other events as arguments, and regular expressions over syntactic graphs), robust (to recover from syntactic parsing errors, syntactic patterns can be freely mixed with surface, token-based patterns), and fast (the runtime environment processes 110 sentences/ second in a real-world domain with a grammar of over 200 rules). We used this framework to develop a grammar for the biochemical domain, which approached human performance. Our EE framework is accompanied by a web-based user interface for the rapid development of event grammars and visualization of matches. The ODIN framework and the domain-specific grammars are available as open-source code.

UR - http://www.scopus.com/inward/record.url?scp=84944312138&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84944312138&partnerID=8YFLogxK

M3 - Conference contribution

SN - 9781941643990

SP - 127

EP - 132

BT - ACL-IJCNLP 2015 - 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing, Proceedings of System Demonstrations

PB - Association for Computational Linguistics (ACL)

ER -