Abstract
Corpus resources for Spanish have proved invaluable for a number of applications in a wide variety of fields. However, a majority of resources are based on formal, written language and/or are not built to model language variation between varieties of the Spanish language, despite the fact that most language in 'everyday' use is informal/dialogue-based and shows rich regional variation. This paper outlines the development and evaluation of the ACTIV-ES corpus, a first-step to produce a comparable, cross-dialect corpus representative of the 'everyday' language of various regions of the Spanish-speaking world.
Original language | English (US) |
---|---|
Title of host publication | Proceedings of the 9th International Conference on Language Resources and Evaluation, LREC 2014 |
Publisher | European Language Resources Association (ELRA) |
Pages | 1733-1737 |
Number of pages | 5 |
ISBN (Electronic) | 9782951740884 |
State | Published - Jan 1 2014 |
Event | 9th International Conference on Language Resources and Evaluation, LREC 2014 - Reykjavik, Iceland Duration: May 26 2014 → May 31 2014 |
Other
Other | 9th International Conference on Language Resources and Evaluation, LREC 2014 |
---|---|
Country | Iceland |
City | Reykjavik |
Period | 5/26/14 → 5/31/14 |
Keywords
- Corpora
- Dialects
- Spanish
ASJC Scopus subject areas
- Linguistics and Language
- Library and Information Sciences
- Education
- Language and Linguistics