ACTIV-ES: A comparable, cross-dialect corpus of 'everyday' Spanish from Argentina, Mexico, and Spain

Jerid Francom, Mans Hulden, Adam P Ussishkin, Julieta Fumagalli, Mikel Santesteban, Julio Serrano

Research output: Chapter in Book/Report/Conference proceedingConference contribution

3 Scopus citations

Abstract

Corpus resources for Spanish have proved invaluable for a number of applications in a wide variety of fields. However, a majority of resources are based on formal, written language and/or are not built to model language variation between varieties of the Spanish language, despite the fact that most language in 'everyday' use is informal/dialogue-based and shows rich regional variation. This paper outlines the development and evaluation of the ACTIV-ES corpus, a first-step to produce a comparable, cross-dialect corpus representative of the 'everyday' language of various regions of the Spanish-speaking world.

Original languageEnglish (US)
Title of host publicationProceedings of the 9th International Conference on Language Resources and Evaluation, LREC 2014
PublisherEuropean Language Resources Association (ELRA)
Pages1733-1737
Number of pages5
ISBN (Electronic)9782951740884
StatePublished - Jan 1 2014
Event9th International Conference on Language Resources and Evaluation, LREC 2014 - Reykjavik, Iceland
Duration: May 26 2014May 31 2014

Other

Other9th International Conference on Language Resources and Evaluation, LREC 2014
CountryIceland
CityReykjavik
Period5/26/145/31/14

Keywords

  • Corpora
  • Dialects
  • Spanish

ASJC Scopus subject areas

  • Linguistics and Language
  • Library and Information Sciences
  • Education
  • Language and Linguistics

Fingerprint Dive into the research topics of 'ACTIV-ES: A comparable, cross-dialect corpus of 'everyday' Spanish from Argentina, Mexico, and Spain'. Together they form a unique fingerprint.

  • Cite this

    Francom, J., Hulden, M., Ussishkin, A. P., Fumagalli, J., Santesteban, M., & Serrano, J. (2014). ACTIV-ES: A comparable, cross-dialect corpus of 'everyday' Spanish from Argentina, Mexico, and Spain. In Proceedings of the 9th International Conference on Language Resources and Evaluation, LREC 2014 (pp. 1733-1737). European Language Resources Association (ELRA).