ACTIV-ES: A comparable, cross-dialect corpus of 'everyday' Spanish from Argentina, Mexico, and Spain

Jerid Francom, Mans Hulden, Adam Ussishkin, Julieta Fumagalli, Mikel Santesteban, Julio Serrano

Research output: Chapter in Book/Report/Conference proceedingConference contribution

5 Scopus citations

Abstract

Corpus resources for Spanish have proved invaluable for a number of applications in a wide variety of fields. However, a majority of resources are based on formal, written language and/or are not built to model language variation between varieties of the Spanish language, despite the fact that most language in 'everyday' use is informal/dialogue-based and shows rich regional variation. This paper outlines the development and evaluation of the ACTIV-ES corpus, a first-step to produce a comparable, cross-dialect corpus representative of the 'everyday' language of various regions of the Spanish-speaking world.

Original languageEnglish (US)
Title of host publicationProceedings of the 9th International Conference on Language Resources and Evaluation, LREC 2014
EditorsNicoletta Calzolari, Khalid Choukri, Sara Goggi, Thierry Declerck, Joseph Mariani, Bente Maegaard, Asuncion Moreno, Jan Odijk, Helene Mazo, Stelios Piperidis, Hrafn Loftsson
PublisherEuropean Language Resources Association (ELRA)
Pages1733-1737
Number of pages5
ISBN (Electronic)9782951740884
StatePublished - 2014
Event9th International Conference on Language Resources and Evaluation, LREC 2014 - Reykjavik, Iceland
Duration: May 26 2014May 31 2014

Publication series

NameProceedings of the 9th International Conference on Language Resources and Evaluation, LREC 2014

Other

Other9th International Conference on Language Resources and Evaluation, LREC 2014
CountryIceland
CityReykjavik
Period5/26/145/31/14

Keywords

  • Corpora
  • Dialects
  • Spanish

ASJC Scopus subject areas

  • Linguistics and Language
  • Library and Information Sciences
  • Education
  • Language and Linguistics

Fingerprint Dive into the research topics of 'ACTIV-ES: A comparable, cross-dialect corpus of 'everyday' Spanish from Argentina, Mexico, and Spain'. Together they form a unique fingerprint.

Cite this