The Role of Surface, Semantic and Grammatical Features on Simplification of Spanish Medical Texts: A User Study

Partha Mukherjee, Gondy Augusta Leroy, David Kauchak, Brianda Armenta Navarrete, Damian Y. Diaz, Sonia - Colina

Research output: Contribution to journalArticle

1 Citation (Scopus)

Abstract

Simplifying medical texts facilitates readability and comprehension. While most simplification work focuses on English, we investigate whether features important for simplifying English text are similarly helpful for simplifying Spanish text. We conducted a user study on 15 Spanish medical texts using Amazon Mechanical Turk and measured perceived and actual difficulty. Using the median of the difficulty scores, we split the texts into easy and difficult groups and extracted 10 surface, 2 semantic and 4 grammatical features. Using t-tests, we identified those features that significantly distinguish easy text from difficult text in Spanish and compare with prior work in English. We found that easy Spanish texts use more repeated words and adverbs, less negations and more familiar words, similar to English. Also like English, difficult Spanish texts use more nouns and adjectives. However in contrast to English, easier Spanish texts contained longer sentences and used grammatical structures that were more varied.

Original languageEnglish (US)
Pages (from-to)1322-1331
Number of pages10
JournalAMIA ... Annual Symposium proceedings. AMIA Symposium
Volume2017
StatePublished - Jan 1 2017

Fingerprint

Semantics
Work Simplification

ASJC Scopus subject areas

  • Medicine(all)

Cite this

The Role of Surface, Semantic and Grammatical Features on Simplification of Spanish Medical Texts : A User Study. / Mukherjee, Partha; Leroy, Gondy Augusta; Kauchak, David; Navarrete, Brianda Armenta; Diaz, Damian Y.; Colina, Sonia -.

In: AMIA ... Annual Symposium proceedings. AMIA Symposium, Vol. 2017, 01.01.2017, p. 1322-1331.

Research output: Contribution to journalArticle

@article{6bd597f6ce474eb5ac0fa34f93d379db,
title = "The Role of Surface, Semantic and Grammatical Features on Simplification of Spanish Medical Texts: A User Study",
abstract = "Simplifying medical texts facilitates readability and comprehension. While most simplification work focuses on English, we investigate whether features important for simplifying English text are similarly helpful for simplifying Spanish text. We conducted a user study on 15 Spanish medical texts using Amazon Mechanical Turk and measured perceived and actual difficulty. Using the median of the difficulty scores, we split the texts into easy and difficult groups and extracted 10 surface, 2 semantic and 4 grammatical features. Using t-tests, we identified those features that significantly distinguish easy text from difficult text in Spanish and compare with prior work in English. We found that easy Spanish texts use more repeated words and adverbs, less negations and more familiar words, similar to English. Also like English, difficult Spanish texts use more nouns and adjectives. However in contrast to English, easier Spanish texts contained longer sentences and used grammatical structures that were more varied.",
author = "Partha Mukherjee and Leroy, {Gondy Augusta} and David Kauchak and Navarrete, {Brianda Armenta} and Diaz, {Damian Y.} and Colina, {Sonia -}",
year = "2017",
month = "1",
day = "1",
language = "English (US)",
volume = "2017",
pages = "1322--1331",
journal = "AMIA ... Annual Symposium proceedings / AMIA Symposium. AMIA Symposium",
issn = "1559-4076",
publisher = "American Medical Informatics Association",

}

TY - JOUR

T1 - The Role of Surface, Semantic and Grammatical Features on Simplification of Spanish Medical Texts

T2 - A User Study

AU - Mukherjee, Partha

AU - Leroy, Gondy Augusta

AU - Kauchak, David

AU - Navarrete, Brianda Armenta

AU - Diaz, Damian Y.

AU - Colina, Sonia -

PY - 2017/1/1

Y1 - 2017/1/1

N2 - Simplifying medical texts facilitates readability and comprehension. While most simplification work focuses on English, we investigate whether features important for simplifying English text are similarly helpful for simplifying Spanish text. We conducted a user study on 15 Spanish medical texts using Amazon Mechanical Turk and measured perceived and actual difficulty. Using the median of the difficulty scores, we split the texts into easy and difficult groups and extracted 10 surface, 2 semantic and 4 grammatical features. Using t-tests, we identified those features that significantly distinguish easy text from difficult text in Spanish and compare with prior work in English. We found that easy Spanish texts use more repeated words and adverbs, less negations and more familiar words, similar to English. Also like English, difficult Spanish texts use more nouns and adjectives. However in contrast to English, easier Spanish texts contained longer sentences and used grammatical structures that were more varied.

AB - Simplifying medical texts facilitates readability and comprehension. While most simplification work focuses on English, we investigate whether features important for simplifying English text are similarly helpful for simplifying Spanish text. We conducted a user study on 15 Spanish medical texts using Amazon Mechanical Turk and measured perceived and actual difficulty. Using the median of the difficulty scores, we split the texts into easy and difficult groups and extracted 10 surface, 2 semantic and 4 grammatical features. Using t-tests, we identified those features that significantly distinguish easy text from difficult text in Spanish and compare with prior work in English. We found that easy Spanish texts use more repeated words and adverbs, less negations and more familiar words, similar to English. Also like English, difficult Spanish texts use more nouns and adjectives. However in contrast to English, easier Spanish texts contained longer sentences and used grammatical structures that were more varied.

UR - http://www.scopus.com/inward/record.url?scp=85058769457&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85058769457&partnerID=8YFLogxK

M3 - Article

C2 - 29854201

AN - SCOPUS:85058769457

VL - 2017

SP - 1322

EP - 1331

JO - AMIA ... Annual Symposium proceedings / AMIA Symposium. AMIA Symposium

JF - AMIA ... Annual Symposium proceedings / AMIA Symposium. AMIA Symposium

SN - 1559-4076

ER -