Chronic disease related entity extraction in online Chinese question and answer services

Yan Zhang, Yong Zhang, Yanshen Yin, Jennifer Xu, Chunxiao Xing, Hsinchun Chen

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)

Abstract

Chinese chronic disease entity extraction aims to extract health related entities from online questions and answers (QA). Our research tackles challenges in Chinese chronic disease entity extraction from three aspects: Chinese health lexicons construction, feature development, and equivalence conjunctions tagging. We construct large scale Chinese health lexicons based on expert knowledge and the Web resources; develop a feature extraction approach that draws out character, part-of-speech, and lexical features from QA data; and improve the performance of answer entity extraction by leveraging equivalence conjunctions (punctuation marks and conjunctional words) in Chinese to capture dependencies between tags of entities. Experiments on question and answer entity extraction demonstrate that the Precision, Recall and F-1 score are improved using our proposed features, and the Precision and F-1 score can be further improved by considering equivalence conjunctions.

Original languageEnglish (US)
Title of host publicationLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
PublisherSpringer Verlag
Pages55-67
Number of pages13
Volume9545
ISBN (Print)9783319291741
DOIs
StatePublished - 2016
EventInternational Conference for Smart Health, ICSH 2015 - Phoenix, United States
Duration: Nov 17 2015Nov 18 2015

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume9545
ISSN (Print)03029743
ISSN (Electronic)16113349

Other

OtherInternational Conference for Smart Health, ICSH 2015
CountryUnited States
CityPhoenix
Period11/17/1511/18/15

Fingerprint

Chronic Disease
Health
Equivalence
Feature extraction
Tagging
Feature Extraction
Resources
Experiments
Demonstrate
Experiment

Keywords

  • Entity extraction
  • Health lexicon
  • QA

ASJC Scopus subject areas

  • Computer Science(all)
  • Theoretical Computer Science

Cite this

Zhang, Y., Zhang, Y., Yin, Y., Xu, J., Xing, C., & Chen, H. (2016). Chronic disease related entity extraction in online Chinese question and answer services. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9545, pp. 55-67). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 9545). Springer Verlag. https://doi.org/10.1007/978-3-319-29175-8_6

Chronic disease related entity extraction in online Chinese question and answer services. / Zhang, Yan; Zhang, Yong; Yin, Yanshen; Xu, Jennifer; Xing, Chunxiao; Chen, Hsinchun.

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 9545 Springer Verlag, 2016. p. 55-67 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 9545).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Zhang, Y, Zhang, Y, Yin, Y, Xu, J, Xing, C & Chen, H 2016, Chronic disease related entity extraction in online Chinese question and answer services. in Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). vol. 9545, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 9545, Springer Verlag, pp. 55-67, International Conference for Smart Health, ICSH 2015, Phoenix, United States, 11/17/15. https://doi.org/10.1007/978-3-319-29175-8_6
Zhang Y, Zhang Y, Yin Y, Xu J, Xing C, Chen H. Chronic disease related entity extraction in online Chinese question and answer services. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 9545. Springer Verlag. 2016. p. 55-67. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). https://doi.org/10.1007/978-3-319-29175-8_6
Zhang, Yan ; Zhang, Yong ; Yin, Yanshen ; Xu, Jennifer ; Xing, Chunxiao ; Chen, Hsinchun. / Chronic disease related entity extraction in online Chinese question and answer services. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 9545 Springer Verlag, 2016. pp. 55-67 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).
@inproceedings{65451c0cc1da4faaa3238625e2f683b7,
title = "Chronic disease related entity extraction in online Chinese question and answer services",
abstract = "Chinese chronic disease entity extraction aims to extract health related entities from online questions and answers (QA). Our research tackles challenges in Chinese chronic disease entity extraction from three aspects: Chinese health lexicons construction, feature development, and equivalence conjunctions tagging. We construct large scale Chinese health lexicons based on expert knowledge and the Web resources; develop a feature extraction approach that draws out character, part-of-speech, and lexical features from QA data; and improve the performance of answer entity extraction by leveraging equivalence conjunctions (punctuation marks and conjunctional words) in Chinese to capture dependencies between tags of entities. Experiments on question and answer entity extraction demonstrate that the Precision, Recall and F-1 score are improved using our proposed features, and the Precision and F-1 score can be further improved by considering equivalence conjunctions.",
keywords = "Entity extraction, Health lexicon, QA",
author = "Yan Zhang and Yong Zhang and Yanshen Yin and Jennifer Xu and Chunxiao Xing and Hsinchun Chen",
year = "2016",
doi = "10.1007/978-3-319-29175-8_6",
language = "English (US)",
isbn = "9783319291741",
volume = "9545",
series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
publisher = "Springer Verlag",
pages = "55--67",
booktitle = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

}

TY - GEN

T1 - Chronic disease related entity extraction in online Chinese question and answer services

AU - Zhang, Yan

AU - Zhang, Yong

AU - Yin, Yanshen

AU - Xu, Jennifer

AU - Xing, Chunxiao

AU - Chen, Hsinchun

PY - 2016

Y1 - 2016

N2 - Chinese chronic disease entity extraction aims to extract health related entities from online questions and answers (QA). Our research tackles challenges in Chinese chronic disease entity extraction from three aspects: Chinese health lexicons construction, feature development, and equivalence conjunctions tagging. We construct large scale Chinese health lexicons based on expert knowledge and the Web resources; develop a feature extraction approach that draws out character, part-of-speech, and lexical features from QA data; and improve the performance of answer entity extraction by leveraging equivalence conjunctions (punctuation marks and conjunctional words) in Chinese to capture dependencies between tags of entities. Experiments on question and answer entity extraction demonstrate that the Precision, Recall and F-1 score are improved using our proposed features, and the Precision and F-1 score can be further improved by considering equivalence conjunctions.

AB - Chinese chronic disease entity extraction aims to extract health related entities from online questions and answers (QA). Our research tackles challenges in Chinese chronic disease entity extraction from three aspects: Chinese health lexicons construction, feature development, and equivalence conjunctions tagging. We construct large scale Chinese health lexicons based on expert knowledge and the Web resources; develop a feature extraction approach that draws out character, part-of-speech, and lexical features from QA data; and improve the performance of answer entity extraction by leveraging equivalence conjunctions (punctuation marks and conjunctional words) in Chinese to capture dependencies between tags of entities. Experiments on question and answer entity extraction demonstrate that the Precision, Recall and F-1 score are improved using our proposed features, and the Precision and F-1 score can be further improved by considering equivalence conjunctions.

KW - Entity extraction

KW - Health lexicon

KW - QA

UR - http://www.scopus.com/inward/record.url?scp=84958530669&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84958530669&partnerID=8YFLogxK

U2 - 10.1007/978-3-319-29175-8_6

DO - 10.1007/978-3-319-29175-8_6

M3 - Conference contribution

AN - SCOPUS:84958530669

SN - 9783319291741

VL - 9545

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 55

EP - 67

BT - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

PB - Springer Verlag

ER -