Effective Crowd-Annotation of Participants, Interventions, and Outcomes in the Text of Clinical Trial Reports

Authors:

Markus Zlabinger

Reka Marta Sabou

Sebastian Hofstätter

Allan Hanbury

Type:

Proceedings contribution

Proceedings:

Findings of the Association for Computational Linguistics: EMNLP 2020

Publisher:

The Association for Computational Linguistics

Pages:

3064 - 3074

ISBN:

Year:

2020

Abstract:

The search for Participants, Interventions, and Outcomes (PIO) in clinical trial reports is a critical task in Evidence Based Medicine. For an automatic PIO extraction, high-quality corpora are needed. Obtaining such a corpus from crowdworkers, however, has been shown to be ineffective since (i) workers usually lack domain-specific expertise to conduct the task with sufficient quality, and (ii) the standard approach of annotating entire abstracts of trial reports as one task-instance (i.e. HIT) leads to an uneven distribution in task effort. In this paper, we switch from entire abstract to sentence annotation, referred to as the SenBase approach. We build upon SenBase in SenSupport, where we compensate the lack of domain-specific expertise of crowdworkers by showing for each task-instance similar sentences that are already annotated by experts. Such tailored task-instance examples are retrieved via unsupervised semantic short-text similarity (SSTS) method - and we evaluate nine methods to find an effective solution for SenSupport. We compute the Cohen´s Kappa agreement between crowd-annotations and gold standard annotations and show that (i) both sentence-based approaches outperform a Baseline approach where entire abstracts are annotated; (ii) supporting annotators with tailored task-instance examples is the best performing approach with Kappa agreements of 0.78/0.75/0.69 for P, I, and O respectively.

TU Focus:

Information and Communication Technology

Reference:

M. Zlabinger, R. Sabou, S. Hofstätter, A. Hanbury:
"Effective Crowd-Annotation of Participants, Interventions, and Outcomes in the Text of Clinical Trial Reports";
in: "Findings of the Association for Computational Linguistics: EMNLP 2020", T. Cohn, Y. He, Y. Liu (Hrg.); herausgegeben von: Association for Computational Linguistics; The Association for Computational Linguistics, 2020, S. 3064 - 3074.

Zusätzliche Informationen

PDF Link:

Last changed:

09.01.2021 03:12:23

TU Id:

294081

Accepted:

Accepted

Invited:

Department Focus:

Business Informatics

Info Link:

https://publik.tuwien.ac.at/showentry.php?ID=294081&lang=1

Abstract German:

Author List:

M. Zlabinger, R. Sabou, S. Hofstätter, A. Hanbury

Main menu

Effective Crowd-Annotation of Participants, Interventions, and Outcomes in the Text of Clinical Trial Reports

Who's online

Contact

Offenlegung gemäß § 25 Mediengesetz:

Datenschutzerklärung

In case of problems

Effective Crowd-Annotation of Participants, Interventions, and Outcomes in the Text of Clinical Trial Reports

Search form

Who's online

Contact

Offenlegung gemäß § 25 Mediengesetz:

Datenschutzerklärung

In case of problems