Optimizing feature representation for automated systematic review work prioritization release_ukdqmdokabhc3no5egm5cqnz5e

by Aaron M Cohen

Published in AMIA Annual Symposium Proceedings.

2008   p121-5

Abstract

Automated document classification can be a valuable tool for enhancing the efficiency of creating and updating systematic reviews (SRs) for evidence-based medicine. One way document classification can help is in performing work prioritization: given a set of documents, order them such that the most likely useful documents appear first. We evaluated several alternate classification feature systems including unigram, n-gram, MeSH, and natural language processing (NLP) feature sets for their usefulness on 15 SR tasks, using the area under the receiver operating curve as a measure of goodness. We also examined the impact of topic-specific training data compared to general SR inclusion data. The best feature set used a combination of n-gram and MeSH features. NLP-based features were not found to improve performance. Furthermore, topic-specific training data usually provides a significant performance gain over more general SR training.
In text/plain format

Archived Files and Locations

application/pdf  138.7 kB
file_kasy4ke5qrcrpgosjwuuyweupy
europepmc.org (repository)
web.archive.org (webarchive)
Read Archived PDF
Preserved and Accessible
Type  article-journal
Stage   published
Date   2008-11-06
Language   en ?
PubMed  18998798
PMC  PMC2656096
Proceedings Metadata
Open Access Publication
Not in DOAJ
In ISSN ROAD
In Keepers Registry
ISSN-L:  1559-4076
Work Entity
access all versions, variants, and formats of this works (eg, pre-prints)
Catalog Record
Revision: a00d17e7-60c5-4aa4-875c-d32f2512f080
API URL: JSON