Automatic Human-like Mining and Constructing Reliable Genetic Association Database with Deep Reinforcement Learning release_3u5ilcozlffxdpwvohbbabi4ne

by Haohan Wang, Xiang Liu, Yifeng Tao, Wenting Ye, Qiao Jin, William W. Cohen, Eric P. Xing

Released as a post by Cold Spring Harbor Laboratory.

2018  

Abstract

The increasing amount of scientific literature in biological and biomedical science research has created a challenge in the continuous and reliable curation of the latest knowledge discovered, and automatic biomedical text-mining has been one of the answers to this chal-lenge. In this paper, we aim to further improve the reliability of biomedical text-mining by training the system to directly simulate the human behaviors such as querying the PubMed, selecting articles from queried results, and reading selected articles for knowledge. We take advantage of the efficiency of biomedical text-mining, the flexibility of deep reinforcement learning, and the massive amount of knowledge collected in UMLS into an integrative arti-ficial intelligent reader that can automatically identify the authentic articles and effectively acquire the knowledge conveyed in the articles. We construct a system, whose current pri-mary task is to build the genetic association database between genes and complex traits of the human. Our contributions in this paper are three-fold: 1) We propose to improve the reliability of text-mining by building a system that can directly simulate the behavior of a researcher, and we develop corresponding methods, such as Bi-directional LSTM for text mining and Deep Q-Network for organizing behaviors. 2) We demonstrate the effec-tiveness of our system with an example in constructing a genetic association database. 3) We release our implementation as a generic framework for researchers in the community to conveniently construct other databases.
In application/xml+jats format

Archived Files and Locations

application/pdf  994.7 kB
file_zyac7r5lczbc7exr2ru3swu4me
www.biorxiv.org (web)
web.archive.org (webarchive)
application/pdf  993.8 kB
file_3dh3db63ofhuffhqrgwbb2zh3q
www.biorxiv.org (repository)
web.archive.org (webarchive)
Read Archived PDF
Preserved and Accessible
Type  post
Stage   unknown
Date   2018-10-05
Work Entity
access all versions, variants, and formats of this works (eg, pre-prints)
Catalog Record
Revision: a2f4d220-7fae-44e3-9902-2ba8e4054bb1
API URL: JSON