Integrating Informativeness, Representativeness and Diversity in Pool-Based Sequential Active Learning for Regression release_odhkwfrvubfaldxeemjw2klhqm

by Ziang Liu, Dongrui Wu

Released as a article .

2020  

Abstract

In many real-world machine learning applications, unlabeled samples are easy to obtain, but it is expensive and/or time-consuming to label them. Active learning is a common approach for reducing this data labeling effort. It optimally selects the best few samples to label, so that a better machine learning model can be trained from the same number of labeled samples. This paper considers active learning for regression (ALR) problems. Three essential criteria -- informativeness, representativeness, and diversity -- have been proposed for ALR. However, very few approaches in the literature have considered all three of them simultaneously. We propose three new ALR approaches, with different strategies for integrating the three criteria. Extensive experiments on 12 datasets in various domains demonstrated their effectiveness.
In text/plain format

Archived Files and Locations

application/pdf  176.7 kB
file_vzruhaloebcfjieroqjn4c5s3e
arxiv.org (repository)
web.archive.org (webarchive)
Read Archived PDF
Preserved and Accessible
Type  article
Stage   submitted
Date   2020-03-26
Version   v1
Language   en ?
arXiv  2003.11786v1
Work Entity
access all versions, variants, and formats of this works (eg, pre-prints)
Catalog Record
Revision: 282231b7-882a-43ad-a4e2-36465efda618
API URL: JSON