A scikit-based Python environment for performing multi-label classification release_buzoouag3bd5xowjivy32a5une

by Piotr Szymański, Tomasz Kajdanowicz

Released as a article .

2018  

Abstract

scikit-multilearn is a Python library for performing multi-label classification. The library is compatible with the scikit/scipy ecosystem and uses sparse matrices for all internal operations. It provides native Python implementations of popular multi-label classification methods alongside a novel framework for label space partitioning and division. It includes modern algorithm adaptation methods, network-based label space division approaches, which extracts label dependency information and multi-label embedding classifiers. It provides python wrapped access to the extensive multi-label method stack from Java libraries and makes it possible to extend deep learning single-label methods for multi-label tasks. The library allows multi-label stratification and data set management. The implementation is more efficient in problem transformation than other established libraries, has good test coverage and follows PEP8. Source code and documentation can be downloaded from http://scikit.ml and also via pip. The library follows BSD licensing scheme.
In text/plain format

Archived Files and Locations

application/pdf  378.6 kB
file_kthbnufh2bhnnbixug35plipja
arxiv.org (repository)
web.archive.org (webarchive)
Read Archived PDF
Preserved and Accessible
Type  article
Stage   submitted
Date   2018-12-07
Version   v4
Language   en ?
arXiv  1702.01460v4
Work Entity
access all versions, variants, and formats of this works (eg, pre-prints)
Catalog Record
Revision: add0da2d-0a39-4b0e-8c44-33ca2c089510
API URL: JSON