Partitioning Clustering algorithms for handling numerical and categorical data: a review release_odu6orzxfbgavaqdagjrnt4td4

by Trupti M. Kodinariya Dr. Prashant R. Makwana

Released as a article .

2019  

Abstract

Clustering is widely used in different field such as biology, psychology, and economics. Most traditional clustering algorithms are limited to handling datasets that contain either numeric or categorical attributes. However, datasets with mixed types of attributes are common in real life data mining applications. In this paper, we review partitioning based algorithm such as K-prototype, Extension of K-prototype, K-histogram, Fuzzy approaches, genetic approaches, etc. These algorithm works on both numerical and categorical data. The approaches has been proposed to handle mixed data are based on four different perceptive: i) split data set into two part such that each part contain either numerical or categorical data, then apply separate clustering algorithm on each data set, finally combined the result of both clustering algorithm, ii) converting categorical attribute into numerical attribute and apply numerical attribute clustering algorithm; iii) discrimination of numerical attribute and apply categorical based clustering algorithm; iv) Conversion of the categorical attributes into binary ones and apply any numerical based clustering algorithm
In text/plain format

Archived Files and Locations

application/pdf  518.7 kB
file_v7za6pivtvgepk2enqyyt2pnty
arxiv.org (repository)
web.archive.org (webarchive)
Read Archived PDF
Preserved and Accessible
Type  article
Stage   submitted
Date   2019-07-02
Version   v3
Language   en ?
arXiv  1311.7219v3
Work Entity
access all versions, variants, and formats of this works (eg, pre-prints)
Catalog Record
Revision: 91264ea9-dab9-4d4f-bc74-4278df302c38
API URL: JSON