Reducing Side Effects of Hiding Sensitive Itemsets in Privacy Preserving Data Mining release_6fclxv4kczexdptwjnpsuczsyy

by Chun-Wei Lin, Tzung-Pei Hong, Hung-Chuan Hsu

Published in The Scientific World Journal by Hindawi Limited.

2014   Volume 2014, p1-12

Abstract

Data mining is traditionally adopted to retrieve and analyze knowledge from large amounts of data. Private or confidential data may be sanitized or suppressed before it is shared or published in public. Privacy preserving data mining (PPDM) has thus become an important issue in recent years. The most general way of PPDM is to sanitize the database to hide the sensitive information. In this paper, a novel hiding-missing-artificial utility (HMAU) algorithm is proposed to hide sensitive itemsets through transaction deletion. The transaction with the maximal ratio of sensitive to nonsensitive one is thus selected to be entirely deleted. Three side effects of hiding failures, missing itemsets, and artificial itemsets are considered to evaluate whether the transactions are required to be deleted for hiding sensitive itemsets. Three weights are also assigned as the importance to three factors, which can be set according to the requirement of users. Experiments are then conducted to show the performance of the proposed algorithm in execution time, number of deleted transactions, and number of side effects.
In application/xml+jats format

Archived Files and Locations

application/pdf  798.6 kB
file_m75z3oawbzbutauhidfhy7ebka
europepmc.org (repository)
web.archive.org (webarchive)
application/pdf  1.6 MB
file_lfjdgilbyvgk7ndzjyds65dnna
downloads.hindawi.com (publisher)
web.archive.org (webarchive)
Read Archived PDF
Preserved and Accessible
Type  article-journal
Stage   published
Year   2014
Language   en ?
DOI  10.1155/2014/235837
PubMed  24982932
PMC  PMC4005146
Wikidata  Q53512627
Container Metadata
Open Access Publication
In DOAJ
In ISSN ROAD
In Keepers Registry
ISSN-L:  1537-744X
Work Entity
access all versions, variants, and formats of this works (eg, pre-prints)
Catalog Record
Revision: bfc05a0c-2e84-49eb-bb8f-077feeea5698
API URL: JSON