Show simple item record

dc.contributor.authorKrishnamoorthy, Srikumar
dc.date.accessioned2022-08-29T09:39:22Z
dc.date.available2022-08-29T09:39:22Z
dc.date.issued2022-08-17
dc.identifier.citationKrishnamoorthy, S. (2022). A two-stage integer programming model considering transaction equivalence for privacy preservation. Computers & Operations Research, 105997.en_US
dc.identifier.issn0305-0548
dc.identifier.urihttp://hdl.handle.net/11718/25802
dc.description.abstractPreserving privacy is one of the fundamental requirements of firms that share data with their business partners for building advanced data mining models. Firms often aim to protect the disclosure of sensitive knowledge or information discovered during the data mining process. In this study, we investigate the problem of Frequent Itemset Hiding (FIH) which aims to hide sensitive itemset relationships present in a transactional database. We propose a two-stage integer programming model that maximizes the proportion of unaltered transactions in the sanitized database and protects sensitive itemset relationships. The model exploits the concept of transactional equivalence and significantly reduces the size of the FIH problem. In addition, our model enables the identification of solutions with minimal side effects. We conduct an experimental evaluation on both real and synthetic databases to show that our approach is scalable and produces a sanitized database with maximum accuracy. The generated solution is also found to have lower side effects (itemset information loss) compared to other state-of-the-art methods. Our experiments on very large problem instances show problem size reductions of one to three orders of magnitude. The proposed approach is quite attractive and practically useful for solving large-scale FIH problem instances and preserving privacy in increasingly shared and big data-driven organizational environments.en_US
dc.language.isoenen_US
dc.publisherElsevieren_US
dc.relation.ispartofComputers & Operations Researchen_US
dc.subjectData privacyen_US
dc.subjectFrequent itemset hidingen_US
dc.subjectInteger programmingen_US
dc.subjectSet covering problemen_US
dc.subjectData miningen_US
dc.titleA two-stage integer programming model considering transaction equivalence for privacy preservationen_US
dc.typeArticleen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record