skip to main content
10.1109/CBMS.2014.85guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

A Benchmark of Globally-Optimal Anonymization Methods for Biomedical Data

Published: 27 May 2014 Publication History

Abstract

Collaboration and data sharing have become core elements of biomedical research. At the same time, there is a growing understanding of privacy threats related to data sharing, especially when sensitive data from distributed sources become available for linkage. Statistical disclosure control comprises well-known data anonymization techniques that allow the protection of data by introducing fuzziness. To protect datasets from different types of threats, different privacy criteria are commonly implemented. Data anonymization is an important measure, but it is computationally complex, and it can significantly reduce the expressiveness of data. To attenuate these problems, a number of algorithms has been proposed, which aim at increasing data quality or improving efficiency. Previous evaluations of such algorithms lack a systematic approach, as they focus on specific algorithms, specific privacy criteria, and specific runtime environments. Therefore, it is difficult for decision makers to decide which algorithm is best suited for their requirements. As a first step towards a comprehensive and systematic evaluation of anonymity algorithms, we report on our ongoing efforts for providing an open source benchmark. In this contribution, we focus on optimal algorithms utilizing global recoding with full-domain generalization. We present a systematic evaluation of domain-specific algorithms and generic search methods for a broad set of privacy criteria, including k-anonymity, l-diversity, t-closeness and d-presence, and their use in multiple real-world datasets. Our results show that there is no single solution fitting all needs, and that generic search methods can outperform highly specialized algorithms.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Guide Proceedings
CBMS '14: Proceedings of the 2014 IEEE 27th International Symposium on Computer-Based Medical Systems
May 2014
571 pages
ISBN:9781479944354

Publisher

IEEE Computer Society

United States

Publication History

Published: 27 May 2014

Author Tags

  1. δ-presence
  2. benchmark
  3. de-identification
  4. k-anonymity
  5. l-diversity
  6. privacy
  7. statistical disclosure control
  8. t-closeness

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 05 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2022) k-Anonymity in practiceComputers and Security10.1016/j.cose.2021.102488111:COnline publication date: 9-Apr-2022
  • (2018)LightningTransactions on Data Privacy10.5555/2993206.29932099:2(161-185)Online publication date: 13-Dec-2018
  • (2018)An Evaluation of Anonymized Models and Ensemble ClassifiersProceedings of the 2018 2nd International Conference on Big Data and Internet of Things10.1145/3289430.3289435(18-22)Online publication date: 24-Oct-2018
  • (2018)The cost of qualityJournal of Biomedical Informatics10.1016/j.jbi.2015.09.00758:C(37-48)Online publication date: 27-Dec-2018
  • (2017)Enhancing the Utility of Anonymized Data by Improving the Quality of Generalization HierarchiesTransactions on Data Privacy10.5555/3121409.312141110:1(27-59)Online publication date: 1-Apr-2017

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media