Beyond k-Anonymity Arik Friedman November 2008 Seminar in Databases (236826)

Slides:

Advertisements

Similar presentations

Naïve Bayes. Bayesian Reasoning Bayesian reasoning provides a probabilistic approach to inference. It is based on the assumption that the quantities of.

Advertisements

Differentially Private Recommendation Systems Jeremiah Blocki Fall A: Foundations of Security and Privacy.

Simulatability “The enemy knows the system”, Claude Shannon CompSci Instructor: Ashwin Machanavajjhala 1Lecture 6 : Fall 12.

M-Invariance: Towards Privacy Preserving Re-publication of Dynamic Datasets by Tyrone Cadenhead.

Privacy-Preserving Data Publishing Donghui Zhang Northeastern University Acknowledgement: some slides come from Yufei Tao and Dimitris Sacharidis.

Data Anonymization - Generalization Algorithms Li Xiong CS573 Data Privacy and Anonymity.

Personalized Privacy Preservation Xiaokui Xiao, Yufei Tao City University of Hong Kong.

1 Privacy in Microdata Release Prof. Ravi Sandhu Executive Director and Endowed Chair March 22, © Ravi Sandhu.

Anatomy: Simple and Effective Privacy Preservation Xiaokui Xiao, Yufei Tao Chinese University of Hong Kong.

Fast Data Anonymization with Low Information Loss 1 National University of Singapore 2 Hong Kong University

UTEPComputer Science Dept.1 University of Texas at El Paso Privacy in Statistical Databases Dr. Luc Longpré Computer Science Department Spring 2006.

Privacy-MaxEnt: Integrating Background Knowledge in Privacy Quantification Wenliang (Kevin) Du, Zhouxuan Teng, and Zutao Zhu. Department of Electrical.

Probabilistic Inference Protection on Anonymized Data

K Beyond k-Anonimity: A Decision Theoretic Framework for Assessing Privacy Risk M.Scannapieco, G.Lebanon, M.R.Fouad and E.Bertino.

Privacy and k-Anonymity Guy Sagy November 2008 Seminar in Databases (236826)

C MU U sable P rivacy and S ecurity Laboratory 1 Privacy Policy, Law and Technology Data Privacy October 30, 2008.

Anatomy: Simple and Effective Privacy Preservation Israel Chernyak DB Seminar (winter 2009)

Malicious parties may employ (a) structure-based or (b) label-based attacks to re-identify users and thus learn sensitive information about their rating.

April 13, 2010 Towards Publishing Recommendation Data With Predictive Anonymization Chih-Cheng Chang †, Brian Thompson †, Hui Wang ‡, Danfeng Yao † †‡

L-Diversity: Privacy Beyond K-Anonymity

The Union-Split Algorithm and Cluster-Based Anonymization of Social Networks Brian Thompson Danfeng Yao Rutgers University Dept. of Computer Science Piscataway,

Ιδιωτικότητα σε Βάσεις Δεδομένων Οκτώβρης Roadmap Motivation Core ideas Extensions 2.

PRIVACY CRITERIA. Roadmap Privacy in Data mining Mobile privacy (k-e) – anonymity (c-k) – safety Privacy skyline.

k-Anonymity and Other Cluster-Based Methods

R 18 G 65 B 145 R 0 G 201 B 255 R 104 G 113 B 122 R 216 G 217 B 218 R 168 G 187 B 192 Core and background colors: 1© Nokia Solutions and Networks 2014.

Preserving Privacy in Published Data

Overview of Privacy Preserving Techniques.  This is a high-level summary of the state-of-the-art privacy preserving techniques and research areas  Focus.

Publishing Microdata with a Robust Privacy Guarantee

Data Publishing against Realistic Adversaries Johannes Gerhrke Cornell University Ithaca, NY Michaela Götz Cornell University Ithaca, NY Ashwin Machanavajjhala.

Sumathie Sundaresan Advisor : Dr. Huiping Guo Survey of Privacy Protection for Medical Data.

Thwarting Passive Privacy Attacks in Collaborative Filtering Rui Chen Min Xie Laks V.S. Lakshmanan HKBU, Hong Kong UBC, Canada UBC, Canada Introduction.

Background Knowledge Attack for Generalization based Privacy- Preserving Data Mining.

Topic 21: Data Privacy1 Information Security CS 526 Topic 21: Data Privacy.

CS573 Data Privacy and Security Anonymization methods Li Xiong.

Refined privacy models

K-Anonymity & Algorithms

Data Anonymization (1). Outline  Problem  concepts  algorithms on domain generalization hierarchy  Algorithms on numerical data.

Privacy of Correlated Data & Relaxations of Differential Privacy CompSci Instructor: Ashwin Machanavajjhala 1Lecture 16: Fall 12.

Data Anonymization – Introduction and k-anonymity Li Xiong CS573 Data Privacy and Security.

The Sparse Vector Technique CompSci Instructor: Ashwin Machanavajjhala 1Lecture 12 : Fall 12.

Hybrid l-Diversity* Mehmet Ercan NergizMuhammed Zahit GökUfuk Özkanlı

Preservation of Proximity Privacy in Publishing Numerical Sensitive Data J. Li, Y. Tao, and X. Xiao SIGMOD 08 Presented by Hongwei Tian.

1 Publishing Naive Bayesian Classifiers: Privacy without Accuracy Loss Author: Barzan Mozafari and Carlo Zaniolo Speaker: Hongwei Tian.

Privacy vs. Utility Xintao Wu University of North Carolina at Charlotte Nov 10, 2008.

Privacy-preserving data publishing

Thesis Sumathie Sundaresan Advisor: Dr. Huiping Guo.

CSCI 347, Data Mining Data Anonymization.

Anonymizing Data with Quasi-Sensitive Attribute Values Pu Shi 1, Li Xiong 1, Benjamin C. M. Fung 2 1 Departmen of Mathematics and Computer Science, Emory.

HASE: A Hybrid Approach to Selectivity Estimation for Conjunctive Queries Xiaohui Yu University of Toronto Joint work with Nick Koudas.

Probabilistic km-anonymity (Efficient Anonymization of Large Set-valued Datasets) Gergely Acs (INRIA) Jagdish Achara (INRIA)

Data Anonymization - Generalization Algorithms Li Xiong, Slawek Goryczka CS573 Data Privacy and Anonymity.

Unraveling an old cloak: k-anonymity for location privacy

No Free Lunch in Data Privacy CompSci Instructor: Ashwin Machanavajjhala 1Lecture 15: Fall 12.

Topic 21: Data Privacy1 Information Security CS 526 Topic 21: Data Privacy.

Privacy-safe Data Sharing. Why Share Data? Hospitals share data with researchers – Learn about disease causes, promising treatments, correlations between.

Personalized Privacy Preservation: beyond k-anonymity and ℓ-diversity SIGMOD 2006 Presented By Hongwei Tian.

Data Mining And Privacy Protection Prepared by: Eng. Hiba Ramadan Supervised by: Dr. Rakan Razouk.

A hospital has a database of patient records, each record containing a binary value indicating whether or not the patient has cancer. -suppose.

Privacy Issues in Graph Data Publishing Summer intern: Qing Zhang (from NC State University) Mentors: Graham Cormode and Divesh Srivastava.

Deriving Private Information from Association Rule Mining Results Zutao Zhu, Guan Wang, and Wenliang Du ICDE /3/181.

Versatile Publishing For Privacy Preservation

Privacy in Database Publishing

University of Texas at El Paso

ACHIEVING k-ANONYMITY PRIVACY PROTECTION USING GENERALIZATION AND SUPPRESSION International Journal on Uncertainty, Fuzziness and Knowledge-based Systems,

Xiaokui Xiao and Yufei Tao Chinese University of Hong Kong

By (Group 17) Mahesha Yelluru Rao Surabhee Sinha Deep Vakharia

Presented by : SaiVenkatanikhil Nimmagadda

TELE3119: Trusted Networks Week 4

Refined privacy models

Privacy-Preserving Data Publishing

Presentation transcript:

Beyond k-Anonymity Arik Friedman November 2008 Seminar in Databases (236826)

2 Outline  Recap – privacy and k-anonymity  -diversity (beyond k-anonymity)  t-closeness (beyond k-anonymity and l-diversity)  Privacy?

Recap - k-Anonymity Using medical data without disclosing patients’ identity: The problem: the ability of an attacker to cross the released data with external data. Zip Birthdate Gender Ethnicity Visit date Diagnosis Procedure Medication Total charge Name Address Date registered Party affiliation Date last voted Medical data Voter List Quasi-identifier

4 K-Anonymity – Formal Definition  RT - Released Table  (A1,A2, …,An) - Attributes  QI RT - Quasi Identifier  RT[QI RT ] – Projection of RT on QI RT

Example – original data Non-Sensitive DataSensitive Data # ZIPAgeNationalityCondition RussianHeart Disease AmericanHeart Disease JapaneseViral Infection AmericanViral Infection IndianCancer RussianHeart Disease AmericanViral Infection AmericanViral Infection AmericanCancer IndianCancer JapaneseCancer AmericanCancer

Example - 4-anonymized Table Non-Sensitive DataSensitive Data # ZIPAgeNationalityCondition *Heart Disease *Heart Disease *Viral Infection *Viral Infection *Cancer *Heart Disease *Viral Infection *Viral Infection *Cancer *Cancer *Cancer *Cancer

Example - 4-anonymized Table Non-Sensitive DataSensitive Data # ZIPAgeNationalityCondition <30*Heart Disease <30*Heart Disease <30*Viral Infection <30*Viral Infection 40 *Cancer 40 *Heart Disease 40 *Viral Infection 40 *Viral Infection **Cancer **Cancer **Cancer **Cancer

Example - 4-anonymized Table Non-Sensitive DataSensitive Data # ZIPAgeNationalityCondition 1130**<30*Heart Disease 2130**<30*Heart Disease 3130**<30*Viral Infection 4130**<30*Viral Infection 51485* 40 *Cancer 61485* 40 *Heart Disease 71485* 40 *Viral Infection 81485* 40 *Viral Infection 9130**3**Cancer 10130**3**Cancer 11130**3**Cancer 12130**3**Cancer

Example - 4-anonymized Table Non-Sensitive DataSensitive Data # ZIPAgeNationalityCondition 1130**<30*Heart Disease 2130**<30*Heart Disease 3130**<30*Viral Infection 4130**<30*Viral Infection 51485* 40 *Cancer 61485* 40 *Heart Disease 71485* 40 *Viral Infection 81485* 40 *Viral Infection 9130**3**Cancer 10130**3**Cancer 11130**3**Cancer 12130**3**Cancer We have 4-anonymity!!! We have privacy!!!! We have 4-anonymity!!! We have privacy!!!!

Example - 4-anonymized Table Non-Sensitive DataSensitive Data # ZIPAgeNat.Condition 1130**<30*Heart Disease 2130**<30*Heart Disease 3130**<30*Viral Infection 4130**<30*Viral Infection 51485* 40 *Cancer 61485* 40 *Heart Disease 71485* 40 *Viral Infection 81485* 40 *Viral Infection 9130**3**Cancer 10130**3**Cancer 11130**3**Cancer 12130**3**Cancer Suppose attacker knows the non- sensitive attributes of And the fact that Japanese have very low incidence of heart disease NameZipAgeNational Umeko Japanese Bob American

Example - 4-anonymized Table Non-Sensitive DataSensitive Data # ZIPAgeNat.Condition 1130**<30*Heart Disease 2130**<30*Heart Disease 3130**<30*Viral Infection 4130**<30*Viral Infection 51485* 40 *Cancer 61485* 40 *Heart Disease 71485* 40 *Viral Infection 81485* 40 *Viral Infection 9130**3**Cancer 10130**3**Cancer 11130**3**Cancer 12130**3**Cancer Suppose attacker knows the non- sensitive attributes of And the fact that Japanese have very low incidence of heart disease NameZipAgeNational Umeko Japanese Bob American Bob has cancer! Umeko has viral infection!

k-Anonymity Drawbacks  Basic Reasons for leak: Sensitive attributes lack diversity in values Sensitive attributes lack diversity in values Homogeneity AttackHomogeneity Attack Attacker has additional background knowledge Attacker has additional background knowledge Background knowledge AttackBackground knowledge Attack  Hence a new solution has been proposed in- addition to k-anonymity – -diversity

Adversary’s background knowledge  Has access to published table T* and knows that it is a generalization of some base table T  Instance-level background knowledge: Some individuals are present in the table. Some individuals are present in the table. Knowledge about sensitive attributes of specific individuals. Knowledge about sensitive attributes of specific individuals.  Demographic background knowledge Partial knowledge about the distribution of sensitive and non-sensitive attributes in the population. Partial knowledge about the distribution of sensitive and non-sensitive attributes in the population.  Diversity in the sensitive attribute values should mitigate both!

Some notation…  T = {t 1, t 2,…, t n } : A table with attributes A 1, A 2,…, A m A table with attributes A 1, A 2,…, A m Subset of some population  Subset of some population   t[C] = (t[C 1, C 2, …, C p ]) : Projection of t onto a set of attributes C  A Projection of t onto a set of attributes C  A  S  A – sensitive attributes  QI  A – quasi-identifier attributes  T*: anonymized table  q*-block – the set of records that were generalized to the same value q* in T*

Bayes Optimal Privacy  Ideal notion of privacy: models background knowledge as probability distribution over attributes  Uses Bayesian Inference techniques  Simplifying assumptions: A single, multi-dimensional quasi-identifier attribute Q A single, multi-dimensional quasi-identifier attribute Q A single sensitive attribute S A single sensitive attribute S T is a simple random sample from  T is a simple random sample from  Adversary Alice knows complete joint distribution f of Q and S (worst case assumption) Adversary Alice knows complete joint distribution f of Q and S (worst case assumption)

Bayes Optimal Privacy  Assume Bob appears in generalized table T*.  Alice’s prior belief of Bob’s sensitive attribute:  (q,s) =P f ( t[S] = s | t[Q] = q)  After seeing T*, Alice’s belief changes to its posterior value (or observed belief):  (q,s,T*) =P f ( t[S] = s | t[Q] = q   t*  T*, t* generalizes t) We wouldn’t want Alice to learn “much”:  (q,s)  (q,s,T*)

Bayes Optimal Privacy - Example  Bob, Alice’s neighbor, is a 62 years old state employee.  Alice’s prior belief: 10% of men over 60 have cancer:  (age  60  ZIPcode=02138,cancer) =  (age  60,cancer) = 0.1  In k-anonymized GIC data T*, the following lines could relate to Bob:  Alice’s belief changes to its posterior value:  (age  60  ZIPcode=02138,cancer,T*) = 0.5 AgeZipcodeDiagnosis  Cancer  Cancer  Healthy  Pneumonia

Bayes Optimal Privacy  Theorem 3.1: where n(q*,s’) is the number of tuples in T* with t*[Q] = q* and t*[S] = s’

Privacy principles  Positive disclosure: the adversary can correctly identify the value of a sensitive attribute:  q,s such that  (q,s,T*) >1-  for a given   Negative disclosure: the adversary can correctly eliminate the value of a sensitive attribute:  (q,s,T*) <  for a given  and  t  T such that t[Q]=q but t[S]  s

Privacy principles  Note not all positive and negative disclosures are bad If Alice already knew Bob has Cancer, there is nothing much one can do! If Alice already knew Bob has Cancer, there is nothing much one can do!  Uninformative principle: there should not be a large difference between the prior and posterior beliefs

Bayes Optimal Privacy  Limitations in practice Insufficient knowledge: data publisher unlikely to know f Insufficient knowledge: data publisher unlikely to know f Publisher does not know how much the adversary actually knows Publisher does not know how much the adversary actually knows He may have instance level knowledgeHe may have instance level knowledge No way to model non-probabilistic knowledgeNo way to model non-probabilistic knowledge Multiple adversaries having different levels of knowledge Multiple adversaries having different levels of knowledge  Hence a practical definition is needed

-diversity principle -diversity principle  Revisit:  Positive disclosure can occur when:

-diversity principle -diversity principle  Could occur due to combination of: Lack of diversity Lack of diversity Strong background Knowledge Strong background Knowledge Mitigate by requiring “well- represented” sensitive values At least -1 damaging pieces of background knowledge required to succeed

-diversity principle -diversity principle A q*-block is -diverse if it contains at least well-represented values for the sensitive attribute S. A table is -diverse if every q*-block is - diverse. Example – distinct -diversity: there are at least l distinct values for the sensitive attribute in each q*-block.

Non-Sensitive DataSensitive Data # ZIPAgeNationalityCondition 11305*<= 40*Heart Disease 21305*<= 40*Viral Infection 31305*<= 40*Cancer 41305*<= 40*Cancer 51485*>= 40*Cancer 61485*>= 40*Heart Disease 71485*>= 40*Viral Infection 81485*>= 40*Viral Infection 91306*<= 40*Heart Disease *<= 40*Viral Infection *<= 40*Cancer *<= 40*Cancer Example – 3-distinct diverse Table We have 3-distinct diversity!!! We have privacy!!!! We have 3-distinct diversity!!! We have privacy!!!!

Example - 3-distinct diverse table Non-Sensitive DataSensitive Data # ZIPAgeNat.Condition 1130**<30*Heart Disease 2130**<30*Heart Disease 3130**<30*Viral Infection 4130**<30*Viral Infection 5130**<30*Viral Infection 6130**<30*Viral Infection 7130**<30*Viral Infection 8130**<30*Viral Infection 9130**<30*Viral Infection 10130**<30*Viral Infection 11130**<30*Viral Infection 12130**<30*Cancer Suppose attacker knows the non- sensitive attributes of And the fact that Japanese have very low incidence of heart disease NameZipAgeNational Umeko Japanese Still very likely that Umeko has viral infection!

 A table is Entropy -Diverse if for every q*- block: where Entropy -diversity p(S 1 )p(S 2 )Entropy Not feasible when one value is very common Example with 2 sensitive attribute values

Recursive (c, )-diversity  None of the sensitive values should occur too frequently.  Let r i be the i th most frequent sensitive value  Given const c, recursive (c, )-diversity is satisfied if r 1 < c ( r + r +1 + … + r m ) For example, with 3 attributes (m=3):  (2,2)-diversity: r 1 <2(r 2 +r 3 )  (2,3)-diversity: r 1 <2r 3 Equivalently: even if we eliminate a sensitive value, we still have (2,2)-diversity Equivalently: even if we eliminate a sensitive value, we still have (2,2)-diversity

An algorithm for -diversity?  Monotonicity property: If T* preserves privacy, then so does every generalization of it  Satisfied by k-anonymity  Most k-anonymization algorithms work for any privacy measure that satisfies monotonicity - We can re-use previous algorithms directly  Bayes optimal privacy is not monotonic  -diversity variants are monotonic!

Mondrian(partition)  if (no allowable multidimensional cut for partition) return  : partition  summary  else dim  choose dimension() fs  frequency set(partition, dim) splitVal  find median(fs) lhs  {t  partition : t.dim  splitVal} rhs  {t  partition : t.dim > splitVal} return Mondrian(rhs)  Mondrian(lhs) Weight Age Example: Mondrian -entropy diverse, = 1.89 (for two sensitive attributes, equivalent to limiting prevalence to up to 2/3. Also equivalent to recursive (2,2)-diversity)

Experiments  Used Incognito (a popular generalization algorithm)  Adult dataset (Census data) from the UCI machine learning repository ( Adult Database Description Experiment results refer to this sensitive attribute

Experiments - Utility  Intuitively: “usefulness” of the -diverse and k-anonymized tables. Used k, = 2, 4, 6, 8 Number of generalization steps that were performed vs. k, Average size of q*-blocks generated (similar to C AVG ) vs. k,

Non-Sensitive DataSensitive Data # ZIPAgeNationalityCondition 11305*<= 40*Heart Disease 21305*<= 40*Viral Infection 31305*<= 40*Cancer 41305*<= 40*Cancer 51485*>= 40*Cancer 61485*>= 40*Heart Disease 71485*>= 40*Viral Infection 81485*>= 40*Viral Infection 91306*<= 40*Heart Disease *<= 40*Viral Infection *<= 40*Cancer *<= 40*Cancer Example – 3-diverse Table We have 3-diversity!!! We have privacy!!!! We have 3-diversity!!! We have privacy!!!!

Similarity attack Bob ZipAge ZipcodeAgeSalaryDisease 476**2*20KGastric Ulcer 476**2*30KGastritis 476**2*40KStomach Cancer 4790*≥4050KGastritis 4790*≥40100KFlu 4790*≥4070KBronchitis 476**3*60KBronchitis 476**3*80KPneumonia 476**3*90KStomach Cancer A 3-diverse patient table Conclusion 1.Bob’s salary is in [20k,40k], which is relative low. 2.Bob has some stomach-related disease. l-diversity does not consider semantic meanings of sensitive values l-diversity is insufficient to prevent attribute disclosure.

Skewness attack Non-Sensitive Data Sensitive Data # AgeCondition 1<30Cancer 2<30Cancer 3<30Healthy 4<30Healthy 53*Cancer 63*Healthy 73*Healthy 83*Healthy 93*Healthy 10 30 Healthy 11 30 Cancer 12 30 Cancer 13 30 Cancer 14 30 Cancer Two sensitive values in  : Cancer (1%) and Healthy (99%) (entropy: ) entropy: 2 entropy: 1.65 Equivalent in terms of - diversity, but very different semantically Attacker learned a lot!

t-Closeness: the main idea  Rationale AgeZipcode……GenderDisease **……*Flu **……*Heart Disease **……*Cancer …… ** *Gastritis External Knowledge Overall distribution Q of sensitive values BeliefKnowledge B0B0 B1B1 A completely generalized table

t-Closeness: the main idea  Rationale External Knowledge AgeZipcode……GenderDisease 2*479**……MaleFlu 2*479**……MaleHeart Disease 2*479**……MaleCancer …… ≥504766*……*Gastritis Overall distribution Q of sensitive values Distribution P i of sensitive values in each equivalence class BeliefKnowledge B0B0 B1B1 B2B2 A released table

t-Closeness: the main idea  Rationale External Knowledge Overall distribution Q of sensitive values Distribution P i of sensitive values in each equivalence class BeliefKnowledge B0B0 B1B1 B2B2  Observations Q should be treated as public Knowledge gain in two parts:  Whole population (from B 0 to B 1 )  Specific individuals (from B 1 to B 2 ) We bound knowledge gain between B 1 and B 2 instead  Principle The distance between Q and P i should be bounded by a threshold t.

t-closeness An equivalence class is said to have t-closeness if the distance between the distribution of a sensitive attribute in this class and the distribution of the attribute in the whole table is no more than a threshold t. A table is said to have t-closeness if all equivalence classes have t-closeness. A distance measure called Earth Movers Distance is used. It maintains monotonicity!

Non-Sensitive DataSensitive Data # ZIPAgeSalaryCondition 14767*<= 403KGastric ulcer 24767*<= 405KStomach cancer 34767*<= 409KPneumonia 44790*>= 406KGastritis 54790*>= 4011KFlu 64790*>= 408KBronchitis 74760*<= 404KGastritis 84760*<= 407KBronchitis 94760*<= 4010KStomach cancer Example – t-closeness We have closeness w.r.t. Salary and closeness w.r.t. Disease!!! We have privacy!!!! We have closeness w.r.t. Salary and closeness w.r.t. Disease!!! We have privacy!!!!

Netflix privacy breach (Robust De-anonymization of Large Sparse Datasets, Narayanan and Shmatikov, 2008)  Released for the Netflix Prize contest 17,770 movie titles 17,770 movie titles 480,189 users with random customer IDs 480,189 users with random customer IDs Ratings: 1-5 Ratings: 1-5 For each movie we have the ratings: For each movie we have the ratings: (MovieID, CustomerID, Rating, Date)(MovieID, CustomerID, Rating, Date)  Re-arrange by customerID: 41 MovieCustomerIDRankDate The Godfather Quantum of Solace Hamlet The Scorpion King The profit

Netflix privacy breach (Robust De-anonymization of Large Sparse Datasets, Narayanan and Shmatikov, 2008)  Can be linked, e.g., with IMDB data, to re- identify individuals! 42 MovieCustomerIDRankDate The Godfather Quantum of Solace Hamlet The Scorpion King The profit Netflix data IMDB data (This example is made up. Possibly, James Hitchcock has nothing to do with Netflix)

Epilogue 43 “You have zero privacy anyway. Get over it.” Scott McNeally (SUN CEO, January 1999)

HIPAA excerpt Health Insurance Portability and Accountability Act of 1996

45 Thank you!

46 Bibliography  “Mondrian Multidimensional k-Anonymity”,K. LeFevre, D.J. DeWitt, R. Ramakrishnan,2006  -diversity: Privacy beyond k-anonymity, A. Machanavajjhala, Johannes Gehrke, Daniel Kifer, 2006  T-closeness: Privacy beyond k-anonymity and -diversity, Ninghui Li, Tiancheng Li, Suresh Venkatasubramanian, 2006  Presentations: “Privacy In Databases”, B. Aditya Prakash “Privacy In Databases”, B. Aditya Prakash “K-Anonymity and Other Cluster-Based Methods”, Ge. Ruan “K-Anonymity and Other Cluster-Based Methods”, Ge. Ruan