15CS338E - Database Security and Privacy part 2

Published on
Embed video
Share video
Ask about this video

Scene 1 (0s)

[Audio] Applications of Privacy Preserving Data Mining … Medical Databases: The Scrub and Datafly Systems Scrub :  The scrub system was designed for de identification of clinical notes and letters which typically occurs in the form of textual data.  Clinical notes and letters are typically in the form of text which contain references to patients, family members, addresses, phone numbers or providers.  Traditional techniques simply use a global search and replace procedure in order to provide privacy.  However clinical notes often contain cryptic references in the form of abbreviations which may only be understood either by other providers or members of the same institution.  Therefore traditional methods can identify no more than 30-60% of the identifying information in the data  The Scrub System uses local knowledge sources which compete with one another based on the certainty of their findings.  Such a system is able to remove more than 99% of the identifying information from the data. 54.