[Audio] Applications of Privacy Preserving Data Mining … Medical Databases: The Scrub and Datafly Systems Scrub : The scrub system was designed for de identification of clinical notes and letters which typically occurs in the form of textual data. Clinical notes and letters are typically in the form of text which contain references to patients, family members, addresses, phone numbers or providers. Traditional techniques simply use a global search and replace procedure in order to provide privacy. However clinical notes often contain cryptic references in the form of abbreviations which may only be understood either by other providers or members of the same institution. Therefore traditional methods can identify no more than 30-60% of the identifying information in the data The Scrub System uses local knowledge sources which compete with one another based on the certainty of their findings. Such a system is able to remove more than 99% of the identifying information from the data. 54.