Surprisingly to most observers, one of the biggest effects of the new FTC report will be in the area of de-identified data. The FTC’s new approach, highlighted by them as the top issue of interest to techies, provides a major incentive for companies to improve their data processes.
The earlier report would have applied to “consumer data that can be reasonably linked to a specific consumer, computer, or other device.” The debate has been about what it means to be “reasonably linked.” Consumer groups have correctly focused on the risks to consumers — new technology can link a vast range of data to individual consumers. Industry has correctly focused on the problems that come with an over-broad definition of “reasonably linked,” which could extend privacy rules to an almost unlimited range of data processing.
I believe the FTC has found a Goldilocks solution for the problem of de-identified data. The FTC provides what amounts to a safe harbor where: “(1) a given data set is not reasonably identifiable; (2) the company publicly commits not to re-identify it, and (3) the company requires any downstream users of the data to keep it in de-identified form.”
The FTC approach provides a major incentive for companies to comply with the de-identification safe harbor. For data in the safe harbor, all of the other privacy requirements do not apply. That reduces the scope and cost of compliance.
The FTC approach correctly recognizes that a promise not to re-identify data is key. Once a company makes that promise, it is subject to enforcement for a deceptive practice under Section 5 of the FTC Act. The company thus will have a strong reason to control its internal processes, to make sure that data that should be de-identified stays de-identified.
Similarly, the requirement of promises from the downstream users keeps data protected against the main risks. Data that can be potentially re-identified stays within a protected bubble – the companies promise not to re-identify, on pain of Section 5 enforcement.
I have long believed that technical controls are not enough to protect consumers against possible re-identification, as shown in a 2009 report by the Center for Democracy and Technology and my December talk on de-identified data. The best path is to have reasonably strong technical protections, supplemented by the sorts of enforceable promises that the FTC report supports.
In short, companies now will have an important incentive to comply with the de-identification safe harbor, so that their other databases won’t have to comply with privacy requirements. The result will be better data practices for the information that could otherwise cause the most risk to consumers.
Going forward, defining the scope of this “safe harbor” could be a good candidate for a multi-stakeholder process facilitated by the U.S. Department of Commerce. The Administration is asking for public comments on “substantive consumer data privacy issues that warrant the development of legally enforceable codes of conduct.” By defining the meaning of “reasonably identifiable” in concrete settings, companies will have a stronger incentive to put effective de-identification measures into place.
Please see slides and videos for a recap of FPF’s December 5, 2011 event “Personal Information: The Benefits and Risks of De-Identified Data.”