[e2e] was double blind, now reproduceable results
David P. Reed
dpreed at reed.com
Wed May 26 12:03:42 PDT 2004
If there is no way to verify that the data is real, why bother to publish it?
If the issue is prevarication, anonymizing protects the prevaricator.
If the issue is verifying the calculations, the anonymizing transform may
be more unreliable than the raw data.
Once transformed and released, there is neither revocation, nor logging of
use, and so if the anonymizing transform is faulty, there is no way to
monitor misuse.
Why not put the raw data in a secure place guarded by the original
investigators, where people can submit statistical queries against it? Of
course that can leak info, too, but no more than anonymizers can, and at
least the query programs actually run and the credentials of their authors
can be recorded and logged as evidence of bad intentions, and vetted to a
certain extent.
More information about the end2end-interest
mailing list