<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	>
<channel>
	<title>Comments on: Anonymous data that isn&#8217;t</title>
	<atom:link href="http://www.lightbluetouchpaper.org/2006/08/09/anonymous-data-that-isnt/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.lightbluetouchpaper.org/2006/08/09/anonymous-data-that-isnt/</link>
	<description>Security Research, Computer Laboratory, University of Cambridge</description>
	<pubDate>Thu, 28 Aug 2008 00:11:39 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.6.1</generator>
		<item>
		<title>By: Anon</title>
		<link>http://www.lightbluetouchpaper.org/2006/08/09/anonymous-data-that-isnt/#comment-1072</link>
		<dc:creator>Anon</dc:creator>
		<pubDate>Tue, 15 Aug 2006 09:51:13 +0000</pubDate>
		<guid isPermaLink="false">http://www.lightbluetouchpaper.org/2006/08/09/anonymous-data-that-isnt/#comment-1072</guid>
		<description>re:NHS - Secondary Use Service,
One has to ask why they don't recruit you Ross! or someone under your tuition/supervision.

I certainly made my NHS IG colleagues aware of your work (Chapter 8, Medical records, inference etc), I think a comment from a previous IG director was what do you know Ross! (cough cough), I felt my resignation letter began there!

Anyway it's politics, delivery is king! for CFH, not what's right. That's my experience of 2 years of working there.</description>
		<content:encoded><![CDATA[<p>re:NHS - Secondary Use Service,<br />
One has to ask why they don&#8217;t recruit you Ross! or someone under your tuition/supervision.</p>
<p>I certainly made my NHS IG colleagues aware of your work (Chapter 8, Medical records, inference etc), I think a comment from a previous IG director was what do you know Ross! (cough cough), I felt my resignation letter began there!</p>
<p>Anyway it&#8217;s politics, delivery is king! for CFH, not what&#8217;s right. That&#8217;s my experience of 2 years of working there.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Ian Brown</title>
		<link>http://www.lightbluetouchpaper.org/2006/08/09/anonymous-data-that-isnt/#comment-988</link>
		<dc:creator>Ian Brown</dc:creator>
		<pubDate>Fri, 11 Aug 2006 17:51:39 +0000</pubDate>
		<guid isPermaLink="false">http://www.lightbluetouchpaper.org/2006/08/09/anonymous-data-that-isnt/#comment-988</guid>
		<description>Boing Boing has a couple of posts pointing out how this AOL data can be deanonymised:

&lt;a href="http://www.boingboing.net/2006/08/10/the_search_engine_co.html" rel="nofollow"&gt;The Search Engine Confessions of AOL User 23187425&lt;/a&gt;

&lt;a href="http://www.boingboing.net/2006/08/09/aols_user_query_data.html" rel="nofollow"&gt;AOL's user query database has been splunk'd&lt;/a&gt;</description>
		<content:encoded><![CDATA[<p>Boing Boing has a couple of posts pointing out how this AOL data can be deanonymised:</p>
<p><a href="http://www.boingboing.net/2006/08/10/the_search_engine_co.html" rel="nofollow">The Search Engine Confessions of AOL User 23187425</a></p>
<p><a href="http://www.boingboing.net/2006/08/09/aols_user_query_data.html" rel="nofollow">AOL&#8217;s user query database has been splunk&#8217;d</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Ambersail Infosec Roundup &#187; Blog Archive &#187; Nonanonymity</title>
		<link>http://www.lightbluetouchpaper.org/2006/08/09/anonymous-data-that-isnt/#comment-969</link>
		<dc:creator>Ambersail Infosec Roundup &#187; Blog Archive &#187; Nonanonymity</dc:creator>
		<pubDate>Wed, 09 Aug 2006 21:37:47 +0000</pubDate>
		<guid isPermaLink="false">http://www.lightbluetouchpaper.org/2006/08/09/anonymous-data-that-isnt/#comment-969</guid>
		<description>[...] I note this posting with interest. It would seem that AOL is not the only company that does not fully understand how to de-identify personal data. [...]</description>
		<content:encoded><![CDATA[<p>[...] I note this posting with interest. It would seem that AOL is not the only company that does not fully understand how to de-identify personal data. [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Richard Clayton</title>
		<link>http://www.lightbluetouchpaper.org/2006/08/09/anonymous-data-that-isnt/#comment-967</link>
		<dc:creator>Richard Clayton</dc:creator>
		<pubDate>Wed, 09 Aug 2006 19:16:51 +0000</pubDate>
		<guid isPermaLink="false">http://www.lightbluetouchpaper.org/2006/08/09/anonymous-data-that-isnt/#comment-967</guid>
		<description>AOL's motivation in releasing their data was &lt;a href="http://www.techcrunch.com/2006/08/07/aol-this-was-a-screw-up/" rel="nofollow"&gt;apparently to encourage researchers&lt;/a&gt; to use their real data so as to improve the user experience for searching. Encouraging research is fine, their main error was in not properly controlling the way the data was used.

Many researchers are interested in &lt;a href="http://www.ee.ucl.ac.uk/~lsacks/acse/masts/" rel="nofollow"&gt;processing traffic data from TCP/IP networks&lt;/a&gt;. They hope to better understand the real impact of congestion algorithms, tweaks to network stacks etc. This data is usually anonymised before being handed over -- but some of it can be deanonymised because particular machines will have particular patterns of traffic that could be picked out. &lt;a href="http://www.ja.net/" rel="nofollow"&gt;UKERNA&lt;/a&gt; are tackling this by only releasing traffic data to researchers who have signed an appropriate agreement ("Provision of Traffic Data for Research Use" [not yet published]) one of the terms of which is that the researcher contracts "not make any attempt to decompile, interfere, manipulate or otherwise take any action in respect of the Data which may have the result of revealing either message content or Personal Data contained in that Data, unless expressly permitted in the Data Specification;".

Assuming of course that researchers are honest and law abiding, this permits academic research without infringing people's privacy. AOL should look around and see how other people have dealt with the issues and not conclude that releasing data for research is inherently impossible.</description>
		<content:encoded><![CDATA[<p>AOL&#8217;s motivation in releasing their data was <a href="http://www.techcrunch.com/2006/08/07/aol-this-was-a-screw-up/" rel="nofollow">apparently to encourage researchers</a> to use their real data so as to improve the user experience for searching. Encouraging research is fine, their main error was in not properly controlling the way the data was used.</p>
<p>Many researchers are interested in <a href="http://www.ee.ucl.ac.uk/~lsacks/acse/masts/" rel="nofollow">processing traffic data from TCP/IP networks</a>. They hope to better understand the real impact of congestion algorithms, tweaks to network stacks etc. This data is usually anonymised before being handed over &#8212; but some of it can be deanonymised because particular machines will have particular patterns of traffic that could be picked out. <a href="http://www.ja.net/" rel="nofollow">UKERNA</a> are tackling this by only releasing traffic data to researchers who have signed an appropriate agreement (&#8221;Provision of Traffic Data for Research Use&#8221; [not yet published]) one of the terms of which is that the researcher contracts &#8220;not make any attempt to decompile, interfere, manipulate or otherwise take any action in respect of the Data which may have the result of revealing either message content or Personal Data contained in that Data, unless expressly permitted in the Data Specification;&#8221;.</p>
<p>Assuming of course that researchers are honest and law abiding, this permits academic research without infringing people&#8217;s privacy. AOL should look around and see how other people have dealt with the issues and not conclude that releasing data for research is inherently impossible.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
