<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Michael Zimmer.org &#187; Research ethics</title>
	<atom:link href="http://michaelzimmer.org/category/ethics/research-ethics/feed/" rel="self" type="application/rss+xml" />
	<link>http://michaelzimmer.org</link>
	<description>information ethics : new media : privacy : values in design : 2.0</description>
	<lastBuildDate>Mon, 30 Aug 2010 00:52:03 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.1</generator>
		<item>
		<title>SACHRP Presentation: Research Ethics in the 2.0 Era: Conceptual Gaps for Ethicists, Researchers, IRBs</title>
		<link>http://michaelzimmer.org/2010/07/20/presentation-research-ethics-in-the-2-0-era/</link>
		<comments>http://michaelzimmer.org/2010/07/20/presentation-research-ethics-in-the-2-0-era/#comments</comments>
		<pubDate>Tue, 20 Jul 2010 12:45:43 +0000</pubDate>
		<dc:creator>Michael Zimmer</dc:creator>
				<category><![CDATA[Featured]]></category>
		<category><![CDATA[Privacy]]></category>
		<category><![CDATA[Research ethics]]></category>
		<category><![CDATA[Talks]]></category>
		<category><![CDATA[Web 2.0]]></category>

		<guid isPermaLink="false">http://michaelzimmer.org/?p=2333</guid>
		<description><![CDATA[On Wednesday, July 21, 2010, I will be presenting in front of the Secretary’s Advisory Committee on Human Research Protections (SACHRP), part of the Office for Human Research Protections in the United States Department of Health and Human Services (HHS). My presentation will focus on how Web 2.0 tools, environments, and experiences are creating new conceptual gaps in our understanding of privacy, anonymity/identifiability, consent, and harm.


Related posts: (automatically generated)<ol><li><a href='http://michaelzimmer.org/2009/08/28/nsf-grant-internet-research-and-ethics-2-0/' rel='bookmark' title='Permanent Link: NSF Grant: Internet Research and Ethics 2.0'>NSF Grant: Internet Research and Ethics 2.0</a></li>
<li><a href='http://michaelzimmer.org/2009/06/18/draft-paper-but-the-data-is-already-public/' rel='bookmark' title='Permanent Link: Draft Paper: “But the Data is Already Public”: On the Ethics of Research in Facebook'>Draft Paper: “But the Data is Already Public”: On the Ethics of Research in Facebook</a></li>
<li><a href='http://michaelzimmer.org/2010/02/06/revisiting-research-ethics-in-the-facebook-era-challenges-in-emerging-cscw-research/' rel='bookmark' title='Permanent Link: Revisiting Research Ethics in the Facebook Era: Challenges in Emerging CSCW Research'>Revisiting Research Ethics in the Facebook Era: Challenges in Emerging CSCW Research</a></li>
<li><a href='http://michaelzimmer.org/2007/07/17/sdp2007-presentation-the-quest-for-the-perfect-search-engine/' rel='bookmark' title='Permanent Link: SDP2007 Presentation: The Quest for the Perfect Search Engine'>SDP2007 Presentation: The Quest for the Perfect Search Engine</a></li>
<li><a href='http://michaelzimmer.org/2009/02/25/sony-provides-complete-everquest-2-server-logs-to-researchers/' rel='bookmark' title='Permanent Link: Sony Provides Complete Everquest 2 Server Logs to Researchers'>Sony Provides Complete Everquest 2 Server Logs to Researchers</a></li>
</ol>]]></description>
			<content:encoded><![CDATA[<p>On Wednesday, July 21, 2010, I will be presenting in front of the <a href="http://www.hhs.gov/ohrp/sachrp/" target="_blank">Secretary’s Advisory Committee on Human Research Protections (SACHRP)</a>, part of the <a href="http://www.hhs.gov/ohrp/" target="_blank">Office for Human Research Protections</a> in the United States Department of Health and Human Services (HHS). The committee is convening <a href="http://www.hhs.gov/ohrp/sachrp/mtgings/mtg07-10/mtg07-10.html" target="_blank">this week</a> to address pressing issues in human subjects protections, and I&#8217;ve been invited by the SACRHP Chair to discuss emerging ethical issues with Internet-based research. I feel privileged to sit on the panel on &#8220;The Internet in Human Subjects Research&#8221; with <a href="http://www4.uwm.edu/cipr/about/director.cfm" target="_blank">Elizabeth Buchanan</a>, <a href="http://www.bgsu.edu/departments/popc/page16741.html" target="_blank">Montana Miller</a>, and <a href="http://blogs.law.harvard.edu/palfrey/top/bio/" target="_blank">John Palfrey</a>, all preeminent experts.</p>
<p>My contribution to the discussion will focus on how Web 2.0 tools, environments, and experiences are creating new conceptual gaps in our understanding of privacy, anonymity/identifiability, consent, and harm. My presentation is titled &#8220;Research Ethics in the 2.0 Era:Conceptual Gaps for Ethicists, Researchers, IRBs&#8221;, and relies heavily on my <a href="http://michaelzimmer.org/2009/06/18/draft-paper-but-the-data-is-already-public/" target="_blank">critique of the Tastes, Ties, and Time research project</a> and subsequent data release (<em>news coming soon about publication of this critique</em>).</p>
<p>The slides are available below (updated with new version).</p>
<div id="__ss_4793965" style="width: 425px;"><strong style="display:block;margin:12px 0 4px"><a title="Research Ethics in the 2.0 Era:Conceptual Gaps for Ethicists, Researchers, IRBs" href="http://www.slideshare.net/michaelzimmer/zimmer-sachrp-slides-v1">Research Ethics in the 2.0 Era:Conceptual Gaps for Ethicists, Researchers, IRBs</a></strong><object id="__sse4793965" classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000" width="425" height="355" codebase="http://download.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=6,0,40,0"><param name="allowFullScreen" value="true" /><param name="allowScriptAccess" value="always" /><param name="src" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=zimmersachrpslidesv1-100719234132-phpapp01&amp;stripped_title=zimmer-sachrp-slides-v1" /><param name="name" value="__sse4793965" /><param name="allowfullscreen" value="true" /><embed id="__sse4793965" type="application/x-shockwave-flash" width="425" height="355" src="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=zimmersachrpslidesv1-100719234132-phpapp01&amp;stripped_title=zimmer-sachrp-slides-v1" name="__sse4793965" allowscriptaccess="always" allowfullscreen="true"></embed></object>
<p> </p>
</div>


<p>Related posts: (automatically generated)<ol><li><a href='http://michaelzimmer.org/2009/08/28/nsf-grant-internet-research-and-ethics-2-0/' rel='bookmark' title='Permanent Link: NSF Grant: Internet Research and Ethics 2.0'>NSF Grant: Internet Research and Ethics 2.0</a></li>
<li><a href='http://michaelzimmer.org/2009/06/18/draft-paper-but-the-data-is-already-public/' rel='bookmark' title='Permanent Link: Draft Paper: “But the Data is Already Public”: On the Ethics of Research in Facebook'>Draft Paper: “But the Data is Already Public”: On the Ethics of Research in Facebook</a></li>
<li><a href='http://michaelzimmer.org/2010/02/06/revisiting-research-ethics-in-the-facebook-era-challenges-in-emerging-cscw-research/' rel='bookmark' title='Permanent Link: Revisiting Research Ethics in the Facebook Era: Challenges in Emerging CSCW Research'>Revisiting Research Ethics in the Facebook Era: Challenges in Emerging CSCW Research</a></li>
<li><a href='http://michaelzimmer.org/2007/07/17/sdp2007-presentation-the-quest-for-the-perfect-search-engine/' rel='bookmark' title='Permanent Link: SDP2007 Presentation: The Quest for the Perfect Search Engine'>SDP2007 Presentation: The Quest for the Perfect Search Engine</a></li>
<li><a href='http://michaelzimmer.org/2009/02/25/sony-provides-complete-everquest-2-server-logs-to-researchers/' rel='bookmark' title='Permanent Link: Sony Provides Complete Everquest 2 Server Logs to Researchers'>Sony Provides Complete Everquest 2 Server Logs to Researchers</a></li>
</ol></p>]]></content:encoded>
			<wfw:commentRss>http://michaelzimmer.org/2010/07/20/presentation-research-ethics-in-the-2-0-era/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Rethinking the Human Subjects Process</title>
		<link>http://michaelzimmer.org/2010/06/14/rethinking-the-human-subjects-process/</link>
		<comments>http://michaelzimmer.org/2010/06/14/rethinking-the-human-subjects-process/#comments</comments>
		<pubDate>Mon, 14 Jun 2010 20:17:55 +0000</pubDate>
		<dc:creator>Michael Zimmer</dc:creator>
				<category><![CDATA[Research ethics]]></category>

		<guid isPermaLink="false">http://michaelzimmer.org/?p=2204</guid>
		<description><![CDATA[Recently I&#8217;ve found myself absorbed in various issues surrounding Internet research ethics: the Tastes, Ties, and Time Facebook data release, Pete Warden&#8217;s plans to release a database of public Facebook information on 215 million users, etc. To help work through some of these issues &#8212; and assist others who are much more qualified than I to figure them out &#8212; I&#8217;ve been lucky to join Elizabeth Buchanan and Charles Ess on their NSF-funded project to launch the Internet  Research Ethics Digital Library, Resource Center, and Commons.
Complementing this new research ...


Related posts: (automatically generated)<ol><li><a href='http://michaelzimmer.org/2009/08/28/nsf-grant-internet-research-and-ethics-2-0/' rel='bookmark' title='Permanent Link: NSF Grant: Internet Research and Ethics 2.0'>NSF Grant: Internet Research and Ethics 2.0</a></li>
</ol>]]></description>
			<content:encoded><![CDATA[<p><a href="http://dmlcentral.net/blog/alexander-halavais/rethinking-human-subjects-process"><img class="alignright" style="margin: 2px;" title="IRB whiteboard" src="/images/irb2.jpg" alt="" width="281" height="117" /></a>Recently I&#8217;ve found myself absorbed in various issues surrounding <a href="http://michaelzimmer.org/category/ethics/research-ethics/" target="_blank">Internet research ethics</a>: the <a href="http://michaelzimmer.org/2009/06/18/draft-paper-but-the-data-is-already-public/" target="_blank">Tastes, Ties, and Time Facebook data release</a>, <a href="http://michaelzimmer.org/2010/02/12/why-pete-warden-should-not-release-profile-data-on-215-million-facebook-users/" target="_blank">Pete Warden&#8217;s plans</a> to release a database of public Facebook information on 215 million users, etc. To help work through some of these issues &#8212; and assist others who are much more qualified than I to figure them out &#8212; I&#8217;ve been lucky to join <a href="http://www4.uwm.edu/sois/cipr/director.html" target="_blank">Elizabeth Buchanan</a> and <a href="http://www.drury.edu/ess/ess.html" target="_blank">Charles Ess</a> on their<a href="http://www.nsf.gov/awardsearch/showAward.do?AwardNumber=0924604" target="_blank"> NSF-funded</a> project to launch the <a href="http://internetresearchethics.org/" target="_blank">Internet  Research Ethics Digital Library, Resource Center, and Commons</a>.</p>
<p>Complementing this new research area of mine, I had the privilege of participating in the first of a pair of one-day <a href="http://dmlcentral.net/projects/4113">workshops  intended to discuss challenges</a> related to human subjects approval processes for to Internet-based research on children and learning. The workshop was organized by <a href="http://alex.halavais.net/">Alex Halavais</a>, and <a href="http://lawgeek.typepad.com/">Jason Schultz</a>, as part of the <a href="http://digitallearning.macfound.org/site/c.enJLKQNlFiG/b.2029199/k.94AC/Latest_News.htm" target="_blank">larger MacArthur Foundation-funded</a> project on <a href="http://dmlcentral.net/about/what-all-about" target="_blank">Digital Media and Learning</a>, and I was joined by these preeminent scholars and experts: <a href="http://www.anthro.uci.edu/faculty_bios/boellstorff/boellstorff.php">Tom  Boellstorff</a>, <a href="http://www.heatherhorst.org/">Heather Horst</a>, <a href="http://www.bgsu.edu/offices/mc/zoom/news120809.html">Montana  Miller</a>, <a href="http://blogs.ischool.berkeley.edu/dperkel/">Dan  Perkel</a>, <a href="http://www.primr.org/Content.aspx?id=3671">Ivor  Pritchard</a>, and <a href="http://history.nih.gov/about/stark.html">Laura Stark</a>.</p>
<p>Details of that first meeting have been posted <a href="http://dmlcentral.net/blog/alexander-halavais/rethinking-human-subjects-process" target="_blank">here</a>, and summarized below:</p>
<blockquote><p>We found that while there might be some fairly intractable issues, as  there are for any established institution, some of the difficulties  that IRBs and investigators encountered were a result of reinventing the  wheel locally, and a general lack of transparency in the process of  approving human subjects research. The elements required to make good  decisions on planned research tend to be obscure and unevenly  distributed across IRBs. From shared vocabularies between IRBs and  investigators, to knowledge of social computing contexts, to a clear  understanding of the regulations and empirical evidence of risk, many of  the elements that delay the approval of protocols and frustrate  researchers and IRBs could be addressed if the information necessary was  more widely accessible and easily discoverable.</p>
<p>Rather than encouraging the creation of national or other centralized  IRBs, more awareness and transparency would allow local solutions to be  shared widely. Essentially, this is a problem of networked learning:  how is it that investigators, IRB members, and administrators can come  quickly to terms with the best practices in DML research?</p></blockquote>
<p>We are meeting again in August to continue this conversation. If you have suggestions, or your own stories to tell, feel free to drop me a line or comment at the <a href="http://dmlcentral.net/blog/alexander-halavais/rethinking-human-subjects-process" target="_blank">DML Central blog</a>.</p>


<p>Related posts: (automatically generated)<ol><li><a href='http://michaelzimmer.org/2009/08/28/nsf-grant-internet-research-and-ethics-2-0/' rel='bookmark' title='Permanent Link: NSF Grant: Internet Research and Ethics 2.0'>NSF Grant: Internet Research and Ethics 2.0</a></li>
</ol></p>]]></content:encoded>
			<wfw:commentRss>http://michaelzimmer.org/2010/06/14/rethinking-the-human-subjects-process/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Why Pete Warden Should Not Release Profile Data on 215 Million Facebook Users</title>
		<link>http://michaelzimmer.org/2010/02/12/why-pete-warden-should-not-release-profile-data-on-215-million-facebook-users/</link>
		<comments>http://michaelzimmer.org/2010/02/12/why-pete-warden-should-not-release-profile-data-on-215-million-facebook-users/#comments</comments>
		<pubDate>Fri, 12 Feb 2010 19:52:05 +0000</pubDate>
		<dc:creator>Michael Zimmer</dc:creator>
				<category><![CDATA[Facebook]]></category>
		<category><![CDATA[Featured]]></category>
		<category><![CDATA[Privacy]]></category>
		<category><![CDATA[Research ethics]]></category>

		<guid isPermaLink="false">http://michaelzimmer.org/?p=1832</guid>
		<description><![CDATA[Speaking of the research ethics related to automatically harvesting public social networking data, we are confronted this week with the story of Pete Warden, a former Apple engineer who has spent the last six months harvesting and analyzing data from some 215 million public Facebook profile pages.
According to Warden, he exploited a flaw in Facebook&#8217;s architecture to access public profiles without needing to be signed in to a Facebook account, effectively avoiding being bound by Facebook&#8217;s Terms of Service preventing such automated harvesting of data. As a result, he amassed ...


Related posts: (automatically generated)<ol><li><a href='http://michaelzimmer.org/2009/06/18/draft-paper-but-the-data-is-already-public/' rel='bookmark' title='Permanent Link: Draft Paper: “But the Data is Already Public”: On the Ethics of Research in Facebook'>Draft Paper: “But the Data is Already Public”: On the Ethics of Research in Facebook</a></li>
<li><a href='http://michaelzimmer.org/2007/07/20/askcom-to-allow-users-to-control-data-retention/' rel='bookmark' title='Permanent Link: Ask.com To Allow Users to Control Data Retention'>Ask.com To Allow Users to Control Data Retention</a></li>
<li><a href='http://michaelzimmer.org/2007/08/19/41-of-facebook-users-share-personal-information-with-a-frog/' rel='bookmark' title='Permanent Link: 41% of Facebook Users Share Personal Information with a Frog'>41% of Facebook Users Share Personal Information with a Frog</a></li>
<li><a href='http://michaelzimmer.org/2006/01/25/77-of-google-users-dont-know-it-records-personal-data/' rel='bookmark' title='Permanent Link: 77% of Google users don&#8217;t know it records personal data'>77% of Google users don&#8217;t know it records personal data</a></li>
<li><a href='http://michaelzimmer.org/2010/05/10/what-happens-to-your-facebook-data-when-you-leave/' rel='bookmark' title='Permanent Link: What Happens to Your Facebook Data When You Leave? (Updated)'>What Happens to Your Facebook Data When You Leave? (Updated)</a></li>
</ol>]]></description>
			<content:encoded><![CDATA[<p>Speaking of the <a href="http://michaelzimmer.org/2010/02/12/is-it-ethical-to-harvest-public-twitter-accounts-without-consent/" target="_blank">research ethics related to automatically harvesting public</a> social networking data, we are <a href="http://www.readwriteweb.com/archives/facebook_user_data_analysis.php" target="_blank">confronted this week with the story</a> of <a href="http://petewarden.typepad.com/">Pete Warden</a>, a former Apple engineer who has spent the last six months harvesting and analyzing data from some 215 million public Facebook profile pages.</p>
<p><a href="http://petewarden.typepad.com/searchbrowser/2010/02/how-to-harvest-facebook-profiles-from-emails-without-logging-in.html" target="_blank">According to Warden</a>, he exploited a flaw in Facebook&#8217;s architecture to access public profiles without needing to be signed in to a Facebook account, effectively avoiding being bound by Facebook&#8217;s <a href="http://www.facebook.com/#!/terms.php?ref=pf" target="_blank">Terms of Service</a> preventing such automated harvesting of data. As a result, he amassed a database of names, fan pages, and lists of friends for 215 million public Facebook accounts.</p>
<p><a href="http://michaelzimmer.org/images/Pete_Warden_Data.png"><img class="alignright" title="Pete Warden chart" src="http://michaelzimmer.org/images/Pete_Warden_Data.png" alt="" width="265" height="133" /></a>Warden has already done some <a href="http://petewarden.typepad.com/searchbrowser/2010/02/how-to-split-up-the-us.html" target="_blank">impressive analysis</a> of this data at an aggregate level, and I know researchers would love to get their hands on it. And like the <a href="http://cyber.law.harvard.edu/node/4682" target="_blank">&#8220;Tastes, Ties, and Time&#8221;</a> Facebook project, Warden wants to release the dataset to the academic community.</p>
<p>But also <a href="http://michaelzimmer.org/2009/06/18/draft-paper-but-the-data-is-already-public/" target="_blank">like the &#8220;Tastes, Ties, and Time&#8221; project</a>, Warden would be wrong to do so.</p>
<p>First, similar to our discussion of the ethics of collecting public Twitter streams, just because these Facebook users made their profiles publicly available does not mean they are fair game for scraping for research purposes. Yes, I have limited profile information viewable to the public, and I&#8217;ve authorized Facebook to make that information available for search engines to crawl. But the purpose of this public availability is to help people &#8212; humans, not bots &#8212; find me. The presumption is that my public profile data will only be found and viewed if someone actually searches for &#8220;Michael Zimmer&#8221; on Facebook or a search engine. In reality, my profile is only &#8220;public&#8221; if a human being takes specific and conscious action to find me.</p>
<p>Warden&#8217;s actions, however, violate this implicit understanding for making profiles publicly searchable. Rather than trying to find <em>me</em>, Warden is systematically sought <em>everyone</em>, letting a script to the work of seeking and harvesting my data. There is no genuine desire to find me, to friend me, and so on. He&#8217;s just collecting data. His reasons might be honest and beneficial, but that&#8217;s not what&#8217;s at issue here. The point is whether the 215 million Facebook users who now have some of their information in Warden&#8217;s database contemplated such harvesting and aggregating when they built their profile and configured their privacy settings. They almost certainly didn&#8217;t, which brings into doubt whether this data has been collected with proper consent.</p>
<p>Second, Warden&#8217;s release of this dataset &#8212; even with the best of intentions &#8212; poses a serious privacy threat to the subjects in the dataset, their friends, and perhaps unknown others. Warden claims to be sensitive to the privacy of the subjects in the database, and <a href="http://www.readwriteweb.com/archives/facebook_user_data_analysis.php" target="_blank">in response he has removed</a> the identifying URL&#8217;s's that are unique to each profile, but the dataset retains the subjects&#8217; names <em>(really!)</em>, locations, Fan page lists and partial Friends lists (I&#8217;m not sure what is meant by a &#8220;partial&#8221; list of friends).</p>
<p>So, obviously, individuals can be easily identified within the dataset. But that&#8217;s not the greatest threat with the release of this data. What is most dangerous is its potential use to help re-identify <em>other</em> datasets, ones that might contain much more sensitive or potentially damaging data. Recall the research that showed how trivial it was to <a href="http://arxivblog.com/?p=142" target="_blank">re-identify the presumed &#8220;anonymized&#8221; Netflix database</a>, or the <a href="http://www.cs.utexas.edu/~shmat/socialnetworks-faq.html" target="_blank">ease in identifying individuals within social networks</a>. These ease of re-identifying these datasets came from having ready access to other large sets of data where the subjects where already known. By overlaying social graphs and other intricate data-comparison methods, the &#8220;anonymous&#8221; datasets were quickly re-identified. (See Paul Ohm&#8217;s <a href="http://papers.ssrn.com/sol3/papers.cfm?abstract_id=1450006" target="_blank">&#8220;Broken Promises of Privacy: Responding to the Surprising Failure of Anonymization&#8221;</a> for excellent coverage of these cases and discussion of consequences for law &amp; policy).</p>
<p>Warden&#8217;s rich dataset of 210 million Facebook users, complete with their names, locations, and social graphs, is just the ammunition needed to fuel a new wave of re-identification of presumed anonymous datasets. It is impossible to predict who might use Warden&#8217;s dataset and to what ends, but this threat is real.</p>
<p>It turns out that <a href="http://petewarden.typepad.com/searchbrowser/2010/02/the-facebook-whisperer.html#idc-container" target="_blank">Facebook has asked Warden to delay releasing</a> this data to the academic community (I&#8217;m curious as to what kind of pressure &#8212; if any &#8212; they exerted to keep him from releasing this week as originally planned). We will need to keep a close eye to see if the data is actually released, in what form, and if any steps will be taken to control and track its usage.</p>
<p>UPDATE: Under a <a href="http://petewarden.typepad.com/searchbrowser/2010/04/how-i-got-sued-by-facebook.html#comments" target="_blank">threat of a lawsuit</a> from Facebook, Warden has <a href="http://petewarden.typepad.com/searchbrowser/2010/03/facebook-data-destruction.html" target="_blank">destroyed the dataset</a>.</p>


<p>Related posts: (automatically generated)<ol><li><a href='http://michaelzimmer.org/2009/06/18/draft-paper-but-the-data-is-already-public/' rel='bookmark' title='Permanent Link: Draft Paper: “But the Data is Already Public”: On the Ethics of Research in Facebook'>Draft Paper: “But the Data is Already Public”: On the Ethics of Research in Facebook</a></li>
<li><a href='http://michaelzimmer.org/2007/07/20/askcom-to-allow-users-to-control-data-retention/' rel='bookmark' title='Permanent Link: Ask.com To Allow Users to Control Data Retention'>Ask.com To Allow Users to Control Data Retention</a></li>
<li><a href='http://michaelzimmer.org/2007/08/19/41-of-facebook-users-share-personal-information-with-a-frog/' rel='bookmark' title='Permanent Link: 41% of Facebook Users Share Personal Information with a Frog'>41% of Facebook Users Share Personal Information with a Frog</a></li>
<li><a href='http://michaelzimmer.org/2006/01/25/77-of-google-users-dont-know-it-records-personal-data/' rel='bookmark' title='Permanent Link: 77% of Google users don&#8217;t know it records personal data'>77% of Google users don&#8217;t know it records personal data</a></li>
<li><a href='http://michaelzimmer.org/2010/05/10/what-happens-to-your-facebook-data-when-you-leave/' rel='bookmark' title='Permanent Link: What Happens to Your Facebook Data When You Leave? (Updated)'>What Happens to Your Facebook Data When You Leave? (Updated)</a></li>
</ol></p>]]></content:encoded>
			<wfw:commentRss>http://michaelzimmer.org/2010/02/12/why-pete-warden-should-not-release-profile-data-on-215-million-facebook-users/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>Is it Ethical to Harvest Public Twitter Accounts without Consent?</title>
		<link>http://michaelzimmer.org/2010/02/12/is-it-ethical-to-harvest-public-twitter-accounts-without-consent/</link>
		<comments>http://michaelzimmer.org/2010/02/12/is-it-ethical-to-harvest-public-twitter-accounts-without-consent/#comments</comments>
		<pubDate>Fri, 12 Feb 2010 15:09:21 +0000</pubDate>
		<dc:creator>Michael Zimmer</dc:creator>
				<category><![CDATA[Featured]]></category>
		<category><![CDATA[Privacy]]></category>
		<category><![CDATA[Research ethics]]></category>
		<category><![CDATA[Twitter]]></category>

		<guid isPermaLink="false">http://michaelzimmer.org/?p=1834</guid>
		<description><![CDATA[While participating in the workshop on Revisiting   Research Ethics in the Facebook Era: Challenges in Emerging CSCW   Research, the question arose as to whether it was ethical for  researchers to follow and systematically capture public Twitter streams  without first obtaining specific, informed consent by the subjects. Many  in the room felt that consent was not necessary since the tweets are  public, a conscious choice made by the user to allow the whole world see  her activity. In short, by not restricting ...


Related posts: (automatically generated)<ol><li><a href='http://michaelzimmer.org/2010/04/14/how-your-private-tweets-might-be-included-in-the-library-of-congress-public-archive/' rel='bookmark' title='Permanent Link: How Your Private Tweets Might Be Included in the Library of Congress Public Archive'>How Your Private Tweets Might Be Included in the Library of Congress Public Archive</a></li>
<li><a href='http://michaelzimmer.org/2006/01/31/public-comments-on-rfid-passports-arepublic/' rel='bookmark' title='Permanent Link: Public Comments on RFID Passports are&#8230;Public'>Public Comments on RFID Passports are&#8230;Public</a></li>
<li><a href='http://michaelzimmer.org/2007/06/25/wsj-when-public-records-are-too-public/' rel='bookmark' title='Permanent Link: WSJ: When Public Records Are Too Public'>WSJ: When Public Records Are Too Public</a></li>
<li><a href='http://michaelzimmer.org/2006/01/18/itunes-now-asks-for-consent-before-collecting-data/' rel='bookmark' title='Permanent Link: iTunes Now Asks for Consent Before Collecting Data'>iTunes Now Asks for Consent Before Collecting Data</a></li>
<li><a href='http://michaelzimmer.org/2007/05/24/flaw-in-twitters-privacy-settings/' rel='bookmark' title='Permanent Link: Flaw in Twitter&#8217;s Privacy Settings'>Flaw in Twitter&#8217;s Privacy Settings</a></li>
</ol>]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.flickr.com/photos/tporter2/4300503088/"><img class="alignright" title="Tweet Me" src="http://michaelzimmer.org/images/tweet_me.jpg" alt="" width="240" height="167" /></a>While participating in the workshop on <a href="../2010/02/06/revisiting-research-ethics-in-the-facebook-era-challenges-in-emerging-cscw-research/">Revisiting   Research Ethics in the Facebook Era: Challenges in Emerging CSCW   Research</a>, the question arose as to whether it was ethical for  researchers to follow and systematically capture public Twitter streams  without first obtaining specific, informed consent by the subjects. Many  in the room felt that consent was not necessary since the tweets are  public, a conscious choice made by the user to allow the whole world see  her activity. In short, by not restricting access to one&#8217;s account,  there is no expectation of privacy.</p>
<p>I argued, however, that we  cannot be so quick to presume the expectations of potential research  subjects. Yes, setting one&#8217;s Twitter stream to public does mean that  anyone can search for you, follow you, and view your activity. However,  there is a reasonable expectation that one&#8217;s tweet stream will be  &#8220;practically obscure&#8221; within the thousands (if not millions) of tweets  similarly publicly viewable. Yes, the subject has consented to making  her tweets visible to those who take the time and energy to seek her  out, those who have a genuine interest to connect and view her activity  through this social network.</p>
<p>But she did <em>not</em> automatically  consent, I argue, to having her tweet stream systematically followed,  harvested, archived, and mined by researchers (no matter the positive  intent of such research). That is not what is expected when making a  Twitter account public, and it is my opinion that researchers should  seek consent prior to capturing and using this data.</p>
<p>A healthy  debate on this issue followed, and continued in a separate  thread on  Facebook, which included the following varied positions &amp; responses (edited and condensed):</p>
<blockquote>
<ol>
<li>&#8220;&#8230;if the account holder tweets to the general public, then it&#8217;d seem like  there&#8217;s no expectation of privacy so no consent would be necessary.&#8221;</li>
<li><em>(me)</em> &#8220;But isn&#8217;t my expectation that even though my  tweets are public, they&#8217;re often lost in a sea of hundreds of tweets  among my followers, and I never anticipated someone would archive, mine,  and perform research on them?&#8221;</li>
<li>&#8220;If you&#8217;re comfortable with your anonymity  being guaranteed only by virtue of your public tweets being hidden in  plain sight among millions of others, then you&#8217;d have to realize that  some determined person could follow just yours, archive them, and  analyze them.  I like my privacy, but I don&#8217;t worry about walking around  a city or campus even though &#8230;&#8221;</li>
<li>&#8220;&#8230;depends on how data are being presented &#8211; e.g. in aggregate vs specific  &#8220;quotes&#8221; that could easily be traced.&#8221;</li>
<li>&#8220;Many IRBs would say yes [consent is needed], or at least would  require you to get a waiver&#8211;publicizing the extremes to which IRBs go&#8230;&#8221;</li>
<li>&#8220;&#8230;IRB application is  required. You could request that Informed consent be waived with the  argument that you are only analyzing tweets broadcast publicly, and that  you de-identify your data to eliminate potential risk to the individual&#8221;</li>
<li>&#8220;I would say if it is for research and you  are dealing only with publicly available documents, then no, you need no  consent.  you can run that by the irb and get a waiver, but in the end,  you are dealing with publicly available documents&#8230; not people,  subjects.  If you are dealing with subjects and not documents, then you  will need irb clearance.&#8221;</li>
<li>&#8220;Tweets are publications. I think it&#8217;s absurd  to even consider IRB review for anything dealing with things people  have published&#8221;</li>
<li>&#8220;The questions are:  1) Are you  conducting research that is intended to be published; 2) Does your  research involved human participants; 3) For these human participants,  will you gather data through intervention or interaction with the  individual; and/or will you gather identifiable private information  about them. (45 CFR 46.102(f))<br />
If these 3 conditions are met, your research must be  reviewed by IRB.  They will work with you and determine whether or not  informed consent is required.  In your case, if you are NOT interacting  with the individual publishing the tweets, and the tweets are broadcast  and searchable as public records (that is, you don’t need access to  their account to view tweets posted to a limited audience), then it  won’t fall under the definition of research with human subjects.&#8221;</li>
<li>&#8220;If i download all of Michael&#8217;s published  papers, blog posts, twitter posts and each one he publishes  thereafter&#8230;  are they the same?  or different?   I&#8217;d argue the same,  just for different audiences.&#8221;</li>
<li><em>(me)</em> &#8220;What if tomorrow, I decide to take my Tweet  stream private. And I delete my blog posts. Does my affirmative action  to purge my documents from the &#8220;live&#8221; web mean that you (researcher)  need to treat that previously archived material differently?&#8221;</li>
<li>&#8220;If the individual changes their intent regarding release of data, then  by IRB standards what might previously have been considered publicly  available information, then becomes private information, and your  collection would likely require BOTH IRB review AND informed consent,  b/c the user now has an expectation that their information is protected.&#8221;</li>
<li>&#8220;Once tweeted, a birdsong is gone forever.  No deleting or taking back  what&#8217;s been broadcast to the world.  If someone seeks privacy, they  should seek another method of communication.  If from the beginning,  there was some kind of inherent expectation that tweets were private  messages, then the situation might be different.  But the whole idea of  tweeting is to voluntarily publish or broadcast.  It&#8217;s different from,  say, e-mailing or IMing.&#8221;</li>
</ol>
</blockquote>
<p>What we see here are numerous, intelligent researchers not in complete agreement about wither consent is necessary, about whether one&#8217;s tweets are &#8220;publications&#8221; not needing IRB review, or whether Twitter-based research is dealing with &#8220;human subjects&#8221; that does require strict scrutiny. There&#8217;s also some question about how to deal with the fact that users might make information private after an initial release, something our current forms of communication allow more than in the past.</p>
<p>What do you think? If readers have had experience with related research  ethics issues, and  how their IRB dealt with is, please email me or leave  a comment.</p>
<p><em>Aside:</em> Interestingly, Adam Fish, who I&#8217;ve friended on Facebook, saw that discussion and wanted to <a href="http://savageminds.org/2010/02/12/mining-twitter-and-informed-consent/" target="_blank">repost the thread on his blog</a>. Respectful of the delicate nature of re-posting other conversations and moving them from the controlled environs of Facebook to a public blog, he contacted me to ask permission. He didn&#8217;t, apparently, contact each of the commenters to ask for <em>their</em> permission. I felt it necessary to get consent from everyone in that thread before authorizing its re-posting. When I asked each of them, all agreed (with some edits), and some took the position that the Facebook conversation was <em>de facto</em> public, even though technically only a certain set of users (friends of the participants) could in reality see the thread.</p>
<p style="text-align: right;"><em>[<a href="http://www.flickr.com/photos/tporter2/4300503088/" target="_blank">image</a> from <a title="Link to TPorter2006's photostream" rel="dc:creator cc:attributionURL" href="http://www.flickr.com/photos/tporter2/">TPorter2006</a>]</em></p>


<p>Related posts: (automatically generated)<ol><li><a href='http://michaelzimmer.org/2010/04/14/how-your-private-tweets-might-be-included-in-the-library-of-congress-public-archive/' rel='bookmark' title='Permanent Link: How Your Private Tweets Might Be Included in the Library of Congress Public Archive'>How Your Private Tweets Might Be Included in the Library of Congress Public Archive</a></li>
<li><a href='http://michaelzimmer.org/2006/01/31/public-comments-on-rfid-passports-arepublic/' rel='bookmark' title='Permanent Link: Public Comments on RFID Passports are&#8230;Public'>Public Comments on RFID Passports are&#8230;Public</a></li>
<li><a href='http://michaelzimmer.org/2007/06/25/wsj-when-public-records-are-too-public/' rel='bookmark' title='Permanent Link: WSJ: When Public Records Are Too Public'>WSJ: When Public Records Are Too Public</a></li>
<li><a href='http://michaelzimmer.org/2006/01/18/itunes-now-asks-for-consent-before-collecting-data/' rel='bookmark' title='Permanent Link: iTunes Now Asks for Consent Before Collecting Data'>iTunes Now Asks for Consent Before Collecting Data</a></li>
<li><a href='http://michaelzimmer.org/2007/05/24/flaw-in-twitters-privacy-settings/' rel='bookmark' title='Permanent Link: Flaw in Twitter&#8217;s Privacy Settings'>Flaw in Twitter&#8217;s Privacy Settings</a></li>
</ol></p>]]></content:encoded>
			<wfw:commentRss>http://michaelzimmer.org/2010/02/12/is-it-ethical-to-harvest-public-twitter-accounts-without-consent/feed/</wfw:commentRss>
		<slash:comments>17</slash:comments>
		</item>
		<item>
		<title>Revisiting Research Ethics in the Facebook Era: Challenges in Emerging CSCW Research</title>
		<link>http://michaelzimmer.org/2010/02/06/revisiting-research-ethics-in-the-facebook-era-challenges-in-emerging-cscw-research/</link>
		<comments>http://michaelzimmer.org/2010/02/06/revisiting-research-ethics-in-the-facebook-era-challenges-in-emerging-cscw-research/#comments</comments>
		<pubDate>Sat, 06 Feb 2010 06:22:28 +0000</pubDate>
		<dc:creator>Michael Zimmer</dc:creator>
				<category><![CDATA[CSCW]]></category>
		<category><![CDATA[Facebook]]></category>
		<category><![CDATA[Research ethics]]></category>
		<category><![CDATA[cscw2010]]></category>

		<guid isPermaLink="false">http://michaelzimmer.org/?p=1820</guid>
		<description><![CDATA[I&#8217;m currently in Savannah, GA to participate in a workshop on Revisiting  Research Ethics in the Facebook Era: Challenges in Emerging CSCW  Research at CSCW 2010.
This is my first time at CSCW, and looking at the set of papers for this workshop, it should be an excellent experience. I&#8217;ve submitted a brief analysis of the &#8220;Tastes, Ties, and Time&#8221; Facebook dataset release (my larger paper is going through its final edits for publication). You can download the short analysis here: Subject Privacy and the Release of the “Tastes, ...


Related posts: (automatically generated)<ol><li><a href='http://michaelzimmer.org/2009/06/18/draft-paper-but-the-data-is-already-public/' rel='bookmark' title='Permanent Link: Draft Paper: “But the Data is Already Public”: On the Ethics of Research in Facebook'>Draft Paper: “But the Data is Already Public”: On the Ethics of Research in Facebook</a></li>
<li><a href='http://michaelzimmer.org/2009/08/28/nsf-grant-internet-research-and-ethics-2-0/' rel='bookmark' title='Permanent Link: NSF Grant: Internet Research and Ethics 2.0'>NSF Grant: Internet Research and Ethics 2.0</a></li>
<li><a href='http://michaelzimmer.org/2008/10/03/more-on-the-anonymity-of-the-facebook-dataset-its-harvard-college/' rel='bookmark' title='Permanent Link: More On the “Anonymity” of the Facebook Dataset &#8211; It&#8217;s Harvard College (Updated)'>More On the “Anonymity” of the Facebook Dataset &#8211; It&#8217;s Harvard College (Updated)</a></li>
<li><a href='http://michaelzimmer.org/2008/09/30/on-the-anonymity-of-the-facebook-dataset/' rel='bookmark' title='Permanent Link: On the &#8220;Anonymity&#8221; of the Facebook Dataset (Updated)'>On the &#8220;Anonymity&#8221; of the Facebook Dataset (Updated)</a></li>
<li><a href='http://michaelzimmer.org/2009/02/16/on-facebook-people-own-and-control-their-information-except-when-facebook-does/' rel='bookmark' title='Permanent Link: On Facebook, People Own and Control Their Information (Except When Facebook Does)'>On Facebook, People Own and Control Their Information (Except When Facebook Does)</a></li>
</ol>]]></description>
			<content:encoded><![CDATA[<p>I&#8217;m currently in Savannah, GA to participate in a workshop on <a href="http://www.cc.gatech.edu/%7Eyardi/ethics-cscw2010.htm">Revisiting  Research Ethics in the Facebook Era: Challenges in Emerging CSCW  Research</a> at <a href="http://www.cscw2010.org/">CSCW 2010</a>.</p>
<p>This is my first time at CSCW, and looking at the <a href="http://www.cc.gatech.edu/~yardi/ethics-cscw2010_files/AcceptedPapers.htm" target="_blank">set of papers</a> for this workshop, it should be an excellent experience. I&#8217;ve submitted a brief analysis of the &#8220;Tastes, Ties, and Time&#8221; Facebook dataset release (my larger paper is going through its final edits for publication). You can download the short analysis here: <a href="http://www.cc.gatech.edu/~yardi/CSCW/Zimmer_Submission.pdf" target="_blank">Subject Privacy and the Release of the “Tastes,  Ties, and Time” Dataset</a>.</p>
<p>The organizers also asked me to provide some brief comments on the ethical issues related to archiving and releasing research data. I&#8217;ve created a few slides with some provocations that will hopefully spark some discussion on these matters. You can view these slides here:<br />
<a style="margin: 12px auto 6px auto; font-family: Helvetica,Arial,Sans-serif; font-style: normal; font-variant: normal; font-weight: normal; font-size: 14px; line-height: normal; font-size-adjust: none; font-stretch: normal; -x-system-font: none; display: block; text-decoration: underline;" title="View Zimmer CSCW 2010 on Scribd" href="http://www.scribd.com/doc/26460551/Zimmer-CSCW-2010">Zimmer CSCW 2010</a> <object id="doc_640373139906250" style="outline: none;" classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000" width="100%" height="600" codebase="http://download.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=6,0,40,0"><param name="name" value="doc_640373139906250" /><param name="data" value="http://d1.scribdassets.com/ScribdViewer.swf" /><param name="wmode" value="opaque" /><param name="bgcolor" value="#ffffff" /><param name="allowFullScreen" value="true" /><param name="allowScriptAccess" value="always" /><param name="FlashVars" value="document_id=26460551&amp;access_key=key-274596rld1pc5fimckni&amp;page=1&amp;viewMode=slideshow" /><param name="src" value="http://d1.scribdassets.com/ScribdViewer.swf" /><param name="flashvars" value="document_id=26460551&amp;access_key=key-274596rld1pc5fimckni&amp;page=1&amp;viewMode=slideshow" /><param name="allowfullscreen" value="true" /><embed id="doc_640373139906250" style="outline: none;" type="application/x-shockwave-flash" width="100%" height="600" src="http://d1.scribdassets.com/ScribdViewer.swf" flashvars="document_id=26460551&amp;access_key=key-274596rld1pc5fimckni&amp;page=1&amp;viewMode=slideshow" allowscriptaccess="always" allowfullscreen="true" bgcolor="#ffffff" wmode="opaque" data="http://d1.scribdassets.com/ScribdViewer.swf" name="doc_640373139906250"></embed></object></p>


<p>Related posts: (automatically generated)<ol><li><a href='http://michaelzimmer.org/2009/06/18/draft-paper-but-the-data-is-already-public/' rel='bookmark' title='Permanent Link: Draft Paper: “But the Data is Already Public”: On the Ethics of Research in Facebook'>Draft Paper: “But the Data is Already Public”: On the Ethics of Research in Facebook</a></li>
<li><a href='http://michaelzimmer.org/2009/08/28/nsf-grant-internet-research-and-ethics-2-0/' rel='bookmark' title='Permanent Link: NSF Grant: Internet Research and Ethics 2.0'>NSF Grant: Internet Research and Ethics 2.0</a></li>
<li><a href='http://michaelzimmer.org/2008/10/03/more-on-the-anonymity-of-the-facebook-dataset-its-harvard-college/' rel='bookmark' title='Permanent Link: More On the “Anonymity” of the Facebook Dataset &#8211; It&#8217;s Harvard College (Updated)'>More On the “Anonymity” of the Facebook Dataset &#8211; It&#8217;s Harvard College (Updated)</a></li>
<li><a href='http://michaelzimmer.org/2008/09/30/on-the-anonymity-of-the-facebook-dataset/' rel='bookmark' title='Permanent Link: On the &#8220;Anonymity&#8221; of the Facebook Dataset (Updated)'>On the &#8220;Anonymity&#8221; of the Facebook Dataset (Updated)</a></li>
<li><a href='http://michaelzimmer.org/2009/02/16/on-facebook-people-own-and-control-their-information-except-when-facebook-does/' rel='bookmark' title='Permanent Link: On Facebook, People Own and Control Their Information (Except When Facebook Does)'>On Facebook, People Own and Control Their Information (Except When Facebook Does)</a></li>
</ol></p>]]></content:encoded>
			<wfw:commentRss>http://michaelzimmer.org/2010/02/06/revisiting-research-ethics-in-the-facebook-era-challenges-in-emerging-cscw-research/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>IR.10 Internet: Critical (or, why the blog has been slow lately)</title>
		<link>http://michaelzimmer.org/2009/10/06/ir-10-internet-critical/</link>
		<comments>http://michaelzimmer.org/2009/10/06/ir-10-internet-critical/#comments</comments>
		<pubDate>Wed, 07 Oct 2009 01:51:52 +0000</pubDate>
		<dc:creator>Michael Zimmer</dc:creator>
				<category><![CDATA[AOIR]]></category>
		<category><![CDATA[Conferences]]></category>
		<category><![CDATA[Internet]]></category>
		<category><![CDATA[Research ethics]]></category>
		<category><![CDATA[Technology & Society]]></category>
		<category><![CDATA[ir10]]></category>

		<guid isPermaLink="false">http://michaelzimmer.org/?p=1518</guid>
		<description><![CDATA[For the last 353 days, I&#8217;ve been part of a team planning Internet Research 10.0 – Internet: Critical, the 10th annual conference of the Association of Internet Researchers (AoIR). My life is about to get back to normal, as an interdisciplinary collection of nearly 400 scholars, researchers and graduate students interested in Internet and new media studies are descending on Milwaukee this week.
The conference program is fantastic, featuring keynote addresses by Siva Vaidhyanathan, Wendy Hui Kyong Chun, and Megan Boler. I&#8217;ll be presenting an updated version of my paper, &#8220;But ...


Related posts: (automatically generated)<ol><li><a href='http://michaelzimmer.org/2008/10/18/announcing-ir10-internet-critical/' rel='bookmark' title='Permanent Link: Announcing IR.10 Internet:Critical (Milwaukee &#8211; 2009)'>Announcing IR.10 Internet:Critical (Milwaukee &#8211; 2009)</a></li>
<li><a href='http://michaelzimmer.org/2008/11/19/cfp-ir10-internet-critical/' rel='bookmark' title='Permanent Link: CFP: IR.10 Internet: Critical (Milwaukee &#8211; 2009)'>CFP: IR.10 Internet: Critical (Milwaukee &#8211; 2009)</a></li>
<li><a href='http://michaelzimmer.org/2007/05/22/us-high-speed-internet-isslow/' rel='bookmark' title='Permanent Link: US high-speed Internet is&#8230;slow'>US high-speed Internet is&#8230;slow</a></li>
<li><a href='http://michaelzimmer.org/2010/07/10/registration-open-for-internet-research-11/' rel='bookmark' title='Permanent Link: Registration Open – and a Student Grant Opportunity – for Internet Research 11.0'>Registration Open – and a Student Grant Opportunity – for Internet Research 11.0</a></li>
<li><a href='http://michaelzimmer.org/2006/09/06/critical-perspectives-on-social-software-and-web-20/' rel='bookmark' title='Permanent Link: Critical Perspectives on Social Software and Web 2.0'>Critical Perspectives on Social Software and Web 2.0</a></li>
</ol>]]></description>
			<content:encoded><![CDATA[<p>For the <a href="http://michaelzimmer.org/2008/10/18/announcing-ir10-internet-critical/">last 353 days</a>, I&#8217;ve been part of a team planning <a href="http://ir10.aoir.org/" target="_blank"><strong>Internet Research 10.0 – Internet: Critical</strong></a>, the 10th annual conference of the <a href="http://www.aoir.org/">Association of Internet Researchers</a> (AoIR). My life is about to get back to normal, as an interdisciplinary collection of nearly 400 scholars, researchers and graduate students interested in Internet and new media studies are descending on Milwaukee this week.</p>
<p>The <a href="http://ir10.aoir.org/?page_id=4" target="_blank">conference program</a> is fantastic, featuring keynote addresses by Siva Vaidhyanathan, Wendy Hui Kyong Chun, and Megan Boler. I&#8217;ll be presenting an <a href="http://michaelzimmer.org/2009/06/18/draft-paper-but-the-data-is-already-public/" target="_blank">updated version</a> of my paper, <em>&#8220;But the Data is Already Public”: On the Ethics of Research in Facebook</em>, based on my critique of the &#8220;Tastes, Ties, and Time&#8221; Facebook data release.</p>
<p>I&#8217;ll also be participating in a pre-conference workshop on <a href="http://ir10.aoir.org/?page_id=191" target="_blank">Critical Issues and Perspectives in Internet Research Methods and Ethics</a>, where the focus will be on raising awareness of and sensitivity by researchers around critical methodological and ethical issues working particularly in online or Internet-mediated realms.</p>
<p>There already is a great tweet stream at <a href="http://search.twitter.com/search?q=%23IR10" target="_blank">#ir10</a>, so please follow us if you can&#8217;t make it.</p>
<p>UPDATE: By most measures, the conference was a great success. As an organizer, I was only able to sit in on a handful of sessions (including my own), but others have blogged about <a href="http://fstutzman.com/2009/10/15/aoir-wrapup/" target="_blank">here</a> and <a href="http://virtualpolitik.blogspot.com/2009/10/ethics-has-become-what-you-cant-do.html" target="_blank">here</a>.</p>
<p>There are quite a few <a href="http://www.flickr.com/photos/tags/ir10/" target="_blank">images up on Flickr</a>, and I&#8217;ve created a <a href="http://www.wordle.net/show/wrdl/1213598/IR.10_Internet%3ACritical" target="_blank">Wordle</a> from all the individual paper titles. Matt Allen <a href="http://www.netcrit.net/events/aoir-annual-general-meeting/" target="_blank">blogged the AoIR general meeting</a>, where the locations of the next two conferences were announced: Gothenburg, Sweden (2010) &amp; Seattle, Washington (2011). See you there!</p>


<p>Related posts: (automatically generated)<ol><li><a href='http://michaelzimmer.org/2008/10/18/announcing-ir10-internet-critical/' rel='bookmark' title='Permanent Link: Announcing IR.10 Internet:Critical (Milwaukee &#8211; 2009)'>Announcing IR.10 Internet:Critical (Milwaukee &#8211; 2009)</a></li>
<li><a href='http://michaelzimmer.org/2008/11/19/cfp-ir10-internet-critical/' rel='bookmark' title='Permanent Link: CFP: IR.10 Internet: Critical (Milwaukee &#8211; 2009)'>CFP: IR.10 Internet: Critical (Milwaukee &#8211; 2009)</a></li>
<li><a href='http://michaelzimmer.org/2007/05/22/us-high-speed-internet-isslow/' rel='bookmark' title='Permanent Link: US high-speed Internet is&#8230;slow'>US high-speed Internet is&#8230;slow</a></li>
<li><a href='http://michaelzimmer.org/2010/07/10/registration-open-for-internet-research-11/' rel='bookmark' title='Permanent Link: Registration Open – and a Student Grant Opportunity – for Internet Research 11.0'>Registration Open – and a Student Grant Opportunity – for Internet Research 11.0</a></li>
<li><a href='http://michaelzimmer.org/2006/09/06/critical-perspectives-on-social-software-and-web-20/' rel='bookmark' title='Permanent Link: Critical Perspectives on Social Software and Web 2.0'>Critical Perspectives on Social Software and Web 2.0</a></li>
</ol></p>]]></content:encoded>
			<wfw:commentRss>http://michaelzimmer.org/2009/10/06/ir-10-internet-critical/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>NSF Grant: Internet Research and Ethics 2.0</title>
		<link>http://michaelzimmer.org/2009/08/28/nsf-grant-internet-research-and-ethics-2-0/</link>
		<comments>http://michaelzimmer.org/2009/08/28/nsf-grant-internet-research-and-ethics-2-0/#comments</comments>
		<pubDate>Fri, 28 Aug 2009 07:57:08 +0000</pubDate>
		<dc:creator>Michael Zimmer</dc:creator>
				<category><![CDATA[Ethics]]></category>
		<category><![CDATA[Research ethics]]></category>
		<category><![CDATA[Web 2.0]]></category>

		<guid isPermaLink="false">http://michaelzimmer.org/?p=1446</guid>
		<description><![CDATA[Congratulations to Elizabeth Buchanan and Charles Ess for being awarded an NSF grant to create a much-needed repository and advisory board to address the challenges that emerging Internet and Web 2.0 platforms are placing on research ethics.
I&#8217;m thrilled to be among the senior personnel on this grant, and look forward to the collaboration and results.
[Note: The Internet Research Ethics Digital Library, Resource Center, and Commons website is now live]
Here is the summary of the research project:

Internet Research and Ethics 2.0:
The Internet Research Ethics Digital Library, Interactive Resource Center, and Online ...


Related posts: (automatically generated)<ol><li><a href='http://michaelzimmer.org/2010/02/06/revisiting-research-ethics-in-the-facebook-era-challenges-in-emerging-cscw-research/' rel='bookmark' title='Permanent Link: Revisiting Research Ethics in the Facebook Era: Challenges in Emerging CSCW Research'>Revisiting Research Ethics in the Facebook Era: Challenges in Emerging CSCW Research</a></li>
<li><a href='http://michaelzimmer.org/2009/06/18/draft-paper-but-the-data-is-already-public/' rel='bookmark' title='Permanent Link: Draft Paper: “But the Data is Already Public”: On the Ethics of Research in Facebook'>Draft Paper: “But the Data is Already Public”: On the Ethics of Research in Facebook</a></li>
<li><a href='http://michaelzimmer.org/2009/10/06/ir-10-internet-critical/' rel='bookmark' title='Permanent Link: IR.10 Internet: Critical (or, why the blog has been slow lately)'>IR.10 Internet: Critical (or, why the blog has been slow lately)</a></li>
<li><a href='http://michaelzimmer.org/2008/11/04/cfp-the-ethics-of-information-organization/' rel='bookmark' title='Permanent Link: CFP: The Ethics of Information Organization'>CFP: The Ethics of Information Organization</a></li>
<li><a href='http://michaelzimmer.org/2008/01/21/the-role-of-information-ethics-in-education/' rel='bookmark' title='Permanent Link: The Role of Information Ethics in Education'>The Role of Information Ethics in Education</a></li>
</ol>]]></description>
			<content:encoded><![CDATA[<p>Congratulations to <a href="http://www4.uwm.edu/sois/cipr/director.html" target="_blank">Elizabeth Buchanan</a> and <a href="http://www.drury.edu/ess/ess.html" target="_blank">Charles Ess</a> for being <a href="http://www.nsf.gov/awardsearch/showAward.do?AwardNumber=0924604" target="_blank">awarded an NSF grant</a> to create a much-needed repository and advisory board to address the challenges that emerging Internet and Web 2.0 platforms are placing on research ethics.</p>
<p>I&#8217;m thrilled to be among the senior personnel on this grant, and look forward to the collaboration and results.</p>
<p>[Note: The <a href="http://internetresearchethics.org" target="_blank">Internet Research Ethics Digital Library, Resource Center, and Commons</a> website is now live]</p>
<p>Here is the summary of the research project:</p>
<blockquote>
<p style="text-align: center;"><strong>Internet Research and Ethics 2.0:</strong><br />
The Internet Research Ethics Digital Library, Interactive Resource Center, and Online Ethics Advisory Board</p>
<p>The working knowledge, professional and disciplinary norms and practices, and body of Internet Research ethics literature, is scattered across disciplines and locales, often contributing to uncertainty in ethical and methodological decision making among researchers and ethics boards which review research in academic institutions. This occasionally prevents research from being pursued and unnecessarily lengthens the time and effort that such bodies as Institutional Review Boards put into internet-research based protocols, as Buchanan and Ess have discovered in their current NSF funded research on IRBs and IRE protocol review. This uncertainty also contributes to a growing confusion and frustration among researchers, who see an ethical rigidity imposed from extant ethical review models, as more research is conducted across an array of technological and global boundaries.</p>
<p>This project brings the IRE literature together into a comprehensive database and couples it with it an interactive resource center, thus, centralizing and simplifying the task of developing standards, best practices, and guidelines around IRE. It also develops an Online Ethics Advisory Board, which will provide professional advice and guidance for researchers, ethics boards, and any research participants. The project is an exemplar of a new concept, “research ethics 2.0,” borrowing from web 2.0 to describe the emerging interactivity, user-developed content and resources, and technologies such as social networking and hyper-blogging that are stretching traditional notions of subject-object, creator-created, owner-consumer, and ultimately, researcher-researched. This emergent model of research ethics has the potential to transform the types of research itself that is conducted across universities and beyond.</p>
<p>The intellectual merit of this project lies in its cross-disciplinary breadth and depth that will benefit a range of scholarship in understanding “research ethics 2.0,” or, “transformative research ethics,” its focus on the ever-increasing literature concerned with Internet research ethics and the various discourses in play across disciplines around such relevant issues such as privacy, consent, ownership; and, its transformative nature, contribute to the model of a traditional ethics board with an online advisory board that is highly versed in Internet research ethics problems and scenarios.</p>
<p>The broader impacts resulting from the proposed activity stem from the intellectual merits: The project will push the boundaries of traditional research ethics issues, allowing transformative models for managing Internet research. It provides sound resources, a solidified research base, and expert advice as more researchers and more IRBs/ethics boards struggle with the complexities of Internet research ethics. The greatest overall significance lies in the broad impact the project will have for Internet researchers from all disciplines and on the foundations of their research ethics systems and information ethics in general, and on the nature of research itself in an ever-connected, online environment.</p></blockquote>


<p>Related posts: (automatically generated)<ol><li><a href='http://michaelzimmer.org/2010/02/06/revisiting-research-ethics-in-the-facebook-era-challenges-in-emerging-cscw-research/' rel='bookmark' title='Permanent Link: Revisiting Research Ethics in the Facebook Era: Challenges in Emerging CSCW Research'>Revisiting Research Ethics in the Facebook Era: Challenges in Emerging CSCW Research</a></li>
<li><a href='http://michaelzimmer.org/2009/06/18/draft-paper-but-the-data-is-already-public/' rel='bookmark' title='Permanent Link: Draft Paper: “But the Data is Already Public”: On the Ethics of Research in Facebook'>Draft Paper: “But the Data is Already Public”: On the Ethics of Research in Facebook</a></li>
<li><a href='http://michaelzimmer.org/2009/10/06/ir-10-internet-critical/' rel='bookmark' title='Permanent Link: IR.10 Internet: Critical (or, why the blog has been slow lately)'>IR.10 Internet: Critical (or, why the blog has been slow lately)</a></li>
<li><a href='http://michaelzimmer.org/2008/11/04/cfp-the-ethics-of-information-organization/' rel='bookmark' title='Permanent Link: CFP: The Ethics of Information Organization'>CFP: The Ethics of Information Organization</a></li>
<li><a href='http://michaelzimmer.org/2008/01/21/the-role-of-information-ethics-in-education/' rel='bookmark' title='Permanent Link: The Role of Information Ethics in Education'>The Role of Information Ethics in Education</a></li>
</ol></p>]]></content:encoded>
			<wfw:commentRss>http://michaelzimmer.org/2009/08/28/nsf-grant-internet-research-and-ethics-2-0/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>Draft Paper: “But the Data is Already Public”: On the Ethics of Research in Facebook</title>
		<link>http://michaelzimmer.org/2009/06/18/draft-paper-but-the-data-is-already-public/</link>
		<comments>http://michaelzimmer.org/2009/06/18/draft-paper-but-the-data-is-already-public/#comments</comments>
		<pubDate>Thu, 18 Jun 2009 21:02:30 +0000</pubDate>
		<dc:creator>Michael Zimmer</dc:creator>
				<category><![CDATA[CEPE]]></category>
		<category><![CDATA[Conferences]]></category>
		<category><![CDATA[Facebook]]></category>
		<category><![CDATA[Online Privacy]]></category>
		<category><![CDATA[Research ethics]]></category>

		<guid isPermaLink="false">http://michaelzimmer.org/?p=1329</guid>
		<description><![CDATA[Next week I will be attending the 8th International Conference of Computer Ethics: Philosophical Enquiry in Corfu, Greece, where I will be presenting an early draft of a paper based on my critique of the &#8220;Taste, Ties, and Time&#8221; Facebook data release.
Recall that last fall, a group of researchers affiliated with the Berkman Center for Internet &#38; Society at Harvard University released a dataset of Facebook profile information from an entire cohort (the class of 2009) of college students from “an anonymous, northeastern American university.” While the researchers took good ...


Related posts: (automatically generated)<ol><li><a href='http://michaelzimmer.org/2010/02/06/revisiting-research-ethics-in-the-facebook-era-challenges-in-emerging-cscw-research/' rel='bookmark' title='Permanent Link: Revisiting Research Ethics in the Facebook Era: Challenges in Emerging CSCW Research'>Revisiting Research Ethics in the Facebook Era: Challenges in Emerging CSCW Research</a></li>
<li><a href='http://michaelzimmer.org/2009/08/28/nsf-grant-internet-research-and-ethics-2-0/' rel='bookmark' title='Permanent Link: NSF Grant: Internet Research and Ethics 2.0'>NSF Grant: Internet Research and Ethics 2.0</a></li>
<li><a href='http://michaelzimmer.org/2005/10/08/paper-the-value-implications-of-the-%e2%80%9cgoogle-paradigm%e2%80%9d-for-organizing-distributing-and-accessing-information/' rel='bookmark' title='Permanent Link: Paper: The Value Implications of the “Google Paradigm” for Organizing, Distributing and Accessing Information'>Paper: The Value Implications of the “Google Paradigm” for Organizing, Distributing and Accessing Information</a></li>
<li><a href='http://michaelzimmer.org/2008/10/03/more-on-the-anonymity-of-the-facebook-dataset-its-harvard-college/' rel='bookmark' title='Permanent Link: More On the “Anonymity” of the Facebook Dataset &#8211; It&#8217;s Harvard College (Updated)'>More On the “Anonymity” of the Facebook Dataset &#8211; It&#8217;s Harvard College (Updated)</a></li>
<li><a href='http://michaelzimmer.org/2010/05/10/what-happens-to-your-facebook-data-when-you-leave/' rel='bookmark' title='Permanent Link: What Happens to Your Facebook Data When You Leave? (Updated)'>What Happens to Your Facebook Data When You Leave? (Updated)</a></li>
</ol>]]></description>
			<content:encoded><![CDATA[<p>Next week I will be attending the <a href="http://cepe2009.ionio.gr/" target="_blank">8th International Conference of Computer Ethics: Philosophical Enquiry</a> in Corfu, Greece, where I will be presenting an early draft of a paper based on my critique of the <a href="http://cyber.law.harvard.edu/node/4682" target="_blank">&#8220;Taste, Ties, and Time&#8221; Facebook data release</a>.</p>
<p>Recall that last fall, a group of researchers affiliated with the <a href="http://cyber.law.harvard.edu/" target="_blank">Berkman Center for Internet &amp; Society</a> at Harvard University <a href="http://cyber.law.harvard.edu/node/4682" target="_blank">released a dataset of Facebook profile information</a> from an entire cohort (the class of 2009) of college students from “an anonymous, northeastern American university.” While the researchers took good faith steps to preserve the anonymity of the source of the data (and, presumably, the privacy of the subjects), <a href="http://michaelzimmer.org/2008/09/30/on-the-anonymity-of-the-facebook-dataset/" target="_blank">I quickly narrowed it down</a> to 7 possible universities, and then with only a little more effort, <a href="http://michaelzimmer.org/2008/10/03/more-on-the-anonymity-of-the-facebook-dataset-its-harvard-college/" target="_blank">identified the source</a> (with some confidence) as Harvard College. All this without ever even downloading or looking at the actual data.</p>
<p>The researchers have since <a href="http://dvn.iq.harvard.edu/dvn/dv/t3" target="_blank">pulled the data out of circulation</a>, and plan to make it available again this month, presumably with some of the anonymity and privacy concerns addressed.</p>
<p>The draft paper I am presenting, <em>“But the Data is Already Public”: On the Ethics of Research in Facebook</em>, retells the circumstances around the T3 project and my partial re-identification of the dataset. It also describes some of the good faith efforts made by the T3 researchers to try to ensure the anonymity of the data, but exposes the limitations and errors in their procedures. Finally, it highlights the broader challenges for engaging in research on/in social networking sites that this case brings to light. These include:</p>
<ul>
<li>the nature of consent in online research</li>
<li>identifying and respecting expectations of privacy on social network sites</li>
<li>developing sufficient strategies for data anonymization prior to the public release of potentially personally-identifiable data</li>
<li>measuring the relative expertise of institutional review boards when confronted with innovative research projects based on data gleaned from social media</li>
</ul>
<p>Future versions of the paper will attempt to provide some guidelines in this regard. In the meantime, I welcome any comments on this draft. E-mail me if you would like to receive a copy.</p>
<p>The PDF of my CEPE presentation is <a href="http://michaelzimmer.org/files/Zimmer_CEPE_slides.pdf">here</a>.</p>


<p>Related posts: (automatically generated)<ol><li><a href='http://michaelzimmer.org/2010/02/06/revisiting-research-ethics-in-the-facebook-era-challenges-in-emerging-cscw-research/' rel='bookmark' title='Permanent Link: Revisiting Research Ethics in the Facebook Era: Challenges in Emerging CSCW Research'>Revisiting Research Ethics in the Facebook Era: Challenges in Emerging CSCW Research</a></li>
<li><a href='http://michaelzimmer.org/2009/08/28/nsf-grant-internet-research-and-ethics-2-0/' rel='bookmark' title='Permanent Link: NSF Grant: Internet Research and Ethics 2.0'>NSF Grant: Internet Research and Ethics 2.0</a></li>
<li><a href='http://michaelzimmer.org/2005/10/08/paper-the-value-implications-of-the-%e2%80%9cgoogle-paradigm%e2%80%9d-for-organizing-distributing-and-accessing-information/' rel='bookmark' title='Permanent Link: Paper: The Value Implications of the “Google Paradigm” for Organizing, Distributing and Accessing Information'>Paper: The Value Implications of the “Google Paradigm” for Organizing, Distributing and Accessing Information</a></li>
<li><a href='http://michaelzimmer.org/2008/10/03/more-on-the-anonymity-of-the-facebook-dataset-its-harvard-college/' rel='bookmark' title='Permanent Link: More On the “Anonymity” of the Facebook Dataset &#8211; It&#8217;s Harvard College (Updated)'>More On the “Anonymity” of the Facebook Dataset &#8211; It&#8217;s Harvard College (Updated)</a></li>
<li><a href='http://michaelzimmer.org/2010/05/10/what-happens-to-your-facebook-data-when-you-leave/' rel='bookmark' title='Permanent Link: What Happens to Your Facebook Data When You Leave? (Updated)'>What Happens to Your Facebook Data When You Leave? (Updated)</a></li>
</ol></p>]]></content:encoded>
			<wfw:commentRss>http://michaelzimmer.org/2009/06/18/draft-paper-but-the-data-is-already-public/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Sony Provides Complete Everquest 2 Server Logs to Researchers</title>
		<link>http://michaelzimmer.org/2009/02/25/sony-provides-complete-everquest-2-server-logs-to-researchers/</link>
		<comments>http://michaelzimmer.org/2009/02/25/sony-provides-complete-everquest-2-server-logs-to-researchers/#comments</comments>
		<pubDate>Wed, 25 Feb 2009 15:57:09 +0000</pubDate>
		<dc:creator>Michael Zimmer</dc:creator>
				<category><![CDATA[Gaming]]></category>
		<category><![CDATA[Research ethics]]></category>

		<guid isPermaLink="false">http://michaelzimmer.org/?p=1109</guid>
		<description><![CDATA[One of my students pointed me to this post noting that Sony Online Entertainment has provided 4 years of server log data for their online game EverQuest 2 to researchers. From the Ars Technica story:
With the cooperation of Sony, a collaborative group of academic researchers at a number of institutions have obtained the complete server logs from the company&#8217;s Everquest 2 MMORPG. &#8230;Dmitri Williams introduced the project and described how researchers have been approaching various game developers over the years. He paraphrased the conversation with Sony as:
&#8220;What do you collect?&#8221;
&#8220;Well, ...


No related posts.]]></description>
			<content:encoded><![CDATA[<p>One of my students pointed me to this <a href="http://www.massively.com/2009/02/16/sony-opens-complete-everquest-2-database-to-researchers/" target="_blank">post</a> noting that <a href="http://station.sony.com/">Sony Online Entertainment</a> has provided 4 years of server log data for their online game <a href="http://everquest2.station.sony.com/"><span style="font-style: italic;">EverQuest 2</span></a> to researchers. From the <a href="http://arstechnica.com/science/news/2009/02/aaas-60tb-of-behavioral-data-the-everquest-2-server-logs.ars" target="_blank">Ars Technica story</a>:</p>
<blockquote><p>With the cooperation of Sony, a collaborative group of academic researchers at a number of institutions have obtained the complete server logs from the company&#8217;s <em>Everquest 2</em> <a href="http://en.wikipedia.org/wiki/MMORPG" target="_blank">MMORPG</a>. &#8230;Dmitri Williams introduced the project and described how researchers have been approaching various game developers over the years. He paraphrased the conversation with Sony as:</p>
<p>&#8220;What do you collect?&#8221;<br />
&#8220;Well, everything—what do you want?&#8221;<br />
&#8220;Can we have it all?&#8221;<br />
&#8220;Sure.&#8221;</p>
<p>The end result is a log that includes four years of data for over 400,000 players that took part in the game&#8230;</p></blockquote>
<p>Sony states that the logs were &#8220;scrubbed of all PII (Personally Identifiable Information) prior to being provided to the researchers.&#8221; Presuming Sony has sufficiently anonymized the data (a <a href="http://michaelzimmer.org/2008/10/03/more-on-the-anonymity-of-the-facebook-dataset-its-harvard-college/" target="_blank">troubled presumption</a>, <a href="http://michaelzimmer.org/2006/08/07/aol-proudly-releases-massive-amounts-of-private-data/" target="_blank">for sure</a>), I still wonder  how such a data release might affect the community of users in this social sphere. Does the fact that information about one&#8217;s activities is being shared and mined change the <a href="http://michaelzimmer.org/category/privacy/contextual-integrity/" target="_blank">contextual norms of information flow</a> in an MMORPG?</p>


<p>No related posts.</p>]]></content:encoded>
			<wfw:commentRss>http://michaelzimmer.org/2009/02/25/sony-provides-complete-everquest-2-server-logs-to-researchers/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>More On the “Anonymity” of the Facebook Dataset &#8211; It&#8217;s Harvard College (Updated)</title>
		<link>http://michaelzimmer.org/2008/10/03/more-on-the-anonymity-of-the-facebook-dataset-its-harvard-college/</link>
		<comments>http://michaelzimmer.org/2008/10/03/more-on-the-anonymity-of-the-facebook-dataset-its-harvard-college/#comments</comments>
		<pubDate>Sat, 04 Oct 2008 03:18:37 +0000</pubDate>
		<dc:creator>Michael Zimmer</dc:creator>
				<category><![CDATA[Facebook]]></category>
		<category><![CDATA[Online Privacy]]></category>
		<category><![CDATA[Research ethics]]></category>
		<category><![CDATA[Social media]]></category>

		<guid isPermaLink="false">http://michaelzimmer.org/?p=864</guid>
		<description><![CDATA[(See my update at the bottom of the post, as well as Fred Stutzman&#8217;s thoughtful analysis)
As mentioned the other day, a group of researchers from the Berkman Center for Internet &#38; Society at Harvard University released a dataset of Facebook profile information from an entire cohort (the class of 2009) of college students from &#8220;an anonymous, northeastern American university.&#8221;
(I&#8217;ve been engaging with Jason Kaufman, the PI for this research, on a variety of privacy and research ethics issues in this post and the comments section &#8211; please check it out.)
Well, ...


Related posts: (automatically generated)<ol><li><a href='http://michaelzimmer.org/2008/09/30/on-the-anonymity-of-the-facebook-dataset/' rel='bookmark' title='Permanent Link: On the &#8220;Anonymity&#8221; of the Facebook Dataset (Updated)'>On the &#8220;Anonymity&#8221; of the Facebook Dataset (Updated)</a></li>
<li><a href='http://michaelzimmer.org/2010/05/10/what-happens-to-your-facebook-data-when-you-leave/' rel='bookmark' title='Permanent Link: What Happens to Your Facebook Data When You Leave? (Updated)'>What Happens to Your Facebook Data When You Leave? (Updated)</a></li>
<li><a href='http://michaelzimmer.org/2008/09/19/facebook-really-wants-you-to-use-beacon/' rel='bookmark' title='Permanent Link: Facebook *Really* Wants You to Use Beacon (Updated)'>Facebook *Really* Wants You to Use Beacon (Updated)</a></li>
<li><a href='http://michaelzimmer.org/2009/06/18/draft-paper-but-the-data-is-already-public/' rel='bookmark' title='Permanent Link: Draft Paper: “But the Data is Already Public”: On the Ethics of Research in Facebook'>Draft Paper: “But the Data is Already Public”: On the Ethics of Research in Facebook</a></li>
<li><a href='http://michaelzimmer.org/2008/10/08/how-to-adjust-your-facebook-privacy-settings/' rel='bookmark' title='Permanent Link: How to Adjust your Facebook Privacy Settings'>How to Adjust your Facebook Privacy Settings</a></li>
</ol>]]></description>
			<content:encoded><![CDATA[<p>(See my update at the bottom of the post, as well as <a href="http://fstutzman.com/2008/10/07/facebook-dataset-identified/" target="_blank">Fred Stutzman&#8217;s</a> thoughtful analysis)</p>
<p>As <a href="http://michaelzimmer.org/2008/09/30/on-the-anonymity-of-the-facebook-dataset/" target="_blank">mentioned</a> the other day, a group of researchers from the <a href="http://cyber.law.harvard.edu/" target="_blank">Berkman Center for Internet &amp; Society at Harvard University</a> released a <a href="http://dvn.iq.harvard.edu/dvn/dv/t3" target="_blank">dataset of Facebook profile information</a> from an entire cohort (the class of 2009) of college students from &#8220;an anonymous, northeastern American university.&#8221;</p>
<p>(I&#8217;ve been engaging with Jason Kaufman, the PI for this research, on a variety of privacy and research ethics issues in <a href="http://michaelzimmer.org/2008/09/30/on-the-anonymity-of-the-facebook-dataset/" target="_blank">this post</a> and the comments section &#8211; please check it out.)</p>
<p>Well, I&#8217;m pretty sure this &#8220;anonymous, northeastern American university&#8221; is <strong><a href="http://www.college.harvard.edu/" target="_blank">Harvard College</a>.</strong> And I didn&#8217;t even have to download the dataset to figure it out. Here&#8217;s how.</p>
<p>As I noted <a href="http://michaelzimmer.org/2008/09/30/on-the-anonymity-of-the-facebook-dataset/" target="_blank">here</a>, the <a href="http://cyber.law.harvard.edu/node/4682" target="_blank">press release</a> and the public <a href="http://dvn.iq.harvard.edu/dvn/dv/t3/faces/study/StudyPage.jsp?studyId=36598&amp;tab=files" target="_blank">codebook</a> for the dataset provided many clues to where the data came from: we know it is a northeastern US university, it is private, co-ed, and whose class of 2009 initially had 1640 students in it. A <a href="http://collegesearch.collegeboard.com/search/adv_typeofschool.jsp" target="_blank">quick search for schools</a> reveals there are <strong>only 7</strong> private, co-ed colleges in New England states (CT, ME, MA, NH, R , VT) with total undergraduate populations between 5000 and 7500 students (a likely range if there were 1640 in the 2006 freshman class): Tufts University, Suffolk University, Yale University, University of Hartford, Quinnipiac University, Brown University, and Harvard College.</p>
<p>Next, the codebook mentioned utilizing research assistants (RAs) to access and download the subjects&#8217; Facebook profile information. Specifically, the researchers noted that &#8220;that both undergraduate and graduate student RAs were employed for downloading data, and that each type of RA may have had a different level of default access based on individual students’ privacy settings.&#8221; (See my note regarding how this complicates the argument that &#8220;the subjects&#8217; Facebook profiles were already public, so there is no privacy concerns&#8221; towards <a href="http://michaelzimmer.org/2008/09/30/on-the-anonymity-of-the-facebook-dataset/" target="_blank">the bottom of this post</a>.) This leads one to believe that the RAs were from the same university as the subjects themselves, since the only relevant privacy setting for a Facebook profile would be whether users in one&#8217;s own &#8220;network&#8221; (a university, for example) can see the profile, vs. only one&#8217;s friends). It further stands to reason (for simplicity and efficiency) that the RAs employed by the researchers for this task would be from the researchers&#8217; own univeristy, Harvard.</p>
<p>Then, the fact that the researchers&#8217; institutional &#8220;Committee on the Use of Human Subjects&#8221; allowed this research without requiring explicit consent by the subjects lends me to believe thta the subjects must be affiliated with that institution. I find it unlikely an IRB would have allowed this data collection and release to be performed in such a way on students from an institution that they didn&#8217;t feel they had some kind of domain over.</p>
<p>Finally, and perhaps most convincingly, only Harvard College offers the specific variety of the subjects&#8217; majors that are listed in the codebook. While nearly all univerersities offer the common majors of &#8220;History&#8221;, &#8220;Chemistry&#8221; or &#8220;Economics&#8221;, one only needs to search for the more uniquely phrased majors to discover a shared home institution. As far as I could tell, only Harvard College (of the 7 northeast private universities we&#8217;ve already narrowed it down to) offers majors with unique titles such as these:</p>
<ul>
<li><a href="http://www.fas.harvard.edu/~nelc/" target="_blank">Near Eastern Languages and Civilizations</a></li>
<li><a href="http://www.fas.harvard.edu/~wgs/courses/courses.htm" target="_blank">Studies of Women, Gender and Sexuality</a></li>
<li><a href="http://www.espp.fas.harvard.edu/" target="_blank">Environmental Science and Public Policy</a></li>
<li><a href="http://www.oeb.harvard.edu/" target="_blank">Organismic and Evolutionary Biology</a></li>
<li><a href="http://www.fas.harvard.edu/~sanskrit/" target="_blank">Sanskrit and Indian Studies</a></li>
</ul>
<p>For these reasons, I&#8217;m confident this Facebook dataset represents the class of 2009 from Harvard College. I could be wrong, but I don&#8217;t think so.</p>
<p>Now, a couple of comments:</p>
<p>One, I didn&#8217;t even look at (and haven&#8217;t even downloaded) the actual dataset in order to make this discovery. I simply read the press release and public codebook, performed a few Google searches, and did some thinking of my own. I&#8217;m not a hacker, I&#8217;m not a statistician. This wasn&#8217;t hard, and there&#8217;s something wrong if the source of an &#8220;anonymous&#8221; dataset can be de-anonymized (at the institutional level) so easily.</p>
<p>Two, I hesitated whether to even post this revelation, as it certainly contributes to the chances that individual subject&#8217;s identities could be exposed, further eroding their privacy. There is no easy calculus to determine what to do in such a case, but I decided to post the results of my investigation as a clarion call to others who might want to release similar datasets. We must take care to ensure, to the best of our abilities, the privacy of our subjects.</p>
<p>Three, what could the researchers have done to prevent my discovery? Besides not giving away so many demographic clues as to the university, they should have coded the specific names of the majors into more generic identifiers. Even without the other &#8220;northeast, private university&#8221; clues, having the specific majors alone would have been sufficient to (eventually) figure out the identity of the school.</p>
<p>Methinks we really need to <a href="http://www.uwm.edu/Dept/SOIS/cipr/aoir_ethics.htm" target="_blank">keep working</a> on a new set of <a href="http://www.uwm.edu/Dept/SOIS/cipr/ijire/index.html" target="_blank">Internet research ethics</a> and methodologies.</p>
<hr />UPDATE: Earlier this year, Jason Kaufman (the PI on this Facebook research project) presented some of the findings of the work at the Berkman Center. <a href="http://cyber.law.harvard.edu/node/4408" target="_blank">Here is the video</a>. It provides a great insight into the motivation and results of the research (which I do find fascinating, despite my concerns about the methodology and release of the data).</p>
<p>During his talk, two more tidbits of information are provided that help me feel confident that the dataset is indeed of the class of 2009 from Harvard College.</p>
<p>First, Kaufman makes a throwaway remark (at about minute 35:10) that the dataset is from a &#8220;very idiosyncratic campus&#8230;a very elite college&#8221;. While that alone doesn&#8217;t give it away, it does help rule out some of the 7 colleges on my list above that simply aren&#8217;t considered &#8220;very elite&#8221; (sorry, Quinnipiac).</p>
<p>Second, and more telling, Kaufman notes (at about minute 32:52) that &#8220;midway through the freshman year, students have to pick between 1 and 7 best friends&#8221; that they will essentially live with for the rest of their undergraduate career. This accurately (if not precisely) describes how undergraduate housing works at Harvard. As described <a href="http://www.orl.fas.harvard.edu/icb/icb.do?keyword=k11447&amp;tabgroupid=icb.tabgroup17715" target="_blank">here</a>, all freshman who complete the fall term enter in to a lottery, where they can designate a &#8220;blocking group&#8221; of between 2 and 8 students with whom they would like be housed in close proximity.</p>
<p>So, again, the lesson learned here is how disparate pieces of seemingly benign information can be pieced together to make an otherwise presumed anonymous piece of data identifiable. I did it here by quickly analyzing the codebook, reading a press release, and watching a video presentation. The New York Times <a href="http://michaelzimmer.org/2006/08/09/aol-search-log-profiles-unmasked/" target="_blank">did it with the AOL search data release</a>, and I&#8217;m sure someone will do it with this Facebook dataset.</p>


<p>Related posts: (automatically generated)<ol><li><a href='http://michaelzimmer.org/2008/09/30/on-the-anonymity-of-the-facebook-dataset/' rel='bookmark' title='Permanent Link: On the &#8220;Anonymity&#8221; of the Facebook Dataset (Updated)'>On the &#8220;Anonymity&#8221; of the Facebook Dataset (Updated)</a></li>
<li><a href='http://michaelzimmer.org/2010/05/10/what-happens-to-your-facebook-data-when-you-leave/' rel='bookmark' title='Permanent Link: What Happens to Your Facebook Data When You Leave? (Updated)'>What Happens to Your Facebook Data When You Leave? (Updated)</a></li>
<li><a href='http://michaelzimmer.org/2008/09/19/facebook-really-wants-you-to-use-beacon/' rel='bookmark' title='Permanent Link: Facebook *Really* Wants You to Use Beacon (Updated)'>Facebook *Really* Wants You to Use Beacon (Updated)</a></li>
<li><a href='http://michaelzimmer.org/2009/06/18/draft-paper-but-the-data-is-already-public/' rel='bookmark' title='Permanent Link: Draft Paper: “But the Data is Already Public”: On the Ethics of Research in Facebook'>Draft Paper: “But the Data is Already Public”: On the Ethics of Research in Facebook</a></li>
<li><a href='http://michaelzimmer.org/2008/10/08/how-to-adjust-your-facebook-privacy-settings/' rel='bookmark' title='Permanent Link: How to Adjust your Facebook Privacy Settings'>How to Adjust your Facebook Privacy Settings</a></li>
</ol></p>]]></content:encoded>
			<wfw:commentRss>http://michaelzimmer.org/2008/10/03/more-on-the-anonymity-of-the-facebook-dataset-its-harvard-college/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
	</channel>
</rss>
