<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>MichaelZimmer.org &#187; Internet Research Ethics</title>
	<atom:link href="http://www.michaelzimmer.org/category/issues/internet-research-ethics/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.michaelzimmer.org</link>
	<description>information ethics // privacy // new media // values in design // 2.0</description>
	<lastBuildDate>Wed, 01 May 2013 02:47:37 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.5.1</generator>
		<item>
		<title>Loyola Digital Ethics presentation: &#8220;The Ethics of Twitter Research: A Topology of Disciplines, Methods and Ethics Review Boards&#8221;</title>
		<link>http://www.michaelzimmer.org/2012/10/29/ethics-of-twitter-research-a-topology/</link>
		<comments>http://www.michaelzimmer.org/2012/10/29/ethics-of-twitter-research-a-topology/#comments</comments>
		<pubDate>Mon, 29 Oct 2012 16:17:17 +0000</pubDate>
		<dc:creator>michaelzimmer</dc:creator>
				<category><![CDATA[Events]]></category>
		<category><![CDATA[Internet Research Ethics]]></category>
		<category><![CDATA[Social Media]]></category>
		<category><![CDATA[internet research ethics]]></category>
		<category><![CDATA[Privacy]]></category>
		<category><![CDATA[Research ethics]]></category>
		<category><![CDATA[Social media]]></category>
		<category><![CDATA[Twitter]]></category>

		<guid isPermaLink="false">http://www.michaelzimmer.org/?p=4659</guid>
		<description><![CDATA[Today I have the great privilege of presenting the preliminary results of a research project exploring the ethics of Twitter-based research, co-authored with Nick Proferes, at the second annual International Symposium on Digital Ethics, hosted by the Center for Digital Ethics &#38; Policy at Loyola University Chicago. The abstract and slides are available below. Look [...]]]></description>
				<content:encoded><![CDATA[<p>Today I have the great privilege of presenting the preliminary results of a research project exploring the ethics of Twitter-based research, co-authored with <a href="http://www4.uwm.edu/sois/people/profiles/proferes.cfm" target="_blank">Nick Proferes</a>, at the second annual <a href="http://digitalethics.org/events/international-symposium-digital-ethics/" target="_blank">International Symposium on Digital Ethics</a>, hosted by the <a href="http://digitalethics.org" target="_blank">Center for Digital Ethics &amp; Policy</a> at Loyola University Chicago.</p>
<p>The abstract and slides are available below. Look for the full paper soon.</p>
<blockquote><p><strong>The Ethics of Twitter Research: A Topology of Disciplines, Methods and Ethics Review Boards</strong></p>
<p>In the five years since its launch, the social networking and microblogging service Twitter has quickly grown to over 300 million users, generating over 300 millions tweets each day. By providing a simple platform for users to explain “what’s happening” in 140 characters or less, Twitter has become the Internet’s de facto public forum to sharing “pretty much anything [users] wanted, be it information, relationships, entertainment, citizen journalism, and beyond” (Dybwad, 2009). This sharing of information, relationships, entertainment, journalism, and beyond has made Twitter a cultural phenomenon.</p>
<p>Beyond the utility Twitter provides its millions of users, it also has emerged as a valuable resource for tapping into the zeitgeist of the Internet and its users. There is cultural and historical value in the information that flows across Twitter’s servers, notes Dylan Casey, a Google product manager: “Tweets and other short-form updates create a history of commentary that can provide valuable insights into what’s happened and how people have reacted” (Singel, 2010). Researchers have been quick to recognize the value in studying Twitter users and activities to gain a better understand of its users, uses, and impacts on society and culture from a variety of perspectives (see, for example, http://www.danah.org/researchBibs/twitter.php). The Library of Congress recognized this importance of Twitter when it announced in 2010 that, “Every public tweet, ever, since Twitter’s inception in March 2006, will be archived digitally at the Library of Congress” (Raymond, 2010, ¶2).</p>
<p>The Library of Congress’s announcement clearly validates the research importance of Twitter, but it also prompted concerns about creating a permanent archive of tweets, and whether such a proposal was properly aligned with users’ understanding of how the platform worked and their privacy expectations. Even in broadcasting the news, the language Wired Magazine chose underscored the apparent transition from a fleeting existence for tweets to a newly instilled sense of permanence when it stated, “While the short form musings of a generation chronicled by Twitter might seem ephemeral, the Library of Congress wants to save them for posterity” (Singel, 2010).</p>
<p>In the wake of the Library of Congress announcement, increased debates over the appropriateness of archiving public Tweets for research purposes have arisen (see, for example, Vieweg, 2010; Zimmer, 2010). Particularly relevant are numerous questions regarding how academic research on Twitter has proceeded thus far, such as: What disciplines are engaging in Twitter research and what amount of scrutiny of research ethics is typical within these fields? What research questions are being investigated, what data is being gathered, and how? Are subjects notified or given the opportunity to opt-out of being studied? How are research ethics boards evaluating such projects?</p>
<p>The goal of this paper is to seek initial answers to these questions by surveying academic research that relies on the collection and use of Twitter data. The body of research articles to be surveyed includes over 200 scholarly articles, dissertations and theses from disciplines ranging from communications, political science, health sciences, economics and computer science, among others. In building this corpus, this project will create a topology of disciplinary approaches to research around Twitter, methods used to collect and analyze Twitter data, and accounts of research ethics boards’ oversight of these projects. Through this analysis, we will gain an insight into the current state of research on Twitter, providing a better understanding of the methodological and ethical challenges before us.</p>
<p>Works Cited</p>
<ul>
<li>Dybwad, B. (2009). Twitter Drops “What are You Doing?” Now Asks “What’s Happening?”. Mashable.com. Retrieved February 25, 2012, from http://mashable.com/2009/11/19/twitter-whats-happening/.</li>
<li>Raymond, M. (2010). How Tweet It Is!: Library Acquires Entire Twitter Archive. Library of Congress Blog. Retrieved February 25, 2012, from http://blogs.loc.gov/loc/2010/04/how-tweet-it-is-library-acquires-entire-twitter-archive/.</li>
<li>Singel, R. (2010). Library of Congress Archives Twitter History, While Google Searches It. Retrieved February 25, 2012, from http://www.wired.com/epicenter/2010/04/loc-google-twitter/.</li>
<li>Vieweg, S. (2010). The Ethics of Twitter Research. Paper presented at the CSCW 2010 Workshop on Revisiting Ethics in the Facebook Era.</li>
<li>Zimmer, M. (2010). Is it Ethical to Harvest Public Twitter Accounts without Consent? MichaelZimmer.org. Retrieved February 25, 2012, from http://michaelzimmer.org/2010/02/12/is-it-ethical-to-harvest-public-twitter-accounts-without-consent/.</li>
</ul>
</blockquote>
<p><iframe src="http://www.slideshare.net/slideshow/embed_code/14934824" frameborder="0" marginwidth="0" marginheight="0" scrolling="no" width="476" height="400"></iframe></p>
]]></content:encoded>
			<wfw:commentRss>http://www.michaelzimmer.org/2012/10/29/ethics-of-twitter-research-a-topology/feed/</wfw:commentRss>
		<slash:comments>7</slash:comments>
		</item>
		<item>
		<title>Thoughts on Privacy and the Use of Facebook to Recruit Research Subjects</title>
		<link>http://www.michaelzimmer.org/2012/10/19/thoughts-on-privacy-and-the-use-of-facebook-to-recruit-research-subjects/</link>
		<comments>http://www.michaelzimmer.org/2012/10/19/thoughts-on-privacy-and-the-use-of-facebook-to-recruit-research-subjects/#comments</comments>
		<pubDate>Fri, 19 Oct 2012 13:10:20 +0000</pubDate>
		<dc:creator>michaelzimmer</dc:creator>
				<category><![CDATA[Internet Research Ethics]]></category>
		<category><![CDATA[Privacy]]></category>
		<category><![CDATA[Social Media]]></category>
		<category><![CDATA[Facebook]]></category>
		<category><![CDATA[Research ethics]]></category>

		<guid isPermaLink="false">http://www.michaelzimmer.org/?p=4611</guid>
		<description><![CDATA[Recently, I was approached by a team of researchers concerned with the research ethics issues related to using Facebook to recruit human subjects. Specifically, the team was planning to use Facebook advertisements in order to target certain users for a research study evaluating the effectiveness of a particular educational strategy aimed at decreasing the occurrence [...]]]></description>
				<content:encoded><![CDATA[<p>Recently, I was approached by a team of researchers concerned with the <a href="http://www.michaelzimmer.org/category/issues/internet-research-ethics/" target="_blank">research ethics issues</a> related to using Facebook to recruit human subjects. Specifically, the team was planning to use <a href="http://www.facebook.com/advertising" target="_blank">Facebook advertisements</a> in order to target certain users for a research study evaluating the effectiveness of a particular educational strategy aimed at decreasing the occurrence of a particular high risk behavior. The researchers were also considering creating a <a href="https://www.facebook.com/about/pages" target="_blank">Facebook page</a> in order to manage communication with potential (and perhaps even actual) subjects in the study.</p>
<p>Here&#8217;s my initial assessment of the privacy concerns, with some information changed to keep the researchers and the project confidential.</p>
<blockquote>
<p align="center"> <span style="text-decoration: underline;">Areas of Concern</span></p>
<p>The primary concern related to researchers’ use of Facebook for recruiting and communicating with subjects (potential or real) relates to privacy and the possible collection of personally identifiable or other sensitive information.</p>
<p>The use of Facebook Ads results in limited privacy concerns in relation to what the researchers might learn about the subjects. While the research team can target a specific set of users – for example, 18-to-35-year-old women who live in Portland, OR – and would know that anyone visiting their website from that ad fit this general demographic profile, no information about individual Facebook users is shared with the research team. Thus, users would be able to view and click the advertisement, and then visit the research website, without fear of their identity being made known.<a title="" href="#_ftn1">[1]</a> Further, the act of clicking on ads does not appear on user’s timelines, so friends of that user would not be aware of the user’s action related to the advertisement.</p>
<p>A greater privacy concern related to advertisements relates to what Facebook itself collects. If an advertisement includes text related to drug use, for example, and a users clicks on that ad, Facebook retains a record of that action, and could, potentially, use this action to further develop a profile of the user (beyond what the user voluntarily shares on the platform).<a title="" href="#_ftn2">[2]</a> Facebook’s potential ability to infer information about the user based on the ads clicked is limited to the information provided in the ad itself, and potentially the URL linked to by the ad.</p>
<p>The use of Facebook Pages introduces greater privacy and confidentiality concerns. If a research team wishes to communicate to interested users or participants through the Page, users will need to “like” the page. This positive action of “liking” a page would appear in the user’s News Feed and be publicly viewable by the users friends. For example, if a user “likes” a page related to a drug use study, her friends would be informed of this action, and the page would be listed among that user’s likes on their profile page. Users can also see a list of friends who have “liked” a page by visiting that page at any time. Users do have the ability of controlling the visibility of their “likes” overall, but not of individual pages.</p>
<p>As with advertisements, user activities on Pages are tracked and potentially used by Facebook to compliment other behavioral and demographic data maintained by the company.</p>
<p>In terms of the [project], these concerns can be summarized as follows:</p>
<ul>
<li>Facebook possesses the ability to log user actions related to advertisements, thus any sensitive or controversial content within an ad’s text (such as mention of drug use) could potentially be linked to a user’s account within Facebook’s systems, and Facebook could attempt to infer something about the user based on that action to augment its internal profile of that user. This also applies to user actions on a Facebook Page.</li>
<li>Facebook possesses the ability to log the URL of external websites users click to from a link embedded in ads or pages. Thus, if a URL contains sensitive or controversial information (for example, http://university.edu/DrugUseStudy), it could potentially be linked to a user’s account within Facebook’s systems, and Facebook could attempt to infer something about the user based on the URL to augment its internal profile of that user.</li>
<li>User actions (likes, comments, etc) on Facebook pages created for research projects are typically visible on that user’s timeline and profile page, and also viewable by friends who happen to visit the project page. Thus, any sensitive or controversial information on the Page, including its name, could become associated with that user.</li>
</ul>
<p align="center"><span style="text-decoration: underline;">Preliminary Recommendations</span></p>
<p>The primary recommendation is to minimize the ability for Facebook or a user’s friends from associating interactions with project-related advertisements or pages with any sensitive or controversial information. Thus, the following actions can be taken:</p>
<ul>
<li>Advertisements should avoid any sensitive or controversial text that could be associated with a user who decides to take action. For example, instead of mentioning a study on “illegal drug use”, the ad can indicate “recreational activities” or similar. By using more generic text, any association made to the user’s account will have minimal harm.</li>
<li>URLs used in advertisements or pages should also avoid sensitive or controversial text. For example, http://university.edu/RecreationalActivityStudy should be used rather than http://university.edu/DrugUseStudy. To best protect users, the linked pages should also minimize mention of sensitive or controversial text, ideally providing more general information, and then pointing users to other pages that provide more detail.</li>
<li>Facebook Pages should also avoid sensitive or controversial text in both their name and the content provided within.</li>
<li>While Facebook Pages are a practical tool for communicating with subjects, users might avoid “liking” a Page due to privacy concerns. Thus, other means of communication must be implemented.</li>
</ul>
<hr align="left" size="1" width="33%" />
<p><a title="" href="#_ftnref1">[1]</a> The research website likely maintains its own server logs, so a user’s IP address would potentially be collected. This is not related to the use of Facebook, and applies to any visitor to the website under typical circumstance.</p>
<p><a title="" href="#_ftnref2">[2]</a> Presently, this is mentioned only as a potential concern; additional verification is needed to confirm if Facebook indeed uses “clicked ads” as a means to refine customer profiles.</p></blockquote>
<p>What do readers think of this preliminary analysis and set of recommendations?</p>
]]></content:encoded>
			<wfw:commentRss>http://www.michaelzimmer.org/2012/10/19/thoughts-on-privacy-and-the-use-of-facebook-to-recruit-research-subjects/feed/</wfw:commentRss>
		<slash:comments>8</slash:comments>
		</item>
		<item>
		<title>Virginia IRB Consortium Presentation: &#8220;Research, the Cloud, and the IRB&#8221;</title>
		<link>http://www.michaelzimmer.org/2012/10/10/virginia-irb-research-the-cloud-and-the-irb/</link>
		<comments>http://www.michaelzimmer.org/2012/10/10/virginia-irb-research-the-cloud-and-the-irb/#comments</comments>
		<pubDate>Wed, 10 Oct 2012 13:13:05 +0000</pubDate>
		<dc:creator>michaelzimmer</dc:creator>
				<category><![CDATA[Academic]]></category>
		<category><![CDATA[Conferences]]></category>
		<category><![CDATA[Events]]></category>
		<category><![CDATA[Internet Research Ethics]]></category>
		<category><![CDATA[Cloud computing]]></category>
		<category><![CDATA[irb]]></category>
		<category><![CDATA[Research ethics]]></category>

		<guid isPermaLink="false">http://www.michaelzimmer.org/?p=4568</guid>
		<description><![CDATA[On Friday October 12, 2012, I will be delivering the keynote address at the Virginia IRB Consortium Conference at the University of Virginia. My talk is &#8220;Research, the Cloud, and the IRB&#8221; (slides are below). While the presentation shares some of the same DNA from my recent presentation at the University of Pittsburgh, this talk [...]]]></description>
				<content:encoded><![CDATA[<p>On Friday October 12, 2012, I will be delivering the keynote address at the <a href="http://www.virginia.edu/vpr/irb/sbs/events_conference.html" target="_blank">Virginia IRB Consortium Conference</a> at the University of Virginia. My talk is &#8220;Research, the Cloud, and the IRB&#8221; (slides are below). While the presentation shares some of the <a href="http://www.michaelzimmer.org/category/issues/internet-research-ethics/" target="_blank">same DNA</a> from my <a href="http://www.michaelzimmer.org/2012/10/02/research-ethics-in-the-2-0-era-new-challenges-for-researchers-and-irbs/" target="_blank">recent presentation at the University of Pittsburgh</a>, this talk will have less focus on particular cases of Internet-based research, and instead will highlight ways that cloud computing is used for research (at application, platform, and infrastructural levels), and the ethical dilemmas that begin to arise given the increased reliance on these systems.</p>
<p>(And like in Pittsburgh, I will begin my talk by invoking <a href="http://www.youtube.com/watch?v=49rcVQ1vFAY" target="_blank">this video by Neil Postman</a>, which is what convinced me to get my PhD at NYU.)<br />
<iframe style="border: 1px solid #CCC; border-width: 1px 1px 0; margin-bottom: 5px;" src="http://www.slideshare.net/slideshow/embed_code/14662115" frameborder="0" marginwidth="0" marginheight="0" scrolling="no" width="427" height="356"></iframe></p>
<div style="margin-bottom: 5px;"><strong> <a title="Research, the Cloud, and the IRB" href="http://www.slideshare.net/michaelzimmer/research-the-cloud-and-the-irb" target="_blank">Research, the Cloud, and the IRB</a> </strong> from <strong><a href="http://www.slideshare.net/michaelzimmer" target="_blank">Michael Zimmer</a></strong></div>
]]></content:encoded>
			<wfw:commentRss>http://www.michaelzimmer.org/2012/10/10/virginia-irb-research-the-cloud-and-the-irb/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Pitt IRB Presentation: &#8220;Research Ethics in the 2.0 Era: New Challenges for Researchers and IRBs&#8221;</title>
		<link>http://www.michaelzimmer.org/2012/10/02/research-ethics-in-the-2-0-era-new-challenges-for-researchers-and-irbs/</link>
		<comments>http://www.michaelzimmer.org/2012/10/02/research-ethics-in-the-2-0-era-new-challenges-for-researchers-and-irbs/#comments</comments>
		<pubDate>Tue, 02 Oct 2012 12:55:49 +0000</pubDate>
		<dc:creator>michaelzimmer</dc:creator>
				<category><![CDATA[Academic]]></category>
		<category><![CDATA[Conferences]]></category>
		<category><![CDATA[Events]]></category>
		<category><![CDATA[Internet Research Ethics]]></category>
		<category><![CDATA[Internet]]></category>
		<category><![CDATA[irb]]></category>
		<category><![CDATA[Research ethics]]></category>
		<category><![CDATA[Web 2.0]]></category>

		<guid isPermaLink="false">http://www.michaelzimmer.org/?p=4560</guid>
		<description><![CDATA[On Friday October 5, 2012, I will have the great pleasure of presenting my work on &#8220;Research Ethics in the 2.0 Era: New Challenges for Researchers and IRBs&#8221; for the University of Pittsburgh Institutional Review Board Educational Series. I&#8217;ve discussed these issues in smaller disciplinary settings, as well as much larger national meetings of IRBs, [...]]]></description>
				<content:encoded><![CDATA[<p>On Friday October 5, 2012, I will have the great pleasure of presenting my work on &#8220;Research Ethics in the 2.0 Era: New Challenges for Researchers and IRBs&#8221; for the <a href="http://www.irb.pitt.edu/event/Research-Ethics-in-the-2-0-Era" target="_blank">University of Pittsburgh Institutional Review Board Educational Series</a>.</p>
<p>I&#8217;ve discussed these <a href="http://www.michaelzimmer.org/category/issues/internet-research-ethics/" target="_blank">issues</a> in smaller <a href="http://www.michaelzimmer.org/2012/05/26/ica-2012-researching-social-media-ethical-and-methodological-challenges/" target="_blank">disciplinary settings</a>, as well as much larger <a href="http://www.michaelzimmer.org/2011/12/01/presentations-at-primr-2011-adancing-ethical-research/" target="_blank">national meetings of IRBs</a>, but this will be my first time having detailed discussions with a university&#8217;s particular IRB community.</p>
<p>I&#8217;m really looking forward to it.</p>
<p><iframe style="border: 1px solid #CCC; border-width: 1px 1px 0; margin-bottom: 5px;" src="http://www.slideshare.net/slideshow/embed_code/14555878" frameborder="0" marginwidth="0" marginheight="0" scrolling="no" width="427" height="356"></iframe></p>
<div style="margin-bottom: 5px;"><strong> <a title="Research Ethics in the 2.0 Era: New Challenges for Researchers and IRBs" href="http://www.slideshare.net/michaelzimmer/research-ethics-in-the-20-era-new-challenges-for-researchers-and-irbs" target="_blank">Research Ethics in the 2.0 Era: New Challenges for Researchers and IRBs</a> </strong> from <strong><a href="http://www.slideshare.net/michaelzimmer" target="_blank">Michael Zimmer</a></strong></div>
]]></content:encoded>
			<wfw:commentRss>http://www.michaelzimmer.org/2012/10/02/research-ethics-in-the-2-0-era-new-challenges-for-researchers-and-irbs/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>New Entry in the Stanford Encyclopedia of Philosophy: &#8220;Internet Research Ethics&#8221;</title>
		<link>http://www.michaelzimmer.org/2012/06/22/new-entry-in-the-stanford-encyclopedia-of-philosophy-internet-research-ethics/</link>
		<comments>http://www.michaelzimmer.org/2012/06/22/new-entry-in-the-stanford-encyclopedia-of-philosophy-internet-research-ethics/#comments</comments>
		<pubDate>Fri, 22 Jun 2012 12:45:55 +0000</pubDate>
		<dc:creator>michaelzimmer</dc:creator>
				<category><![CDATA[Internet Research Ethics]]></category>
		<category><![CDATA[Publications]]></category>
		<category><![CDATA[featured]]></category>
		<category><![CDATA[Internet]]></category>
		<category><![CDATA[Research ethics]]></category>

		<guid isPermaLink="false">http://michaelzimmer.org/test/?p=3192</guid>
		<description><![CDATA[I'm thrilled to announce that the Stanford Encyclopedia of Philosophy has published a new entry on "Internet Research Ethics", written by Elizabeth Buchanan and myself. I'm confident the existence of this entry in such a popular and prestigious publication will help increase awareness of these important issues.]]></description>
				<content:encoded><![CDATA[<p style="text-align: left;"><a href="http://michaelzimmer.org/wp-content/uploads/2012/06/informed-consent.png"><img class="alignright  wp-image-3626" style="border: 1px solid black;" title="Stanford Encylopedia of Philosophy" src="http://michaelzimmer.org/wp-content/uploads/2012/06/informed-consent.png" alt="" /></a>I&#8217;m thrilled to announce that the <a href="http://plato.stanford.edu/about.html" target="_blank"><em>Stanford Encyclopedia of Philosophy</em></a> has published a new entry on &#8220;<a href="http://plato.stanford.edu/entries/ethics-internet-research/" target="_blank">Internet Research Ethics</a>&#8220;, written by <a href="http://www.uwstout.edu/faculty/buchanane/" target="_blank">Elizabeth Buchanan</a> and myself. The introduction and table of contents are below. I&#8217;m confident the existence of this entry in such a popular and prestigious publication will help increase awareness of <a href="http://michaelzimmer.org/category/ethics/research-ethics/" target="_blank">these important issues</a>.</p>
<blockquote><p><strong><a href="http://plato.stanford.edu/entries/ethics-internet-research/" target="_blank">Internet Research Ethics</a></strong><br />
<em>First published Fri Jun 22, 2012</em></p>
<p>There is little research that is not impacted in some way on or through the Internet. The Internet, as a field, a tool, and a venue, has specific and far reaching ethical issues. Internet research ethics is a subdiscipline that fits across many disciplines, ranging from social sciences, arts and humanities, medical/biomedical, and hard sciences. Extant ethical frameworks, including <a href="http://plato.stanford.edu/entries/consequentialism/">consequentialism</a>, <a href="http://plato.stanford.edu/entries/utilitarianism-history/">utilitarianism</a>, <a href="http://plato.stanford.edu/entries/ethics-deontological/">deontology</a>, <a href="http://plato.stanford.edu/entries/ethics-virtue/">virtue ethics</a>, and <a href="http://plato.stanford.edu/entries/feminism-ethics/">feminist ethics</a> have contributed to the ways in which ethical issues in Internet research are considered and evaluated.</p>
<p>Conceptually and historically, Internet research ethics is related to computer and information ethics and includes such ethical issues as data privacy and confidentiality, integrity of data, intellectual property issues, and professional standards. Throughout the Internet&#8217;s evolution, there has been debate whether there are new ethical dilemmas emerging, or if the existing dilemmas are consistent across research or despite technological influence (Elgesem 2002; Walther 2002; Ess &amp; AoIR 2002). These debates are similar to philosophical debates in computer and information ethics. For example, many years ago, Moor asked “what is special about computers” in order to understand what is ethically unique and the same question applies to Internet research (Moor 1985; Ess &amp; AoIR 2002; King 1996).</p>
<p>Yet, as the Internet has evolved into a more social and communicative tool and venue, the ethical issues have shifted from purely data driven to more human-centered. “On-ground” or face-to face analogies may not be applicable to online research. For example, the concept of the public park has been used as a site where researchers can observe others, but online, the concepts of public and private are much more complex. Thus, some scholars suggest that the specificity of Internet research ethics calls for new regulatory and/or professional and disciplinary guidance. For these reasons, the concept of human subjects research policy and regulation informs this entry, along with disciplinary standards, which will explore the growing areas of ethical and methodological complexity, including personal identifiability, reputational risk and harm, notions of public space and public text, ownership, and longevity of data as they relate to Internet research. Specifically, the emergence of the social web raises issues around subject or participant recruitment practices, tiered informed consent models, and protection of various expectations and forms of privacy in an ever-increasing world of diffused and ubiquitous technologies; anonymity and confidentiality of data in spaces where researchers and their subjects may not fully understand the terms and conditions of those venues or tools; challenges to data integrity as research projects can be outsourced to a mechanical turk or a bot; and jurisdictional issues as more research is processed, stored, and disseminated via cloud computing or in remote server locales, presenting myriad legal complexities given jurisdictional differences in data laws.</p>
<p>As a result, researchers using the Internet as a tool or a space of research—and their research ethics boards (REBs), also known as institutional review boards (IRBs) in the United States or human research ethics committees (HRECs) in other countries such as Australia—have been confronted with a series of new ethical enquiries: What ethical obligations do researchers have to protect the privacy of subjects engaging in activities in “public” Internet spaces? How is confidentiality or anonymity assured online? How is and should informed consent be obtained online? How should research on minors be conducted, and how do you prove a subject is not a minor? Is deception (pretending to be someone you are not, withholding identifiable information, etc) online a norm or a harm? How is “harm” possible to someone existing in an online space?</p>
<p>A growing number of scholars have explored these and related questions (see, for example, Bromseth 2002; Bruckman 2006; Buchanan 2004; Buchanan &amp; Ess 2008; Gullor &amp; Ess 2003; Johns, Chen &amp; Hall 2003; Kitchin 2003, 2008; King 1996; Mann 2003; Markham &amp; Baym 2008; McKee &amp; Porter 2009; Thorseth 2003), scholarly associations have drafted ethical guidelines for Internet research (Ess &amp; Association of Internet Researchers 2002; Kraut et al. 2004), and non-profit scholarly and scientific agencies such as AAAS (Frankel &amp; Siang 1999) have begun to confront the myriad of ethical concerns that Internet research poses to researchers and research ethics boards (REBs).</p>
<ul>
<li><a href="http://plato.stanford.edu/entries/ethics-internet-research/#Def">1. Definitions</a></li>
<li><a href="http://plato.stanford.edu/entries/ethics-internet-research/#HumSubRes">2. Human Subjects Research</a></li>
<li><a href="http://plato.stanford.edu/entries/ethics-internet-research/#HisDevIREDis">3. History and Development of IRE as a Discipline</a></li>
<li><a href="http://plato.stanford.edu/entries/ethics-internet-research/#SpeEthIssIntRes">4. Specific Ethical Issues in Internet Research</a>
<ul>
<li><a href="http://plato.stanford.edu/entries/ethics-internet-research/#Pri">4.1 Privacy</a></li>
<li><a href="http://plato.stanford.edu/entries/ethics-internet-research/#Rec">4.2 Recruitment</a></li>
<li><a href="http://plato.stanford.edu/entries/ethics-internet-research/#InfCon">4.3 Informed Consent</a></li>
<li><a href="http://plato.stanford.edu/entries/ethics-internet-research/#CloComResEth">4.4 Cloud Computing and Research Ethics</a></li>
</ul>
</li>
<li><a href="http://plato.stanford.edu/entries/ethics-internet-research/#ResEthBoaGui">5. Research Ethics Boards Guidelines</a></li>
<li><a href="http://plato.stanford.edu/entries/ethics-internet-research/#Bib">Bibliography</a></li>
<li><a href="http://plato.stanford.edu/entries/ethics-internet-research/#Aca">Academic Tools</a></li>
<li><a href="http://plato.stanford.edu/entries/ethics-internet-research/#Oth">Other Internet Resources</a></li>
<li><a href="http://plato.stanford.edu/entries/ethics-internet-research/#Rel">Related Entries</a></li>
</ul>
<hr />
</blockquote>
]]></content:encoded>
			<wfw:commentRss>http://www.michaelzimmer.org/2012/06/22/new-entry-in-the-stanford-encyclopedia-of-philosophy-internet-research-ethics/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>ICA 2012: Researching Social Media: Ethical and Methodological Challenges</title>
		<link>http://www.michaelzimmer.org/2012/05/26/ica-2012-researching-social-media-ethical-and-methodological-challenges/</link>
		<comments>http://www.michaelzimmer.org/2012/05/26/ica-2012-researching-social-media-ethical-and-methodological-challenges/#comments</comments>
		<pubDate>Sat, 26 May 2012 19:27:07 +0000</pubDate>
		<dc:creator>michaelzimmer</dc:creator>
				<category><![CDATA[Events]]></category>
		<category><![CDATA[Internet Research Ethics]]></category>
		<category><![CDATA[Social Media]]></category>
		<category><![CDATA[Conferences]]></category>
		<category><![CDATA[ica12]]></category>
		<category><![CDATA[Online Privacy]]></category>
		<category><![CDATA[Research ethics]]></category>

		<guid isPermaLink="false">http://michaelzimmer.org/?p=3148</guid>
		<description><![CDATA[I&#8217;m currently in Phoenix, AZ for the annual meeting of the International Communication Association, participating on an important panel on &#8220;Researching Social Media: Ethical and Methodological Challenges&#8220;, organized by Anders Olof Larsson (Uppsala) and Hallvard Moe (Bergen). The panel is listed under the Communication and Technology division of ICA, but has implications well beyond that [...]]]></description>
				<content:encoded><![CDATA[<p>I&#8217;m currently in Phoenix, AZ for the annual meeting of the <a href="http://www.icahdq.org/" target="_blank">International Communication Association</a>, participating on an important panel on &#8220;<strong>Researching Social Media: Ethical and Methodological Challenges</strong>&#8220;, organized by <a href="http://www.andersoloflarsson.se/" target="_blank">Anders Olof Larsson</a> (Uppsala) and <a href="http://hm.uib.no/" target="_blank">Hallvard Moe</a> (Bergen). The panel is listed under the <a href="http://www.icahdq.org/about_ica/secdetinfo.asp?SecCode=DIV10" target="_blank">Communication and Technology</a> division of ICA, but has implications well beyond that SIG:</p>
<blockquote><p>One of the recent key trends in communication and technology research concerns the use so-called “big data” to study social media use. Whether its high profile projects on students’ Facebook use, comprehensive mappings of the blogosphere, or studies of Twitter use during crisis, the potential for collecting and processing large amounts of new forms of data has brought novel approaches and important insights. However, these new forms of data also require us to critically engage with some of the more delicate issues regarding how research on social media use is undertaken. The presentations will deal with how ethical challenges in these contexts can take on different forms, regarding how we as researchers deal with respondents and data providers, as well as how we communicate our ideas to institutional review boards. Bringing together scholars from Australia, Germany, Scandinavia and the US, the panel is well- suited to generate discussion among division members, as well as attendees beyond the CAT division.</p></blockquote>
<p>My contribution is &#8220;<strong>New Media, New Ethics: How Social Media-Based Research Demand New Attention to Research Ethics</strong>&#8220;, where I discuss many of the  growing conceptual gaps that are emerging in relation to Internet research ethics:</p>
<blockquote><p>Social media tools have opened up vast new means for communication, socialization, expression, and collaboration. They also have provided new avenues for researchers seeking to explore, observe, and measure human opinions, activities and interactions. While scholars, professional societies, and institutional review boards have long-established research ethics frameworks to ensure the rights and welfare of the research subjects are protected, the rapid rise of powerful social media platforms – where individuals increasingly share personal information on platforms with porous and shifting boundaries – provide new challenges to long-held ethical assumptions and guidelines. This talk will present various cases of social media-based research that expose new conceptual gaps in how we think about privacy, anonymity, consent, and harm in the 2.0 era.</p></blockquote>
<p>My slides are available below, and here are some of the other research and resources I cite in the presentation:</p>
<ul>
<li><a href="http://internetresearchethics.org/" target="_blank">Internet Research Ethics Digital Library, Resource Center and Commons</a> website</li>
<li><a href="http://aoir.org/documents/ethics-guide/" target="_blank">Ethical decision-making and Internet research: Recommendations from the AoIR ethics working committee</a></li>
<li>Buchanan, E., &amp; Ess, C. (2009). Internet research ethics and the institutional review board: current practices and issues. ACM SIGCAS <em>Computers and Society, 39</em>(3), 43-49.</li>
<li>Carpenter, K &amp; Dittrich, D. “Bridging the Distance: Removing the Technology Buffer and Seeking Consistent Ethical Analysis in Computer Security Research” 1st International Digital Ethics Symposium, Loyola University Chicago Center for Digital Ethics and Policy.</li>
<li>Moor, J. (1985). What is computer ethics? <em>Metaphilosophy, 16</em>, 266-275.</li>
<li>Nissenbaum, H. (2009). <em>Privacy in context: Technology, policy, and the integrity of social life</em>. Stanford, CA: Stanford University Press.</li>
<li>Ohm, P. (2009). Broken promises of privacy: Responding to the surprising failure of anonymization. <em>UCLA Law Review, 57</em>, 1701.</li>
<li>Soghoian, C. (2012). Enforced community standards for research on users of the Tor anonymity network. <em>Financial Cryptography and Data Security</em>, 7126, 146-153.</li>
<li>Zimmer, M. (2010). “But the data is already public”: On the ethics of research in Facebook. <em>Ethics and Information Technology, 12(</em>4), 313-325.</li>
</ul>
<p>&nbsp;</p>
<div id="__ss_13087806" style="width: 425px;"><strong style="display: block; margin: 12px 0 4px;"><a title="New Media, New Ethics - ICA 2012" href="http://www.slideshare.net/michaelzimmer/new-media-new-ethics-ica-2012">New Media, New Ethics &#8211; ICA 2012</a></strong><object id="__sse13087806" width="425" height="355" classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000" codebase="http://download.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=6,0,40,0"><param name="allowFullScreen" value="true" /><param name="allowScriptAccess" value="always" /><param name="wmode" value="transparent" /><param name="src" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=zimmericaslides-120526140744-phpapp01&amp;stripped_title=new-media-new-ethics-ica-2012&amp;userName=michaelzimmer" /><param name="allowscriptaccess" value="always" /><param name="allowfullscreen" value="true" /><embed id="__sse13087806" width="425" height="355" type="application/x-shockwave-flash" src="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=zimmericaslides-120526140744-phpapp01&amp;stripped_title=new-media-new-ethics-ica-2012&amp;userName=michaelzimmer" allowFullScreen="true" allowScriptAccess="always" wmode="transparent" allowscriptaccess="always" allowfullscreen="true" /></object></p>
<div style="padding: 5px 0 12px;">View more <a href="http://www.slideshare.net/">presentations</a> from <a href="http://www.slideshare.net/michaelzimmer">Michael Zimmer</a>.</div>
</div>
]]></content:encoded>
			<wfw:commentRss>http://www.michaelzimmer.org/2012/05/26/ica-2012-researching-social-media-ethical-and-methodological-challenges/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Research Ethics and the Blackberry Project</title>
		<link>http://www.michaelzimmer.org/2012/04/25/research-ethics-and-the-blackberry-project/</link>
		<comments>http://www.michaelzimmer.org/2012/04/25/research-ethics-and-the-blackberry-project/#comments</comments>
		<pubDate>Wed, 25 Apr 2012 12:30:01 +0000</pubDate>
		<dc:creator>michaelzimmer</dc:creator>
				<category><![CDATA[Internet Research Ethics]]></category>
		<category><![CDATA[Mobility]]></category>
		<category><![CDATA[blackberry]]></category>
		<category><![CDATA[featured]]></category>
		<category><![CDATA[Research ethics]]></category>
		<category><![CDATA[Youth privacy]]></category>

		<guid isPermaLink="false">http://michaelzimmer.org/?p=3095</guid>
		<description><![CDATA[Forbes privacy columnist Kashmir Hill recently published a profile of University of Texas-Dallas developmental psychology professor Marion Underwood's large-scale research project titled “The Blackberry Project.”

The Blackberry Project is an ongoing longitudinal study examining teen behavior and sociability, which first recruited its subjects in 2003. Then, in 2009, the subjects (now entering 8th grade) were provided with BlackBerry devices with unlimited text and data plans paid for by the investigators. The devices were configured so that the content of all text messages, e-mail messages, and instant messages was saved to a secure server to be mined by the researchers -- over 500,000 messages a month are being archived.

While the Blackberry Project appears to have been managed properly through the IRB rules and regulations, it highlights emerging ethical concerns with projects of this nature, including issues of consent, undue influence, and privacy &#038; anonymity.]]></description>
				<content:encoded><![CDATA[<p><a href="http://www.michaelzimmer.org/wp-content/uploads/2012/04/Screen-Shot-2012-09-25-at-9.47.39-PM.png"><img class="alignright size-medium wp-image-4506" title="Forbes Blackberry Project" src="http://www.michaelzimmer.org/wp-content/uploads/2012/04/Screen-Shot-2012-09-25-at-9.47.39-PM-300x95.png" alt="" width="300" height="95" /></a>Forbes privacy columnist <a href="http://blogs.forbes.com/kashmirhill/">Kashmir Hill</a> recently <a href="http://www.forbes.com/sites/kashmirhill/2012/04/18/a-texas-universitys-mind-boggling-database-of-teens-daily-text-messages-emails-and-ims-over-four-years/" target="_blank">published a profile</a> of University of Texas-Dallas developmental psychology professor <a href="http://bbs.utdallas.edu/people/detail.php5?i=591" target="_blank">Marion Underwood</a>&#8216;s large-scale research project titled “The Blackberry Project.”</p>
<p><a href="http://bbs.utdallas.edu/blackberry/indexBlue.html" target="_blank">The Blackberry Project</a> (formerly known as the <a href="http://www.utdallas.edu/~undrwd/" target="_blank">Friendship Project</a>) is an ongoing longitudinal study examining teen behavior and sociability, which first recruited its subjects in 2003 (starting with 281 third and fourth graders from 13 Dallas public schools) and relied on yearly laboratory and home observation and surveys for data collection. Then, in 2009, the subjects (now entering 8th grade) were provided with BlackBerry devices with unlimited text and data plans paid for by the investigators. The devices were configured so that the content of all text messages, e-mail messages, and instant messages was saved to a secure server to be mined by the researchers &#8212; over 500,000 messages a month are being archived. Preliminary analyses have been published in <a href="http://psycnet.apa.org/journals/dev/48/2/295/" target="_blank"><em>Developmental Psychology</em></a>.</p>
<p>The result? Hill puts it best in her headline and opening thoughts:</p>
<blockquote><p><a href="http://www.forbes.com/sites/kashmirhill/2012/04/18/a-texas-universitys-mind-boggling-database-of-teens-daily-text-messages-emails-and-ims-over-four-years/" target="_blank">A Texas University&#8217;s Mind-Boggling Database Of Teens&#8217; Daily Text Messages, Emails, and IMs Over Four Years</a></p>
<p>For the past four years, the University of Texas-Dallas developmental psychology professor has essentially wire-tapped 175 Texas teens,  capturing every text message, email, photo, and IM sent on Blackberries that she provided to them, creating a rich database that now contains millions of funny, explicit, sexual, and inane messages for academic study. Half a million new messages pour into the database every month. This summer, she’s adding Facebook content to the mix as well. The teens sacrificed their privacy for science… and a free smartphone, data plan and unlimited text messaging.</p></blockquote>
<p>Dr. Underwood&#8217;s study has been approved by UT-Dallas&#8217;s <a href="http://www.utdallas.edu/research/compliance/irb/index.html" target="_blank">Institutional Review Board</a>, and she&#8217;s also received a <a href="http://grants.nih.gov/grants/policy/coc/" target="_blank">Certificate of Confidentiality</a> from the NIH, which are only granted after considerable scrutiny. Each participant is given a unique identification number so that all information that is collected is, according to the <a href="http://bbs.utdallas.edu/blackberry/indexRed.html" target="_blank">project website</a>, &#8220;de-personalized&#8221;. The research data is stored securely with the help of <a href="http://ceryx.com/" target="_blank">Ceryx</a> and <a href="http://www.globalrelay.com/" target="_blank">Global Relay</a>, data security providers who typically work together to store and archive electronic communication data for financial institutions. The archive is password protected and can only be accessed by a small group of selected researchers.</p>
<p>In short, this large-scale and long-term project has undergone considerable review, and appears to be taking privacy and security quite seriously. That said, there remain certain <a href="http://michaelzimmer.org/category/ethics/research-ethics/" target="_blank">ethical concerns about the research</a> worth discussing.<span id="more-3095"></span></p>
<p>(Note: my discussion is based on what I can glean from available reports and documents about the study; I&#8217;m trying to gather additional information through various channels.)</p>
<p><strong>Consent</strong></p>
<p>Since the Blackberry Project (and its predecessor) focus on studying the activity of minors, gaining <a href="http://www.utdallas.edu/research/compliance/irb/faq.html#8" target="_blank">informed consent</a> is of particular importance. Participants and parents were required to sign detailed consent forms annual that clearly stated that all electronic communication were be recorded and monitored. (While the consent forms for the earlier Friendship Project are <a href="http://www.utdallas.edu/~undrwd/forms/" target="_blank">available online</a>, I haven&#8217;t been able to locate the consent documents for the Blackberry Project. I&#8217;ll request them from Dr. Underwood.) It appears this consent process was repeated annually, which is particularly important as subjects grow and develop, and the content of their text and email messages might change over time (for example, 10th graders might start texting about dangerous or legal activity, which might not have been contemplated when original consent was provided years earlier).</p>
<p><a href="http://answers.hhs.gov/ohrp/questions/7198" target="_blank">Parental consent</a> for minor subjects is standard procedure. However, I wonder how well a parent actually <a href="http://pewresearch.org/pubs/1572/teens-cell-phones-text-messages" target="_blank">understands the extent</a> to which adolescents make use of mobile texting, and whether a parent really is equipped to represent (and waive) the privacy interests of their adolescent kids if they fail to recognize both the scale and types of information contained within those text messages. Is parental consent really sufficient when we&#8217;re dealing with teenager&#8217;s use of social media and personal technology? This is something I&#8217;ll need to think about more&#8230;.</p>
<p>Further, any consent granted only involves the participants themselves and their outgoing messages. But those sending messages <em>to</em> the participants have not consented to having their messages stored and subjected to analysis. Underwood recognizes this problem, but <a href="http://psycnet.apa.org/journals/dev/48/2/295/" target="_blank">argues</a> it away:</p>
<blockquote><p>Pioneering researchers studying online communication have argued that electronic communication can be observed without permission in some contexts because the information need not be uniquely identifiable, unless individuals have chosen to make their online user name their actual name (see Subrahmanyam et al., 2006; Whitlock, Powers, &amp; Eckenrode, 2006). In our study, although we did have access to participants’ phone contacts and could see how they labeled individuals there, these were rarely uniquely identifiable, because most adolescents chose to label contacts with first names only or with nicknames.</p></blockquote>
<p>However, I find this argument a bit thin. Just because some &#8220;pioneering researchers&#8221; claim it is acceptable to study online messages observed without permission &#8220;in some contexts&#8221; doesn&#8217;t make it necessarily ethical here. Hopefully the IRB pressed hard on this issue.</p>
<p><strong>Undue Influence</strong></p>
<p>Consent is only valid if it doesn&#8217;t involve <a href="http://answers.hhs.gov/ohrp/questions/7250" target="_blank">coercion or undue influence</a>. While paying research subjects is commonplace and generally acceptable, the fact that subjects in the Blackberry Project received a free smartphone with fully paid data and texting plans (and a generous 300 minute voice plan) might quality as undue influence. The Office of Human Research Protections <a href="http://answers.hhs.gov/ohrp/questions/7250" target="_blank">defines</a> undue influence when researchers offer an &#8220;excessive or inappropriate reward or other overture in order to obtain compliance.&#8221; OHRP <a href="http://answers.hhs.gov/ohrp/questions/7251" target="_blank">also notes</a> that &#8220;The level of remuneration should not be so high as to cause a prospective subject to accept risks that he or she would not accept in the absence of the remuneration.&#8221;</p>
<p>This is where the free Blackberries and service plans might be problematic. Since 11% of the participating families had incomes under $25,000, and 29% under $50,000, the allure of a free, &#8220;<a href="http://psycnet.apa.org/journals/dev/48/2/295/" target="_blank">highly attractive</a>&#8221; smartphone, complete with a free and unlimited data plan, might have persuaded some lower-income families to participate who otherwise might have considered the project too risky. If you&#8217;re on a tight budget, and your kids keep pestering you for a smartphone, the Blackberry Project might have been a lifesaver, regardless of the risks.</p>
<p>Determining undue influence is a grey area, and, again, I hope that UT-Dallas&#8217;s IRB considered this matter with vigor.</p>
<p><strong>Privacy and Anonymity<br />
</strong></p>
<p>Underwood has taken great lengths to protect subject privacy, including the use of secure, off-campus data storage platforms and replacing account names with ID numbers within the archive. Yet, considerable privacy concerns remain. There are <a href="http://michaelzimmer.org/2006/08/09/aol-search-log-profiles-unmasked/" target="_blank">plenty</a> of <a href="http://michaelzimmer.org/2008/09/30/on-the-anonymity-of-the-facebook-dataset/" target="_blank">cases</a> where simply replacing names with ID numbers fails to provide sufficient anonymity, and the content of the messages themselves might reveal various personal details of the participants and their friends. The researchers indicate they use the participants address books to help &#8220;replace phone numbers with whatever the participants used to label their contacts&#8221; when compiling transcripts. While some of these labels might be un-identifiable, others might effectively &#8220;out&#8221; particular people within the dataset.</p>
<p>The <a href="http://www.forbes.com/sites/kashmirhill/2012/04/18/a-texas-universitys-mind-boggling-database-of-teens-daily-text-messages-emails-and-ims-over-four-years/" target="_blank">Forbes article</a> also notes:</p>
<blockquote><p>Underwood has gotten calls from investigators around the country who would love access to her database, but she says she doesn’t want to hand over the data unless she can de-identify it or anonymize it. I’m imagining many a privacy scholar shaking his or her head in dismay given how difficult true anonymization is.</p></blockquote>
<p>Indeed. I&#8217;m curious to know what steps toward deidentification or anonymization Underwood intends before sharing the data.</p>
<p>The Forbes piece presses Underwood further about the issue of privacy:</p>
<blockquote><p>When I asked Underwood if any of the kids (or their parents) had ever expressed concern about the privacy of their communications, and the discomfort they might feel about every single thing they send being archived indefinitely for study, she said it had been a “non-issue.”</p>
<p>“We haven’t really directly asked about it. We don’t do anything to draw attention to our monitoring,” says Underwood. She prefers that teenagers act naturally. Asking them too strongly about how they feel about their privacy might negatively affect the “observing them in the wild” aspect of her study.</p></blockquote>
<p>This troubles me. Here, a researcher collecting millions of personal messages sent between teens admits to not wanting to directly address privacy with the subjects because it might negatively affect the study. If you bring up the privacy concern, Underwood seems to say, it will just cause them to self-censor. Of course, if her hypothesis is true, <em>that validates the privacy concern itself</em> &#8212; the participants might actually care about their privacy, once reminded about it. (Note to researchers: if you find yourself wanting to minimize disclosure of privacy concerns, then you have significant privacy concerns that need to be addressed.)</p>
<p>In sum, the Blackberry Project appears to have been managed properly through the IRB rules and regulations. These open issues speak more to the nature of this kind of research generally, versus about this project specifically. I&#8217;m very curious as to how the researchers and the IRB discussed and deliberated these issues, and will provide any updates if I&#8217;m able to gain access to more details.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.michaelzimmer.org/2012/04/25/research-ethics-and-the-blackberry-project/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Presentations at PRIM&amp;R 2011 &quot;Adancing Ethical Research&quot;</title>
		<link>http://www.michaelzimmer.org/2011/12/01/presentations-at-primr-2011-adancing-ethical-research/</link>
		<comments>http://www.michaelzimmer.org/2011/12/01/presentations-at-primr-2011-adancing-ethical-research/#comments</comments>
		<pubDate>Thu, 01 Dec 2011 13:41:35 +0000</pubDate>
		<dc:creator>michaelzimmer</dc:creator>
				<category><![CDATA[Events]]></category>
		<category><![CDATA[Internet Research Ethics]]></category>
		<category><![CDATA[Conferences]]></category>
		<category><![CDATA[PRIMR]]></category>
		<category><![CDATA[PRIMR_AER11]]></category>
		<category><![CDATA[Research ethics]]></category>

		<guid isPermaLink="false">http://michaelzimmer.org/?p=2997</guid>
		<description><![CDATA[This weekend I have the great privilege of sharing my research and perspectives on Internet research ethics at the 2011 &#8220;Advancing Ethical Research&#8221; conference held by Public Responsibility in Medicine and Research (PRIM&#38;R). On the opening morning of the conference, I will join John Palfrey and Lydia Shrier for a plenary panel on &#8220;Would Margaret [...]]]></description>
				<content:encoded><![CDATA[<p><img class="alignright size-full wp-image-3000" title="Screen Shot 2011-11-30 at 8.58.09 PM" src="http://michaelzimmer.org/wp-content/uploads/2011/12/Screen-Shot-2011-11-30-at-8.58.09-PM.png" alt="" width="114" height="114" />This weekend I have the great privilege of sharing my research and perspectives on <a href="http://michaelzimmer.org/category/ethics/research-ethics/" target="_blank">Internet research ethics</a> at the 2011 <a href="http://www.primr.org/Conferences.aspx?id=11065" target="_blank">&#8220;Advancing Ethical Research&#8221;</a> conference held by Public Responsibility in Medicine and Research (<a href="http://www.primr.org/AboutUs.aspx?id=32" target="_blank">PRIM&amp;R</a>).</p>
<p>On the opening morning of the conference, I will join <a href="http://blogs.law.harvard.edu/palfrey/" target="_blank">John Palfrey</a> and <a href="http://www.childrenshospital.org/cfapps/research/data_admin/Site278/mainpageS278P0.html" target="_blank">Lydia Shrier</a> for a plenary panel on &#8220;Would Margaret Mead Have Blogged? How Social Media has Changed Research&#8221;. My slides for that presentation are provided below. I will also be leading a workshop on &#8220;Research, the Cloud, and the IRB&#8221;, and giving a presentation to the <a href="http://humansubjects.energy.gov/doe-resources/hswg.htm" target="_blank">U.S. Department of Energy Human Subjects Working Group</a>.</p>
<p>Internet research ethics is well represented at this year&#8217;s AER conference, with a strong track focusing on &#8220;Research Involving the Internet &amp; Social Networking&#8221;. Among these presentations and workshops are several led by <a href="http://www.uwstout.edu/faculty/buchanane/" target="_blank">Elizabeth Buchanan</a>. Buchanan will be co-facilitating a pre-conference workshop entitled &#8220;What a Tangled Web We Weave: Ethical, Regulatory, and technical Aspects of Internet Research&#8221;. She is also participating in the following workshops:  &#8220;Research, the Internet, and the IRB: Ethical and Regulatory Issues,&#8221; &#8220;Research, Social Media and the IRB,&#8221; and &#8220;Ethical Implications of Pre and Post Enrollment uses of Social Media in Clinical Trials&#8221;.</p>
<div id="__ss_10392318" style="width: 425px;"><strong style="display:block;margin:12px 0 4px"><a title="Research Ethics in the 2.0 Era" href="http://www.slideshare.net/michaelzimmer/research-ethics-in-the-20-era" target="_blank">Research Ethics in the 2.0 Era</a></strong> <object id="__sse10392318" classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000" width="425" height="355" codebase="http://download.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=6,0,40,0"><param name="allowFullScreen" value="true" /><param name="allowScriptAccess" value="always" /><param name="wmode" value="transparent" /><param name="src" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=zimmerprimrtalkv2-111130000734-phpapp01&amp;stripped_title=research-ethics-in-the-20-era&amp;userName=michaelzimmer" /><param name="name" value="__sse10392318" /><param name="allowfullscreen" value="true" /><embed height="355" width="425" name="__sse10392318" src="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=zimmerprimrtalkv2-111130000734-phpapp01&amp;stripped_title=research-ethics-in-the-20-era&amp;userName=michaelzimmer" wmode="transparent" allowscriptaccess="always" allowfullscreen="true" id="__sse10392318" type="application/x-shockwave-flash"></embed></object><br /> 
<div style="padding:5px 0 12px">View more <a href="http://www.slideshare.net/" target="_blank">presentations</a> from <a href="http://www.slideshare.net/michaelzimmer" target="_blank">Michael Zimmer</a></div>
</div>
]]></content:encoded>
			<wfw:commentRss>http://www.michaelzimmer.org/2011/12/01/presentations-at-primr-2011-adancing-ethical-research/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>My Research in The Chronicle of Higher Education: &quot;Harvard&#039;s Privacy Meltdown&quot;; some annotations</title>
		<link>http://www.michaelzimmer.org/2011/07/11/harvards-privacy-meltdown/</link>
		<comments>http://www.michaelzimmer.org/2011/07/11/harvards-privacy-meltdown/#comments</comments>
		<pubDate>Mon, 11 Jul 2011 12:39:44 +0000</pubDate>
		<dc:creator>michaelzimmer</dc:creator>
				<category><![CDATA[Internet Research Ethics]]></category>
		<category><![CDATA[Privacy]]></category>
		<category><![CDATA[Publications]]></category>
		<category><![CDATA[Facebook]]></category>
		<category><![CDATA[Online Privacy]]></category>
		<category><![CDATA[Research ethics]]></category>

		<guid isPermaLink="false">http://michaelzimmer.org/?p=2882</guid>
		<description><![CDATA[The Chronicle of Higher Education has published an article featuring my critique of the privacy protections and research methods related to the “Taste, Ties, and Time” (T3) Facebook research study conducted by a set of Harvard sociologists. Written by Marc Parry, the article is not-so-subtly teased as &#8220;Harvard&#8217;s Privacy Meltdown&#8221; on the Chronicle&#8217;s front page, [...]]]></description>
				<content:encoded><![CDATA[<p><a href="http://chronicle.com/article/Harvards-Privacy-Meltdown/128166/"><img class="alignright" title="Zimmer, Chronicle of Higher Education" src="/images/Zimmer_Chronicle.png" alt="" width="193" height="218" /></a>The <a href="http://chronicle.com" target="_blank">Chronicle of Higher Education</a> has published an article featuring <a href="http://michaelzimmer.org/2009/06/18/draft-paper-but-the-data-is-already-public/" target="_blank">my critique</a> of the privacy protections and research methods related to the <a href="http://cyber.law.harvard.edu/node/4682" target="_blank">“Taste, Ties, and Time”</a> (T3) Facebook research study conducted by a set of Harvard sociologists. Written by Marc Parry, the article is not-so-subtly teased as &#8220;<a href="http://chronicle.com/article/Harvards-Privacy-Meltdown/128166/" target="_blank">Harvard&#8217;s Privacy Meltdown</a>&#8221; on the Chronicle&#8217;s front page, and carries the title &#8220;Harvard Researchers Accused of Breaching Students&#8217; Privacy: Social-network project shows promise and peril of doing social science online&#8221; within the link.</p>
<p>It is a well-written article, quite balanced, and features myself, the T3 principle researcher Jason Kaufman, and fellow Internet research experts Alex Halavais, Fred Stutzman, and Elizabeth Buchanan (I am friends with the latter three, for disclosure). The Chronicle also tracked down a Harvard student presumably within the dataset.</p>
<p>For those looking, my initial blog posts (from 2008) regarding the T3 dataset are <a href="http://michaelzimmer.org/2008/09/30/on-the-anonymity-of-the-facebook-dataset/" target="_blank">here</a> and <a href="http://michaelzimmer.org/2008/10/03/more-on-the-anonymity-of-the-facebook-dataset-its-harvard-college/" target="_blank">here</a>, and my full treatment of the dataset release was published here:</p>
<ul>
<li>Zimmer, M. (2010). &#8220;<a href="http://www.springerlink.com/content/q1v7731u26210682/">&#8216;But the data is already public&#8217;: on the ethics of research in Facebook</a>,&#8221; <em>Ethics &amp; Information Technology</em>, 12(4), 313-325</li>
</ul>
<p>I don&#8217;t want to rehash the entire article or episode, but would like to provide a few annotations:</p>
<hr />
<p>The article does a nice job pointing out the dual challenges of &#8220;Researchers [who] must navigate the shifting privacy standards of social networks and their users&#8221;, as well as the &#8220;the committees set up to protect research subjects—institutional review boards, or IRB&#8217;s—[who] lack experience with Web-based research.&#8221;</p>
<p>These are critical revelations that we cannot take lightly. There is much work to be done to ensure researchers of all disciplines and levels recognize and respond to the complexities of engaging in this kind of research online, and that IRBs are sufficiently trained to recognize issues related to Internet research ethics.</p>
<p>To these ends, the <a href="http://aoir.org/" target="_blank">Association of Internet Researchers</a> (AoIR) has published an <a href="http://aoir.org/documents/ethics-guide/" target="_blank">ethics guide</a> (now undergoing revisions) as &#8220;as at least a starting point for their inquiries and reflection&#8221;, and we&#8217;ve held <a href="http://michaelzimmer.org/2010/10/26/debrief-internet-research-11-0-conference/" target="_blank">various</a> <a href="http://michaelzimmer.org/2009/10/06/ir-10-internet-critical/" target="_blank">workshops</a> on the subject. <a href="http://www4.uwm.edu/cipr/about/director.cfm" target="_blank">Elizabeth Buchanan</a> and <a href="http://www.drury.edu/ess/ess.html" target="_blank">Charles Ess</a> have spearheaded important research on the IRBs&#8217; awareness of Internet-related concerns, and have launched the <a href="http://internetresearchethics.org/" target="_blank">Internet Research Ethics Digital Library, Resource Center and Commons</a> website as a valuable resource.</p>
<p>And, specific to the article&#8217;s mention that I have &#8220;pointed to the Harvard case in urging the federal government to do more to educate IRB&#8217;s about Web research&#8221;, I was <a href="http://michaelzimmer.org/2010/07/20/presentation-research-ethics-in-the-2-0-era/" target="_blank">privileged to present before</a> the <a href="http://www.hhs.gov/ohrp/sachrp/" target="_blank">Secretary’s Advisory Committee on Human Research Protections (SACHRP)</a>, part of the <a href="http://www.hhs.gov/ohrp/" target="_blank">Office for Human Research Protections</a> in the United States Department of Health and Human Services (HHS). Joined by <a href="http://www4.uwm.edu/cipr/about/director.cfm" target="_blank">Elizabeth Buchanan</a>, <a href="http://www.bgsu.edu/departments/popc/page16741.html" target="_blank">Montana Miller</a>, and <a href="http://blogs.law.harvard.edu/palfrey/top/bio/" target="_blank">John Palfrey</a> (of Harvard&#8217;s Berkman Center, by the way), we discussed emerging ethical issues with Internet-based research and urged the committee to take steps to ensure IRBs and researchers were suitably trained to recognize and address these important ethical issues.</p>
<hr />
<p>In the context of this entire debate (and some of the original comments left on my blog posts), this passage from the article is quite telling:</p>
<blockquote><p>But Mr. Kaufman talks openly about another controversial piece of his data gathering: Students were not informed of it. He discussed this with the institutional review board. Alerting students risked &#8220;frightening people unnecessarily,&#8221; he says.</p>
<p>&#8220;We all agreed that it was not necessary, either legally or ethically,&#8221; Mr. Kaufman says.</p></blockquote>
<p>Frankly, I&#8217;m troubled by this statement. I will leave it to legal experts to determine if the research violated the consent requirements of the Federal Regulations for the Protection of Human Subjects (<a href="http://ohsr.od.nih.gov/guidelines/45cfr46.html" target="_blank">45 CFR 46</a>), but from an ethical standpoint, I argue the researchers <em>did</em> have an obligation to respect the intentions of those students who might have restricted their Facebook profiles to only be visible to members of the Harvard community. The researcher&#8217;s own codebook acknowledged that the assistants used to access the profile data might have had preferential access to a profile, and that &#8220;a given student’s information should not be considered objectively &#8216;public&#8217; or &#8216;private&#8217;&#8221;. This realization should have triggered an ethical concern over whether each students truly intended to have their profile data publicly visible and accessible for downloading.</p>
<p>This is the crux of the issue, and my earlier attempts to learn if and how this apparent waiver of the consent requirement was deliberated by Harvard&#8217;s IRB were unsuccessful. Perhaps now we can gain a bit more understanding of why it was deemed that consent wasn&#8217;t necessary (and I hope it was a more nuanced decision than simply avoiding &#8220;frightening people unnecessarily&#8221;).</p>
<hr />
<p>I agree with the article&#8217;s conclusion that the &#8220;biggest victim&#8221; in this episode is academic scholarship.</p>
<p>The uniqueness of this dataset is of obvious value for sociologists and Internet researchers, and it wasn&#8217;t my goal to shut down this research project. It is unfortunate the researchers <a href="http://dvn.iq.harvard.edu/dvn/dv/t3" target="_blank">haven&#8217;t been able to find</a> a suitable means of re-releasing the data, but just like the AOL search data release <a href="http://michaelzimmer.org/2006/08/08/will-aol-flap-help-privacy-awareness/" target="_blank">forced us to rethink methods</a> of anonymization before again releasing large datasets of transaction logs, I&#8217;m hopeful that this episode can prompt meaningful consideration and debate of our understandings of privacy, anonymity/identifiability, consent, and harm when it comes to Internet-based research.</p>
<hr />
<p>Finally, I wanted to provide a brief response to the implicit accusation made in the article that I&#8217;m a part of some kind of &#8220;academic paparazzi&#8221;.</p>
<p>I&#8217;m not even sure what this means. Perhaps someone thinks I spend my time trolling through other people&#8217;s research hoping to find a place where they slip up so I can have a &#8220;gotcha&#8221; moment? Hardly. I had never written on research ethics until I came across this particular case. I saw a passing mention of the data release on another scholar&#8217;s blog, and the ensuing discussion there about how the presumed anonymity of the dataset should be questioned due to its unique data variables. So I started to explore, and my discoveries followed. I&#8217;m not out to get anyone, but rather have taken quite a number of proactive steps to help researchers (both the T3 team and more broadly) address these complexities.</p>
<p>The complexities of research ethics and methodology in today&#8217;s Internet-based environment is complex, and <a href="http://michaelzimmer.org/research/#IRE" target="_blank">I&#8217;m just starting to scratch the surface</a>. But I don&#8217;t take this lightly; I&#8217;m a scholar, not a paparazzo.</p>
<p>As I conclude in my full article:</p>
<blockquote><p>The purpose of this critical analysis of the T3 project is not to place blame or single out these researchers for condemnation, but to use it as a case study to help expose the emerging challenges of engaging in research within online social network settings. &#8230;The T3 research project might very well be ushering in ‘‘a new way of doing social science’’, but it is our responsibility scholars to ensure our research methods and processes remain rooted in long- standing ethical practices. Concerns over consent, privacy and anonymity do not disappear simply because subjects participate in online social networks; rather, they become even more important.</p></blockquote>
<p>I hope that&#8217;s the takeaway from all this.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.michaelzimmer.org/2011/07/11/harvards-privacy-meltdown/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Facebook Data of 1.2 Million Users from 2005 Released: Limited Exposure, but Very Problematic</title>
		<link>http://www.michaelzimmer.org/2011/02/15/facebook-data-of-1-2-million-users-from-2005-released/</link>
		<comments>http://www.michaelzimmer.org/2011/02/15/facebook-data-of-1-2-million-users-from-2005-released/#comments</comments>
		<pubDate>Tue, 15 Feb 2011 13:30:24 +0000</pubDate>
		<dc:creator>michaelzimmer</dc:creator>
				<category><![CDATA[Internet Research Ethics]]></category>
		<category><![CDATA[Privacy]]></category>
		<category><![CDATA[Social Media]]></category>
		<category><![CDATA[Facebook]]></category>
		<category><![CDATA[Research ethics]]></category>

		<guid isPermaLink="false">http://michaelzimmer.org/?p=2589</guid>
		<description><![CDATA[Recently, a Facebook dataset was released consisting of the complete set of users from the Facebook networks at 100 American institutions, and all of the in-network “friendship” links between those users as they existed at a single moment of time in September 2005. Surprisingly, it initially included each users unique Facebook ID, meaning the presumed "anonymous" dataset could be easily re-identified, potentially putting the personal information of 1.2 million Facebook users at risk.]]></description>
				<content:encoded><![CDATA[<p>Last week, a Facebook dataset was <a href="http://lists.ufl.edu/cgi-bin/wa?A2=ind1102&amp;L=SOCNET&amp;T=0&amp;P=22841" target="_blank">released</a> by a group of researchers (Amanda L. Traud,  Peter J. Mucha,  Mason A. Porter) in connection with their <a href="http://arxiv.org/abs/1102.2166" target="_blank">paper</a> studying the role of user attributes &#8211; gender, class year, major, high school, and residence &#8211; on social network formations at various colleges and universities. The dataset &#8212; referred to by the researchers as the &#8220;Facebook 100&#8243; &#8212; consists of the complete set of users from the Facebook networks at 100 American schools, and all of the in-network “friendship” links between those users as they existed at a single moment of time in September 2005.</p>
<p>The research paper indicates that the Facebook data was provided to the researchers &#8220;in anonymized form by <a href="The data that we use was sent directly to us in anonymized form by Adam D’Angelo of Facebook.?PHPSESSID=697e106875e826d159b329a2506ff349" target="_blank">Adam D’Angelo</a> of Facebook.&#8221; (D&#8217;Angelo was then Facebook&#8217;s CTO, and <a href="http://kara.allthingsd.com/20080511/facebooks-cto-dangelo-to-leave/" target="_blank">left Facebook</a> in 2008.) Curious as to what precisely was included in the data release, and what steps towards anonymization were taken, I downloaded the data (<a href="http://people.maths.ox.ac.uk/~porterm/data/facebook100.zip" target="_blank">200 MB zip file</a>) on the morning of February 11.</p>
<p>The data files are separated by institution, and in total include, by my estimation, about 1.2 million user accounts. The content of each institution&#8217;s file is described as containing the following:</p>
<blockquote><p>Each of the school .mat files has an A matrix (sparse) and a &#8220;local_info&#8221; variable, one row per node: ID, a student/faculty status flag, gender, major, second major/minor (if applicable), dorm/house, year, and high school.</p></blockquote>
<p>Thus, the datasets include limited demographic information that was posted by users on their individual Facebook pages. The identity of users&#8217; dorm and high schools were obscured by numerical identifiers, but to my surprise, <em>the dataset included each user&#8217;s unique Facebook ID number.</em> As a result, while user names and extended profile information were kept out of the data release, a simple query against Facebook&#8217;s databases would yield considerable identifiable information for each record. In short, the suggestion that the data has been &#8220;anonymized&#8221; is seriously flawed.</p>
<p>The consequences of this ease of re-identifying the dataset are numerous.</p>
<p>First, while only limited profile information is within the dataset, there is no indication that any consideration was given to users&#8217; particular privacy settings. Based on the article, all user accounts from each of the 100 networks were provided to the researchers, and as long as the user provided the data to Facebook, it was turned over to the researchers. [Clarification: when I say "all user accounts" we provided, I do not mean full profile information was given to the researchers, just the particular data fields as described above]</p>
<p>Yet, in 2005, users had the ability to restrict access and visibility of their Facebook profile, their demographic data, and their lists of friends (much of this control was <a href="http://michaelzimmer.org/2009/12/10/facebooks-privacy-upgrade-is-a-downgrade-for-user-privacy/" target="_blank">taken away in 2009</a>). So, a user might have restricted access to certain information to only people within her network or just her friends, and Facebook&#8217;s own <a href="http://web.archive.org/web/20051231035715/http://www.facebook.com/policy.php" target="_blank">privacy policy at the time</a> promised that: <em>&#8220;No personal information that you submit to Facebook will be available to  any user of the Web Site who does not belong to at least one of the  groups specified by you in your privacy settings.&#8221;</em> This data release, and the ease by which users could be identified and linked to their data, potentially negates actions taken by users to control access to the data within the files, and seemingly contradicts Facebook&#8217;s own privacy policy.</p>
<p>Second, even though the specific data exposure within the dataset is limited, the fact that users can be identified and linked to their in-network social map fosters additional threats to privacy. Previous research (<a href="http://arxiv.org/abs/cs/0610105" target="_blank">here</a> and <a href="http://randomwalker.info/social-networks/" target="_blank">here</a>, for example) has shown how &#8220;anonymous&#8221; datasets can be largely re-identified when there is access to other large sets of data where the subjects are already  known. The &#8220;Facebook 100&#8243; data, with the Facebook IDs intact to guide identification of users, might be useful in similar efforts.</p>
<p>To recap, the suggestion that the &#8220;Facebook 100&#8243; data has been &#8220;anonymized&#8221; is seriously flawed, and its release might be putting the information of 1.2 million Facebook users at risk.</p>
<p>Interestingly, a few hours after the <a href="http://lists.ufl.edu/cgi-bin/wa?A2=ind1102&amp;L=SOCNET&amp;T=0&amp;P=22841" target="_blank">initial release</a> of the &#8220;Facebook 100&#8243; dataset, <span style="text-decoration: line-through;">the researchers</span> Mason Porter <a href="http://lists.ufl.edu/cgi-bin/wa?A2=ind1102&amp;L=SOCNET&amp;T=0&amp;P=26009" target="_blank">announced</a> they were pulling the data due to an unspecified &#8220;bug&#8221;. Later that evening, the data was again made available with a <a href="http://lists.ufl.edu/cgi-bin/wa?A2=ind1102&amp;L=SOCNET&amp;T=0&amp;P=28725" target="_blank">message</a> indicating that the data files were now fixed.</p>
<p>Again, I was curious, so I downloaded and examined the new dataset. The only change I could see was that now the Facebook ID was removed entirely from the data files, and the order of the records in each file was randomized.</p>
<p>Thus, the &#8220;bug&#8221; must&#8217;ve been that the data was easily re-identifiable, and the &#8220;fix&#8221; was to take additional steps to anonymize the records. Somone <a href="http://lists.ufl.edu/cgi-bin/wa?A2=ind1102&amp;L=SOCNET&amp;T=0&amp;P=28256" target="_blank">joked</a> on the announcement email list that the &#8220;bug&#8221; must have something to do with Facebook attorneys, but <span style="text-decoration: line-through;">the</span> Porter&#8217;s <a href="http://lists.ufl.edu/cgi-bin/wa?A2=ind1102&amp;L=SOCNET&amp;T=0&amp;P=28725" target="_blank">message</a> re-releasing the data jokes that no lawyers were involved, and that they &#8220;really were fixing the data files!&#8221;</p>
<p>To me, however, the language used in these explanations was disingenuous. The data, as far as I could tell, had no bugs that prevented its usefulness for social network analysis. No, the problem with the data was that it contained each user&#8217;s unique Facebook ID, thus allowing easy identification. <span style="text-decoration: line-through;">The researchers</span> Porter should have been open and honest about why the data was pulled and what they did to correct the situation.</p>
<p>That said, there are still a number of open questions regarding this particular dataset:</p>
<p>To Facebook:</p>
<ul>
<li>What kind of internal processes, if any, did D&#8217;Angelo follow when releasing the data to these researchers? Was he authorized to do so?</li>
<li>Was this kind of large data release routine? How many other similar releases have taken place?</li>
<li>Does Facebook consider releasing this information, with Facebook IDs, in compliance with the privacy policy in effect in 2005? If so, how?</li>
</ul>
<p>To the research team:</p>
<ul>
<li>Was the data received by Facebook already obscured with numerical identifiers replacing student majors, minors, and high schools, or did you add those?
<ul>
<li><em>UPDATE</em>: I have received word from one of the researchers, Mason Porter, that the data sent to them by Facebook was indeed already obscured with numerical identifiers in the place of actual student major, minor, and high school information.</li>
</ul>
</li>
<li>Did your IRB review the data used for the research, and approve the subsequent data release?</li>
<li>Was there any &#8220;bug&#8221; in the data, or was the attempt to gain greater anonymization of the data the sole reason to pull it from public access?</li>
</ul>
<p>Obtaining answers to these questions can help us better understand the uniqueness of this situation, and to put better processes and protections in place to prevent similar data releases that falsely believe data is sufficiently anonymized and respecting of users&#8217; privacy expectations.</p>
<p>I hope Facebook and the researchers are willing to engage in a discussion, and I&#8217;ll report back on any communication, as allowed.</p>
<p>UPDATE (Feb 15, 6:00pm): I have been in contact with one of the researchers, Mason Porter, who confirmed that the data sent to them by Facebook was indeed already obscured with numerical identifiers in the place of actual student major, minor, and high school information. I&#8217;ve inserted this reply into the question above. I have also made a few minor changes to the main text, clarifying that the email messages reporting the &#8220;bug&#8221; in the data came from Mason alone, and should not be attributed to the entire research team.</p>
<p>UPDATE 2 (Feb 15, 6:10pm): The link to the full, revised dataset (<a href="http://people.maths.ox.ac.uk/~porterm/data/facebook100.zip" target="_blank">http://people.maths.ox.ac.uk/~porterm/data/facebook100.zip</a>) is no longer active.</p>
<p>UDPATE 3 (Feb 16, 9am): Added a clarification that when I say &#8220;all user accounts&#8221; were provided to the researchers, I do not  mean full profile information was given, just the particular data fields as described above.</p>
<p>UDPATE 4 (Feb 16, 11am): Mason Porter, one of the authors, has posted an <a href="http://masonporter.blogspot.com/2011/02/facebook100-data-set.html" target="_blank">explanatory note on his blog</a> indicating that he&#8217;s been in contact with the Facebook Data Team, and per their request, &#8220;I have taken down the  data, and I will be working with them to eventually post a version of  the data set with which both they and I are happy.&#8221;</p>
]]></content:encoded>
			<wfw:commentRss>http://www.michaelzimmer.org/2011/02/15/facebook-data-of-1-2-million-users-from-2005-released/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
	</channel>
</rss>
