<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Ready-to-hand &#187; data collection</title>
	<atom:link href="http://www.deaneckles.com/blog/category/data-collection/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.deaneckles.com/blog</link>
	<description>Dean Eckles on people, technology &#38; inference</description>
	<lastBuildDate>Wed, 11 Jan 2012 01:51:42 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>A deluge of experiments</title>
		<link>http://www.deaneckles.com/blog/632_a-deluge-of-experiments/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=a-deluge-of-experiments</link>
		<comments>http://www.deaneckles.com/blog/632_a-deluge-of-experiments/#comments</comments>
		<pubDate>Thu, 24 Nov 2011 07:43:51 +0000</pubDate>
		<dc:creator>Dean Eckles</dc:creator>
				<category><![CDATA[causal inference]]></category>
		<category><![CDATA[data collection]]></category>
		<category><![CDATA[econometrics]]></category>
		<category><![CDATA[experiments]]></category>
		<category><![CDATA[HCI]]></category>
		<category><![CDATA[research methods]]></category>
		<category><![CDATA[statistics]]></category>

		<guid isPermaLink="false">http://www.deaneckles.com/blog/?p=632</guid>
		<description><![CDATA[The Atlantic reports on the data deluge and its value for innovation.1 I particularly liked how Erik Brynjolfsson and Andrew McAfee, who wrote the Atlantic piece, highlight the value of experimentation for addressing causal questions &#8212; and that many of the questions we care about are causal.2 In writing about experimentation, they report that Hal [...]]]></description>
			<content:encoded><![CDATA[<p><em>The Atlantic</em> <a href="http://www.theatlantic.com/business/archive/2011/11/the-big-data-boom-is-the-innovation-story-of-our-time/248215/">reports on the data deluge and its value for innovation</a>.<sup><a href="http://www.deaneckles.com/blog/632_a-deluge-of-experiments/#footnote_0_632" id="identifier_0_632" class="footnote-link footnote-identifier-link" title="I don&amp;#8217;t know that I would call much of it &amp;#8216;innovation&amp;#8217;. There is some outright innovation, but a lot of that is in the general strategies for using the data. There is much more gained in minor tweaking and optimization of products and services.">1</a></sup> I particularly liked how Erik Brynjolfsson and Andrew McAfee, who wrote the <em>Atlantic</em> piece, highlight the value of experimentation for addressing causal questions &#8212; and that many of the questions we care about are causal.<sup><a href="http://www.deaneckles.com/blog/632_a-deluge-of-experiments/#footnote_1_632" id="identifier_1_632" class="footnote-link footnote-identifier-link" title="Perhaps they even overstate the power of simple experiments. For example, they do not mention the fact that many times the results these kinds of experiments often change over time, so that what you learned 2 months ago is no longer true.">2</a></sup></p>
<p>In writing about experimentation, they report that Hal Varian, Google&#8217;s Chief Economist, estimates that Google runs &#8220;100-200 experiments on any given day&#8221;. This struck me as incredibly low! I would have guessed more like 10,000 or maybe more like 100,000. </p>
<p>The trick of course is how one individuates experiments. Say Google has an automatic procedure whereby each ad has a (small) random set of users who are prevented from seeing it and are shown the next best ad instead. Is this one giant experiment? Or one experiment for each ad?</p>
<p>This is a bit of a silly question.<sup><a href="http://www.deaneckles.com/blog/632_a-deluge-of-experiments/#footnote_2_632" id="identifier_2_632" class="footnote-link footnote-identifier-link" title="Note that two single-factor experiments over the same population with independent random assignment can be regarded as a single experiment with two factors.">3</a></sup> </p>
<p>But when most people &#8212; even statisticians and scientists &#8212; think of an experiment in this context, they think of something like Google or Amazon making a particular button bigger. (Maybe somebody thought making <em>that</em> button bigger would improve a particular metric.) They likely don&#8217;t think of automatically generating an experiment for every button, such that a random sample see that particular button slightly bigger. It&#8217;s these latter kinds of procedures that lead to thinking about tens of thousands of experiments. </p>
<p>That&#8217;s the real deluge of experiments.</p>
<ol class="footnotes"><li id="footnote_0_632" class="footnote">I don&#8217;t know that I would call much of it &#8216;innovation&#8217;. There is some outright innovation, but a lot of that is in the general strategies for using the data. There is much more gained in minor tweaking and optimization of products and services.</li><li id="footnote_1_632" class="footnote">Perhaps they even overstate the power of simple experiments. For example, they do not mention the fact that many times the results these kinds of experiments often change over time, so that what you learned 2 months ago is no longer true.</li><li id="footnote_2_632" class="footnote">Note that two single-factor experiments over the same population with independent random assignment can be regarded as a single experiment with two factors.</li></ol>]]></content:encoded>
			<wfw:commentRss>http://www.deaneckles.com/blog/632_a-deluge-of-experiments/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Traits, adaptive systems &amp; dimensionality reduction</title>
		<link>http://www.deaneckles.com/blog/495_traits-adaptive-systems-dimensionality-reduction/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=traits-adaptive-systems-dimensionality-reduction</link>
		<comments>http://www.deaneckles.com/blog/495_traits-adaptive-systems-dimensionality-reduction/#comments</comments>
		<pubDate>Fri, 22 Apr 2011 03:07:58 +0000</pubDate>
		<dc:creator>Dean Eckles</dc:creator>
				<category><![CDATA[data collection]]></category>
		<category><![CDATA[HCI]]></category>
		<category><![CDATA[influence]]></category>
		<category><![CDATA[persuasion profiling]]></category>
		<category><![CDATA[persuasive technology]]></category>
		<category><![CDATA[psychology]]></category>
		<category><![CDATA[research methods]]></category>
		<category><![CDATA[statistics]]></category>

		<guid isPermaLink="false">http://www.deaneckles.com/blog/?p=495</guid>
		<description><![CDATA[Psychologists have posited numerous psychological traits and described causal roles they ought to play in determining human behavior. Most often, the canonical measure of a trait is a questionnaire. Investigators obtain this measure for some people and analyze how their scores predict some outcomes of interest. For example, many people have been interested in how [...]]]></description>
			<content:encoded><![CDATA[<p>Psychologists have posited numerous psychological traits and described causal roles they ought to play in determining human behavior. Most often, the canonical measure of a trait is a questionnaire. Investigators obtain this measure for some people and analyze how their scores predict some outcomes of interest. For example, many people have been interested in how psychological traits affect persuasion processes. Traits like need for cognition (NFC) have been posited and questionnaire items developed to measure them. Among other things, NFC affects how people respond to messages with arguments for varying quality.</p>
<p><strong>How useful are these traits for explanation, prediction, and adaptive interaction?</strong> I can&#8217;t address all of this here, but I want to sketch an argument for their irrelevance to adaptive interaction &#8212; and then offer a tentative rejoinder.</p>
<p>Interactive technologies can tailor their messages to the tastes and susceptibilities of the people interacting with and through them. It might seem that these traits should figure in the statistical models used to make these adaptive selections. After all, some of the possible messages fit for, e.g., coaching a person to meet their exercise goals are more likely to be effective for low NFC people than high NFC people, and vice versa. However, the standard questionnaire measures of NFC cannot often be obtained for most users &#8212; certainly not in commerce settings, and even people signing up for a mobile coaching service likely don&#8217;t want to answer pages of questions. On the other hand, some Internet and mobile services have other abundant data available about their users, which could perhaps be used to construct an alternative measure of these traits. <strong>The trait-based-adaptation recipe is</strong>: </p>
<ol>
<li>obtain the questionnaire measure of the trait for a sample, </li>
<li>predict this measure with data available for many individuals (e.g., log data), </li>
<li>use this model to construct a measure for out-of-sample individuals. </li>
</ol>
<p>This new measure could then be used to personalize the interactive experience based on this trait, such that if a version performs well (or poorly) for people with a particular score on the trait, then use (or don&#8217;t use) that version for people with similar scores.</p>
<p><strong>But why involve the trait at all?</strong> Why not just personalize the interactive experience based on the responses of similar others? Since the new measure of the trait is just based on the available behavioral, demographic, and other logged data, one could simply predict responses based on those measure. Put in geometric terms, if the goal is to project the effects of different message onto available log data, why should one project the questionnaire measure of the trait onto the available log data and then project the effects onto this projection? This seems especially unappealing if one doesn&#8217;t fully trust the questionnaire measure to be accurate or one can&#8217;t be sure about which the set of all the traits that make a (substantial) difference.</p>
<p>I find this argument quite intuitively appealing, and it seems to resonate with others.<sup><a href="http://www.deaneckles.com/blog/495_traits-adaptive-systems-dimensionality-reduction/#footnote_0_495" id="identifier_0_495" class="footnote-link footnote-identifier-link" title="I owe some clarity on this to some conversations with Mike Nowak, Maurits Kaptein, and others.">1</a></sup> But I think there are some reasons the recipe above could still be appealing.</p>
<p>One way to think about this recipe is as dimensionality reduction guided by theory about psychological traits. Available log data can often be used to construct countless predictors (or &#8220;features&#8221;, as the machine learning people call them). So one can very quickly get into a situation where the effective number of parameters for a full model predicting the effects of different messages is very large and will make for poor predictions. Nothing &#8212; no, not penalized regression, not even a support vector machine &#8212; makes this problem go away. Instead, one has to rely on the domain knowledge of the person constructing the predictors (i.e., doing the &#8220;feature engineering&#8221;) to pick some good ones.</p>
<p>So the tentative rejoinder is this: established psychological traits might often make good dimensions to predict effects of different version of a message, intervention, or experience with. And they may &#8220;come with&#8221; suggestions about what kinds of log data might serve as measures of them. They would be expected to be reusable across settings. Thus, I think this recipe is nonetheless deserves serious attention.</p>
<ol class="footnotes"><li id="footnote_0_495" class="footnote">I owe some clarity on this to some conversations with Mike Nowak, Maurits Kaptein, and others.</li></ol>]]></content:encoded>
			<wfw:commentRss>http://www.deaneckles.com/blog/495_traits-adaptive-systems-dimensionality-reduction/feed/</wfw:commentRss>
		<slash:comments>9</slash:comments>
		</item>
		<item>
		<title>Aardvark&#8217;s use of Wizard of Oz prototyping to design their social interfaces</title>
		<link>http://www.deaneckles.com/blog/305_aardvarks-use-of-wizard-of-oz-prototyping-to-design-their-social-interfaces/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=aardvarks-use-of-wizard-of-oz-prototyping-to-design-their-social-interfaces</link>
		<comments>http://www.deaneckles.com/blog/305_aardvarks-use-of-wizard-of-oz-prototyping-to-design-their-social-interfaces/#comments</comments>
		<pubDate>Tue, 27 Apr 2010 02:25:41 +0000</pubDate>
		<dc:creator>Dean Eckles</dc:creator>
				<category><![CDATA[communication]]></category>
		<category><![CDATA[data collection]]></category>
		<category><![CDATA[design]]></category>
		<category><![CDATA[HCI]]></category>
		<category><![CDATA[information needs]]></category>
		<category><![CDATA[markets]]></category>
		<category><![CDATA[Mechanical Turk]]></category>
		<category><![CDATA[needfinding]]></category>
		<category><![CDATA[prototyping]]></category>
		<category><![CDATA[research methods]]></category>
		<category><![CDATA[search]]></category>
		<category><![CDATA[social responses to communication technologies]]></category>
		<category><![CDATA[social software]]></category>
		<category><![CDATA[source orientation]]></category>
		<category><![CDATA[usability]]></category>
		<category><![CDATA[Wizard of Oz]]></category>

		<guid isPermaLink="false">http://www.deaneckles.com/blog/?p=305</guid>
		<description><![CDATA[The Wall Street Journal&#8217;s Venture Capital Dispatch reports on how Aardvark, the social question asking and answering service recently acquired by Google, used a Wizard of Oz prototype to learn about how their service concept would work without building all the tech before knowing if it was any good. Aardvark employees would get the questions [...]]]></description>
			<content:encoded><![CDATA[<p>The <a href="http://blogs.wsj.com/venturecapital/2010/04/24/how-a-start-up-grew-by-paying-attention-to-whats-behind-the-curtain/">Wall Street Journal&#8217;s Venture Capital Dispatch reports</a> on how <a href="http://blogs.wsj.com/venturecapital/2010/04/24/how-a-start-up-grew-by-paying-attention-to-whats-behind-the-curtain/">Aardvark</a>, the social question asking and answering service recently acquired by Google, used a <a href="http://www.usabilitynet.org/tools/wizard.htm">Wizard of Oz prototype</a> to learn about how their service concept would work without building all the tech before knowing if it was any good.</p>
<blockquote><p>Aardvark employees would get the questions from beta test users and route them to users who were online and would have the answer to the question. This was done to test out the concept before the company spent the time and money to build it, said Damon Horowitz, co-founder of Aardvark, who spoke at Startup Lessons Learned, a conference in San Francisco on Friday.</p>
<p>“If people like this in super crappy form, then this is worth building, because they’ll like it even more,” Horowitz said of their initial idea.</p>
<p>At the same time it was testing a “fake” product powered by humans, the company started building the automated product to replace humans. While it used humans “behind the curtain,” it gained the benefit of learning from all the questions, including how to route the questions and the entire process with users.</p></blockquote>
<p>This is a really good idea, as I&#8217;ve argued before <a href="http://www.deaneckles.com/blog/16_using-a-wizard-of-oz-technique-in-mobile-service-design-probing-with-realistic-motivations/">on this blog</a> and in <a href="http://www.amazon.com/dp/0979502543/">a chapter for developers of mobile health interventions</a>. What better way to (a) learn about how people will use and experience your service and (b) get training data for your machine learning system than to have humans-in-the-loop run the service?</p>
<p>My friend <a href="http://www.chrisstreeter.com/">Chris Streeter</a> wondered whether this was all done by Aardvark employees or whether workers on Amazon Mechanical Turk may have also been involved, especially in identifying the expertise of the early users of the service so that the employees could route the questions to the right place. I think this highlights how different parts of a service can draw on human and non-human intelligence in a variety of ways &#8212; via a micro-labor market, using skilled employees who will gain hands-on experience with customers, etc.</p>
<p>I also wonder what UIs the humans-in-the-loop used to accomplish this. It&#8217;d be great to get a peak. I&#8217;d expect that these were certainly rough around the edges, as was the Aardvark customer-facing UI.</p>
<p>Aardvark does a good job of being a quite sociable agent (e.g., when using it via instant messaging) that also gets out of the way of the human&#8211;human interaction between question askers and answers. I wonder how the language used by humans to coordinate and hand-off questions may have played into creating a positive para-social interaction with vark.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.deaneckles.com/blog/305_aardvarks-use-of-wizard-of-oz-prototyping-to-design-their-social-interfaces/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Public once, public always? Privacy, egosurfing, and the availability heuristic</title>
		<link>http://www.deaneckles.com/blog/291_public-once-public-always-privacy-egosurfing-and-the-availability-heuristic/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=public-once-public-always-privacy-egosurfing-and-the-availability-heuristic</link>
		<comments>http://www.deaneckles.com/blog/291_public-once-public-always-privacy-egosurfing-and-the-availability-heuristic/#comments</comments>
		<pubDate>Mon, 19 Apr 2010 01:36:40 +0000</pubDate>
		<dc:creator>Dean Eckles</dc:creator>
				<category><![CDATA[activity streams]]></category>
		<category><![CDATA[automaticity]]></category>
		<category><![CDATA[communication]]></category>
		<category><![CDATA[consumption]]></category>
		<category><![CDATA[culture]]></category>
		<category><![CDATA[data collection]]></category>
		<category><![CDATA[Facebook]]></category>
		<category><![CDATA[HCI]]></category>
		<category><![CDATA[heuristics]]></category>
		<category><![CDATA[participatory media]]></category>
		<category><![CDATA[priming]]></category>
		<category><![CDATA[psychology]]></category>

		<guid isPermaLink="false">http://www.deaneckles.com/blog/?p=291</guid>
		<description><![CDATA[The Library of Congress has announced that it will be archiving all Twitter posts (tweets). You can find positive reaction on Twitter. But some have also wondered about privacy concerns. Fred Stutzman, for example, points out how even assuming that only unprotected accounts are being archived this can still be problematic.1 While some people have [...]]]></description>
			<content:encoded><![CDATA[<p>The <a href="http://blogs.loc.gov/loc/2010/04/how-tweet-it-is-library-acquires-entire-twitter-archive/">Library of Congress has announced</a> that it will be archiving all Twitter posts (tweets). You can find positive reaction on Twitter. But some have also wondered about privacy concerns. Fred Stutzman, for example, <a href="http://fstutzman.com/2010/04/14/twitter-and-the-library-of-congress/">points out</a> how even assuming that only unprotected accounts are being archived this can still be problematic.<sup><a href="http://www.deaneckles.com/blog/291_public-once-public-always-privacy-egosurfing-and-the-availability-heuristic/#footnote_0_291" id="identifier_0_291" class="footnote-link footnote-identifier-link" title="This might not be the case, see Michael Zimmer and this New York Times article.">1</a></sup> While some people have Twitter usernames that easily identify their owners and many allow themselves to be found based on an email address that is publicly associated with their identity, there are also many that do not. If at a future time, this account becomes associated with their identity for a larger audience than they desire, they can make their whole account viewable only by approved followers<sup><a href="http://www.deaneckles.com/blog/291_public-once-public-always-privacy-egosurfing-and-the-availability-heuristic/#footnote_1_291" id="identifier_1_291" class="footnote-link footnote-identifier-link" title="Why don&amp;#8217;t people do this in the first place? Many may not be aware of the feature, but even if they are, there are reasons not to use it. For example, it makes any participation in topical conversations (e.g., around a hashtag) difficult or impossible.">2</a></sup>, delete the account, or delete some of the tweets. Of course, this information may remain elsewhere on the Internet for a short or long time. But in contrast, the Library of Congress archive will be much more enduring and likely outside of individual users&#8217; control.<sup><a href="http://www.deaneckles.com/blog/291_public-once-public-always-privacy-egosurfing-and-the-availability-heuristic/#footnote_2_291" id="identifier_2_291" class="footnote-link footnote-identifier-link" title="Or at least this control would have to be via Twitter, likely before archiving: &amp;#8220;We asked them [Twitter] to deal with the users; the library doesn&amp;#8217;t want to mediate that.&amp;#8221;">3</a></sup> While I think it is worth examining the strategies that people adopt to cope with inflexible or difficult to use privacy controls in software, I don&#8217;t intend to do that here.</p>
<p>Instead, I want to relate this discussion to my continued interest in how activity streams and other information consumption interfaces affect their users&#8217; beliefs and behaviors through the availability heuristic. In response to some comments on <a href="http://fstutzman.com/2010/04/14/twitter-and-the-library-of-congress/">his first post</a>, <a href="http://fstutzman.com/2010/04/16/is-it-time-to-cancel-your-twitter-account/">Stutzman argues</a> that people overestimate the degree to which content once public on the Internet is public forever:</p>
<blockquote><p>So why is it that we all assume that the content we share publicly will be around forever?  I think this is a classic case of selection on the dependent variable.  When we Google ourselves, we are confronted with <em>what’s there</em> as opposed to what’s not there.  The stuff that goes away gets forgotten, and we concentrate on things that we see or remember (like a persistent page about us that we don’t like).  In reality, our online identities decay, decay being a stochastic process.  The internet is actually quite bad at remembering.</p></blockquote>
<p>This unconsidered &#8220;selection on the dependent variable&#8221; is one way of thinking about some cases of how the availability heuristic (and use of ease-of-retrievel information more generally). But I actually think the latter is more general and more useful for describing the psychological processes involved. For example, it highlights both that there are many occurrences or interventions can can influence which cases are available to mind and that even if people have thought about cases where their content disappeared at some point, this may not be easily retrieved when making particular privacy decisions or offering opinions on others&#8217; actions.</p>
<p>Stutzman&#8217;s example is but one way that the combination of the availability heuristic and existing Internet services combine to affect privacy decisions. For example, consider how activity streams like Facebook News Feed influence how people perceive their audience. News Feed shows items drawn from an individual&#8217;s friends&#8217; activities, and they often have some reciprocal access. However, the items in the activity stream are likely unrepresentative of this potential and likely audience. &#8220;Lurkers&#8221; &#8212; people who consume but do not produce &#8212; are not as available to mind, and proliﬁc producers are too available to mind for how often they are in the actual audience for some new shared content. This can, for example, lead to making self-disclosures that are not appropriate for the actual audience.</p>
<ol class="footnotes"><li id="footnote_0_291" class="footnote">This might not be the case, see <a href="http://michaelzimmer.org/2010/04/14/how-your-private-tweets-might-be-included-in-the-library-of-congress-public-archive/">Michael Zimmer</a> and <a href="http://www.nytimes.com/2010/04/15/technology/15twitter.html">this New York Times article</a>.</li><li id="footnote_1_291" class="footnote">Why don&#8217;t people do this in the first place? Many may not be aware of the feature, but even if they are, there are reasons not to use it. For example, it makes any participation in topical conversations (e.g., around a hashtag) difficult or impossible.</li><li id="footnote_2_291" class="footnote">Or at least this control would have to be via Twitter, likely before archiving: <a href="http://www.prospect.org/cs/articles?article=the_library_of_congress_is_now_following_you_on_twitter">&#8220;We asked them [Twitter] to deal with the users; the library doesn&#8217;t want to mediate that.&#8221;</a></li></ol>]]></content:encoded>
			<wfw:commentRss>http://www.deaneckles.com/blog/291_public-once-public-always-privacy-egosurfing-and-the-availability-heuristic/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Not just predicting the present, but the future: Twitter and upcoming movies</title>
		<link>http://www.deaneckles.com/blog/266_not-just-predicting-the-present-but-the-future-twitter-and-upcoming-movies/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=not-just-predicting-the-present-but-the-future-twitter-and-upcoming-movies</link>
		<comments>http://www.deaneckles.com/blog/266_not-just-predicting-the-present-but-the-future-twitter-and-upcoming-movies/#comments</comments>
		<pubDate>Fri, 02 Apr 2010 19:02:01 +0000</pubDate>
		<dc:creator>Dean Eckles</dc:creator>
				<category><![CDATA[activity streams]]></category>
		<category><![CDATA[blogging]]></category>
		<category><![CDATA[communication]]></category>
		<category><![CDATA[consumption]]></category>
		<category><![CDATA[culture]]></category>
		<category><![CDATA[data collection]]></category>
		<category><![CDATA[markets]]></category>
		<category><![CDATA[search]]></category>
		<category><![CDATA[statistics]]></category>
		<category><![CDATA[surveillance]]></category>

		<guid isPermaLink="false">http://www.deaneckles.com/blog/?p=266</guid>
		<description><![CDATA[Search queries have been used recently to &#8220;predict the present&#8220;, as Hal Varian has called it. Now some initial use of Twitter chatter to predict the future: The chatter in Twitter can accurately predict the box-office revenues of upcoming movies weeks before they are released. In fact, Tweets can predict the performance of films better [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.deaneckles.com/blog/233_search-terms-and-the-flu-preferring-complex-models/">Search queries have been used recently</a> to &#8220;<a href="http://googleresearch.blogspot.com/2009/04/predicting-present-with-google-trends.html">predict the present</a>&#8220;, as Hal Varian has called it. Now some initial use of Twitter chatter to predict the future:</p>
<blockquote><p>The chatter in Twitter can accurately predict the box-office revenues of upcoming movies weeks before they are released. In fact, Tweets can predict the performance of films better than market-based predictions, such as <a href="http://www.hsx.com/">Hollywood Stock Exchange</a>, which have been the best predictors to date. (<a href="http://www.kk.org/thetechnium/archives/2010/04/twitter_predict.php">Kevin Kelley</a>)</p></blockquote>
<p><a href="http://arxiv.org/abs/1003.5699">Here is the paper by Asur and Huberman from HP Labs</a>. Also see <a href="http://sloanreview.mit.edu/improvisations/2009/05/12/using-online-discussions-to-predict-sales/">a similar use of online discussion forums</a>.</p>
<p>But the obvious question from my previous post is, <a href="http://www.deaneckles.com/blog/233_search-terms-and-the-flu-preferring-complex-models/">how much improvement do you get by adding more inputs to the model?</a> That is, how does the combined Hollywood Stock Exchange and Twitter chatter model perform? The authors report adding the number of theaters the movie opens in to both models, but not combining them directly.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.deaneckles.com/blog/266_not-just-predicting-the-present-but-the-future-twitter-and-upcoming-movies/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Persuasion profiling and genres: Fogg in 2006</title>
		<link>http://www.deaneckles.com/blog/256_persuasion-profiling-and-genres-fogg-in-2006/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=persuasion-profiling-and-genres-fogg-in-2006</link>
		<comments>http://www.deaneckles.com/blog/256_persuasion-profiling-and-genres-fogg-in-2006/#comments</comments>
		<pubDate>Thu, 01 Apr 2010 00:58:49 +0000</pubDate>
		<dc:creator>Dean Eckles</dc:creator>
				<category><![CDATA[data collection]]></category>
		<category><![CDATA[HCI]]></category>
		<category><![CDATA[influence]]></category>
		<category><![CDATA[persuasion profiling]]></category>
		<category><![CDATA[persuasive technology]]></category>
		<category><![CDATA[privacy]]></category>
		<category><![CDATA[statistics]]></category>

		<guid isPermaLink="false">http://www.deaneckles.com/blog/?p=256</guid>
		<description><![CDATA[Maurits Kaptein and I have recently been thinking a lot about persuasion profiling &#8212; estimating and adapting to individual differences in responses to influence strategies based on past behavior and other information. With help from students, we&#8217;ve been running experiments and building statistical models that implement persuasion profiling. My thinking on persuasion profiling is very [...]]]></description>
			<content:encoded><![CDATA[<p>Maurits Kaptein and I have recently been thinking a lot about <em>persuasion profiling</em> &#8212; estimating and adapting to individual differences in responses to influence strategies based on past behavior and other information. With help from students, we&#8217;ve been running experiments and building statistical models that implement persuasion profiling.</p>
<p>My thinking on persuasion profiling is very much in BJ Fogg&#8217;s footsteps, since he has been talking about persuasion profiling in courses, lab meetings, and personal discussions since 2004 or earlier.</p>
<p>Just yesterday, I came across <a href="http://www.ftc.gov/bcp/workshops/techade/pdfs/transcript_061107.pdf">this transcript</a> of BJ&#8217;s presentation for an <a href="http://www.ftc.gov/bcp/workshops/techade/">FTC hearing in 2006</a>. I was struck at how much it anticipates some of what Maurits and I have written recently (more on this later). I&#8217;m sure I watched <a href="http://www.youtube.com/watch?v=X_Pyy6NsP5s#t=6m04s">the draft video of the presentation</a> back then and it&#8217;s influenced me, even if I forgot some of the details.</p>
<p>Here is the relevant excerpt from BJ&#8217;s comments for the FTC:</p>
<blockquote><p>Persuasion profiling means that each one of us has a different set of persuasion strategies that affect us.  Just like we like different types of food or are vulnerable to giving in to different types of food on a diet, we are vulnerable to different types of persuasion strategies.</p>
<p>On the food example, I love old-fashioned popcorn, and if I go to a party and somebody has old-fashioned popcorn, I will probably break down and eat it.   On the persuasion side of things, I know I&#8217;m vulnerable to trying new things, to challenges and to anything that gets measured.  If that&#8217;s proposed to me, I&#8217;m going to be vulnerable and I&#8217;m going to give it a shot.</p>
<p><em>Whenever we go to a Web site and use an interactive system, it is likely they will be capturing what persuasion strategies work on us and will be using those when we use the service again.  The mapping out of what makes me tick, what motivates me can also be bought or sold, just like a credit report. </em></p>
<p><em>So imagine I&#8217;m going in to buy a new car and the person selling me the car downloads my credit report but also buys my persuasion profile.  I may or may not know about this.  Imagine if persuasion profiles are available on political campaigns so that when I visit a Web site, the system knows it is B.J. Fogg, and it changes [its] approach based on my vulnerabilities when it comes to persuasion. </em></p>
<p>Persuasive technology will touch our lives anywhere that we access digital products or services, in the car, in our living room, on the Web, through our mobile phones and so on.  Persuasive technology will be all around us, and unlike other media types, where you have 30-second commercial or a magazine ad, you have genres you can understand, when it comes to computer-based persuasion, it is so flexible that it won&#8217;t have genre boundaries.   It will come to us in the ordinary course of our lives, as we are working on a Web site, as we are editing a document, as we are driving a car. There won&#8217;t be clear markers about when you are being persuaded and when you are not.</p></blockquote>
<p>This last paragraph is about the &#8220;genrelessness&#8221; of many persuasive technologies. This isn&#8217;t directly on the topic of persuasion profiling, but I see it as critically relevant. Persuasion profiling is likely to be most effective when invisible and undisclosed to users. From this and the lack of genre-based flags for persuasive technology it follows that we will frequently be &#8220;persuasion profiled&#8221; without knowing it.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.deaneckles.com/blog/256_persuasion-profiling-and-genres-fogg-in-2006/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Search terms and the flu: preferring complex models</title>
		<link>http://www.deaneckles.com/blog/233_search-terms-and-the-flu-preferring-complex-models/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=search-terms-and-the-flu-preferring-complex-models</link>
		<comments>http://www.deaneckles.com/blog/233_search-terms-and-the-flu-preferring-complex-models/#comments</comments>
		<pubDate>Tue, 01 Dec 2009 07:54:22 +0000</pubDate>
		<dc:creator>Dean Eckles</dc:creator>
				<category><![CDATA[communication]]></category>
		<category><![CDATA[consumption]]></category>
		<category><![CDATA[culture]]></category>
		<category><![CDATA[data collection]]></category>
		<category><![CDATA[health]]></category>
		<category><![CDATA[research methods]]></category>
		<category><![CDATA[search]]></category>
		<category><![CDATA[sociology]]></category>
		<category><![CDATA[statistics]]></category>
		<category><![CDATA[surveillance]]></category>

		<guid isPermaLink="false">http://www.deaneckles.com/blog/?p=233</guid>
		<description><![CDATA[Simplicity has its draws. A simple model of some phenomena can be quick to understand and test. But with the resources we have today for theory building and prediction, it is worth recognizing that many phenomena of interest (e.g., in social sciences, epidemiology) are very, very complex. Using a more complex model can help. It&#8217;s [...]]]></description>
			<content:encoded><![CDATA[<p>Simplicity has its draws. A simple model of some phenomena can be quick to understand and test. But with the resources we have today for theory building and prediction, it is worth recognizing that many phenomena of interest (e.g., in social sciences, epidemiology) are very, very complex. Using a more complex model can help. <a href="http://www.stat.columbia.edu/~cook/movabletype/archives/2009/07/that_modeling_f.html">It&#8217;s great to try many simple models along the way &#8212; as scaffolding &#8212; but if you have a large enough N in an observational study, a larger model will likely be an improvement.</a></p>
<p>One obvious way a model gets more complex is by adding predictors. There has recently been a good deal of attention on using the frequency of search terms to predict important goings-on &#8212; like flu trends. Sharad Goel et al. (<a href="http://messymatters.com/2009/11/30/what-can-search-predict/">blog post</a>, <a href="http://www.cam.cornell.edu/~sharad/papers/searchpreds.pdf">paper</a>) temper the excitement a bit by demonstrating that simple models using other, existing public data sets outperform the search data. In some cases (music popularity, in particular), adding the search data to the model improves predictions: the more complex combined model can &#8220;explain&#8221; some of the variance not handled by the more basic non-search-data models.</p>
<p><a href="http://messymatters.com/2009/11/30/what-can-search-predict/"><img class="alignleft" title="Model comparisons" src="http://messymatters.com/wp-content/uploads/2009/11/searchpreds.jpg" alt="" width="425" height="250" /></a></p>
<p>This echos one big takeaway from the Netflix Prize competition: committees win. The top competitors were all large teams formed from smaller teams and their models were tuned combinations of several models. That is, the strategy is, <em>take a bunch of complex models and combine them. </em></p>
<p>One way of doing this is just taking a weighted average of the predictions of several simpler models. <a href="http://lingpipe-blog.com/2009/09/29/convexity-of-root-mean-square-error-or-why-committees-won-the-netflix-prize/">This works quite well when your measure of the value of your model is root mean squared error (RMSE), since RMSE is convex.</a></p>
<p>While often the larger model &#8220;explains&#8221; more of the variance, what &#8220;explains&#8221; means here is just that the R-squared is larger: less of the variance is error. More complex models can be difficult to understand, just like the phenomena they model. We will continue to need better tools to understand, visualize, and evaluate our models as their complexity increases. I think the committee metaphor will be an interesting and practical one to apply in the many cases where the best we can do is use a weighted average of several simpler, pretty good models.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.deaneckles.com/blog/233_search-terms-and-the-flu-preferring-complex-models/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Reprioritizing human intelligence tasks for low latency and high throughput on Mechanical Turk</title>
		<link>http://www.deaneckles.com/blog/19_reprioritizing-human-intelligence-tasks-for-low-latency-and-high-throughput-on-mechanical-turk/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=reprioritizing-human-intelligence-tasks-for-low-latency-and-high-throughput-on-mechanical-turk</link>
		<comments>http://www.deaneckles.com/blog/19_reprioritizing-human-intelligence-tasks-for-low-latency-and-high-throughput-on-mechanical-turk/#comments</comments>
		<pubDate>Fri, 25 Jul 2008 04:07:25 +0000</pubDate>
		<dc:creator>Dean Eckles</dc:creator>
				<category><![CDATA[api]]></category>
		<category><![CDATA[crowdsourcing]]></category>
		<category><![CDATA[data collection]]></category>
		<category><![CDATA[HCI]]></category>
		<category><![CDATA[human performance modeling]]></category>
		<category><![CDATA[markets]]></category>
		<category><![CDATA[Mechanical Turk]]></category>
		<category><![CDATA[prototyping]]></category>
		<category><![CDATA[research methods]]></category>

		<guid isPermaLink="false">http://www.deaneckles.com/blog/19_reprioritizing-human-intelligence-tasks-for-low-latency-and-high-throughput-on-mechanical-turk/</guid>
		<description><![CDATA[Amazon Mechanical Turk is a platform and market for human intelligence tasks (HITs) that are submitted by requesters and completed by workers (or &#8220;turkers&#8221;).  Each HIT is associated with a payment, often a few cents. This post covers some basics of Mechanical Turk and shows its lack of designed-in support for dynamic reprioritization is problematic [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.mturk.com/">Amazon Mechanical Turk</a> is a platform and market for human intelligence tasks (<a href="http://docs.amazonwebservices.com/AWSMechanicalTurkRequester/2008-04-01/Concepts_HITsArticle.html">HITs</a>) that are submitted by <em>requesters </em>and completed by <em>workers </em>(or &#8220;turkers&#8221;).  Each HIT is associated with a payment, often a few cents. This post covers some basics of Mechanical Turk and shows its lack of designed-in support for dynamic reprioritization is problematic for some uses. I also mention some other factors that influence latency and throughput.</p>
<p>With mTurk one can create a HIT that asks someone to <a href="http://blog.doloreslabs.com/2008/04/search-engine-relevance-an-empirical-test/">rate some search results for a query</a>, <a href="http://asc-parc.blogspot.com/2008/03/how-to-reduce-cost-of-doing-user.html">evaluate the credibility of a Wikipedia article</a>, <a href="http://www.thesheepmarket.com/">draw a sheep facing left</a>, <a href="http://blog.doloreslabs.com/2008/03/where-does-blue-end-and-red-begin/">enter names for a provided color</a>, <a href="http://visionpc.cs.uiuc.edu/~largescale/index.html">annotate a photo of a person with pose information</a>, or <a href="http://hci.stanford.edu/mkrieger/research.html">create a storyboard illustrating a new product idea</a>. So Mechanical Turk can be used in many ways for basic research, building a training set for machine learning, or actually enabling a (perhaps prototype) service in use through a kind of <a href="http://www.deaneckles.com/blog/16_using-a-wizard-of-oz-technique-in-mobile-service-design-probing-with-realistic-motivations/">Wizard-of-Oz approach</a>. Additionally, I&#8217;ve used mTurk to code images captured by participants in a lab experiment (more on this in another post or article).</p>
<p>When creating HITs, a requester can specify a <a href="http://docs.amazonwebservices.com/AWSMechanicalTurkRequester/2008-04-01/ApiReference_QuestionFormDataStructureArticle.html">QuestionForm</a> (QF) (e.g., via command line tools or an SDK) that is then presented to the worker by Amazon. This can include images, free text answers, multiple choice, etc. Additionally one can embed Flash or Java objects in it. But the easiest way of creating HITs is to use a QF and not create a Java or Flash application of one&#8217;s own. This is especially true for HITs that are handled well by the basic question form. The other option is to create an <a href="http://docs.amazonwebservices.com/AWSMechanicalTurkRequester/2008-04-01/ApiReference_ExternalQuestionArticle.html">ExternalQuestion</a> (EQ), which is hosted on one&#8217;s own server and is displayed in an iFrame. This provides greater freedom but requires additional development and it is you that must host the page (though you can do so through Amazon&#8217;s S3). QF HITs (without embeds) also offer a familiar interface to workers (though it is possible to create a more efficient, custom interface by, e.g., making all the targets larger). So when possible, it is often preferable to use a QF rather than an EQ.</p>
<p>For some of the uses of mTurk for powering a service, it can be important to minimize latency for specific HITs<sup><a href="http://www.deaneckles.com/blog/19_reprioritizing-human-intelligence-tasks-for-low-latency-and-high-throughput-on-mechanical-turk/#footnote_0_19" id="identifier_0_19" class="footnote-link footnote-identifier-link" title="I use the term HIT somewhat loosely in this article. There are at least three uses that each differ in their identity conditions. (1) There are HITs considered as human intelligence tasks, and thus divided as we divide tasks; this means that a HIT in another sense can be composed of multiple HITs in this sense (tasks or sub-tasks). (2) There are HITs in Amazon&amp;#8217;s technical sense of the term: a HIT is something that has the same HIT ID and therefore has the same specification. In QF HITs without embeds, this means all instances (assignments) of a HIT are the same in content; but in EQ HIT this is not necessarily true, since the content can be determined when assigned. (3) Finally, there is what Amazon calls assignments, specific instances of a HITs that are only completed once.">1</a></sup>, including prioritizing particular new HITs over previously created HITs. For example, after some HIT has not been completed for a specific period after creation, it may still be important to complete it, but when it is completed may become less important. This can happen easily if the value of a HIT being completed has a sharp drop off after some time.</p>
<p>This should be done while maintaining high throughput; that is, you don&#8217;t want to reduce the rate at which your HITs are completed. When there are more HITs of the same type, workers can check a box to immediately start the next HIT of the same type when they submit the current one (see screenshot). Workers will often complete many HITs of the same type in a row. So throughput can drop substantially if any workers run out of HITs of the same type at any point: they may switch to another HIT type, or if they do your HITs once more appear, then there will be a delay. As we&#8217;ll see, these two requirements don&#8217;t seem to be well met by the platform &#8212; or at least certain uses of it.</p>
<p>Mechanical Turk does not provide a mechanism for prioritizing HITs of the same type, so without deleting all but particular high-priority HITs of that type, there is not a way to ensure that some particular HIT gets done before the rest. And deleting the other HITs would hurt throughput and increase latency for any new high-priority HITs added in the near future (since workers won&#8217;t simply start these once they finish their previous HITs).</p>
<p>EQ HITs allow one to avoid this problem. Unlike with QF HITs (without Flash and Java embeds), one does not have to specify the full content of the HIT in advance. When a worker accepts an EQ HIT, you can dynamically serve up the HIT you want to depending on changing priorities. But this means that you can&#8217;t take advantage of, e.g., the simplicity of creating and managing data from QF HITs. So though there are ways of coping, adding dynamic reprioritization to Mechanical Turk would be a boon for time-sensitive uses.</p>
<p>There are, of course, other factors that influence latency and throughput on mTurk when (EQ) HITs are reprioritized. Here are a few:</p>
<ul>
<li><strong>HIT and sub-tasks duration</strong>. How long does it take for workers to complete a HIT, which may be composed of multiple sub-tasks? A worker cannot be assigned a new HIT until they complete (or reject) the previous one. This can be somewhat avoided by creating longer HITs that are subdivided into dynamically selected sub-tasks. This can be done with an EQ HIT or an embedded Flash or Java application in a QF HIT. But the sub-task duration is always a limiting factor, unless one is willing to force abortion of the current sub-task, replacing it will still in progress (with an EQ, Flash, or Java).</li>
<li><strong>Available workers</strong>. How many workers are logged into mTurk and completing task? How many are currently switching HIT types? This can vary with the time of day.</li>
<li><strong>Appeal of your HITs</strong>. How much do workers like your HITs &#8212; are they fun? How much do you pay for how much you ask? How many of their completed assignments do you approve?</li>
<li><strong>Reliability</strong>. How accurate or precise must your results be? How many workers do you need to complete a HIT before you have reliable results? Do other workers need to complete meta-HITs before the data can be used?</li>
</ul>
<ol class="footnotes"><li id="footnote_0_19" class="footnote">I use the term HIT somewhat loosely in this article. There are at least three uses that each differ in their identity conditions. (1) There are HITs considered as human intelligence tasks, and thus divided as we divide tasks; this means that a HIT in another sense can be composed of multiple HITs in this sense (tasks or sub-tasks). (2) There are HITs in Amazon&#8217;s technical sense of the term: a HIT is something that has the same HIT ID and therefore has the same specification. In QF HITs without embeds, this means all instances (assignments) of a HIT are the same in content; but in EQ HIT this is not necessarily true, since the content can be determined when assigned. (3) Finally, there is what Amazon calls <em>assignments</em>, specific instances of a HITs that are only completed once.</li></ol>]]></content:encoded>
			<wfw:commentRss>http://www.deaneckles.com/blog/19_reprioritizing-human-intelligence-tasks-for-low-latency-and-high-throughput-on-mechanical-turk/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Using a Wizard of Oz technique in mobile service design: probing with realistic motivations</title>
		<link>http://www.deaneckles.com/blog/16_using-a-wizard-of-oz-technique-in-mobile-service-design-probing-with-realistic-motivations/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=using-a-wizard-of-oz-technique-in-mobile-service-design-probing-with-realistic-motivations</link>
		<comments>http://www.deaneckles.com/blog/16_using-a-wizard-of-oz-technique-in-mobile-service-design-probing-with-realistic-motivations/#comments</comments>
		<pubDate>Tue, 06 May 2008 04:53:26 +0000</pubDate>
		<dc:creator>Dean Eckles</dc:creator>
				<category><![CDATA[consumption]]></category>
		<category><![CDATA[context]]></category>
		<category><![CDATA[data collection]]></category>
		<category><![CDATA[design]]></category>
		<category><![CDATA[diary methods]]></category>
		<category><![CDATA[HCI]]></category>
		<category><![CDATA[mobile]]></category>
		<category><![CDATA[mobile messaging]]></category>
		<category><![CDATA[needfinding]]></category>
		<category><![CDATA[participatory media]]></category>
		<category><![CDATA[photography]]></category>
		<category><![CDATA[research methods]]></category>
		<category><![CDATA[Wizard of Oz]]></category>

		<guid isPermaLink="false">http://www.deaneckles.com/blog/16_using-a-wizard-of-oz-technique-in-mobile-service-design-probing-with-realistic-motivations/</guid>
		<description><![CDATA[As I&#8217;ve blogged before, I spoke at the Texting 4 Health conference on the topic of research methods for mobile messaging. One method I covered was an interesting use of Wizard of Oz techniques for designing mobile services. I&#8217;ve since started getting some of this material in writing for the Texting 4 Health book. Here [...]]]></description>
			<content:encoded><![CDATA[<p>As I&#8217;ve <a href="http://www.deaneckles.com/blog/13_texting-4-health-conference-in-review/">blogged </a>before, I spoke at the <a href="http://www.texting4health.org/">Texting 4 Health conference</a> on the topic of research methods for mobile messaging. One method I covered was an interesting use of Wizard of Oz techniques for designing mobile services. I&#8217;ve since started getting some of this material in writing for the Texting 4 Health book. Here is a taste of that material, minus the health-specific focus and examples.<br />
&#8212;&#8212;-<br />
Just like the famous Wizard of Oz, one can simulate something impressive with a just a humble person behind the curtain &#8212; and use this simulation to inform design decisions. When using a <a href="http://www.usabilitynet.org/tools/wizard.htm">Wizard of Oz technique</a> to study a prototype, a human “wizard” carries out functions that, in a deployed application or service, would be handled by a computer. This can allow evaluating a design without fully building what can be expensive back-end parts of the system (Kelley 1984). The technique is often used in recognition-based interfaces, but it also has traditional applications to identifying usability problems and carrying out experiments in which the interaction is systematically manipulated.</p>
<p>Wizard of Oz techniques are well suited to prototyping mobile services, especially those using mobile messaging (SMS, MMS, voice messaging). When participants send a request, a wizard reads or listens to it and chooses the appropriate response, or just creates it on-the-fly. Since all user actions in mobile messaging are discrete messages and (depending on the application) the user can often tolerate a short delay, a few part-time wizards, such as you and a colleague, can manage a short field trial. As you&#8217;ll see, <strong>this can be used for purposes beyond many traditional uses of a Wizard of Oz.</strong></p>
<p><strong>Probing photo consumption needs with realistic motivations</strong><br />
One use for this technique in designing a mobile messaging service is a bit like a diary study. In designing an online and mobile photography service, we wanted to better understand what photos people wanted to view and what prompted these desires.<sup><a href="http://www.deaneckles.com/blog/16_using-a-wizard-of-oz-technique-in-mobile-service-design-probing-with-realistic-motivations/#footnote_0_16" id="identifier_0_16" class="footnote-link footnote-identifier-link" title="This study was designed and executed at Yahoo! Research Berkeley by Shane Ahern, Nathan Good, Simon King, Mor Naaman, Rahul Nair, and myself.">1</a></sup>  Instead of just making diary entries, participants actually made voice requests to the system for photos – and received a mobile message with photos fitting the request in return. We didn’t need to first build a robust system that could do this; a few of us served as wizards, listening to the request, doing a couple manual searches, and choosing which photos to return on demand. Though this can be done with a normal voice call, we used a mobile client application that also recorded contextual information not available via a normal voice call (e.g. location), so that participants could make context-aware requests as they saw fit (e.g. &#8220;I want too see photos of this park&#8221;)</p>
<p>In this case, we didn’t plan to (specifically) create a voice-based photo search system; instead, like a diary study, this technique served as a probe to understand what we should build. As a probe it provided realistic motivations for submitting requests, as the request would actually be fulfilled. This design research, in additional to other interviews and a usability study, informed our creation of <a href="http://zurfer.research.yahoo.com">Zurfer</a>, a mobile application that supports exploring and conversing around personalized, location-aware channels of photos.<br />
It is great if the Wizard of Oz prototype is quite similar to what you later build, but it can yield valuable insights even if not. Sometimes it is precisely these insights that can lead you to substantially change your design.</p>
<p>This study design can apply in designing many mobile services. As in our photos study, participants can be interviewed about the trigger for the requests (why did they want that media or information) and how satisfied they were with the (human-created) responses.<sup><a href="http://www.deaneckles.com/blog/16_using-a-wizard-of-oz-technique-in-mobile-service-design-probing-with-realistic-motivations/#footnote_1_16" id="identifier_1_16" class="footnote-link footnote-identifier-link" title="Participants were informed that their requests would be seen by our research staff. Anonymization and strict limits of who the wizards are is necessary to protect participants&rsquo; privacy. Even if participants are not informed that a wizard is creating the responses until they are debriefed after the experiment, participants can nonetheless be notified that their responses are being reviewed by the research team.">2</a></sup></p>
<div class="references">
Kelley, J.F. (1984). An iterative design methodology for user-friendly natural language office information applications. In <em>ACM Trans. Inf. Syst.</em>,  vol. 2, pp. 26-41.
<div>
<ol class="footnotes"><li id="footnote_0_16" class="footnote">This study was designed and executed at Yahoo! Research Berkeley by Shane Ahern, Nathan Good, Simon King, Mor Naaman, Rahul Nair, and myself.</li><li id="footnote_1_16" class="footnote">Participants were informed that their requests would be seen by our research staff. Anonymization and strict limits of who the wizards are is necessary to protect participants’ privacy. Even if participants are not informed that a wizard is creating the responses until they are debriefed after the experiment, participants can nonetheless be notified that their responses are being reviewed by the research team.</li></ol>]]></content:encoded>
			<wfw:commentRss>http://www.deaneckles.com/blog/16_using-a-wizard-of-oz-technique-in-mobile-service-design-probing-with-realistic-motivations/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Texting 4 Health conference in review</title>
		<link>http://www.deaneckles.com/blog/13_texting-4-health-conference-in-review/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=texting-4-health-conference-in-review</link>
		<comments>http://www.deaneckles.com/blog/13_texting-4-health-conference-in-review/#comments</comments>
		<pubDate>Sat, 08 Mar 2008 04:14:49 +0000</pubDate>
		<dc:creator>Dean Eckles</dc:creator>
				<category><![CDATA[data collection]]></category>
		<category><![CDATA[diary methods]]></category>
		<category><![CDATA[events]]></category>
		<category><![CDATA[HCI]]></category>
		<category><![CDATA[health]]></category>
		<category><![CDATA[mobile]]></category>
		<category><![CDATA[mobile messaging]]></category>
		<category><![CDATA[mobile persuasion]]></category>
		<category><![CDATA[persuasive technology]]></category>
		<category><![CDATA[research methods]]></category>

		<guid isPermaLink="false">http://www.deaneckles.com/blog/13_texting-4-health-conference-in-review/</guid>
		<description><![CDATA[As I blogged already, I attended and spoke at the first Texting 4 Health conference at Stanford University last week. You can see my presentation slides at SlideShare here, and the program, with links to the slides for most speakers is here. The conference was very interesting, and there was quite the mix of participants [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.deaneckles.com/blog/11_texting-4-health/">As I blogged </a>already, I attended and spoke at the first <a href="http://www.texting4health.org/">Texting 4 Health</a> conference at Stanford University last week.  You can see my presentation slides at SlideShare <a href="http://www.slideshare.net/deaneckles/research-methods-for-mobile-messaging">here</a>, and the program, with links to the slides for most speakers is <a href="http://www.texting4health.org/page2/page2.html">here</a>.</p>
<p>The conference was very interesting, and there was quite the mix of participants &#8212; both speakers and others.  There were medical school faculty, business people, people from NGOs and foundations, technologists, representatives of government agencies and centers, futurists, and social scientists.  Everyone had something to learn &#8212; I know I did.  This also made it somewhat difficult as a speaker because it is hard to know how best to reach, inform, and hold the interest of such a diverse audience: what is common ground with some is entirely new territory with others.</p>
<p>I think my favorite session was &#8220;Changing Health Behavior via SMS&#8221;.  The methods used by the panelists to evaluate their interventions were both interesting to reflect on and good tools for persuading me of the importance and effectiveness of their work.  One of my reflections was about what factors to vary in doing experiments on health interventions: there is (reasonable) focus on having a no-SMS control condition, and there are very few studies with manipulations of dimensions more fine-grained.  Of course, the field is young and I understand how important true controls are in medical domains, but I think that real progress in understanding mobile messaging and designing effective interventions will require looking at more subtle and theoretically valuable manipulations.</p>
<p>You can see other posts about the conference <a href="http://trusted.md/blog/hippocrates/2008/03/05/texting_4_health_a_fascinating_look_into_sms_health_applications">here</a> and <a href="http://blog.aids.gov/2008/03/texting-4-healt.html">here</a>.  And the <a href="http://www.texting4health.org/">conference Web site</a> is also starting a blog to watch in the future.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.deaneckles.com/blog/13_texting-4-health-conference-in-review/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>

