<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Ready-to-hand &#187; psychology</title>
	<atom:link href="http://www.deaneckles.com/blog/category/psychology/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.deaneckles.com/blog</link>
	<description>Dean Eckles on people, technology &#38; inference</description>
	<lastBuildDate>Wed, 11 Jan 2012 01:51:42 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>Against between-subjects experiments</title>
		<link>http://www.deaneckles.com/blog/577_against-between-subjects-experiments/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=against-between-subjects-experiments</link>
		<comments>http://www.deaneckles.com/blog/577_against-between-subjects-experiments/#comments</comments>
		<pubDate>Thu, 09 Jun 2011 06:11:23 +0000</pubDate>
		<dc:creator>Dean Eckles</dc:creator>
				<category><![CDATA[average treatment effects]]></category>
		<category><![CDATA[individual differences]]></category>
		<category><![CDATA[psychology]]></category>
		<category><![CDATA[research methods]]></category>

		<guid isPermaLink="false">http://www.deaneckles.com/blog/?p=577</guid>
		<description><![CDATA[A less widely known reason for using within-subjects experimental designs in psychological science. In a within-subjects experiment, each participant experiences multiple conditions (say, multiple persuasive messages), while in a between-subjects experiment, each participant experiences only one condition. If you ask a random social psychologist, &#8220;Why would you run a within-subjects experiment instead of a between-subjects [...]]]></description>
			<content:encoded><![CDATA[<p><em>A less widely known reason for using within-subjects experimental designs in psychological science. In a </em>within-subjects<em> experiment, each participant experiences multiple conditions (say, multiple persuasive messages), while in a </em>between-subjects<em> experiment, each participant experiences only one condition.</em></p>
<p>If you ask a random social psychologist, &#8220;Why would you run a within-subjects experiment instead of a between-subjects experiments?&#8221;, the most likely answer is &#8220;power&#8221; &#8212; within-subjects experiments provide more power. That is, with the same number of participants, within-subjects experiments allow investigators to more easily tell that observed differences between conditions are not due to chance.<sup><a href="http://www.deaneckles.com/blog/577_against-between-subjects-experiments/#footnote_0_577" id="identifier_0_577" class="footnote-link footnote-identifier-link" title="And to more precisely estimate these differences. Though social psychologist often don&amp;#8217;t care about estimation, since many social psychological theories are only directional.">1</a></sup></p>
<p>Why do within-subjects experiments increase power? Because responses by the same individual are generally dependent; more specifically, they are often positively correlated. Say an experiment involves evaluating products, people, or policy proposals under different conditions, such as the presence of different persuasive cues or following different primes. It is often the case that participants who rate an item high on a scale under one condition will rate other items high on that scale under other condition. Or participants with short response times for one task will have relatively short response times for another task. Et cetera. This positive association might be due to stable characteristics of people or transient differences such as mood. Thus, the increase in power is due to heterogeneity in how individuals respond to the stimuli. </p>
<p>However, this advantage of within-subjects designs is frequently overridden in social psychology by the appeal of between-subjects designs. The latter are widely regarded as &#8220;cleaner&#8221; as they avoid carryover effects &#8212; in which one condition may effect responses to subsequent conditions experienced by the same participant. They can also be difficult to design when studies involve deception &#8212; even just deception about the purpose of the study &#8212; and one-shot encounters. Because of this, between-subjects designs are much more common in social psychology than within-subjects designs: investigators don&#8217;t regard the complexity of conducting within-subjects designs as worth it for the gain in power, which they regard as the primary advantage of within-subjects designs.</p>
<p>I want to point out another &#8212; but related &#8212; reason for using within-subjects designs: between-subjects experiments often do not allow consistent estimation of the parameters of interest. Now, between-subjects designs are great for estimating average treatment effects (ATEs), and ATEs can certainly be of great interest. For example, if one is interested how a design change to a web site will effect sales, an ATE estimated from an A-B test with the very same population will be useful. But this isn&#8217;t enough for psychological science for two reasons. First, social psychology experiments are usually very different from the circumstances of potential application: the participants are undergraduate students in psychology and the manipulations and situations are not realistic. So the ATE from a psychology experiment might not say much about the ATE for a real intervention. Second, social psychologists regard themselves as building and testing theories about psychological processes. By their nature, psychological processes occur within individuals. So an ATE won&#8217;t do &#8212; in fact, it can be a substantially biased estimate of the psychological parameter of interest. </p>
<p>To illustrate this problem, consider an example where the outcome of an experiment is whether the participant says that a job candidate should be hired. For simplicity, let&#8217;s say this is a binary outcome: either they say to hire them or not. Their judgements might depend on some discrete scalar X. Different participants may have different thresholds for hiring the applicant, but otherwise be effected by X in the same way. In a logistic model, that is, each participant has their own intercept but all the slopes are the same. This is depicted with the grey curves below.<sup><a href="http://www.deaneckles.com/blog/577_against-between-subjects-experiments/#footnote_1_577" id="identifier_1_577" class="footnote-link footnote-identifier-link" title="This example is very directly inspired by Alan Agresti&amp;#8217;s Categorical Data Analysis, p. 500.">2</a></sup><br />
<div id="attachment_583" class="wp-caption alignnone" style="width: 452px"><a href="http://www.deaneckles.com/blog/wp-content/uploads/2011/06/marginal_conditional_logit.png"><img src="http://www.deaneckles.com/blog/wp-content/uploads/2011/06/marginal_conditional_logit.png" alt="Comparison of marginal and conditional logit functions" title="Comparison of marginal and conditional logit functions" width="442" height="233" class="size-full wp-image-583" /></a><p class="wp-caption-text">Marginal (blue) and conditional (grey) expectation functions</p></div></p>
<p>These grey curves can be estimated if one has multiple observations per participant at different values of X. However, in a between-subjects experiment, this is not the case. As an estimate of a parameter of the psychological process common to all the participants, the estimated slope from a between-subjects experiment will be biased. This is clear in the figure above: the blue curve (the marginal expectation function) is shallower than any of the individual curves.</p>
<p>More generally, between-subjects experiments are good for estimating ATEs and making striking demonstrations. But they are often insufficient for investigating psychological processes since any heterogeneity &#8212; even only in intercepts &#8212; produces biased estimates of the parameters of psychological processes, including parameters that are universal in the population.</p>
<p>I see this as a strong motivation for doing more within-subjects experiments in social psychology. Unlike the power motivation for within-subjects designs, this isn&#8217;t solved by getting a larger sample of individuals. Instead, investigators need to think carefully about whether their experiments estimate any quantity of interest when there is substantial heterogeneity &#8212; as there generally is.<sup><a href="http://www.deaneckles.com/blog/577_against-between-subjects-experiments/#footnote_2_577" id="identifier_2_577" class="footnote-link footnote-identifier-link" title="The situation is made a bit &amp;#8220;better&amp;#8221; by the fact that social psychologists are often only concerned with determining the direction of effects, so maybe aren&amp;#8217;t worried that their estimates of parameters are biased. Of course, this is a problem in itself if the direction of the effect varies by individual. Here I have only treated the simpler case of universal function subject to a random shift.">3</a></sup></p>
<ol class="footnotes"><li id="footnote_0_577" class="footnote">And to more precisely estimate these differences. Though social psychologist often don&#8217;t care about estimation, since many social psychological theories are only directional.</li><li id="footnote_1_577" class="footnote">This example is very directly inspired by Alan Agresti&#8217;s <em>Categorical Data Analysis</em>, p. 500.</li><li id="footnote_2_577" class="footnote">The situation is made a bit &#8220;better&#8221; by the fact that social psychologists are often only concerned with determining the direction of effects, so maybe aren&#8217;t worried that their estimates of parameters are biased. Of course, this is a problem in itself if the direction of the effect varies by individual. Here I have only treated the simpler case of universal function subject to a random shift.</li></ol>]]></content:encoded>
			<wfw:commentRss>http://www.deaneckles.com/blog/577_against-between-subjects-experiments/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Marginal evidence for psychological processes</title>
		<link>http://www.deaneckles.com/blog/555_marginal-evidence-for-psychological-processes/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=marginal-evidence-for-psychological-processes</link>
		<comments>http://www.deaneckles.com/blog/555_marginal-evidence-for-psychological-processes/#comments</comments>
		<pubDate>Mon, 02 May 2011 07:38:27 +0000</pubDate>
		<dc:creator>Dean Eckles</dc:creator>
				<category><![CDATA[academia]]></category>
		<category><![CDATA[causal inference]]></category>
		<category><![CDATA[psychology]]></category>
		<category><![CDATA[research methods]]></category>
		<category><![CDATA[statistics]]></category>

		<guid isPermaLink="false">http://www.deaneckles.com/blog/?p=555</guid>
		<description><![CDATA[Some comments on problems with investigating psychological processes using estimates of average (i.e. marginal) effects. Hence the play on words in the title. Social psychology makes a lot of being theoretical. This generally means not just demonstrating an effect, but providing evidence about the psychological processes that produce it. Psychological processes are, it is agreed, [...]]]></description>
			<content:encoded><![CDATA[<p><em>Some comments on problems with investigating psychological processes using estimates of average (i.e. marginal) effects. Hence the play on words in the title.</em></p>
<p>Social psychology makes a lot of being theoretical. This generally means not just demonstrating an effect, but providing evidence about the psychological processes that produce it. Psychological processes are, it is agreed, <em>intra-individual processes</em>. To tell a story about a psychological process is to posit something going on &#8220;inside&#8221; people. It is quite reasonable that this is how social psychology should work &#8212; and it makes it consistent with much of cognitive psychology as well.</p>
<p>But the evidence that social psychology uses to support these theories about these intra-individual processes is largely evidence about effects of experimental conditions (or, worse, non-manipulated measures) <em>averaged across many participants</em>. That is, it is using estimates of marginal effects as evidence of conditional effects. This is intuitively problematic. Now, there is no problem when using experiments to study effects and processes that are homogenous in the population. But, of course, they aren&#8217;t: heterogeneity abounds. There is variation in how factors affect different people. This is why the causal inference literature has emphasized the differences among the average treatment effect, (average) treatment effect on the treated, local average treatment effect, etc.</p>
<p>Not only is this disconnect between marginal evidence and conditional theory trouble in the abstract, we know it has already produced many problems in the social psychology literature.<sup><a href="http://www.deaneckles.com/blog/555_marginal-evidence-for-psychological-processes/#footnote_0_555" id="identifier_0_555" class="footnote-link footnote-identifier-link" title="The situation is bad enough that I (and some colleagues) certainly don&amp;#8217;t even take many results in social psych as more than providing a possibly interesting vocabulary.">1</a></sup> Baron and Kenny (1986) is the most cited paper published in the <em>Journal of Personality and Social Psychology</em>, the leading journal in the field. It paints an rosy picture of what it is like to investigate psychological processes. The methods of analysis it proposes for investigating processes are almost ubiquitous in social psych.<sup><a href="http://www.deaneckles.com/blog/555_marginal-evidence-for-psychological-processes/#footnote_1_555" id="identifier_1_555" class="footnote-link footnote-identifier-link" title="Luckily, my sense is that they are waning a bit, partially because of illustrations of the method&amp;#8217;s bias.">2</a></sup> The trouble is that this approach is severely biased in the face of heterogeneity in the processes under study. This is usually described as problem of correlated error terms, omitted-variables bias, or adjusting for post-treatment variables. This is all true. But, in the most common uses, it is perhaps more natural to think of it as a problem of mixing up marginal (i.e. average) and conditional effects.<sup><a href="http://www.deaneckles.com/blog/555_marginal-evidence-for-psychological-processes/#footnote_2_555" id="identifier_2_555" class="footnote-link footnote-identifier-link" title="To translate to the terms used before, note that we want to condition on unobserved (latent) heterogeneity. If one doesn&amp;#8217;t, then there is omitted variable bias. This can be done with models designed for this purpose, such as random effects models.">3</a></sup></p>
<p>What&#8217;s the solution? First, it is worth saying that average effects are worth investigating! Especially if you are evaluating a intervention or drug that might really be used &#8212; or if you are working at another level of analysis than psychology. But if psychological processes are your thing, you must do better. </p>
<p>Social psychologists sometimes do condition on individual characteristics, but often this is a measure of a single trait (e.g., need for cognition) that cannot plausibly exhaust all (or even much) of the heterogeneity in the effects under study. Without much larger studies, they cannot condition on more characteristics because of estimation problems (too many parameters for their N). So there is bound to be substantial heterogeneity.</p>
<p>Beyond this, I think social psychology could benefit from a lot more within-subjects experiments. Modern statistical computing (e.g., tools for fitting mixed-effects or multilevel models) makes it possible &#8212; even easy &#8212; to use such data to estimate effects of the manipulated factors for each participant. If they want to make credible claims about processes, then within-subjects designs &#8212; likely with many measurements of each person &#8212; are a good direction to more thoroughly explore.</p>
<ol class="footnotes"><li id="footnote_0_555" class="footnote">The situation is bad enough that I (and some colleagues) certainly don&#8217;t even take many results in social psych as more than providing a possibly interesting vocabulary.</li><li id="footnote_1_555" class="footnote">Luckily, my sense is that they are waning a bit, partially because of illustrations of the method&#8217;s bias.</li><li id="footnote_2_555" class="footnote">To translate to the terms used before, note that we want to condition on unobserved (latent) heterogeneity. If one doesn&#8217;t, then there is omitted variable bias. This can be done with models designed for this purpose, such as random effects models.</li></ol>]]></content:encoded>
			<wfw:commentRss>http://www.deaneckles.com/blog/555_marginal-evidence-for-psychological-processes/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Traits, adaptive systems &amp; dimensionality reduction</title>
		<link>http://www.deaneckles.com/blog/495_traits-adaptive-systems-dimensionality-reduction/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=traits-adaptive-systems-dimensionality-reduction</link>
		<comments>http://www.deaneckles.com/blog/495_traits-adaptive-systems-dimensionality-reduction/#comments</comments>
		<pubDate>Fri, 22 Apr 2011 03:07:58 +0000</pubDate>
		<dc:creator>Dean Eckles</dc:creator>
				<category><![CDATA[data collection]]></category>
		<category><![CDATA[HCI]]></category>
		<category><![CDATA[influence]]></category>
		<category><![CDATA[persuasion profiling]]></category>
		<category><![CDATA[persuasive technology]]></category>
		<category><![CDATA[psychology]]></category>
		<category><![CDATA[research methods]]></category>
		<category><![CDATA[statistics]]></category>

		<guid isPermaLink="false">http://www.deaneckles.com/blog/?p=495</guid>
		<description><![CDATA[Psychologists have posited numerous psychological traits and described causal roles they ought to play in determining human behavior. Most often, the canonical measure of a trait is a questionnaire. Investigators obtain this measure for some people and analyze how their scores predict some outcomes of interest. For example, many people have been interested in how [...]]]></description>
			<content:encoded><![CDATA[<p>Psychologists have posited numerous psychological traits and described causal roles they ought to play in determining human behavior. Most often, the canonical measure of a trait is a questionnaire. Investigators obtain this measure for some people and analyze how their scores predict some outcomes of interest. For example, many people have been interested in how psychological traits affect persuasion processes. Traits like need for cognition (NFC) have been posited and questionnaire items developed to measure them. Among other things, NFC affects how people respond to messages with arguments for varying quality.</p>
<p><strong>How useful are these traits for explanation, prediction, and adaptive interaction?</strong> I can&#8217;t address all of this here, but I want to sketch an argument for their irrelevance to adaptive interaction &#8212; and then offer a tentative rejoinder.</p>
<p>Interactive technologies can tailor their messages to the tastes and susceptibilities of the people interacting with and through them. It might seem that these traits should figure in the statistical models used to make these adaptive selections. After all, some of the possible messages fit for, e.g., coaching a person to meet their exercise goals are more likely to be effective for low NFC people than high NFC people, and vice versa. However, the standard questionnaire measures of NFC cannot often be obtained for most users &#8212; certainly not in commerce settings, and even people signing up for a mobile coaching service likely don&#8217;t want to answer pages of questions. On the other hand, some Internet and mobile services have other abundant data available about their users, which could perhaps be used to construct an alternative measure of these traits. <strong>The trait-based-adaptation recipe is</strong>: </p>
<ol>
<li>obtain the questionnaire measure of the trait for a sample, </li>
<li>predict this measure with data available for many individuals (e.g., log data), </li>
<li>use this model to construct a measure for out-of-sample individuals. </li>
</ol>
<p>This new measure could then be used to personalize the interactive experience based on this trait, such that if a version performs well (or poorly) for people with a particular score on the trait, then use (or don&#8217;t use) that version for people with similar scores.</p>
<p><strong>But why involve the trait at all?</strong> Why not just personalize the interactive experience based on the responses of similar others? Since the new measure of the trait is just based on the available behavioral, demographic, and other logged data, one could simply predict responses based on those measure. Put in geometric terms, if the goal is to project the effects of different message onto available log data, why should one project the questionnaire measure of the trait onto the available log data and then project the effects onto this projection? This seems especially unappealing if one doesn&#8217;t fully trust the questionnaire measure to be accurate or one can&#8217;t be sure about which the set of all the traits that make a (substantial) difference.</p>
<p>I find this argument quite intuitively appealing, and it seems to resonate with others.<sup><a href="http://www.deaneckles.com/blog/495_traits-adaptive-systems-dimensionality-reduction/#footnote_0_495" id="identifier_0_495" class="footnote-link footnote-identifier-link" title="I owe some clarity on this to some conversations with Mike Nowak, Maurits Kaptein, and others.">1</a></sup> But I think there are some reasons the recipe above could still be appealing.</p>
<p>One way to think about this recipe is as dimensionality reduction guided by theory about psychological traits. Available log data can often be used to construct countless predictors (or &#8220;features&#8221;, as the machine learning people call them). So one can very quickly get into a situation where the effective number of parameters for a full model predicting the effects of different messages is very large and will make for poor predictions. Nothing &#8212; no, not penalized regression, not even a support vector machine &#8212; makes this problem go away. Instead, one has to rely on the domain knowledge of the person constructing the predictors (i.e., doing the &#8220;feature engineering&#8221;) to pick some good ones.</p>
<p>So the tentative rejoinder is this: established psychological traits might often make good dimensions to predict effects of different version of a message, intervention, or experience with. And they may &#8220;come with&#8221; suggestions about what kinds of log data might serve as measures of them. They would be expected to be reusable across settings. Thus, I think this recipe is nonetheless deserves serious attention.</p>
<ol class="footnotes"><li id="footnote_0_495" class="footnote">I owe some clarity on this to some conversations with Mike Nowak, Maurits Kaptein, and others.</li></ol>]]></content:encoded>
			<wfw:commentRss>http://www.deaneckles.com/blog/495_traits-adaptive-systems-dimensionality-reduction/feed/</wfw:commentRss>
		<slash:comments>9</slash:comments>
		</item>
		<item>
		<title>Applying social psychology</title>
		<link>http://www.deaneckles.com/blog/333_applying-social-psychology/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=applying-social-psychology</link>
		<comments>http://www.deaneckles.com/blog/333_applying-social-psychology/#comments</comments>
		<pubDate>Wed, 20 Apr 2011 22:00:59 +0000</pubDate>
		<dc:creator>Dean Eckles</dc:creator>
				<category><![CDATA[causal inference]]></category>
		<category><![CDATA[econometrics]]></category>
		<category><![CDATA[psychology]]></category>
		<category><![CDATA[research methods]]></category>
		<category><![CDATA[science studies]]></category>
		<category><![CDATA[statistics]]></category>

		<guid isPermaLink="false">http://www.deaneckles.com/blog/?p=333</guid>
		<description><![CDATA[Some reflections on how &#8220;quantitative&#8221; social psychology is and how this matters for its application to design and decision-making &#8212; especially in industries touched by the Internet. In many ways, contemporary social psychology is dogmatically quantitative. Investigators run experiments, measure quantitative outcomes (even coding free responses to make them amenable to analysis), and use statistics [...]]]></description>
			<content:encoded><![CDATA[<p><em>Some reflections on how &#8220;quantitative&#8221; social psychology is and how this matters for its application to design and decision-making &#8212; especially in industries touched by the Internet.</em></p>
<p>In many ways, contemporary social psychology is dogmatically quantitative. Investigators run experiments, measure quantitative outcomes (even coding free responses to make them amenable to analysis), and use statistics to characterize the collected data. On the other hand, social psychology&#8217;s processes of stating and integrating its conclusions remain largely qualitative. Many hypotheses in social psychology state that some factor affects a process or outcome in one direction (i.e., &#8220;call&#8221; either beta > 0 or beta < 0). Reviews of research in social psychology often start with a simple effect and then note how many other variables moderate this effect. This is all quite fitting with the dominance of null-hypothesis significance testing (NHST) in much of psychology: rather than producing point estimates or confidence intervals for causal effects, it is enough to simply see how likely the observed data is given there there is no effect.<sup><a href="http://www.deaneckles.com/blog/333_applying-social-psychology/#footnote_0_333" id="identifier_0_333" class="footnote-link footnote-identifier-link" title="To parrot Andrew Gelman, in social phenomena, everything affects everything else. There are no betas that are exactly zero.">1</a></sup> Of course, there have been many efforts to change this. Many journals require reporting effect sizes. This is a good thing, but these effect sizes are rarely predicted by social psychological theory. Rather, they are reported to aid judgments of whether a finding is not only statistically significant but substantively or practically significant, and the theory predicts the direction of the effect.</p>
<p>Not only is this process of reporting and combining results not quantitative in many ways, but it requires substantial inference from the particular settings of conducted experiments to the present settings. This actually helps to make sense of the practices described above: many social psychology experiments are conducted in conditions and with populations that are so different from those in which people would like to apply the resulting theories, that expecting consistency of effect sizes is implausible.<sup><a href="http://www.deaneckles.com/blog/333_applying-social-psychology/#footnote_1_333" id="identifier_1_333" class="footnote-link footnote-identifier-link" title="It&#039;s also often implausible that the direction of the effect must be preserved.">2</a></sup> This is not to say that these studies cannot tell us a good deal about how people will behave in many circumstances. It's just that figuring out what they predict and whether these predictions are reliable is a very messy, qualitative process.</p>
<p>Thus, when it comes to making decisions -- about a policy, intervention, or service -- based on social-psychological research, this process is largely qualitative. Decision-makers can ask, <a href="http://brenocon.com/blog/2008/12/statistics-vs-machine-learning-fight/#comment-908">which effects are in play?</a> What is their direction? With interventions and measurement that are very likely different from the present case, how large were the effects?<sup><a href="http://www.deaneckles.com/blog/333_applying-social-psychology/#footnote_2_333" id="identifier_2_333" class="footnote-link footnote-identifier-link" title="Major figures in social psychology, such as Lee Ross, have worked on trying to better anticipate the effects of social interventions from theory. It isn&amp;#8217;t easy.">3</a></sup></p>
<p>Sometimes this is the best that social science can provide. And such answers can be quite useful in design. The results of psychology experiments can often be very effective when used generatively. For example, designers can use taxonomies of persuasive strategies to dream up some ways of producing desired behavior change.</p>
<p>Nonetheless, I think all this can be contrasted with some alternative practices that are both more quantitative and require less of this uneasy generalization. First, social scientists can give much more attention to point estimates of parameters. While not without its (other) flaws, the economics literature on financial returns to education has aimed to provide, criticize, and refine estimates of just how much wages increase (on average) with more education.<sup><a href="http://www.deaneckles.com/blog/333_applying-social-psychology/#footnote_3_333" id="identifier_3_333" class="footnote-link footnote-identifier-link" title="The diversity of the manipulations used by social psychologists ostensibly studying the same thing can make this more difficult.">4</a></sup></p>
<p>Second, researchers can avoid much of the messiest kinds of generalization altogether. Within the Internet industry, product optimization experiments are ubiquitous. Google, Yahoo, Facebook, Microsoft, and many others are running hundreds to thousands of simultaneous experiments with parts of their services. This greatly simplifies generalization: the exact intervention under consideration has just been tried with a random sample from the very population it will be applied to. If someone wants to tweak the intervention, just try it again before launching. This process still involves human judgment about how to react to these results.<sup><a href="http://www.deaneckles.com/blog/333_applying-social-psychology/#footnote_4_333" id="identifier_4_333" class="footnote-link footnote-identifier-link" title="Generalization is not avoided. In particular, decision-makers often have to consider what would happen if an intervention tested with 1% of the population is launched for the whole population. There are all kinds of issues relating to peer influence, network effects, congestion, etc., here that don&amp;#8217;t allow for simple extrapolation from the treatment effects identified by the experiment. Nonetheless, these challenges obviously apply to most research that aims to predict the effects of causes.">5</a></sup> An even more extreme alternative is when machine learning is used to fine-tune, e.g., recommendations without direct involvement (or understanding) by humans.</p>
<p>So am I saying that <strong>social psychology &#8212; at least as an enterprise that is useful to designers and decision-makers &#8212; is going to be replaced by simple &#8220;bake-off&#8221; experiments and machine learning</strong>? Not quite. Unlike product managers at Google, many decision-makers don&#8217;t have the ability to cheaply test a proposed intervention on their population of interest.<sup><a href="http://www.deaneckles.com/blog/333_applying-social-psychology/#footnote_5_333" id="identifier_5_333" class="footnote-link footnote-identifier-link" title="However, Internet services play a more and more central role in many parts of our life, so this doesn&amp;#8217;t just have to be limited to the Internet industry itself.">6</a></sup> Even at Google, many changes (or new products) under consideration are too difficult to build to them all: one has to decide among an overabundance of options before the most directly applicable data could be available. This is consistent with my note above that social-psychological findings can make excellent inspiration during idea generation and early evaluation. </p>
<ol class="footnotes"><li id="footnote_0_333" class="footnote">To parrot Andrew Gelman, in social phenomena, everything affects everything else. There are no betas that are exactly zero.</li><li id="footnote_1_333" class="footnote">It's also often implausible that the direction of the effect must be preserved.</li><li id="footnote_2_333" class="footnote">Major figures in social psychology, such as Lee Ross, have worked on trying to better anticipate the effects of social interventions from theory. It isn&#8217;t easy.</li><li id="footnote_3_333" class="footnote">The diversity of the manipulations used by social psychologists ostensibly studying the same thing can make this more difficult.</li><li id="footnote_4_333" class="footnote">Generalization is not avoided. In particular, decision-makers often have to consider what would happen if an intervention tested with 1% of the population is launched for the whole population. There are all kinds of issues relating to peer influence, network effects, congestion, etc., here that don&#8217;t allow for simple extrapolation from the treatment effects identified by the experiment. Nonetheless, these challenges obviously apply to most research that aims to predict the effects of causes.</li><li id="footnote_5_333" class="footnote">However, Internet services play a more and more central role in many parts of our life, so this doesn&#8217;t just have to be limited to the Internet industry itself.</li></ol>]]></content:encoded>
			<wfw:commentRss>http://www.deaneckles.com/blog/333_applying-social-psychology/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Will the desire for other perspectives trump the &#8220;friendly world syndrome&#8221;?</title>
		<link>http://www.deaneckles.com/blog/454_will-the-desire-for-other-perspectives-trump-the-friendly-world-syndrome/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=will-the-desire-for-other-perspectives-trump-the-friendly-world-syndrome</link>
		<comments>http://www.deaneckles.com/blog/454_will-the-desire-for-other-perspectives-trump-the-friendly-world-syndrome/#comments</comments>
		<pubDate>Fri, 04 Feb 2011 08:02:46 +0000</pubDate>
		<dc:creator>Dean Eckles</dc:creator>
				<category><![CDATA[activity streams]]></category>
		<category><![CDATA[availability heuristic]]></category>
		<category><![CDATA[communication]]></category>
		<category><![CDATA[consumption]]></category>
		<category><![CDATA[culture]]></category>
		<category><![CDATA[Facebook]]></category>
		<category><![CDATA[friendly world syndrome]]></category>
		<category><![CDATA[HCI]]></category>
		<category><![CDATA[heuristics]]></category>
		<category><![CDATA[influence]]></category>
		<category><![CDATA[participatory media]]></category>
		<category><![CDATA[psychology]]></category>
		<category><![CDATA[social networks]]></category>
		<category><![CDATA[social software]]></category>
		<category><![CDATA[sociology]]></category>
		<category><![CDATA[Twitter]]></category>

		<guid isPermaLink="false">http://www.deaneckles.com/blog/?p=454</guid>
		<description><![CDATA[Some recent journalism at NPR and The New York Times has addressed some aspects of the &#8220;friendly world syndrome&#8221; created by personalized media. A theme common to both pieces is that people want to encounter different perspectives and will use available resources to do so. I&#8217;m a bit more skeptical. Here&#8217;s Natasha Singer at The [...]]]></description>
			<content:encoded><![CDATA[<p>Some recent journalism at NPR and The New York Times has addressed some aspects of the <a href="http://www.deaneckles.com/blog/386_the-friendly-world-syndrome-induced-by-simple-filtering-rules/">&#8220;friendly world syndrome&#8221; created by personalized media</a>. A theme common to both pieces is that people want to encounter different perspectives and will use available resources to do so. I&#8217;m a bit more skeptical.</p>
<p>Here&#8217;s <a href="http://www.nytimes.com/2011/02/06/business/06stream.html">Natasha Singer at The New York Times on cascades of memes, idioms, and links through online social networks (e.g., Twitter)</a>:</p>
<blockquote><p>If we keep seeing the same links and catchphrases ricocheting around our social networks, it might mean we are being exposed only to what we want to hear, says Damon Centola, an assistant professor of economic sociology at the Massachusetts Institute of Technology.</p>
<p>“You might say to yourself: ‘I am in a group where I am not getting any views other than the ones I agree with. I’m curious to know what else is out there,’” Professor Centola says.</p>
<p>Consider a new hashtag: diversity. </p></blockquote>
<p>This is how Singer ends this article in which the central example is &#8220;icantdateyou&#8221; leading Egypt-related idioms as a trending topic on Twitter. The suggestion here, by Centola and Singer, is that people will notice they are getting a biased perspective of how many people agree with them and what topics people care about &#8212; and then will take action to get other perspectives. </p>
<p>Why am I skeptical? </p>
<p>First, I doubt that we really realize the extent to which media &#8212; and personalized social media in particular &#8212; bias their perception of the frequency of beliefs and events. Even though people know that fiction TV programs (e.g., cop shows) don&#8217;t aim to represent reality, heavy TV watchers (on average) substantially overestimate the percent of adult men employed in law enforcement.<sup><a href="http://www.deaneckles.com/blog/454_will-the-desire-for-other-perspectives-trump-the-friendly-world-syndrome/#footnote_0_454" id="identifier_0_454" class="footnote-link footnote-identifier-link" title="Gerbner, G., Gross, L., Morgan, M., &amp;#038; Signorielli, N. (1980). The &ldquo;Mainstreaming&rdquo; of America: Violence Profile No. 11. Journal of Communication, 30(3), 10-29.">1</a></sup> That is, the processes that produce the &#8220;friendly world syndrome&#8221; function without conscious awareness and, perhaps, even despite it. So people can&#8217;t consciously choose to seek out diverse perspectives if they don&#8217;t know they are increasingly missing them.</p>
<p>Second, I doubt that people actually want diversity of perspectives all that much. Even if I realize divergent views are missing from my media experience, why would I seek them out? This might be desirable for some people (but not all), and even for those, the desire to encounter people who radically disagree has its limits.</p>
<p>Similar ideas pop up in a NPR <em>All Things Considered</em> segment by Laura Sydell. This short piece (<a href="   http://www.npr.org/2011/02/03/133469245/anti-social-networks-were-just-as-cliquey-online">audio</a>, <a href="http://www.npr.org/templates/transcript/transcript.php?storyId=133469245">transcript</a>) is part of NPR&#8217;s &#8220;Cultural Fragmentation&#8221; series. The segment begins with the worry that offline bubbles are replicated online and quotes me describing how attempts to filter for personal relevance also heighten the bias towards agreement in personalized media. </p>
<p>But much of the piece has actually focuses on how one person &#8212; Kyra Gaunt, a professor and musician &#8212; is using Twitter to connect and converse with new and different people. Gaunt describes her experience on Twitter as featuring debate, engagement, and &#8220;learning about black people even if you&#8217;ve never seen one before&#8221;. Sydell&#8217;s commentary identifies the public nature of Twitter as an important factor in facilitating experiencing diverse perspectives:</p>
<blockquote><p>
But, even though there is a lot of conversation going on among African Americans on Twitter, Professor Gaunt says it&#8217;s very different from the closed nature of Facebook because tweets are public.
</p></blockquote>
<p>I think this is true to some degree: much of the content produced by Facebook users is indeed public, but Facebook does not make it as easily searchable or discoverable (e.g., through trending topics). But more importantly, Facebook and Twitter differ in their affordances for conversation. Facebook ties responses to the original post, which means both that the original poster controls who can reply and that everyone who replies is part of the same conversation. Twitter supports replies through the @reply mechanism, so that anyone can reply but the conversation is fragmented, as repliers and consumers often do not see all replies. So, <a href="http://www.deaneckles.com/blog/386_the-friendly-world-syndrome-induced-by-simple-filtering-rules/">as I&#8217;ve described</a>, even if you follow a few people you disagree with on Twitter, you&#8217;ll most likely see replies from the other people you follow, who &#8212; more often than not &#8212; you agree with.</p>
<p>Gaunt&#8217;s experience with Twitter is certainly not typical. <a href="http://twitter.com/kyraocity">She has over 3,300 followers and follows over 2,400</a>, so many of her posts will generate <a href="http://twitter.com/#!/search/%40kyraocity">replies</a> from people she doesn&#8217;t know well but whose replies will appear in her main feed. And &#8212; if she looks beyond her main feed to the <a href="http://twitter.com/#!/mentions">@Mentions page</a> &#8212; she will see the replies from even those she does not follow herself. On the other hand, her followers will likely only see her posts and replies from others they follow.<sup><a href="http://www.deaneckles.com/blog/454_will-the-desire-for-other-perspectives-trump-the-friendly-world-syndrome/#footnote_1_454" id="identifier_1_454" class="footnote-link footnote-identifier-link" title="One nice feature in &amp;#8220;new Twitter&amp;#8221; &amp;#8212; the recently refresh of the Twitter user interface &amp;#8212; is that clicking on a tweet will show some of the replies to it in the right column. This may offer an easier way for followers to discover diverse replies to the people they follow. But it is also not particularly usable, as it is often difficult to even trace what a reply is a reply to.">2</a></sup></p>
<p>Nonetheless, Gaunt&#8217;s case is worth considering further, as does Sydell:</p>
<blockquote><p>
SYDELL: Gaunt says she&#8217;s made new friends through Twitter.</p>
<p>GAUNT: I&#8217;m meeting strangers. I met with two people I had engaged with through Twitter in the past 10 days who I&#8217;d never met in real time, in what we say in IRL, in real life. And I met them, and I felt like <em>this is my tribe</em>.</p>
<p>SYDELL: And Gaunt says they weren&#8217;t black. <em>But the key word for some observers is tribe. Although there are people like Gaunt who are using social media to reach out, some observers are concerned that she is the exception to the rule, that most of us will be content to stay within our race, class, ethnicity, family or political party.</em>
</p></blockquote>
<p>So Professor Gaunt is likely making connections with people she would not have otherwise. But &#8212; it is at least tempting to conclude from &#8220;this is my tribe&#8221; &#8212; they are not people with radically different beliefs and values, even if they have arrived at those beliefs and values from a membership in a different race or class.</p>
<ol class="footnotes"><li id="footnote_0_454" class="footnote">Gerbner, G., Gross, L., Morgan, M., &#038; Signorielli, N. (1980). The “Mainstreaming” of America: Violence Profile No. 11. Journal of Communication, 30(3), 10-29.</li><li id="footnote_1_454" class="footnote">One nice feature in &#8220;new Twitter&#8221; &#8212; the recently refresh of the Twitter user interface &#8212; is that clicking on a tweet will show some of the replies to it in the right column. This may offer an easier way for followers to discover diverse replies to the people they follow. But it is also not particularly usable, as it is often difficult to even trace what a reply is a reply to.</li></ol>]]></content:encoded>
			<wfw:commentRss>http://www.deaneckles.com/blog/454_will-the-desire-for-other-perspectives-trump-the-friendly-world-syndrome/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>The &#8220;friendly world syndrome&#8221; induced by simple filtering rules</title>
		<link>http://www.deaneckles.com/blog/386_the-friendly-world-syndrome-induced-by-simple-filtering-rules/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=the-friendly-world-syndrome-induced-by-simple-filtering-rules</link>
		<comments>http://www.deaneckles.com/blog/386_the-friendly-world-syndrome-induced-by-simple-filtering-rules/#comments</comments>
		<pubDate>Wed, 10 Nov 2010 08:27:22 +0000</pubDate>
		<dc:creator>Dean Eckles</dc:creator>
				<category><![CDATA[activity streams]]></category>
		<category><![CDATA[availability heuristic]]></category>
		<category><![CDATA[communication]]></category>
		<category><![CDATA[consumption]]></category>
		<category><![CDATA[friendly world syndrome]]></category>
		<category><![CDATA[heuristics]]></category>
		<category><![CDATA[mental models]]></category>
		<category><![CDATA[participatory media]]></category>
		<category><![CDATA[persuasive technology]]></category>
		<category><![CDATA[priming]]></category>
		<category><![CDATA[psychology]]></category>
		<category><![CDATA[social networks]]></category>
		<category><![CDATA[social software]]></category>

		<guid isPermaLink="false">http://www.deaneckles.com/blog/?p=386</guid>
		<description><![CDATA[I&#8217;ve written previously about how filtered activity streams can lead to biased views of behaviors in our social neighborhoods. Recent conversations with two people writing popular-press books on related topics have helped me clarify these ideas. Here I reprise previous comments on filtered activity streams, aiming to highlight how they apply even in the case [...]]]></description>
			<content:encoded><![CDATA[<p><em>I&#8217;ve <a href="http://www.deaneckles.com/blog/category/availability-heuristic/">written previously</a> about how filtered activity streams can lead to biased views of behaviors in our social neighborhoods. Recent conversations with two people writing popular-press books on related topics have helped me clarify these ideas. Here I reprise previous comments on filtered activity streams, aiming to highlight how they apply even in the case of simple and transparent personalization rules, such as those used by Twitter.</em><br />
&#8212;</p>
<p>Birds of a feather flock together. Once flying together, a flock is also subject to the same causes (e.g., storms, pests, prey). Our friends, family, neighbors, and colleagues are more similar to us for similar reasons (and others). So we should have no illusions that the behaviors, attitudes, outcomes, and beliefs of our social neighborhood are good indicators of those of other populations &#8212; like U.S. adults, Internet users, or <em>homo sapiens</em> of the past, present, or future. The apocryphal Pauline Kael quote &#8220;How could Nixon win? No one I know voted for him&#8221; suggests both the ease and error of this kind of inference. I take it as a given that people&#8217;s estimates of larger populations&#8217; behaviors and beliefs are often biased in the direction of the behaviors and beliefs in their social neighborhoods. This is the case with and without &#8220;social media&#8221; and filtered activity streams &#8212; and even mediated communication in general. </p>
<p>That is, even without media, our personal experiences are not &#8220;representative&#8221; of the American experience, human experience, etc., but we do (and must) rely on it anyway. One simple cognitive tool here is using &#8220;ease of retrieval&#8221; to estimate how common or likely some event is: we can estimate how common something is based on how easy it is to think of. So if something prompts someone to consider how common a type of event is, they will (on average) estimate the event as more common if it is more easy to think of an example of the event, imagine the event, etc. And our personal experiences provide these examples and determine how easy they are to bring to mind. Both prompts and immediately prior experience can thus affect these frequency judgments via ease of retrieval effects. </p>
<p>Now this is not to say that we should think as ease of retrieval heuristics as biases per se. Large classes and frequent occurrences are often more available to mind than those that are smaller or less frequent. It is just that this is also often not the case, especially when there is great diversity in frequency among physical and social neighborhoods. But certainly we can see some cases where these heuristics fail.</p>
<p>Media are powerful sources of experiences that can make availability and actual frequency diverge, whether by increasing the biases in the direction of projecting our social neighborhoods onto larger population or in other, perhaps unexpected directions. In a classic and controversial line of research in the 1970s and 80s, Gerbner and colleagues argued that increased television-watching produces a &#8220;mean world syndrome&#8221; such that watching more TV causes people to increasingly overestimate, e.g., the fraction of adult U.S. men employed in law enforcement and the probability of being a victim of violent crime. Their work did not focus on investigating heuristics producing these effects, but others have suggested the availability heuristic (and related ease of retrieval effects) as at work. So even if my social neighborhood has <em>fewer</em> cops or victims of violent crime than the national average, media consumption and the availability heuristic can lead me to <em>over</em>estimate both. </p>
<p>Personalized and filtered activity streams certainly also affect us through some of the same psychological processes, leading to biases in users&#8217; estimates of population-wide frequencies. They can aIso bias inference about our own social neighborhoods. If I try to estimate how likely a Facebook status update by a friend is to receive a comment, this estimate will be affected by the status updates I have seen recently. And if content with comments is more likely to be shown to me in my personalized filtered activity stream (a simple rule for selecting more interesting content, when there is too much for me to consume it all), then it will be easier for me to think of cases in which status updates by my friends do receive comments.</p>
<p>In my previous posts on these ideas, I have mainly focused on effects on beliefs about my social neighborhood and specifically behaviors and outcomes specific to the service providing the activity stream (e.g., receiving comments). But similar effects apply for beliefs about other behaviors, opinions, and outcomes. In particular, filtered activity streams can increase the sense that my social neighborhood (and perhaps the world) agrees with me. Say that content produced by my Facebook friends with comments and interaction from mutual friends is more likely to be shown in my filtered activity streams. Also assume that people are more likely to express their agreement in such a way than substantial disagreement. As long as I am likely to agree with most of my friends, then this simple rule for filtering produces an activity stream with content I agree with more than an unfiltered stream would. Thus, even if I have a substantial minority of friends with whom I disagree on politics, this filtering rule would likely make me see less of their content, since it is less likely to receive (approving) comments from mutual friends. </p>
<p>I&#8217;ve been casually calling this larger family of effects this the &#8220;friendly world syndrome&#8221; induced by filtered activity streams. Like the mean world syndrome of the television cultivation research described above, this picks out a family of unintentional effects of media. Unlike the mean world syndrome, the friendly world syndrome includes such results as overestimating how many friends I have in common with my friends, how much positive and accomplishment-reporting content my friends produce, and (as described) how much I agree with my friends.<sup><a href="http://www.deaneckles.com/blog/386_the-friendly-world-syndrome-induced-by-simple-filtering-rules/#footnote_0_386" id="identifier_0_386" class="footnote-link footnote-identifier-link" title="This might suggest that some of the false consensus effects observed in recent work using data collected about Facebook friends could be endogenous to Facebook. See Goel, S., Mason, W., &amp;#038; Watts, D. J. (2010). Real and perceived attitude agreement in social networks. Journal of Personality and Social Psychology, 99(4), 611-621. doi:10.1037/a0020697">1</a></sup></p>
<p>Even though the filtering rules I&#8217;ve described so far are quite simple and appealing, they still are more consistent with versions of activity streams that are filtered by fancy relevance models, which are often quite opaque to users. Facebook News Feed &#8212; and &#8220;Top News&#8221; in particular &#8212; is the standard example here. On the other hand, one might think that these arguments do not apply to Twitter, which does not apply any kind of machine learning model estimating relevance to filtering users&#8217; streams. But Twitter actually does implement a filtering rule with important similarities to the &#8220;comments from mutual friends&#8221; rule described above. Twitter only shows &#8220;<a href="http://blog.twitter.com/2008/05/how-replies-work-on-twitter-and-how.html">@replies</a>&#8221; to a user on their home page when that user is following both the poster of the reply and the person being replied to.<sup><a href="http://www.deaneckles.com/blog/386_the-friendly-world-syndrome-induced-by-simple-filtering-rules/#footnote_1_386" id="identifier_1_386" class="footnote-link footnote-identifier-link" title="Twitter offers the option to see all @replies written by people one is following, but 98% of users use the default option. Some users were unhappy with an earlier temporary removal of this feature. My sense is that the biggest complaint was that removing this feature removed a valuable means for discovering new people to follow.">2</a></sup> This rule makes a lot of sense, as a reply is often quite difficult to understand without the original tweet. Thus, I am much more likely to see people I follow replying to people I follow than to others (since the latter replies are encountered only from browsing away from the home page.  I think this illustrates how even a straightforward, transparent rule for filtering content can magnify false consensus effects.</p>
<p>One aim in writing this is to clarify that a move from filtering activity streams using opaque machine learning models of relevance to filtering them with simple, transparent, user-configurable rules will likely be insufficient to prevent the friendly world syndrome. This change might have many positive effects and even reduce some of these effects by making people mindful of the filtering.<sup><a href="http://www.deaneckles.com/blog/386_the-friendly-world-syndrome-induced-by-simple-filtering-rules/#footnote_2_386" id="identifier_2_386" class="footnote-link footnote-identifier-link" title="We are investigating this in ongoing experimental research. Also see Schwarz, N., Bless, H., Strack, F., Klumpp, G., Rittenauer-Schatka, H., &amp;#038; Simons, A. (1991). Ease of retrieval as information: Another look at the availability heuristic. Journal of Personality and Social Psychology, 61(2), 195-202. doi:10.1037/0022-3514.61.2.195">3</a></sup> But I don&#8217;t think these effects are so easily avoided in any media environment that includes sensible personalization for increased relevance and engagement.</p>
<ol class="footnotes"><li id="footnote_0_386" class="footnote">This might suggest that some of the false consensus effects observed in recent work using data collected about Facebook friends could be endogenous to Facebook. See Goel, S., Mason, W., &#038; Watts, D. J. (2010). Real and perceived attitude agreement in social networks. <em>Journal of Personality and Social Psychology, 99</em>(4), 611-621. doi:10.1037/a0020697</li><li id="footnote_1_386" class="footnote">Twitter offers the option to see all @replies written by people one is following, but 98% of users use the default option. Some users were unhappy with an earlier temporary removal of this feature. My sense is that the biggest complaint was that removing this feature removed a valuable means for discovering new people to follow.</li><li id="footnote_2_386" class="footnote">We are investigating this in ongoing experimental research. Also see Schwarz, N., Bless, H., Strack, F., Klumpp, G., Rittenauer-Schatka, H., &#038; Simons, A. (1991). Ease of retrieval as information: Another look at the availability heuristic. <em>Journal of Personality and Social Psychology, 61</em>(2), 195-202. doi:10.1037/0022-3514.61.2.195</li></ol>]]></content:encoded>
			<wfw:commentRss>http://www.deaneckles.com/blog/386_the-friendly-world-syndrome-induced-by-simple-filtering-rules/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Search queries in referrer headers: Technical knowledge, privacy, and the status quo</title>
		<link>http://www.deaneckles.com/blog/351_search-queries-in-referrer-headers-technical-knowledge-privacy-and-the-status-quo/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=search-queries-in-referrer-headers-technical-knowledge-privacy-and-the-status-quo</link>
		<comments>http://www.deaneckles.com/blog/351_search-queries-in-referrer-headers-technical-knowledge-privacy-and-the-status-quo/#comments</comments>
		<pubDate>Tue, 12 Oct 2010 06:05:57 +0000</pubDate>
		<dc:creator>Dean Eckles</dc:creator>
				<category><![CDATA[automaticity]]></category>
		<category><![CDATA[mental models]]></category>
		<category><![CDATA[privacy]]></category>
		<category><![CDATA[psychology]]></category>
		<category><![CDATA[risk]]></category>
		<category><![CDATA[search]]></category>

		<guid isPermaLink="false">http://www.deaneckles.com/blog/?p=351</guid>
		<description><![CDATA[I have been fascinated by Christopher Soghoian&#8216;s complaint to the FTC about Google&#8217;s practices of including search query information in the HTTP referrer header. In summary, Google has taken proactive efforts to ensure that Web site owners that get visitors from Google search receive the search terms entered by Google&#8217;s users. Meanwhile, Google has agreed [...]]]></description>
			<content:encoded><![CDATA[<p>I have been fascinated by <a href="http://www.dubfire.net/">Christopher Soghoian</a>&#8216;s <a href="http://online.wsj.com/public/resources/documents/FTCcomplaint100710.pdf">complaint</a> to the FTC about Google&#8217;s practices of including search query information in the HTTP referrer header. </p>
<p>In summary, Google has taken proactive efforts to ensure that Web site owners that get visitors from Google search receive the search terms entered by Google&#8217;s users. Meanwhile, Google has agreed that search query data is personally sensitive information and that it does not disclosure this information, except under specific, limited circumstances; this is reflected in its privacy policy. Note that Google has not just let the URL do the work, but has specifically worked to make the referrer header include search terms (and additional information) when it has adopted techniques that would otherwise prevent these disclosures from being made. (For a fuller summary, see <a href="http://paranoia.dubfire.net/2010/10/my-ftc-complaint-about-googles-private.html">his blog post</a> and <a href="http://blogs.wsj.com/digits/2010/10/07/former-ftc-employee-files-complaint-over-google-privacy/">this WSJ article</a>. Or <a href="http://searchengineland.com/privacy-the-creepy-line-and-beyond-its-not-just-about-google-52563">this article at Search Engine Land</a>.)</p>
<p>I am not going to discuss the ethics and legal issues in this particular case. Instead, I just want to draw attention to how this issue reveals the importance of technical knowledge in thinking about privacy issues.</p>
<p>A common response from people working in the Internet industry is that Soghoian is a non-techie that has suddenly &#8220;discovered&#8221; referrer headers. For example, Danny Sullivan <a href="http://twitter.com/#!/dannysullivan/status/26693206178">writes</a> &#8220;former FTC employee discovers browsers sends referrer strings, turns it into google conspiracy&#8221;. (Of course, Soghoian is actually technically savvy, as reading the complaint to the FTC makes clear.)</p>
<p>What&#8217;s going on here? Folks with technical knowledge perceive search query disclosure as the status quo (though I bet most don&#8217;t often think about the consequences of clicking on a link after a sensitive search).</p>
<p>But how would most Internet users be aware of this? Certainly not through Google&#8217;s statements, or through warnings from Web browsers. One of the few ways I think users might realize this is happening is through query-highlighting &#8212; on forums, mailing list archives, and spammy pages. So a super-rational user who cares to think about how that works, might guess something like this is going on. But I doubt most users would actively work out the mechanisms involved. Futhermore, their observations likely radically underdetermine the mechanism anyway, since it is quite reasonable that a Web browser could do this kind of highlighting directly, especially for formulaic sites, like forums. Even casual use of Web analytics software (such as Google Analytics) may not make it clear that this per-user information is being provided, since aggregated data could reasonably be used to present summaries of top search queries leading to a Web site.<sup><a href="http://www.deaneckles.com/blog/351_search-queries-in-referrer-headers-technical-knowledge-privacy-and-the-status-quo/#footnote_0_351" id="identifier_0_351" class="footnote-link footnote-identifier-link" title="Google does separately provide aggregated query data to Web site owners.">1</a></sup></p>
<p>This should be a reminder why empirical studies of privacy attitudes and behaviors are useful: us techie folks often have severe blind spots. I don&#8217;t know that this is just a matter of differences in expectations, but rather involves differences in preferences. Over time, these expectations change our sense of the status quo, from which we can calibrate our preferences and intentions.</p>
<p>Google has worked to ensure that referrer headers continue to include search query information &#8212; even as it adopts techniques that would make this not happen simply by the standard inclusion of the URL there.<sup><a href="http://www.deaneckles.com/blog/351_search-queries-in-referrer-headers-technical-knowledge-privacy-and-the-status-quo/#footnote_1_351" id="identifier_1_351" class="footnote-link footnote-identifier-link" title="See Danny Sullivan&amp;#8217;s post following some changes by Google that could have ended including search queries in referrer headers.">2</a></sup> A difference in beliefs about the status quo puts these actions by Google in a different context. For us techies, that is just maintaining the status quo (which may seem more desirable, since we know it&#8217;s the industry-wide standard). For others, it might seem more like Google putting advertisers and Web site owners above its promises to its users about their sensitive data.</p>
<ol class="footnotes"><li id="footnote_0_351" class="footnote">Google does separately provide aggregated query data to Web site owners.</li><li id="footnote_1_351" class="footnote">See Danny Sullivan&#8217;s <a href="http://searchengineland.com/the-death-of-web-analytics-an-ode-to-the-referrer-42875">post</a> following some changes by Google that could have ended including search queries in referrer headers.</li></ol>]]></content:encoded>
			<wfw:commentRss>http://www.deaneckles.com/blog/351_search-queries-in-referrer-headers-technical-knowledge-privacy-and-the-status-quo/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Economic imperialism and causal inference</title>
		<link>http://www.deaneckles.com/blog/324_economic-imperialism-and-causal-inference/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=economic-imperialism-and-causal-inference</link>
		<comments>http://www.deaneckles.com/blog/324_economic-imperialism-and-causal-inference/#comments</comments>
		<pubDate>Tue, 05 Oct 2010 07:05:58 +0000</pubDate>
		<dc:creator>Dean Eckles</dc:creator>
				<category><![CDATA[academia]]></category>
		<category><![CDATA[causal inference]]></category>
		<category><![CDATA[psychology]]></category>
		<category><![CDATA[research methods]]></category>
		<category><![CDATA[sociology]]></category>
		<category><![CDATA[statistics]]></category>

		<guid isPermaLink="false">http://www.deaneckles.com/blog/?p=324</guid>
		<description><![CDATA[And I, for one, welcome our new economist overlords&#8230; Readers not in academic social science may take the title of this post as indicating I&#8217;m writing about the use of economic might to imperialist ends.1 Rather, economic imperialism is a practice of economists (and acolytes) in which they invade research territories that traditionally &#8220;belong&#8221; to [...]]]></description>
			<content:encoded><![CDATA[<p><em>And I, for one, welcome our new economist overlords&#8230;</em></p>
<p>Readers not in academic social science may take the title of this post as indicating I&#8217;m writing about the use of economic might to imperialist ends.<sup><a href="http://www.deaneckles.com/blog/324_economic-imperialism-and-causal-inference/#footnote_0_324" id="identifier_0_324" class="footnote-link footnote-identifier-link" title="Well, if economists have better funding sources, this might apply in some sense.">1</a></sup> Rather, <em>economic imperialism</em> is a practice of economists (and acolytes) in which they invade research territories that traditionally &#8220;belong&#8221; to other social scientific disciplines.<sup><a href="http://www.deaneckles.com/blog/324_economic-imperialism-and-causal-inference/#footnote_1_324" id="identifier_1_324" class="footnote-link footnote-identifier-link" title="For arguments in favor of economic imperialism, see Lazear, E.P. (1999). Economic imperialism. NBER Working Paper No. 7300.">2</a></sup> See <a href="http://www.gocomics.com/bliss/2010/10/02/">this comic</a> for one way you can react to this.<sup><a href="http://www.deaneckles.com/blog/324_economic-imperialism-and-causal-inference/#footnote_2_324" id="identifier_2_324" class="footnote-link footnote-identifier-link" title="Or see this comic for imperialism by physicists.">3</a></sup></p>
<p>Economists bring their theoretical, statistical, and research-funding resources to bear on problems that might not be considered economics. For example, freakonomists like Levitt study sumo wrestlers and the effects of the legalization of abortion on crime. But, hey, if the <a href="http://www.fff.org/freedom/0895g.asp">Commerce Clause means that Congress can legislate everything</a>, then, for the same reasons, economists can &#8212; no, must &#8212; study everything.</p>
<p>I am not an economist by training, but I have recently had reason to read quite a bit in econometrics. Overall, I&#8217;m impressed.<sup><a href="http://www.deaneckles.com/blog/324_economic-imperialism-and-causal-inference/#footnote_3_324" id="identifier_3_324" class="footnote-link footnote-identifier-link" title="At least by the contemporary literature on what I&amp;#8217;ve been reading on &amp;#8212; IVs, encouragement designs, endogenous interactions, matching estimators. But it is true that in some of these areas econometrics has been able to fruitfully borrow from work on potential outcomes in statistics and epidemiology.">4</a></sup> Economists have recently taken causal inference &#8212; learning about cause and effect relationships, often from observational data &#8212; quite seriously. In the eyes of some, this has precipitated a &#8220;credibility revolution&#8221; in economics. Certainly, papers in economics and (especially) econometrics journals consider threats to the validity of causal inference at length.</p>
<p>On the other hand, causal inference in the rest of the social sciences is <em>simultaneously over-inhibited and under-inhibited</em>. As Judea Pearl observes in his book <em>Causality</em>, lack of clarity about statistical models (that social scientists often don&#8217;t understand) and causality has induced confusion about distinctions between statistical and causal issues (i.e., between estimation methods and identification).<sup><a href="http://www.deaneckles.com/blog/324_economic-imperialism-and-causal-inference/#footnote_4_324" id="identifier_4_324" class="footnote-link footnote-identifier-link" title="Econometricians have made similar observations.">5</a></sup></p>
<p>So, on the one had, <a href=" http://www.blog.sethroberts.net/2010/09/23/why-psychologists-dont-imitate-economists/">many psychologists stick to experiments</a>. Randomized experiments are, generally, the gold standard for investigating cause&#8211;effect relationships, so this can and often does go well. However, social psychologists have recently been obsessed with using &#8220;mediation analysis&#8221; to investigate the mechanisms by which causes they can manipulate produce effects of interest. Investigators often manipulate some factors experimentally and then measure one or more variables they believe fully or partially mediate the effect of those factors on their outcome. Then, under the standard Baron &#038; Kenny approach, psychologists fit a few regression models, including regressing the outcome on both the experimentally manipulated variables and the simply measured (mediating) variables. The assumptions required for this analysis to identify any effects of interest are rarely satisfied (e.g., effects on individuals are homogenous).<sup><a href="http://www.deaneckles.com/blog/324_economic-imperialism-and-causal-inference/#footnote_5_324" id="identifier_5_324" class="footnote-link footnote-identifier-link" title="For a bit on this topic, see the discussion and links to papers here.">6</a></sup> So psychologists are often over-inhibited (experiments only please!) and under-inhibited (mediation analysis).</p>
<p>Likewise, in more observational studies (in psychology, sociology, education, etc.), investigators are sometimes wary of making explicit causal claims. So instead of carefully stating the causal assumptions that would justify different causal conclusions, readers are left with phrases like &#8220;suggests&#8221; and &#8220;is consistent with&#8221; followed by causal claims. Authors then recommend that further research be conducted to better support these causal conclusions. With these kinds of recommendations awaiting, no wonder that economists find the territory ready for taking: they can just show up with econometrics tools and get to work on hard-won questions the rightly belong to others!</p>
<ol class="footnotes"><li id="footnote_0_324" class="footnote">Well, if economists have better funding sources, this might apply in some sense.</li><li id="footnote_1_324" class="footnote">For arguments in favor of economic imperialism, see Lazear, E.P. (1999). <a href="http://www.nber.org/papers/w7300">Economic imperialism</a>. NBER Working Paper No. 7300.</li><li id="footnote_2_324" class="footnote">Or see <a href="http://xkcd.com/793/">this comic</a> for imperialism by physicists.</li><li id="footnote_3_324" class="footnote">At least by the contemporary literature on what I&#8217;ve been reading on &#8212; IVs, encouragement designs, endogenous interactions, matching estimators. But it is true that in some of these areas econometrics has been able to fruitfully borrow from work on potential outcomes in statistics and epidemiology.</li><li id="footnote_4_324" class="footnote">Econometricians have made similar observations.</li><li id="footnote_5_324" class="footnote">For a bit on this topic, see the discussion and links to papers <a href="http://www.stat.columbia.edu/~cook/movabletype/archives/2010/03/criticizing_sta.html">here</a>.</li></ol>]]></content:encoded>
			<wfw:commentRss>http://www.deaneckles.com/blog/324_economic-imperialism-and-causal-inference/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Public once, public always? Privacy, egosurfing, and the availability heuristic</title>
		<link>http://www.deaneckles.com/blog/291_public-once-public-always-privacy-egosurfing-and-the-availability-heuristic/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=public-once-public-always-privacy-egosurfing-and-the-availability-heuristic</link>
		<comments>http://www.deaneckles.com/blog/291_public-once-public-always-privacy-egosurfing-and-the-availability-heuristic/#comments</comments>
		<pubDate>Mon, 19 Apr 2010 01:36:40 +0000</pubDate>
		<dc:creator>Dean Eckles</dc:creator>
				<category><![CDATA[activity streams]]></category>
		<category><![CDATA[automaticity]]></category>
		<category><![CDATA[communication]]></category>
		<category><![CDATA[consumption]]></category>
		<category><![CDATA[culture]]></category>
		<category><![CDATA[data collection]]></category>
		<category><![CDATA[Facebook]]></category>
		<category><![CDATA[HCI]]></category>
		<category><![CDATA[heuristics]]></category>
		<category><![CDATA[participatory media]]></category>
		<category><![CDATA[priming]]></category>
		<category><![CDATA[psychology]]></category>

		<guid isPermaLink="false">http://www.deaneckles.com/blog/?p=291</guid>
		<description><![CDATA[The Library of Congress has announced that it will be archiving all Twitter posts (tweets). You can find positive reaction on Twitter. But some have also wondered about privacy concerns. Fred Stutzman, for example, points out how even assuming that only unprotected accounts are being archived this can still be problematic.1 While some people have [...]]]></description>
			<content:encoded><![CDATA[<p>The <a href="http://blogs.loc.gov/loc/2010/04/how-tweet-it-is-library-acquires-entire-twitter-archive/">Library of Congress has announced</a> that it will be archiving all Twitter posts (tweets). You can find positive reaction on Twitter. But some have also wondered about privacy concerns. Fred Stutzman, for example, <a href="http://fstutzman.com/2010/04/14/twitter-and-the-library-of-congress/">points out</a> how even assuming that only unprotected accounts are being archived this can still be problematic.<sup><a href="http://www.deaneckles.com/blog/291_public-once-public-always-privacy-egosurfing-and-the-availability-heuristic/#footnote_0_291" id="identifier_0_291" class="footnote-link footnote-identifier-link" title="This might not be the case, see Michael Zimmer and this New York Times article.">1</a></sup> While some people have Twitter usernames that easily identify their owners and many allow themselves to be found based on an email address that is publicly associated with their identity, there are also many that do not. If at a future time, this account becomes associated with their identity for a larger audience than they desire, they can make their whole account viewable only by approved followers<sup><a href="http://www.deaneckles.com/blog/291_public-once-public-always-privacy-egosurfing-and-the-availability-heuristic/#footnote_1_291" id="identifier_1_291" class="footnote-link footnote-identifier-link" title="Why don&amp;#8217;t people do this in the first place? Many may not be aware of the feature, but even if they are, there are reasons not to use it. For example, it makes any participation in topical conversations (e.g., around a hashtag) difficult or impossible.">2</a></sup>, delete the account, or delete some of the tweets. Of course, this information may remain elsewhere on the Internet for a short or long time. But in contrast, the Library of Congress archive will be much more enduring and likely outside of individual users&#8217; control.<sup><a href="http://www.deaneckles.com/blog/291_public-once-public-always-privacy-egosurfing-and-the-availability-heuristic/#footnote_2_291" id="identifier_2_291" class="footnote-link footnote-identifier-link" title="Or at least this control would have to be via Twitter, likely before archiving: &amp;#8220;We asked them [Twitter] to deal with the users; the library doesn&amp;#8217;t want to mediate that.&amp;#8221;">3</a></sup> While I think it is worth examining the strategies that people adopt to cope with inflexible or difficult to use privacy controls in software, I don&#8217;t intend to do that here.</p>
<p>Instead, I want to relate this discussion to my continued interest in how activity streams and other information consumption interfaces affect their users&#8217; beliefs and behaviors through the availability heuristic. In response to some comments on <a href="http://fstutzman.com/2010/04/14/twitter-and-the-library-of-congress/">his first post</a>, <a href="http://fstutzman.com/2010/04/16/is-it-time-to-cancel-your-twitter-account/">Stutzman argues</a> that people overestimate the degree to which content once public on the Internet is public forever:</p>
<blockquote><p>So why is it that we all assume that the content we share publicly will be around forever?  I think this is a classic case of selection on the dependent variable.  When we Google ourselves, we are confronted with <em>what’s there</em> as opposed to what’s not there.  The stuff that goes away gets forgotten, and we concentrate on things that we see or remember (like a persistent page about us that we don’t like).  In reality, our online identities decay, decay being a stochastic process.  The internet is actually quite bad at remembering.</p></blockquote>
<p>This unconsidered &#8220;selection on the dependent variable&#8221; is one way of thinking about some cases of how the availability heuristic (and use of ease-of-retrievel information more generally). But I actually think the latter is more general and more useful for describing the psychological processes involved. For example, it highlights both that there are many occurrences or interventions can can influence which cases are available to mind and that even if people have thought about cases where their content disappeared at some point, this may not be easily retrieved when making particular privacy decisions or offering opinions on others&#8217; actions.</p>
<p>Stutzman&#8217;s example is but one way that the combination of the availability heuristic and existing Internet services combine to affect privacy decisions. For example, consider how activity streams like Facebook News Feed influence how people perceive their audience. News Feed shows items drawn from an individual&#8217;s friends&#8217; activities, and they often have some reciprocal access. However, the items in the activity stream are likely unrepresentative of this potential and likely audience. &#8220;Lurkers&#8221; &#8212; people who consume but do not produce &#8212; are not as available to mind, and proliﬁc producers are too available to mind for how often they are in the actual audience for some new shared content. This can, for example, lead to making self-disclosures that are not appropriate for the actual audience.</p>
<ol class="footnotes"><li id="footnote_0_291" class="footnote">This might not be the case, see <a href="http://michaelzimmer.org/2010/04/14/how-your-private-tweets-might-be-included-in-the-library-of-congress-public-archive/">Michael Zimmer</a> and <a href="http://www.nytimes.com/2010/04/15/technology/15twitter.html">this New York Times article</a>.</li><li id="footnote_1_291" class="footnote">Why don&#8217;t people do this in the first place? Many may not be aware of the feature, but even if they are, there are reasons not to use it. For example, it makes any participation in topical conversations (e.g., around a hashtag) difficult or impossible.</li><li id="footnote_2_291" class="footnote">Or at least this control would have to be via Twitter, likely before archiving: <a href="http://www.prospect.org/cs/articles?article=the_library_of_congress_is_now_following_you_on_twitter">&#8220;We asked them [Twitter] to deal with the users; the library doesn&#8217;t want to mediate that.&#8221;</a></li></ol>]]></content:encoded>
			<wfw:commentRss>http://www.deaneckles.com/blog/291_public-once-public-always-privacy-egosurfing-and-the-availability-heuristic/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Keyword searching papers citing a highly-cited paper with Google Scholar</title>
		<link>http://www.deaneckles.com/blog/227_keyword-searching-papers-citing-a-highly-cited-paper-with-google-scholar/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=keyword-searching-papers-citing-a-highly-cited-paper-with-google-scholar</link>
		<comments>http://www.deaneckles.com/blog/227_keyword-searching-papers-citing-a-highly-cited-paper-with-google-scholar/#comments</comments>
		<pubDate>Tue, 27 Oct 2009 00:01:27 +0000</pubDate>
		<dc:creator>Dean Eckles</dc:creator>
				<category><![CDATA[academia]]></category>
		<category><![CDATA[HCI]]></category>
		<category><![CDATA[psychology]]></category>
		<category><![CDATA[search]]></category>

		<guid isPermaLink="false">http://www.deaneckles.com/blog/?p=227</guid>
		<description><![CDATA[[Update: Google Scholar now directly supports this feature, check the box right below the search box after clicking "Cited by...".] In finding relevant research, once one has found something interesting, it can be really useful to do &#8220;reverse citation&#8221; searches. Google Scholar is often my first stop when finding research literature (and for general search), [...]]]></description>
			<content:encoded><![CDATA[<p>[Update: Google Scholar now directly supports this feature, check the box right below the search box after clicking "Cited by...".]</p>
<p>In finding relevant research, once one has found something interesting, it can be really useful to do &#8220;reverse citation&#8221; searches.</p>
<p>Google Scholar is often my first stop when finding research literature (<a href="http://www.flickr.com/photos/deaneckles/3998516185/">and for general search</a>), and it has this feature &#8212; just click &#8220;Cited by 394&#8243;. But it is not very useful when your starting point is highly cited. What I often want to do is to do a keyword search of the papers that cite my highly-cited starting point.</p>
<p>While there is no GUI for this search within these resultsin Google Scholar, you can actually do it by hacking the URL. Just add the keyword query to the URL.</p>
<p>This is the URL one gets for all resources Google has as citing Allport&#8217;s &#8220;Attitudes&#8221; (1935):</p>
<p style="padding-left: 30px;"><a href="http://scholar.google.com/scholar?cites=9150707851480450787&amp;hl=en">http://scholar.google.com/scholar?cites=9150707851480450787&amp;hl=en</a></p>
<p>And this URL searches within those for &#8220;indispensable concept&#8221;:</p>
<p style="padding-left: 30px;"><a href="http://scholar.google.com/scholar?hl=en&amp;cites=9150707851480450787&amp;q=indispensable+concept">http://scholar.google.com/scholar?hl=en&amp;cites=9150707851480450787<span style="color: #ff0000;">&amp;q=indispensable+concept</span></a></p>
<p>In this particular case, this gives us many examples of authors citing Allport&#8217;s comment that <a href="http://www.deaneckles.com/blog/213_no-entity-without-identity-individuating-attitudes-in-social-psychology/">the attitude is the most distinctive and indispensable concept in social psychology</a>. This example highlights that this can even just help get more useful &#8220;snippets&#8221; in the search results, even if it doesn&#8217;t narrow down the results much.</p>
<p>I find this useful in many cases. Maybe you will also.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.deaneckles.com/blog/227_keyword-searching-papers-citing-a-highly-cited-paper-with-google-scholar/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
	</channel>
</rss>

