Phase transition to two-peaks phase in an information cascade voting ...

Report 2 Downloads 27 Views
Phase transition to two-peaks phase in an information cascade voting experiment

arXiv:1112.2816v4 [physics.soc-ph] 12 Nov 2012

Shintaro Mori∗ Department of Physics, Kitasato University Kitasato 1-15-1, Sagamihara, Kanagawa 252-0373, JAPAN Masato Hisakado Standard and Poor’s Marunouchi 1-6-5, Chiyoda-ku, Tokyo 100-0005, JAPAN Taiki Takahashi Department of Behavioral Science, Hokkaido University N.10,W.07, Kita-ku, Sapporo, Hokkaido 060-0810, JAPAN (Dated: November 13, 2012)

1

Abstract Observational learning is an important information aggregation mechanism. However, it occasionally leads to a state in which an entire population chooses a sub-optimal option. When this occurs and whether it is a phase transition remain unanswered. To address these questions, we performed a voting experiment in which subjects answered a two-choice quiz sequentially with and without information about the prior subjects’ choices. The subjects who could copy others are called herders. We obtained a microscopic rule regarding how herders copy others. Varying the ratio of herders led to qualitative changes in the macroscopic behavior of about 50 subjects in the experiment. If the ratio is small, the sequence of choices rapidly converges to the correct one. As the ratio approaches 100%, convergence becomes extremely slow and information aggregation almost terminates. A simulation study of a stochastic model for 106 subjects based on the herder’s microscopic rule showed a phase transition to the two-peaks phase, where the convergence completely terminates as the ratio exceeds some critical value. PACS numbers: 05.70.Fh,89.65.-s,89.65.G



[email protected]

2

I.

INTRODUCTION

The tendency to imitate others is one of the basic instinct of human. People effectively and inadvertently act as filters to provide the information that is most useful for an observer. Imitation and copying is a highly adaptive means of gaining knowledge [1] . It presumably results from an evolutionary adaptation that promoted survival over thousands of generations. It allows individuals to exploit the hard-won information of others [1, 2]. However, imitating or copying others has disadvantages. The acquired information might be outdated or misleading [2, 3]. Copying wrong information might lead to herding, where an entire population makes a wrong decision. This is referred to as information cascade or rational herding [4–7]. Unfortunately, because imitation is a basic instinct and because it is economically rational to copy others, humans might not be able to evade such a catastrophic situation [4]. Social influences have many forms, including imitation, conformity and obedience [8]. Recent studies in cognitive neuroscience suggest that some type of imitation occurs automatically via the actions of “mirror neuron” systems [9]. In the field of social psychology, many studies have focused on how humans use social information at the microscopic level [2, 10]. In the field of finance and economics, it is now widely believed that investors are influenced by the decisions of others and that this influence is a first-order effect [4, 5]. We now have a number of interesting models of rational herding based on simple, straightforward, and convincing intuition [4, 5, 11–17]. Empirical financial research has focused on macroscopic data primarily because such data is easily available. Micro-macro aspects of information cascade have been studied in [18]. It was concluded that the information cascade is fragile and self-correcting. Even if the population makes a wrong decision at a point in the choice sequence, it will eventually turn to a correct choice. The analysis was based on a stochastic model, and the asymptotic behavior of the empirical choice sequences was not studied in detail. In order to study the nature of information cascade and, furthermore the possibility of phase transition, it is necessary to connect the micro-macro aspects without depending on a model assumption as far as possible. However, thus far, no empirical work has directly connected the microscopic and macroscopic aspects of information cascade and herding. Two types of phase transitions have been predicted in a two-choice voting model depending on the strength of conformity of people [19, 20]. We set two types of individuals: herders 3

and independents. The voting of independents is based on their fundamental values, while the voting of herders is based on the number of previous votes. If the herders are analog herders and they vote for each choice with probabilities that are proportional to the choices’ votes, there occurs a transition between the super and normal diffusion phases [19]. If the independents are the majority of voters, the voting rate converges at the same rate as in a binomial distribution, which is called the normal diffusion phase. As the proportion of herders is over 50%, the voting rate converges more slowwly than in a binomial distribution: this is called the super-diffusion phase. However, the presence of herders does not affect the accuracy of the majority’s choice. If the independents vote for the correct choice rather than for the wrong one, the majority of voters always choose the correct choice. The probability distribution of the voting rate has only one peak, and these two phases are collectively referred to as the one-peak phase. In the digital herder case, where herders always choose the choice with the majority of previous votes, the majority’s choice does not necessarily coincide with the correct choice, even if the independents vote for the correct choice rather than for the wrong one. When the fraction of herders increases, there occurs a phase transition, beyond which a state where most voters choose the correct choice coexists with one where most of them choose the wrong one [20]. If the fraction of herders is below the threshold value, most voters choose the correct choice and the system is in the one-peak phase. If the fraction is above the threshold value, the distribution of the voting rate has two peaks corresponding to the two coexisting states: this phase is called the two-peaks phase. We call the phase transition between the one-peak and two-peaks phases the information cascade transition. In this paper, we have adopted an experimental and theoretical approach to the study of the phase transitions in information cascade . The organization of the paper is as follows. We explain the experimental design and procedure in section II. Section III is devoted to the analysis of the experimental data. We show that varying the ratio of herders led to qualitative changes in the asymptotic behavior of the convergence of the voting rate. In section IV, we introduce a stochastic model which simulates the system. We obtain a microscopic rule regarding how herders copy others. We performed a simulation study of a stochastic model for 106 subjects based on the herder’s microscopic rule. The model showed the information cascade transition to the two-peaks phase, where the convergence completely terminates as the ratio exceeds some critical value. Section V is devoted to the conclusions. In the 4

appendices, we explain the experimental setup in detail.

II.

EXPERIMENTAL DESIGN AND PROCEDURE

TABLE I. Experimental design.

Experiment 2010A 2010B 2011A 2011B

T 31 31 52 52

{r} M {0, 1, 2, 3, 5, 7, 9, ∞} 100 {0, 1, 2, 3, 5, 7, 9, ∞} 100 {0, 1, 5, 11, 21, ∞} 120 {0, 1, 5, 11, 21, ∞} 120

Subject pool System Kitasato Univ. Face-to-Face Kitasato Univ. Face-to-Face Hokkaido Univ. Web Hokkaido Univ. Web

The experiments reported here were conducted at the Information Science Laboratory at Kitasato University in October 2010 and at the Group Experiment Laboratory of the Center for Experimental Research in Social Sciences at Hokkaido University between June 2011 and July 2011. The subjects included students from the two universities. We call the former experiment EXP2010 and the latter EXP2011. In EXP2010 (EXP2011), the number of individuals T was 31 (52). We prepared two groups of subjects, group A and group B. In Total, 62 (104) subjects participated in EXP2010 (EXP2011). There were two sequences of subjects, and we denote the order of each subject by t ∈ {1, 2, · · · , T }. The number of questions M in the two-choice quiz is 100 (120) in EXP2010 (EXP2011). Interaction between subjects in each group was permitted only through the social information given by the experimenter (Face-to-Face) in EXP2010 or by the experiment server (Web) in EXP2011. Table I summarizes the design. The subjects answered the quiz individually with and without information about the previous subjects’ choices. This information, called social information, is given as the summary statistics of the previous r subjects {C0 (r, t), C1 (r, t)}. We denote the t + 1-th subject’s answer for case r by X(r, t + 1), which takes the value 1 (0) if the choice is true (false). {C0 (r, t), C1 (r, t)} are the numbers of subjects who choose each choice among the prior r Pt ′ subjects as C1 (r, t) = t′ =t−r+1 X(r, t ) and C0 (r, t) = r − C1 (r, t). The choice of r is r ∈ {r} = {0, 1, 2, 3, 5, 7, 9, ∞}({0, 1, 5, 11, 21, ∞}) in EXP2010 (EXP2011). Here, r = 0 means that the subjects receive no information and must answer with their knowledge only. In the case r = ∞, the summary statistic is calculated from all previous subjects’ choices. The subjects answered the quiz with their knowledge only (r = 0) initially. Next, they 5

answered with {C0 (r, t), C1 (r, t)} from r = 1 to r = ∞ in increasing order of r in the set {r}. Any differences between the choices in r = 0 and r ≥ 1 can be attributed to the social information.

FIG. 1. Snapshot of the screen for r = 0, 1, and ∞ in EXP2011. The summary statistics {C0 (∞, t), C1 (∞, t)} are given in the second row in the box in cases r = 1 and ∞.

Fig. 1 shows the experience of the subjects in EXP2011 more concretely. The subjects entered the laboratory and sat in the partitioned spaces. After listening to a brief explanation about the experiment and the reward, they logged into the experiment web site using their IDs and started to answer the questions. A question was chosen by the experiment server and displayed on the monitor. First, subjects answered the question using their own knowledge only (r = 0). Later, subjects received social information and answered the same question. 6

Fig. 1 shows the cases r = 1 and r = ∞. Subjects could then use or ignore the social information when making decisions.

A.

Experimental procedure

We here explain in detail the procedure and the experience of the subjects in the experiment [21]. A subject answered a question with no public information (r = 0) initially. The answer was denoted as X(0, t) for the t-th subject in the subjects’ sequence. If the subject was first (t = 1), he answered only in the case r = 0. X(0, 1) was copied to X(r, 1) as X(r, 1) = X(0, 1) for r ≥ 1 for later convenience. If t > 1, the experimenter (in EXP2010) or the server (in EXP2011) calculated the social information {C0 (r, t), C1 (r, t)} and gave it to the subject. If t = 2, the subject answered the question in case r = ∞ with {C0 (∞, 1), C1(∞, 1)} and the answer was denoted as X(∞, 2). Here, C1 (∞, 1) is X(0, 1). By the convention X(r, 1) = X(0, 1) for r ≥ 1, we can write C1 (∞, 1) = X(∞, 1). As C1 (1, 1) = C1 (∞, 1), we copy X(∞, 2) to X(1, 2) as X(1, 2) = X(∞, 2). As in t = 1, X(1, 2) = X(∞, 2) is copied to X(r, 2) as X(r, 2) = X(1, 2) for r > 1. If t = 3, the subject answered the question in case r = 1 with {C0 (1, 2), C1(1, 2)}, and the answer was denoted as X(1, 3). The social information {C0 (1, 2), C1(1, 2)} was calculated with the answer X(1, 2) as C1 (1, 2) = X(1, 2) by the copy convention X(1, 2) = X(∞, 2). Then, the subject answered in case r = ∞ and the answer was denoted as X(∞, 3). C1 (∞, 2) is C1 (∞, 2) = X(0, 1) + X(1, 2), which can be written as C1 (∞, 2) = X(∞, 1) + X(∞, 2). For r > 1, X(∞, 3) is copied to X(r, 3). By the copy convention, the social information in case r can be expressed with the answers {X(r, t)} in case r only. If t = 4, the subject answered the question in case r = 0, which is written as X(0, 4), and then in case r = 1, written as X(1, 4). Next, in EXP2011, the subject answered in case r = ∞, written as X(∞, 4). The social information is C1 (1, 3) = X(1, 3) and C1 (∞, 3) = X(∞, 1) + X(∞, 2) + X(∞, 3). Fig. 2 gives the pictorial explanation of the procedure. In general, if the order of the subject is t, there is no public information for r > t − 1. There is the maximum value rmax in the set {r} that satisfies rmax < t − 1. The subject answered from case r = 0 to case rmax in the set {r} in increasing order of r. Then, the subject was given the social information from all priors (r = ∞) and answered in case r = ∞. He did not answer cases r > rmax in the set {r} and the answer in case 7

FIG. 2. Pictorial explanation of experimental procedure. A subject answers the quiz questions in increasing order of r from r = 0 to r = ∞. The t + 1-th subject answers in case r with social information {C0 (r, t), C1 (r, t)} and the answer is denoted as X(r, t + 1). The first subject (t = 1) answers only in the case r = 0, which is written as X(0, 1). The second subject (t = 2) answers in the cases r = 0 and r = ∞, which are written as X(0, 2) and X(∞, 2), respectively. The third subject answers in case r = 0, then in case r = 1, and finally in case r = ∞; these are written as X(0, 3), X(1, 3), and X(∞, 3), respectively. The number in the circle indicates the order of the answer for each subject. The number in the arrow indicates the memory length of the social information. For r = 1, it is one. For r = ∞, the t-th subject receives t − 1 previous subjects’ information, and the length is t − 1.

r = ∞ was copied to the unanswered cases as X(r, t) = X(∞, t) for r > rmax in set {r}. The answer in case r started from the r + 2-th subject in the sequence. For t < r + 2, X(r, t) = X(∞, t) by the copy convention, and {C0 (r, t), C1 (r, t)} can be written using only Pt ′ {X(r, t)} as C1 (r, t) = t′ =t−r+1 X(r, t ) and C0 (r, t) = r − C1 (r, t). We use the same conventions when we prepare a sequence of choices in case r. All sequences of choices {X(r, t)} start from t = 1. The percentage of correct answers up to the t-th subject is P defined as Z(r, t) = tt′ =1 X(r, t′ )/t, and the final value is Z(r, T ).

B.

Quiz selection

We explain the choice of the questions in the quiz. In the experiment, it was necessary to control the difficulty of the questions. If a question is too easy, all subjects know the answer. If the question is too difficult, no subjects know the answer. In order to study social influence by varying the ratio of people who do not know the answer, it is necessary to choose moderately difficult questions. In EXP2010, we selected 100 questions for which only one among the five experimenters knew the answer. This choice means that the ratio 8

TABLE II. Five typical questions from the two-choice quiz in EXP2011. q is the label of the questions in the quiz and q ∈ {0, 1, 2 · · · , 119}.

q Question Choice0 Choice1 Answer 0 Which insect’s wings flap more in one minute? mosquito honeybee 0 1 During which period did the Tyrannosaurus Rex live? Jurassic Cretaceous 1 3 Which animal has a horn at birth? rhinoceros giraffe 1 7 Which is forbidden during TV programs in Korea? commercials kissing scenes 0 8 Which instrument is in the same group as the marimba ? vibraphone xylophone 1

is estimated to be around 80% for the subjects. After EXP2010, we calculated Z(0, T ) for each question. In general, Z(0, T ) ≥ 0.5 for two-choice questions. A too-small value of Z(0, T ) indicates some bias in the given choices of the question. We excluded questions with too-small values of Z(0, T ) and prepared a new quiz with 120 questions in EXP2011. Table II shows five typical questions from EXP2011.

III.

DATA ANALYSIS

In the analysis, the subjects are classified into two categories, independent and herder, for each question. If a subject knows the answer to a question with 100% confidence and the answer is not affected by others’ choices, he/she is categorized as independent. If the subject does not know the answer and if he/she may be affected by others’ choices, he/she is categorized as herder [15, 19, 20]. We assume that the probability of a correct choice for independent and herder subjects to be 100% and 50%, respectively. For a group with p herders and 1 − p independent subjects, the expectation value of Z(0, T ) is E(Z(0, T )) = 1−p/2. For each question in each group, we estimate p by p = 2(1−Z(0, T )) as the maximal likelihood estimate. The assumption of the random guess (50%) by the herder might be too simple. As Z(0, T ) approaches 0.5 and almost all subjects do not know the answer to the question, p approaches 100% and the estimate works well. Our experimental and analytical design has three advantages over both theoretical models and observational studies. (i) We control the amount of social information that the subjects receive by the change in r. This enables us to derive a microscopic rule for human decisions under social information [22]. (ii) Based on the answers in the absence of information (r = 0), we can estimate the ratio of herders p, which will enable us to extract the herder’s decision rule from the results in (i). (iii) Our analysis focuses on the asymptotic behavior of 9

the convergence of Z(r, t) with fixed herder’s ratio p. We clearly see the collective behavior of humans and the qualitative change when p is varied. In particular, we can study the possibility of the information cascade transition [7, 14, 19, 20, 23]. We include a note about the controllability of p in the experiment. In the experiment, after all T subjects answered, we calculated Z(0, T ) and estimated the herder’s ratio p as p = 2(1 − Z(0, T )). It may seem impossible to control p in the experiment, but this is not so. We think p is an inherent property of two-choice questions. If we can estimate p for a large number of subjects T >> 1, we can apply the same value to the experiments with other groups. We have compared the two values of Z(0, T ) of group A and B for the same question in EXP2011. Pearson’s correlation coefficient ρ is 0.82, and there is a strong correlation. The system size T in EXP2011 is very limited (T ≃ 50) and there remains some fluctuation in the estimation of p, but it will disappear for a large enough T . We can know p in advance and control it in voting experiments.

A.

Distribution of Z(r, T )

There are 2×120(100) samples of sequences of choices for each r in EXP2011 (EXP2010). We divide these samples into 11 (9) bins according to the size of Z(r, T ), as shown in Table IIIA (IIIB). The samples in each bin share almost the same value of p. For example, in the sample in the No. 6 bin (0.45 < Z(0, T ) ≤ 0.55) in EXP2011, there are almost only herders in the subjects’ sequence and p ≃ 100%. On the other hand, in the sample in No. 11 bin (Z(0, T ) > 0.95), almost all subjects know the answer to the question and are independent (p ≃ 0%). An extremely small value of Z(0, T ) indicates some bias in the question. We omit data with Z(0, T ) < 0.45(13/31) in the analysis of the system and we are left with 180 (166) samples in EXP2011 (EXP2010). The samples with Z(0, T ) < 0.5 in No. 6 bin in EXP2011 (No. 5 bin in EXP2010) have p values larger than 100%. These values are errors √ of the estimation p = 2(1 − Z(0, T )). The standard deviation of Z(0, T ) is p/2 T for fixed √ p. In the estimation of p, there is a fluctuation with the magnitude of order ≃ p/ T . If p takes value larger than 100%, we take it to be 100%. Table III shows the number of data samples in each bin for r = 0, 5 and r = ∞ as N(0), N(5), and N(∞). Social information causes remarkable changes in the subjects’ choices. For r = 0 (r = 5), there is one peak at No. 7 (No. 9), and for r = ∞, there are peaks at No. 3 and No. 11 in EXP2011. 10

TABLE III. Effect of social information on subjects’ decisions in (A) EXP2011 and (B) EXP2010. We divide the samples according to the size of Z(r, T ). N (r) denotes the number of samples in each bin. pavg is estimated as the average value of p = 2(1 − Z(0, T )) in each bin. In the last column, the sub-optimal ratios are shown.

No. 1 2 3 4 5 6 7 8 9 10 11 Total No. 1 2 3 4 5 6 7 8 9 Total

Z(r, T )[%] 1/2. If p > pc , the distribution of Z(∞, T ) has two peaks at Z(∞, T ) < 1/2 and Z(∞, T ) > 1/2. In the thermodynamic limit T → ∞, the probability that Z(∞, T ) < 1/2 becomes a function of p, which is non-analytic at p = pc and takes a positive value for p > pc . There are two candidates for the order parameter of the phase transition. One is the sub-optimal ratio Z(∞, T ) < 1/2. The other is the variance of Z(∞, T ), Var(Z(∞, T )). Both candidates are zero for p < pc and positive for p > pc in the thermodynamic limit T → ∞.

Ratio(Z(∞,T) 0) in the limit t → ∞, the system is in the one-peak phase. If the slope is zero (γ ≤ 0) in the limit t → ∞, the system is in the two-peaks phase. We see that the convergence becomes

very slow as p increases. The exponent γ is estimated by fitting with ∝ t−γ for t ≥ 20. It decreases from 1.43 (1.53) to 0.09 with the increase in p in EXP2011 (EXP2010). For the cases with p = 96.7%(97.6%)in EXP2011 (EXP2010), the system can be in the two-peaks phase.

IV.

STOCHASTIC MODEL AND SIMULATION STUDY

The asymptotic analysis of the convergence of Z(∞, t) shows the possibility of the twopeaks phase in the cases p = 96.7% and 97.6%. The negative slope γ is remarkably small (∼ 0.09) in both experiments. However, the system sizes are limited and far from the 14

0.1

t-0.09 t-0.28

Var(Z(∞,t))

A

t-0.90 0.01

No.9:41.9% No.8:62.7% No.7:79.3% No.6:96.7%

t-1.43

10 t B

t-0.09

0.1

Var(Z(∞,t))

t-0.40

t-1.04 0.01 No.8:26.3% No.7:49.1% No.6:74.6% No.5:97.6%

t-1.53

10 t

FIG. 5. Convergent behavior for (A) EXP2011 and (B) EXP2010. The convergence is given by the double logarithmic plot of Var(Z(∞, t)) vs t using the samples in the four bins for r = 0 in Table IIIA (No.6(◦),7(△),8(⋄) and 9(×)) and in Table IIIB (No.5(◦),6(△),7(⋄) and 8(×)). The dotted lines are fitted results with ∝ t−γ for t ≥ 20.

thermodynamic limit T → ∞. In this section, we derive a microscopic rule as to how the herders copy others’ information. Based on the herder’s microscopic rule, we introduce an ad hoc stochastic model. A simulation study of the model for 106 subjects showed the information cascade transition.

A.

Microscopic behavior of herders

We determine how a herder’s decision depends on social information. For this purpose, we need to subtract the independent subjects’ contribution from X(r, t+1). The probability of being independent is 1 − p and such a subject always chooses 1 ; the herder’s decision is 15

then simply estimated as [25] (X(r, t + 1) − (1 − p) · 1)/p. The expectation value of this under C1 (r, t) = n1 indicates the probability that a herder chooses an option under the influence of the prior n1 subjects among r choosing it. We denote it by qh (r, n1 ), and it is defined for t ≥ r as qh (r, n1 ) ≡ E((X(r, t + 1) − (1 − p))/p|C1(r, t) = n1 ).

(1)

The conditional expectation value in Eq.(1) is estimated using the samples that satisfy C1 (r, t) = n1 from EXP2011. From the symmetry between 1 ↔ 0, we assume qh (r, n1 ) = 1 − qh (r, r − n1 ). For r = ∞, the t + 1-th subject obtains information from the previous t subjects and r is considered to be t. r = ∞ case is averaging several values of r. For r = 21 and ∞, we study the dependence of qh (r, n1 ) on n1 /r and round n1 /r to the nearest values in {k/11|k ∈ {0, 1, 2, · · · , 11}}. In addition, we estimate qh (∞, n1 ) for t ≥ 22 to understand the herder’s decision under the largest amount of social information. The second reason is that the subjects receives much information for t ≥ 22 and we can assume that the dependence

qh(r,n1)

of qh (∞, n1 ) on (t, n1) is replaced with the dependence on n1/t. 0.95 0.9 0.85 0.8 0.75 0.7 0.65 0.6 0.55 0.5

r=1 r=5 r=11 r=21 r=∞

a=0.761

λ∞=3.80 λ21=3.11 λ11=1.82 λ5=1.06 λ1=0.38

0.5

0.6

0.7

0.8

0.9

1

n1/r

FIG. 6. Microscopic rule of herder’s decision in EXP2011. It shows the probability qh (r, n1 ) that a herder chooses an option under the influence of the prior n1 subjects among r choosing it for cases r = 1(×), 5(∗), 11(⋄) and 21(△). For r = ∞(◦), r is considered to be t. The thin solid curves are fitted results with Eq.(2) for r = 1, 5, 11, and 21 from the bottom. The top thick solid line corresponds to r = ∞.

16

Figure 6 shows the plot of qh (r, n1 ) vs n1 /r. It is clear that qh (r, n1 ) is an almost monotonic increasing function of n1 . As r increases, it shows stronger dependence on n1 /r, and the herder’s decision is affected more greatly by the prior subjects’ choices. We fit the plot by the following functional form: qh (r, n1 ) =

1 (a tanh(λr (n1 /r − 1/2)) + 1) . 2

(2)

The parameters a and λr indicate the strength of the conformity of the subjects. Social psychology studies suggest that people’s likelihood to use social information depends on their mood [10]. a denotes the net ratio of herders who react positively to the priors subjects’ choices. λr denotes the strength of the dependence on social information. By the least squares fit, we obtain a = 0.761 and λ∞ = 3.80 for r = ∞. The fitted result is also shown in Fig. 6. The values of a and λ∞ depend on the experimental situation and on the system size T [26]. Using the same a, we fit the data for other r using Eq.(2). The results are also given in Fig. 6. As the amount r of social information increases, the strength of dependence λr increases.

B.

Information cascade transition of voting model

To understand the behavior of the system in the thermodynamic limit T → ∞, we simulate the system for large T by a stochastic model based on Eq.(2). We introduce a stochastic process {X(t)}, t ∈ {1, 2, 3, · · · , T }. X(t + 1) ∈ {0, 1} is a Bernoulli random

variable and its probabilistic rule depends on all the previous {X(t′ )}, t′ ∈ {1, · · · , t} through P C1 (t) = tt′ =1 X(t′ ). The probability that X(t + 1) is 1 for C1 (t) = n1 , which is denoted as q(t, n1 ), is given as q(t, n1 ) = (1 − p) + p ·

1 (a tanh λt (n1 /t − 1/2) + 1) . 2

(3)

We set a as a = 0.761 and λt for t ∈ {1, 5, 11, 21} as λt = λr . Here, λr takes the values given in Fig. 6. For other values of t and t < 21, we use the linearly extrapolated value. For t > 21, we set λt = λ∞ . We denote the probability function Pr(

Pt

t′ =1

17

X(t′ ) = n) as P (t, n). The master equation

0.5

A

EXP2011: r=∞ T=50 3 T=106 T=10

Ratio(Z(∞,T) pc = 93.4%.The vertical chain line at p = pc shows the critical point of the phase transition between the one-peak phase (p < pc ) and the two-peaks phase (p > pc ).

for P (t, n) is P (t + 1, n) = q(t, n − 1) · P (t, n − 1) + (1 − q(t, n)) · P (t, n).

(4)

We solved the master equation recursively and obtain P (t, n) for t ≤ 106 . Fig. 7 shows the results of the model. We plot (A) the sub-optimal ratio Z(∞, T ) < 1/2 and (B) Var(Z(∞, T )) vs p. For comparison, we plot the experimental results (EXP2011) using the symbol ◦. The model with T = 50 well describes the experimental results quan-

titatively. In the figure, we also plot the results for T = 103 and 106 . As T increases, the 18

non-zero regions of the order parameters move rightward. In particular, we see that there is a crossing point in the curves of Var(Z(∞, T )) at p = 93.4%, which we denote as pc . If p < pc , Var(Z(∞, T )) goes to zero as T increases. On the other hand, if p > pc , the variance seems to remain in the limit T → ∞. This shows the phase transition between the one-peak and the two-peaks phases. For p > pc , the system is in the two-peaks phase (γ = 0). For p < pc , the system is in the one-peak phase (0 < γ ≤ 1). In order to see the convergence rate (the exponent γ), it is necessary to study the asymptotic behavior of Var(Z(∞, t)) for large values of t. 2.5

3

Data T=50

2

T=10 6 T=10 pc=0.934

γ

1.5 1

Normal Diff.,γ=1

0.5 Super Diff.,0 pc ) [27]. In the one-peak phase, information aggregation works and the majority’s choice is always correct when the independents choose the correct choice rather than the incorrect one. In the two-peaks phase, the majority’s choice is not necessarily correct. The coexistence of the optimal (Z(∞, T ) > 1/2) and the sub-optimal (Z(∞, T ) < 1/2) states occurs there. It has been thought that information cascade was fragile [5, 6] or self-correcting where the sub-optimal state disappears and switches to the optimal state [18]. Our study indicates that the system is in the two-peaks phase and that the sub-optimal state is stable against small perturbations for p > pc if the subjects are given the summary statistics {C0 (∞, t), C1(∞, t)}. The conclusion might appear to be contradictory to the previous one, but this is not so. In the previous works, each subject has his own information and it is not necessary to follow the majority if one can trust his own information [18]. In addition, the social information is the time series of the previous choices {X(∞, t′)}1≤t′ ≤t , and it contains much more information than the summary statistics {C0 (∞, t), C1(∞, t)}. In our experiment, the herder does not have information and it is necessary to follow the majority if he wants to choose a correct answer. However, the system size in our experiment is very limited and it is difficult to infer the state of the system in the thermodynamic limit based only on experimental data. Our conclusion that there occurs a information cascade transition relies heavily on the results of the simulation study of the stochastic model. In addition, the experiments were performed with students at universities, and the scope of the subjects is thus very restricted. The robustness of the conclusion should be established by further experiments. For this purpose, a web-based experiment in artificial laboratories is promising [28]. There, we can approach the thermodynamic limit T → ∞ more easily than in physical laboratories and study the micro-macro feature of information cascade. 20

ACKNOWLEDGMENTS

We thank Yosuke Irie for preparing the quiz used in the experiment and Fumihiko Nakamura and Ruokang Han for their assistance in recruiting the subjects. This work was supported by Grant-in-Aid for Challenging Exploratory Research 21654054.

[1] L. Rendell, R. Boyd, D. Cownden, M. Enquist, K. Eriksson, M. W. Feldman, L. Fogarty, S. Ghirlanda, T. Lillicrap, and K. N. Laland, Science 328, 208 (2010) [2] L. Rendell, L. Fogarty, W. Hoppitt, T. Morgan, M. Webster, and K. Laland, Trends Cogn. Sci. 15, 68 (2011) [3] F. S. J. Lorenz, H. Rauhut and D. Helbing, Proc. Natl. Acad. Sci. (USA) 108, 9020 (2011) [4] S. Bikhchandani, D. Hirshleifer, and I. Welch, Scot. J. Polit. Econ. 100, 992 (1992) [5] A. Devenow and I. Welch, Euro. Econ. Rev. 40, 603 (1996) [6] L. R. Anderson and C. A. Holt, Am. Econ. Rev. 87, 847 (1997) [7] I. H. Lee, J. Econ. Theory 61, 395 (1993) [8] H. Kelman, J. Conflict Resolut. 1, 51 (1958) [9] C. Heyes, Psychol. Bull. 137, 463 (2011) [10] V. Griskevicius, N. J. Goldstein, C. R. Mortensen, R. B. Cialdini, and D. T. Kenrick, J. Pers. Soc. Psychol. 91, 281 (2006) [11] A. Kirman, Q. J. Econ. 108, 137 (1993) [12] T. Lux, Econ. J. 105, 881 (1995) [13] R. Cont and J. Bouchaud, Macroecon. Dynam. 4, 170 (2000) [14] P. Curty and M. Marsili, J. Stat. Mech. 2006, P03013 (2006) [15] S.Mori and M. Hisakado, J. Phys. Soc. Jpn. 79, 034001 (2010) [16] S. Galam, Int. J. Mod. Phys. C 19, 409 (2008) [17] J. Gonz´ alez-Avella, V. Egu´ıluz, M. Marssili, F. Vega-Redondo, and M. S. Miguel, PLoS One 6, e20207 (2011) [18] J. Goeree, T. R. Palfrey, B. W. Rogers, and R. D. McKelvey, Rev. Econ. Stud. 74, 733 (2007) [19] M. Hisakado and S. Mori, J. Phys. A 43, 315207 (2010) [20] M. Hisakado and S. Mori, J. Phys. A 44, 275204 (2011)

21

[21] For more details of the experiments, please refer to Appendix A [22] B. Latan´e, Am. Psychol. 36, 343 (1981) [23] D. J. Watts, Proc. Natl. Acad. Sci. (USA) 99, 5766 (2002) [24] S. Hod and U. Keshet, Phys. Rev. E 70, 015104 (2004) [25] More precisely, the probability that t-th subject is independent depends on the choice X(0, t). If X(0, t) = 0, he is not independent and the probability is zero. On the other hand, if X(0, t) = 1, the probability is (1 − p)/(1 − 0.5p). The subtraction should take into account the value of X(0, t). Here, we adopt the simple procedure described in the text. [26] The microscopic behavior of the herders in EXP2010 is given in Appendix B. [27] M. Hisakado and S. Mori, e-print arXiv:physics/1203.3274 (3 2012) [28] M. J. Salganik, P. S. Dodds, and D. Watts, Science 311, 854 (2006)

Appendix A: Experimental Setup 1.

EXP2010

In EXP2010, the 62 subjects who participated in the experiment were recruited from the School of Science of Kitasato University. The subjects were randomly assigned to either group A or group B; each group had 31 subjects. In each session, one subject from each group entered the laboratory, a total of two subjects. We explained that we were studying how their choices were affected by the choices of others. After explaining the details of the experimental procedure and payment, each subject sat in front of an experimenter and had no contact with the other subject in the laboratory. The experiments on groups A and B were performed independently. Interaction between subjects in each group was permitted only through the social information given by the experimenter in front of each subject. The subjects were asked to answer the 100 questions in the two-choice quiz. The subjects answered the quiz with at most eight social influence conditions, including the case r = 0. Each session lasted about one hour. In order to obtain data from all T = 31 subjects in both groups, we performed the session 11 times on October 9 and and 10 times on October 16 and 23 in 2010. Subjects were paid upon being released from the session. There was a 2000 yen (about $24) participation fee and an additional 1000-yen reward (about $12) for the top ten subjects. The ranking of the subjects was calculated based on the ratio of correct 22

answers to all questions and r.

2.

EXP2011

In EXP2011, the 104 subjects who participated were recruited from the Literature Department of Hokkaido University. The subjects were randomly assigned to either group A or group B; each group had 52 subjects. In each session, between one and six subjects from each group entered the laboratory, for a total of between two and eleven subjects. The experiment was performed in the Group Experiment Laboratory of the Center for Experimental Research in Social Sciences of Hokkaido University. There were fifteen desks furnished with partitions and PCs. Each subject sat behind a partition and communication among them was prohibited. The experiments on groups A and B were performed independently. In order to study the effect of social information on the choices of the subjects, it was necessary to control the transmission of information from others in the same group. We developed a web-based voting system by which multiple subjects could simultaneously participate in the experiment. The subjects used a web browser to access the web voting server in the intranet. They could obtain information about the others’ choices from the summary statistics {C0 (r, t), C1 (r, t)} shown on the monitor. Subjects did not know which questions the other subjects were answering or what their choices were. With slides, we showed subjects how the experiment would proceed. We explained that we were studying how their choices were affected by the choices of others. In particular, we emphasized that the social information was realistic information calculated from the choices of previous subjects. The reason for this is that in contrast to EXP2010, the social information was given on the monitor and seemed less credible to the subjects. Through the slides, we also explained the payment. After the explanation, the experiment started. The subjects answered the 120-question quiz with at most six social influence conditions within about one hour. A total of 15 sessions were held on June 15 and 16 and from July 12 and 13 in 2011. Subjects were paid in cash upon being released from the session. There was a 600 yen (about $7) participation fee and additional rewards that were proportional to the number of correct answers. One correct choice was worth one point, and this was worth one yen (about one and a third cents). Some subjects could not answer all the questions within the alloted time, so the number 23

qh(r,n1)

0.95 0.9 0.85 0.8 0.75 0.7 0.65 0.6 0.55 0.5

r=1 r=3 r=5 r=9 r=∞

a=0.97

λ∞=2.12 λ9=1.75 λ5=1.18 λ3=0.77 λ1=0.11

0.5

0.6

0.7

0.8

0.9

1

n1/r

FIG. 9. Microscopic rule of herders’ decisions in EXP2010. It shows the probability qh (r, n1 ) that a herder chooses an option under the influence of the prior n1 subjects among r choosing it for cases r = 1(×), 3(∗), 5(⋄) and 9(△). For r = ∞(◦), the t + 1-th subject’s decision depends on previous t subjects’ decisions and r is considered to be t. We estimate the dependences of qh (∞, n1 ) on n1 /t for t ≥ 10, where n1 /t is rounded to the nearest value in {k/9|k ∈ {0, 1, 2, · · · , 9}}. The solid curves are fitted results with qh (r, n1 ) = (a tanh(λr (n1 /r − 1/2)) + 1)/2. For r = ∞ (the thick solid line), we set r = t. From bottom to top, the thin solid lines correspond to r = 1, 3, 5, and 9.

T of subjects who answered a question varied. The distribution of T is #{T = 52} = 143, #{T = 51} = 5, #{T = 50} = 2, #{T = 49} = 85, #{T = 48} = 4, #{T = 47} = 1. The average value of T was 50.8 and the standard deviation σT was 1.48. The average length of the sequence was about 50. The assignment of quiz questions in EXP2011 was as follows. Each question has a label q in q ∈ {0, 1, 2, · · · , 119} and each subject has an ID number i. If the ID number i of the subject was odd, the subject started with the question with the smallest q in the pool not chosen by the server for another subject in the same group. If i was even, the subject started with the question with the largest q not chosen by the server for another subject. If another subject was answering a question, that question was never chosen by the server. It was strictly prohibited for multiple subjects to answer the same question simultaneously.

Appendix B: Microscopic behavior of herders in EXP2010

We determine how a herder’s decision depends on social information in EXP2010. We follow the same procedure written in the main text. Fig. 9 plots the results. The fitted results with Eq.(2) are also shown. Compared with Fig. 6, the convergence to Eq.(2) is not 24

so good. The system size T = 31 is not enough to derive the microscopic rule.

Appendix C: Estimate of γ and its error bar

In our study, we focus on the asymptotic behavior of Var(Z(∞, t)). In particular, we are interested in the power law behavior as Var(Z(∞, t)) ∝ t−γ . The negative slope −γ of the double logarithmic plot of Var(Z(∞, t) vs t gives the exponent γ in the limit t → ∞. In our

analysis, we estimate γ by the least squares fit with the functional form a · t−γ in the range 20 ≤ t ≤ T . We denote the estimate as γexp .

For the error bar of the exponent γexp, we adopted the voting model to simulate the system and apply the parametric bootstrapping method based on it. First, we solved the model recursively up to T = 50 − 30 and T = 50 and obtained the probability functions P (T, n) for both time horizons T . We defined Z(T ) as Z(T ) =

n T

and estimated γ by the

relation γ = log

Var(Z(T − ∆T )) T / log Var(Z(T )) T − ∆T

(C1)

using the probability function P (T, n). Here, γ was estimated in the range T − ∆T ≤ t ≤ T . For T = 50, we take ∆T = 30 to match the analysis of the experimental data in EXP2011. The estimate is exact, and we denote it as γexact. In EXP2011, the number of samples in each bin ranges from 33 to 46, and is very limited. We studied the stochastic model using the Monte Carlo method with the same sample size as the experimental data in each bin and estimated γ for the samples. We repeated this 104 times to obtain the samples of γ. Using the samples of γ, we estimated the 95% confidence interval, which is denoted as [γ− , γ+ ]. The approximately estimated γ was distributed around γexact . The upper (lower) deviation was calculated as ∆γ+ ≡ γ+ − γexact (∆γ− ≡ γexact − γ− ). We estimated the 95% confidence interval of γexp as [γexp − ∆γ− , γexp + ∆γ+ ]. Using this procedure, we estimated the error bars for each γ in Fig. 8.

25

1 0.9

A vs B,ρ=0.82

0.8 Z(0,T)

0.7 0.6 0.5 0.4 0.3 0.2 0.1 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 Z(0,T)

1