Social cognition in complex team networks

15
Social Cognition in Complex Team Networks Samuel M. D. Seaver , R. Dean Malmgren , Andr´ e A. Moreira , Marta Sales-Pardo , Daniel Diermeier , Lu´ ıs A. Nunes Amaral Department of Chemical and Biological Engineering Northwestern University, Evanston, IL 60208, USA Departamento de F´ ısica UniversidadeFederal doCear´a, Cear´a, Brazil Managerial Economics and Decision Sciences, Kellogg School of Management Northwestern University, Evanston, IL 60208, USA To whom correspondence should be addressed; E-mail: [email protected]. March 14, 2006 1

Transcript of Social cognition in complex team networks

Social Cognition in Complex Team Networks

Samuel M. D. Seaver�, R. Dean Malmgren

�, Andre A. Moreira

�,

Marta Sales-Pardo�, Daniel Diermeier

�, Luıs A. Nunes Amaral

�����

�Department of Chemical and Biological Engineering

Northwestern University, Evanston, IL 60208, USA�Departamento de F ısica

Universidade Federal do Cear a, Cear a, BrazilManagerial Economics and Decision Sciences, Kellogg School of Management

Northwestern University, Evanston, IL 60208, USATo whom correspondence should be addressed; E-mail: [email protected].

March 14, 2006

1

Abstract

We use the density classification task to model social cognition in complex networks.Agents are embedded in a team network (1) and use simple social learning heuristics toadjust their current state. We investige various agent-specific biases and system perfor-mance measures. We show that the performance of team networks perform less well thansmall-world networks.

2

1 Knowledge Networks

Innovation and problem solving is almost always the consequence of collaboration in teams

or large groups. Here we model creative interaction in organizations as a complex network

(e.g. (1–3)) and systematically investigate whether and how the structure of networks affects an

organization’s ability to solve problems and find innovative solutions.

Recent research has begun to uncover the factors that lead to successful team networks.

An important example is Uzzi and Spiro’s (2005) empirical study of the Broadway industry.

To conduct their analysis Uzzi and Spiro use a newly created data set on creative networks in

Broadway musicals (all 2258 Broadway productions from 1877-1990). This industry is a good

choice to investigate this question for two reasons. First, there are no firms. Teams consist of

producers, directors, lyricists, composers, etc. and form on a project-by-project basis. Dense

social clusters form if the same people repeatedly collaborate, new links form as new-comers

join the system or as incumbents collaborate on a project for the first time. The second reason

is that success in the Broadway industry can easily be measured by box-office revenues, critical

acclaim or the length of runs.

Uzzi and Spiro show that the connectedness of the network (measured by clustering coef-

ficients) has a strong effect on the average yearly success of the industry, even if controlled

for various other independent variables such as market characteristics etc. Surprisingly, the

relationship is curvi-linear. Uzzi and Spiro suggest that if networks are too dense, too much im-

itation may lead to the dominance of conventional ideas. On the other hand, if networks are not

sufficiently connected, fresh ideas may emerge, but they remain isolated. From a managerial

point of view this means that there is a “sweet-spot” in the structure of the networks. An orga-

nization could potentially measure the connectivity of their internal network and then attempt

to increase or decrease its connectivity.

3

Influenced by this line of work, Guimera et al. (2005) study a model of team formation

characterized by the propensities that incumbents continue to collaborate on a new project or

are matched with other incumbents or with newcomers. They show that different matching

probabilities lead to different network topologies and therefore different expected performance.

Surprisingly, the model is able to replicate the structure of various collaboration networks from

different scientific communities.

What is missing from these papers is a behavioral model of collective problem solving that

then can be used to examine the effects of network topologies on system performance in more

detail.

2 A Model of Collective Problem Solving

In a recent series of papers (2,5) we demonstrated how a complex network topology, combined

with noise can alter the efficiency with which a system of decentralized units performs a global

cognition task. The idea is to consider networks of agents who can be in one of two states

denoted here ��������� . Moreover, to capture the effects of misunderstandings and other forms

of miscommunication there is a probability of miscommunication. That is, with probability agents perceive the state of any connected agent to be randomly ��� . Collective problem solving

is modeled as a density classification task, a widely used measure of coordination and global

information processing (4). For a system comprised of units whose state is a binary variable,

the density classification task is completed successfully if all units converge to the same state

and the coordinated state is identical to the majority state in the initial configuration. The initial

state of the system can be interpreted as receiving a partial signal about the correct solution.

Agent may use various social learning heuristics. Specifically, we consider a population

comprised of independent agents and assign to each agent a bias ���� ������������������������ with �!����"�"�"#�$ . If �%�&�'�����(��� , then the agent is “conservative.” A conservative agent requires

4

a “qualified” majority of her neighbors to convince her to change her state. If � � � ��� , the

agent is ”partisan” towards ��� . A partisan agent will “stick” to his preferred state until con-

vinced otherwise by a qualified majority of his neighbors. The update rule for an arbitrary agent

implementing a qualified majority heuristic is:

�����(� � � � ���� �� � � � ��������� � � �$���������� � ������� � � � �$�� � � ��������� � � �$� � (1)

where � ���(��� is the average noisy “signal” an agent receives from his ��� neighbors

We further assume that agents can have different levels of bias strength � � �� � ��� , which

measures the fraction of an agents’ neighbors that must be in the state opposing the state pre-

ferred by the agent in order to “force” him to change state. If #� ��� the agent uses a majority

rule, whereas if �� � � the agent is frozen and will never change his state.

The efficiency ��� ��� �� �$ � of an updating rule � , is a function of noise intensity , rewiring

probability � , and system size . Specifically, we have:

� �(����� �� �(��� � � �(��� � (2)

where �� is the number of agents that are in state “� � ” and !� is the number of agents that

are in state “ � � ”. For each realization, we let the system evolve for "� timesteps. We define

the efficiency � of a single realization as

� #%$'& �)(

�)( � (+*-, � �����/.�� (3)

The efficiency ��� for each set of parameter values is the average of � over 1000 realizations.

Moreira et al. (2004) consider small world networks (6) with � �10 neighbors, rewired with

a probability � (where a small world topology occurs at �32 � ) and a probability of miscom-

munication. They show that if agents use the majority heuristic, that is, set � �4� , the system

5

exhibits rapid and robust convergence to the correct state, provided the interaction structure is

characterized by moderate noise and constitutes a small world-network. Importantly, both con-

ditions are necessary for effective problem solving. Moreover, more complicated decision-rules

that work well in the case of � � and �&� � , for example the Gacs-Kurdyumov-Levin (GKL)

rule (4) fail to function when communications is noisy or interaction occurs in an asynchronous

fashion. Intuitively, this implies that random connections to other members of the network not

only are important for problem solving, but that if they exist even extremely simple decision

heuristics can be successful. Surprisingly, for � � 0 , the case for which the GKL rule was opti-

mized, the efficiency of the majority rule reaches a value of 0.85, which is greater than what is

obtained with the GKL rule under the idealized conditions &��� and � ��� . Strikingly, unlike

the GKL rule, the majority rule yields efficient coordination even for asynchronous updating.

One can thus understand the role of each of the components in this condition. The noise enables

the system to escape conformations with multiple domains. The rewiring of the connections al-

lows fast access to information from across the system—i.e., the long-range connections make

the system a small-world. Importantly, the small-world phenomena is not enough to ensure the

convergence to the correct classification. Only with the combined effect of the noise and the

small-world topology can the system reach a consensus within the permitted evolution time.

Moreira et al. originally used a value of � � � " � , however with the use of agents that have a

preferred state, we need to ensure that the majority state, chosen to be “+1”, is always the same.

To find the ’best’ value for � we explore, for a variety of system sizes, the resulting efficiency

with which a system reaches consensus within "� time steps, as a function of � (Fig. 1A). We

choose a combination of a system size that is computationally inexpensive ( � $ � � ) and the

lowest value of � that will lead to an average efficiency 2 � "�� ��� of the time ( � � � " ��� ). The

resulting efficiency calculated by the original and new methods is compared as a function of

noise in a homogenous population and is found to be similar (Fig. 1B).

6

Seaver et al (2006) consider more general interaction structures with � � � . In this case

agents may exhibit various forms decision biases, as discussed above. Adding partisans to

the model (even if they are distributed evenly between partisans for� � or ��� ) dramatically

decreases the performance of the system. In the case of conservative agents the model yields

a surprising conclusion. For moderate levels of bias ( #� � � # � ) the system not only shows

remarkable levels of efficiency, but system performance actually increases as the fraction of

conservatives increases, provided the noise level is sufficiently high. However, there is an im-

portant trade-off with the speed which a solution is reached. That is, the time to reach consensus

grows in the fraction of conservations. If the fraction is larger than 30%, consensus cannot be

reached within "� time steps.

To estimate the “typical” number of time steps it takes for a system to reach the steady

state we use the following method: We generate 5000 time series of � time steps with different

initial conditions using the same parameters. (Fig. 2). We compute the average time series,� �(��� � ���������� ���������� � � � �(��� . Then for each time step � we compute the average efficiency for the

remaining time steps ��� �(��� � �� ����� � �(��� � �� � . We fit the resulting curve (Fig. 2B) to a stretched

exponential

� ��� � �����%� ����� ���! "$#�% � (4)

We estimate the convergence time � as the time step at which the efficiency variation equals

95% of the total variation of �'& ������ �)( � �+*�, � " � �� ��- (5)

We explore a system with �4$ � ��� ��4� "�$ �-� �4� " � � � �. �4� " $ , and � $ # � . We obtain

the convergence time � for the system at � � "�$ and an increasing�'.

. It can be seen from Fig.

3A that the time needed for a system to reach a steady state increases with�/.

. The steady state

7

is dominated by agents in the state “+1” meaning that a system is highly efficient at reaching a

consensus state of “+1” provided that it is given enough time to evolve.

Finally, from Moreira et al. we know that a larger � will make a system more robust to

noise (2). Our results have shown that the presence of conservatives will also make a system

more robust to noise. Therefore, to observe whether conservatives would make a system more

robust to noise independently of � we explore the efficiency for � � $ , � $ � � , � � � " � � ,���" # � , and ��� # � (Fig. 4). Though these results are not directly comparable to the results

for � � 0 , the same trend can be seen with � " # � where the presence of conservatives still

makes the system more robust to noise.

3 Team Networks

In addition to considering a small-world topology, we will also consider more realistic social

networks generated according to the model of Guimera et al. (1) The model is built on the

concept that collaboration usually occurs in teams. A network is built using teams of a fixed

number of agents ( � ) as building blocks. For each agent in a new team there is a probability �

that it will be an incumbent, that is, an agent already present in the network, and a probability

� ��� that the agent will be a newcomer. If there are more than one incumbents in a team, there

is a probability � that an extra incumbent will be a previous collaborator of the incumbents

already in the team. Otherwise, with a probability � ��� , the extra incumbent is any incumbent

in the network. Finally, agents that do not participate in any team for a long time are considered

to have ’retired’ and are removed from the network, allowing the size of the network to reach a

steady state.

Almost every network generated using this model is fragmented and thus for a more reliable

comparison to idealized small-world networks we use the giant components of the networks.

For one set of giant components we use � ��� "�� 0 $ and a retirement rate (���" � � to reach both

8

a mean network size �� � � and an average giant component size that is �

� � � of the network.

For the second set of giant components we use � � � " � and ( � � $ � to reach a mean network

size of �4$ $ � and an average giant component size that is � � � � of the network. For both set

of giant components, the average size is � $ � � agents. Fig. 5 contain examples of the giant

components we use displaying the differences in the network organisation. In order to quantify

the differences in sparsity we measure the average degree of every node in these two sets—in the

giant components that only make up 50% of their networks, their average degree is � $ " � 0 and

in the giant components that make up � � � of their networks, the average degree is � $ " $ � . We

explore the efficiency of these networks at reaching consensus as a function of the fraction of

conservatives and of the noise intensity. The results are present in Fig. 6. The average degree of

each agent in the small-world networks is 0 simply because of the number of neighbors �&� 0 ,but this measure does not seem to be the dominant reason that the overall efficiency in the team

networks is much smaller. Compare these results with the results for small-world networks

where � � $ (Fig. 4). The dominant factor seems to be that these networks are not small-world

networks.

4 Conclusion

We provide a model of social cognition in complex networks. We show that very simple rules

can perform very well in problem solving tasks such as the density classification system pro-

vided the networks satisfy the small-worls property and interaction between agents is subjecto

moderate noise. In the case where interaction between agents is given by a self-organizing team

network estimated from the data, the system performs moderately well, but not as well as in the

case of small-world networks. This suggests the room for managerial intervention to improve

sytem performance to increase the connectedness of the network.

9

References and Notes

1. Guimera, R, Uzzi, B, Spiro, J & Amaral, L.A.N (2005) Science 308 697.

2. Moreira A. A, Mathur, A, Diermeier, D & Amaral, L.A.N (2004) Proc. Nat. Acad. Sci 101

12083.

3. Uzzi, B & Spiro, J (2005) Am. J. Sociol. 111 447.

4. Crutchfield, J. P & Mitchell, M. (1995) Proc. Natl. Acad. Sci. U.S.A. 92, 10742.

5. Seaver, S. M. D, Malmgren, R. D, Moreira A. A, Sales-Pardo, M, Diermeier, D & Amaral,

L.A.N (2006) Science under review.

6. Watts, D. J.& Strogatz, S. H. (1998) Nature. 393 440.

10

0.5 0.52 0.54 0.56 0.58 0.6

Probability that εj(t=0)=+1 , c

0.4

0.5

0.6

0.7

0.8

0.9

1.0

E

1012014018011601

0.0 0.1 0.2 0.3 0.4 0.5 0.6

Noise

0.0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

1.0

E

B

A

Figure 1: Exploration of initial distribution of state “+1”—The probability that an agents’ statewould initially be “+1” ( � ) in Moreira et al. (2) is � " � . The initial majority was always tiny, andthe final consensus was either 1 or -1. In order to force a consistent consensus in the contextof which we could explore biased agents � was chosen to be � " ��� . Fig. 1A highlights why� � � " ��� and � $ � � was chosen. Fig. 1A shows the response of systems to the increase inthe starting fraction of agents with a positive state (� �1� " � � and � � " " ). A system size of 401was chosen as a compromise due to the performance time and thus � " ��� was used as the startingfraction where each realisation would, on average, achieve an efficiency 2 � " � � . By using theparameter in this manner the majority will always be “+1” and will almost always become thefinal consensus of the system, and the dynamics that resist the consensus will be more obviousand easier to understand. Fig 1b shows a comparison between the original (’old’) method ofdetermining efficiency (2), and the new method where �1$ � � . In addition � �1� " � for the oldmethod and � � � " ��� for the new method.

11

0 2000 4000 6000 8000 10000Time

0

0.2

0.4

0.6

0.8

1

ε(t)

,

0

0.2

0.4

0.6

0.8

1

ε(t)

0 2000 4000 6000 8000 10000Time

0.8600.8620.8640.8660.8680.8700.8720.8740.8760.8780.880

ε av(t

)

A B

Figure 2: Method used to estimate ��� —(A) Examples of one of� � � � time series generated

from a system where � $ � � , � � "�$ , � � � " � � ,�. � � " $ , and � $ # � and the average

time series � �(��� (B) The resulting curve when, for each time step � in � ����� we compute theaverage efficiency for the remaining time steps � �� ����� � � � �� � ��� � � �� � . This curve is fitted to a

stretched exponential� ��� � � ��� � � ��� ��� " # % � . � �)( � �+*�, ��� ���� � - which is an estimation of the

time it takes for the efficiency to vary by � ��� of the total variation at � ��� .

0 0.2 0.4 0.6 0.8 1Fraction of Conservatives

101

102

103

104

t*

0 0.2 0.4 0.6 0.8 1Fraction of Conservatives

0.6

0.7

0.8

0.9

1

Eas

,E

BA

Figure 3: (A) Time � for a population of agents to reach the stationary state. (B) Comparisonof the asymptotic efficiency ( � �� ) and the efficiency � at � � "� as a function of the fractionof conservatives. Note that for

�. � � "�� , the system cannot reach the stationary state in the "�timesteps used in the simulations.

12

0.0 0.2 0.4 0.60.0

0.2

0.4

0.6

0.0 0.2 0.4 0.6 0.2 0.4 0.6−1.0−0.8−0.6−0.4−0.2 0.0 0.2 0.4 0.6 0.8 1.0

Co

nse

rvat

ives

Fra

ctio

n o

f

s = 2/5 s = 3/5

Noise

Bias Strength

Efficien

cy

Figure 4: Efficiency of the system at reaching consensus as a function of the fraction of con-servatives and of the noise intensity. The parameters are � $ � � , � � � " ��� , � � � " � � , and�&� $ . Note that the change in � means a change in the range of . It is clear that the presenceof conservatives will increase the robustness of the system.

13

AA

B

Figure 5: Examples of initial configurations and final states of “Teams” networks—The net-works here are built according to the model of Guimera et al, being giant components thatconsist of (A)

� � � or (B) � � � of the number of nodes. For each set, two different seedswere used to create the different distribution of agent types and states. Note that the final local-ized regions of blue (“-1”) nodes are clustered around conservatives of the same state, which isindicative of the short-range influence that conservatives have.

14

0.0 0.2 0.4 0.6

0.0 0.2 0.4 0.60.0

0.2

0.4

0.6

0.0 0.2 0.4 0.6 0.0 0.2 0.4 0.6

0.0 0.2 0.4 0.6 0.0 0.2 0.4 0.6

0.2 0.4 0.6−1.0−0.8−0.6−0.4−0.2 0.0 0.2 0.4 0.6 0.8 1.0

Fra

ctio

n o

f C

on

serv

ativ

es

Noise

s = 2/7 s = 4/7 s = 6/7Bias Strength

Efficien

cy

A

B

Figure 6: Efficiency of the ’teams’ model—Efficiency of the system, built according to the’teams’ model of Guimera et al., at reaching consensus as a function of the fraction of conser-vatives and of the noise intensity. These results are for systems with 57% of the agents beinginitially in state +1 and (A) a mean number of 398 agents, � � � "�� 0 $ , � � � " � , ( � " � � (B)a mean number of 363 agents, � � � " � , � � � " � , ( � ��$ � . The value used for � in A meansthat the average fraction of the total number of nodes that are in the giant component used isaround $ � � and the average degree is $ " � 0 . In B they are

� � � and $ " $ � respectively. The otherparameter value used is � � � " ��� .

15