Meaning Games

42
1 Meaning Games LENLS 2007 2007-06-18 HASIDA Kôiti ITRI, AIST

Transcript of Meaning Games

1

Meaning GamesLENLS 20072007-06-18

HASIDA KôitiITRI, AIST

2

Introduction to Game Theory

3

Rational Behavior of Individual action selection as optimization

problem maxx u(x): maximize utility u(x) by action x

optimization process inference perhaps largely precompiled

rather than real-time selection: evolution, imitation, economic

competition, etc. contextual factors: free will, etc.

4

Issues of Social Interaction autonomous optimization of entire society?

(invisible hand) Maximization of individual benefits entails

maximization of the social benefit as their total. rational expectation hypothesis Chicago school ... anti-Keynesian

reactionary conservatisms rejecting reformation and supporting imperial colonization, racial discrimination, eugenics, etc. Social Darwinism

In reality, the benefit of the entire society is often impaired!

5

Game Theory mathematical theory about interaction

among multiple players (autonomous agents)

John Von Neumann & Oskar Morgenstern (1944) Minimax principle in zero-sum games cooperative games

John F. Nash (1951) equilibria in n-person non-cooperative games

6

Prisoner’s Dilemma The prosecutor separately asks two fellow

prisoners (A and B) to confess. Do they cooperate (keep silence) or betray (confess)?

other examples: arms race, global warming, etc.

cooperate betray

cooperate 2\2 0\3

betray 3\0 1\1

Nash equilibrium

priso

ner B

prisoner A

7

Non-Cooperative Gameand Nash equilibrium A (mixed) strategy is a probability

distribution over possible actions. A Nash equilibrium is a state (combination of

players’ strategies) where each player’s strategy is optimal as to the others’ strategies. maxxi ui(x1,…, xi-1, xi, xi+1,…, xn): Each player i

maximizes her payoff ui(x1, …, xn) by action xi. Nobody wants to change her strategy in an

equilibrum. A Nash equilibrium exists in every n-person

non-cooperative game. (Nash, 1951)

8

Individual Optimizationvs. Social Optimization Nash equilibria may not be socially

optimal. Prisoners’ Dilemma

Individuals may optimize without optimizing the society, and vice versa. If one person takes a long way round,

then the others’ travel times may become much shorter, reducing the total travel time.

9

Communication and Game

10

Signaling GameSender S of type t T sends message m M to

receiver R, who does action a A.

TM

A

job market(Spence, 1973)

mate selection(Zahavi, 1975)

S job seeker male deer

R employer female deer

T competence vitality

M education antler size

A salary mating

11

Handicap Principle Individual animals communicate their

greater quality through costly morphology or behavior (Zahavi, 1977).

12

Communication and Cooperation Even enemies cooperate in

communication. nonnatural meaning (Grice, 1957) i ≡ S’s intention to convey c to R

S’s intention for R to recognize c and i R wants to recognize c and i even if she

doubts S’s honesty.Both S and R want i to hold.

... cooperation!(S: sender, R: receiver, c: content)

13

Efficiency of Language One linguistic expression may have

multiple meanings in multiple contexts. (Barwise & Perry, 1983)

Efficiency presupposes disambiguation. Collaboration for disambiguation?

14

Collaborative Disambiguationcf. Parikh (1987, 1990, 1992)

Possible correspondences between pieces of content and messages

c1 c2

m1 m2

Optimal correspondencec1 c2

m1 m2

… content

… messages

15

Natural Language Anaphorau1: Max scolded Fred.u2: He (Max) was angry with the man (Fred).u2': ??He (Fred) was angry with the man (Max).

Max Fred

he the manu2 u2'

Max Fred

he the man

Max Fred

he the man

16

Centering Theory (1/3)Centering (Joshi & Weinstein, 1981; Brennan, Friedman,

& Pollard 1987; Walker, Iida, Cote, 1994; Joshi & Weinstein, 1995)

ui: i-th utterance unit (clause) Cf(ui): list of forward-looking centers (entities

referenced in ui) ordered in saliencesalience in English:

subject>direct object >indirect object >other complements >adjuncts

preferred center Cp(ui): first entity of Cf(ui) backward-looking center Cb(ui): most salient entity

in Cf(ui-1) referenced in ui

17

Centering Theory (2/3)Rule I: If an entity E in Cf(ui) is referenced by a

pronoun in ui+1, then Cb(ui+1) must also be referenced by a pronoun in ui+1 (E may be Cb(ui+1)).

(Generalization: More salient entities are referenced by simpler expressions.)

ui+1: He was angry with the man.(He = Max, the man = Fred)Cb(ui+1): MaxCf(ui+1): [Max, Fred]

ui+1: He was angry with the man.(He = Max, the man = Fred)Cb(ui+1): MaxCf(ui+1): [Max, Fred]

ui+1: He was angry with the man.*(He = Fred, the man = Max)Cb(ui+1): Max (violation of Rule I)Cf(ui+1): [Fred, Max]

ui+1: He was angry with the man.*(He = Fred, the man = Max)Cb(ui+1): Max (violation of Rule I)Cf(ui+1): [Fred, Max]

ui: Max scolded Fred.Cb(ui): ?Cf(ui): [Max, Fred]

ui: Max scolded Fred.Cb(ui): ?Cf(ui): [Max, Fred]

18

Centering Theory (3/3)Rule II: CONTINUE > RETAIN >

SMOOTH-SHIFT > ROUGH-SHIFT (Walker, Iida & Cote, 1994).

Cb(ui)=Cb(ui-1) Cb(ui)≠Cb(ui-1)

Cb(ui)=Cp(ui) CONTINUE SMOOTH-SHIFT

Cb(ui)≠Cp(ui) RETAIN ROUGH-SHIFT

19

ui+2: He invited him to dinner.(He = Max, him = Fred)Cb(ui+2): MaxCf(ui+2): [Max, Fred]

ui+2: He invited him to dinner.(He = Max, him = Fred)Cb(ui+2): MaxCf(ui+2): [Max, Fred]

ui+2: He invited him to dinner.*(He = Fred, him = Max)Cb(ui+2): MaxCf(ui+2): [Fred, Max]

ui+2: He invited him to dinner.*(He = Fred, him = Max)Cb(ui+2): MaxCf(ui+2): [Fred, Max]

ui+1: He is waiting for Fred.(He = Max)Cb(ui+1): MaxCf(ui+1): [Max, Fred]

ui+1: He is waiting for Fred.(He = Max)Cb(ui+1): MaxCf(ui+1): [Max, Fred]

CONTINUE or SMOOTH SHIFT

CONTINUE RETAIN

ui: Who is Max waiting for?Cb(ui): ?Cf(ui): [Max]

ui: Who is Max waiting for?Cb(ui): ?Cf(ui): [Max]

20

Drawback of Centering Theory no general principles many different versions no quantitative predictions

Let’s reduce CT to decision-theoretic principles while generalizing it by quantitative (statistical) terms.

21

Meaning Game (1/2)(Hasida, et al., 1995; Hasida, 1996)a turn: [cs, m, cr] T Sender S sends message m M to

convey content cs C. Receiver R interprets m as meaning

cr C.

Unlike Prisoner’s Dilemma, meaning games entail cooperation due to nonnatural meaning.

c1 c2

m1 m2

22

Meaning Game (2/2) An effective solution of a meaning

game should be a Pareto-optimal ESS(evolutionarily stable strategy).

An equilibrium is Pareto-optimal when no other equilibrium is better for all players.

23

Evolutionary Game Theorynarrow sense: game-theoretic analysis of

biological evolution behavioral ecology John Maynard Smith

wide sense: game-theoretic analysis of evolution of social interaction in general

evolutionarily stable strategy (ESS)

24

Evolutionarily Stable Strategy (ESS) a strategy x preventing others When most players' strategy is x, any

other strategy y is disadvantageous.u(x, (1 –ε) x +εy) > u(y, (1 –ε) x +εy)

When ε= 0, u(x, x) u(y, x).→ An ESS is a Nash equilibrium.

25

Stability of Equilibria (Example) Agreement is more beneficial when the

agreement ratio is high, and opposition is more beneficial when the agreement ratio is low.

Nash equilibria are agreement ratio 0, 1, and a.

0 and 1 are ESS, but a is not.

0 1a

benefit of oppositionbenefit of opposition

benefit of agreementbenefit of agreement

agreement ratio

26

Rule I of Centering Theory If an entity E in Cf(ui) is referenced by

a pronoun in ui+1, then Cb(ui+1) must also be referenced by a pronoun in ui+1 (E may be Cb(ui+1)).

ui+1: He was angry with the man.(He = Max, the man = Fred)Cb(ui+1): MaxCf(ui+1): [Max, Fred]

ui+1: He was angry with the man.(He = Max, the man = Fred)Cb(ui+1): MaxCf(ui+1): [Max, Fred]

ui+1: He was angry with the man.*(He = Fred, the man = Max)Cb(ui+1): Max (violation of Rule I)Cf(ui+1): [Fred, Max]

ui+1: He was angry with the man.*(He = Fred, the man = Max)Cb(ui+1): Max (violation of Rule I)Cf(ui+1): [Fred, Max]

ui: Max scolded Fred.Cb(ui): ?Cf(ui): [Max, Fred]

ui: Max scolded Fred.Cb(ui): ?Cf(ui): [Max, Fred]

27

Derivation of Centering Theory (Rule I)(Hasida, et al., 1995)

Max Fred

he the man

probability: P1 > P2

utility: U1 > U2

comparison of expected utilityMax Fred

he the manE1=P1U1+P2U2

Max Fred

he the manE2=P1U2+P2U1

E1-E2=(P1-P2)(U1-U2) > 0Pareto Pareto optimal

common belief

28

Generalization of Rule I(Shiramatsu, et al., 2007)

More salient entities are referenced by simpler (higher-utility) expressions.

corpus-based verification estimation of salience (reference

probability) of entity (potential referent) 1/(1 + exp(-(b0+b1dist+b2gram+b3chain)))

estimation of utility of referring expression Const. + log(occurrence probability)

distance from last reference

salience of grammatical

function of last occurrence

length of successive

coreference chain

29

Annotated Corpora Mainichi Newspaper GDA Corpus Japanese newspaper 1,356 (out of 3,000) articles, 63,562 clauses,

16,728 anaphoric NPs, etc. GDA (Global Document Annotation) tags for

syntax, semantic role, andcoreference/anaphora.

Wall Street Journal GDA Corpus English newspaper 2,412 articles, 135,278 clauses, 95,677

anaphoric NPs, etc. GDA tags translated from Penn TreeBank, plus

coreference/anaphora.

30

Estimated Salience (Reference Probability) of Grammatical Function

(Shiramatsu, et al., 2007)

gram. func. #samples #successivereferences prob. gram. func. #samples #successive

references prob.topicsubj.

objectindirect obj.

subj.

indirect obj.

object

comp.

other postpositionno gram. func.

other prepositionno gram. func.

Mainichi Newspaper Wall Street Journal

31

Whether Pairs of Referential NPs Respect the Generalized Rule I (Mainichi Newspaper) (Shiramatsu, et al., 2007)

E1-E2

#pa

irsnegative pairspositive pairs

ratio

of p

ositi

ve p

airs

32

Whether Pairs of Referential NPs Respect the Generalized Rule I(Wall Street Journal) (Shiramatsu, et al., 2007)

E1-E2

#pa

irs

ratio

of p

ositi

ve p

airs

negative pairspositive pairs

33

Rule II of Centering Theory CONTINUE > RETAIN > SMOOTH-

SHIFT > ROUGH-SHIFT Cb(ui)=Cb(ui-1) > Cb(ui)≠Cb(ui-1)

V Cb(ui)=Cp(ui) > Cb(ui)≠Cp(ui)

Cb(ui)=Cb(ui-1) Cb(ui)≠Cb(ui-1)

Cb(ui)=Cp(ui) CONTINUE SMOOTH-SHIFT

Cb(ui)≠Cp(ui) RETAIN ROUGH-SHIFT

34

Derivation of Rule II (Hasida, et al., 1995) Cb(ui) = Cb(ui-1) raises the reference

probability of Cb(ui) in ui, and hence the utility of ui.

assumption: Successive references raise salience (reference probability or attention).

Cb(ui) = Cp(ui) raises the reference probability of Cb(ui) in ui+1, and hence the expected utility of ui+1.

assumption: Factors contributing to salience are somewhat additive.

Preference for Cb(ui) = Cp(ui) is based on prediction of next utterance, and hence weaker than that for Cb(ui) = Cb(ui-1).

35

Statistical Verification of Rule II(Shiramatsu, et al., 2007)

transition type

transition type

#samples

#samples

avg. expected utilities of transition types Mainichi Newspaper

Wall Street Journal

avg. exp. utility

avg. exp. utility

95% confidence intvl.

95% confidence intvl.

36

More Complex Issues

37

Compound Meaning Game(Hasida, 1996)

sentence-level meaning game, besides NP-level one

Fred insulted Max.He (Max) was angry with the man (Fred).

Angry(Fred,Max) Angry(Max,Fred)probability: P1 < P2

utility: U1 U2

He was angry with the man.

P1U1 < P2U2

38

Partial Sharing of Context (1/2)(Hasida, et al., 1996)

probability: P1 P2

John1 John2

`John Black' `John' `John White'utility: U1 < U0 > U2

two Pareto-optimal equilibria:John1 John2

`John Black' `John'

John1 John2

`John' `John White'

39

Partial Sharing of Context (2/2) actual solution

John1 John2

`John Black' `John White'

The solution of a natural-language meaning game should better be strictly Pareto-optimal in the maximum set of equilibria strictly Pareto-comparable to each other??

40

A Similar GameTwo people are asked to write a number from 1 to 10 without communication. They know that they'll get some money if they happen to write the same number, and that the amount of money is $10 each if the number is from 1 to 9, and $9 each if the number is 10. The Pareto-optimal solutions are those in which they write the same number from 1 to 9, but in reality they'll write 10.

41

Concluding Remarks There are no language-specific rules on

reference resolution. Centering Theory is reduced to meaning game

and general cognitive principles of attention. Language use consists of Pareto-optimal

ESSs of potentially very complex meaning games.

Much to do to address this complexity. Game theory introduces statistical/analytic

methods into semantics/pragmatics.

42

Selected ReferencesKôiti Hasida, Katashi Nagao, and Takashi Miyata (1995) A Game-

Theoretic Account of Collaboration in Communication. Proceedings of the First International Conference on Multi-Agent Systems, 140-147, San Francisco.

Kôiti Hasida (1996) Issues in Communication Game. Proceedings of the 16th International Conference on Computational Linguistics, 531-536, Copenhagen.

Kôiti Hasida, Jerry R. Hobbs, and Megumi Kameyama (1996) Optimality in Communication Games. Proceedings of the Second International Conference on Multi-Agent Systems, Kyoto.

白松 俊・駒谷 和範・橋田 浩一・尾形 哲也・奥乃 博 (2007予定) ゲーム理論に基づく参照結束性のモデル化と日本語・英語の大規模コーパスを用いた統計的検証. 自然言語処理. (Shun Shiramatsu, Kazunori Komatani, Kôiti Hasida, Tetsuya Ogata, and Hiroshi Okuno (2007, to appear) A Game-Theoretic Model of Referential Coherence and Its Statistical Verification based on Large Japanese and English Corpora. Natural Language Processing.)