Crossing the associative/inferential divide: ad hoc concepts and the inferential power of schemata

NOTICE: This is the author’s version of a work that wasaccepted for publication in <Review of Philosophy and Psychology>.Changes resulting from the publishing process, such as peer review,editing, corrections, structural formatting, and other quality controlmechanisms, may not be reflected in this document. Changesmay have been made to this work since it was submitted forpublication. A definitive version is now published inREVIEW OF PHILOSOPHY AND PSYCHOLOGY:http://link.springer.com/article/10.1007/s13164-014-0201-8?sa_campaign=email/event/articleAuthor/onlineFirst

Crossing the associative/inferential divide:

ad hoc concepts and the inferential power of schemata*

Abstract: How do we construct ad hoc concepts, especially those characterised by

emergent properties? A reasonable hypothesis, suggested both in psychology and in

pragmatics (Relevance Theory), is that some sort of inferential processing must be

involved. I argue that this inferential processing can be accounted for in associative

terms. My argument is based on the notion of inference as associative pattern

completion based on schemata, with schemata being conceived in turn as patterns

of concepts and their relationships. The possible role of conscious attention in

inferential processes of this sort is also addressed.

Keywords: concept; ad hoc concept; conceptual combination; inference;

associative process; working memory; Relevance Theory

* A previous version of this paper was presented at the workshop “Ad hoc concepts in Relevance Theory” organized by Agustin Vicente at the University of Basque Country in Vitoria (September 2013). I have to thank Agustin, and all the participant to the workshop (Nicholas Allott, Robyn Carston, Alison Hall, Mark Jary, Elisabetta Lalumera, Fernando Martinez Manrique, Tim Pritchard, Mark Textor) for the precious discussion we had. I also have to thank James Hampton, who in a day-long conversation has generously helped me in focussing some crucial issues of this paper, and also for encouraging comments to a second version. Finally, many thanks are due to the insightful comments of three anonymous reviewers, which have importantly improved the final version.

http://link.springer.com/article/10.1007/s13164-014-0201-8?sa_campaign=email/event/articleAuthor/onlineFirst

1. Introduction

Ad hoc concepts can been defined as temporary concepts constructed spontaneously in order

to achieve a goal that is relevant in the current situation (Barsalou 2010). The notion has been

introduced in psychology by Barsalou (1983) in the context of a discussion of the dynamic nature of

concepts and it has recently been the focus of some research in cognitive pragmatics, under the

assumption that understanding utterances requires that concepts are not simply activated but rather

modulated and adjusted in order to fit specific linguistic and non-linguistic contexts. One of the

most interesting view of such modulation of concepts during language comprehension has been

provided by Relevance Theory (from now on, RT), whose account assigns a crucial role to

inferences. My purpose in the present paper is to elaborate on this idea, specifically by arguing that

the inferences involved in the construction of ad hoc concepts, even in difficult cases characterised

by emergent properties, can be performed by associative processing.

This claim is far from obvious, since psychologists mostly think of associative processing as

a specific kind of learning mechanism (and its counterpart for the recovery of information), which

should be distinguished from other mechanisms of knowledge acquisition and exploitation usually

referred to as inferential (see Shanks 2010 for a review). In most cases, a distinctive feature is held

to consist in the latter – but not the former – processes being accompanied by conscious attention.

More generally, the crucial point is that associative processing is conceived of as insufficient to

provide motivated transitions from one mental content to another. In slightly different words,

associative processing is thought to allow at most the recognition of previously encoded perceptual

patterns, but not any form of inferential reasoning. Examples of this restricted view of associative

processing are legions. For instance, Wood et al. (2007) have conducted a study on nonhuman

primates' ability to perceive actions as goal-directed. Specifically, they have showed that different

kinds of nonhuman primates understand a pointing gesture produced with the elbow as goal-

directed only in case the experimenter's hands are not free to move. Their conclusion is that the

ability of those animals to understand goal-directed actions “extends beyond […] associative

mechanisms, drawing upon inferences about an agent's goals in the context of particular

environmental constraints” (Wood et al. 2007: 1405).

Even in RT, as I will show, there is some reason to suspect that associative mechanisms and

inferences are put in opposition to each other. As a matter of fact, relevance theorists explicitly

differentiate their inferential account from associative ones such as that proposed by Recanati

(2004), according to which “'dumb' processes of activation and association may well mimic the

smart, inferential processes posited by Relevance Theory” (Recanati 2007: 52). It should also be

noted that, in spite of his defence of associative processing as apt to deliver inferential effects,

Recanati himself draws a distinction between associative, inferential-like pragmatic processes and

genuinely inferential pragmatic processes. In recent works (Mazzone 2011; 2013; 2014) I have tried

to expand Recanati's associative approach by arguing that it can be applied to any kind of pragmatic

processing. Once we admit that inferences can be licensed by associative processing, I suggest,

there is no room left for a non-associative notion of inferential processing in pragmatics.

A general demonstration that inferential reasoning can be implemented by associative

processing is largely beyond the scope of this paper. My far more modest aim is to suggest that this

might be the case in the construction of ad hoc concepts. To that effect, I will firstly analyse how the

associative/inferential issue has been framed in pragmatics (section 2) and, specifically, I will

consider an argument against associative accounts of inferences that is indicative of the general

attitude towards this issue. The point of the argument is that associative relations are too poor to

license motivated transitions between concepts. On the contrary, I will show (in section 3) that

simple associative relationships are maintained to allow inferences by otherwise very different

approaches to concepts such as semantic network models and Barsalou's theory. What is common to

them is the idea that concepts are connected to each other by different kinds of associative patterns

which allow, thanks to a simple mechanism of pattern completion, different kinds of inferences. In

section 4, I will frame this idea of associative patterns in terms of the notion of schema, that is, a

structure of representation constituted by a certain number of components and their relationship(s),

and I will argue – following Barsalou – that also sensorimotor representations have such a

schematic organization. In section 5 I will consider the possibility that inferential processing in

general and ad hoc concept construction in particular require conscious attention. I will argue

instead that consciousness and working memory just add stability to the associative construction of

ad hoc concepts, which can occur automatically as well. Finally (section 6), I will illustrate how

difficult cases of ad hoc concepts – that is, cases involving emergent properties – can be accounted

for by an associative account.

2. The associative/inferential issue in pragmatics and the argument of Wilson and Carston (2007)

According to Relevance Theory, ad hoc concept construction would not require any special

explanation. It is rather claimed to be a product of the very same pragmatic process by which both

explicit and implicit content of utterances are determined. This process is supposed to consist in an

inferential derivation thanks to which the implicit content (and other contextual conclusions) are

drawn from explicit content and a number of contextual assumptions. Consider, for example, the

following exchange:

(1) Peter: Will Sally look after the children if we get ill?

Mary: Sally is an angel.

Apparently the implicit content conveyed by Mary's utterance is an affirmative answer to the

question raised by Peter – something like SALLY WILL LOOK AFTER THE CHILDREN IF WE

GET ILL. This content is supposed to be the conclusion of an inference having as its premises the

explicit content of Mary's utterance and possibly other contextual assumptions. As to the explicit

content, however, the concept that the word “angel” contributes to it cannot be the encoded concept

ANGEL which has as its logical property SUPERNATURAL BEING OF A CERTAIN KIND. It is

thought instead to be a different concept – let us call it ANGEL* – obtained by modulating the

encoded concept in accordance with the context.

One might be tempted to think that the construction of such modulated concepts, insofar as

they are components of explicit content, precedes the inferential process proper: ad hoc concepts

such as ANGEL* might be activated by a simple associative dynamic – based on the rank of

accessibility of their properties – and then fed into the explicit content which provides one of the

premises for the inferential process. However, this is not the way in which relevance theorists

conceive of ad hoc concept construction. In their view, the inferential pragmatic process does not

proceed linearly from premises to conclusions; there is, instead, a mutual adjustment of explicit

content, contextual assumptions and contextual conclusions due to both forward and backward

inferences. In practice, not only are conclusions drawn forwards from premises, but also premises

may be adjusted backwards as a result of contextual expectations about the conclusions to be

inferred. But then, backward adjustment may affect explicit content and thus the ad hoc concepts

which constitute it. In our example, Peter's question, insofar as it requires a yes/no answer, can be

thought to raise the expectation that Mary intends to claim either SALLY WILL LOOK AFTER

THE CHILDREN IF WE GET ILL or its negation, and this expectation in turn licences a backward

inference towards the explicit content, which has to be coherent with either the affirmative or the

negative claim. Thus, the concept ANGEL has to be modulated until the explicit content provides a

premise which has either the affirmative or the negative claim as its conclusion.

In sum, according to RT ad hoc concepts may be provisionally activated as a result of a

simple associative dynamic of accessibility, but it is the inferential pragmatic process which

eventually determines the explicit content and the ad hoc concepts that are accepted by the hearer as

part of the contextually appropriate interpretation. That this view implies an opposition between

associative and inferential processing is suggested by a number of papers (Carston 2007; Wilson

and Carston 2006; 2007; Sperber and Wilson 2008) in which relevance theorists argue for their

inferential account in the face of associative alternatives, the most explicit of which has been

proposed by Recanati (2004). It is important to note, however, that Recanati's associative account is

limited to what he calls “primary pragmatic processes”, which include the mechanisms by which

word meanings are contextually modulated so as to provide the explicit content of utterances. In

line with the Gricean approach, Recanati maintains that the further transition from the explicit

content to what is implicitly conveyed by it in a context has to be conceived instead in terms of

genuinely inferential (as opposed to merely associative) processes.

In practice, Recanati conceives of primary pragmatic processes as local associative

processes, based on the spreading of activation within conceptual networks and the consequent

degree of activation of concepts. A concept would be contributed to the explicit content of the

utterance insofar as that concept is the most accessible (i.e. the most activated) for the system given

the situation. There is thus a competition between concepts, which is also affected by what Recanati

calls “accessibility shifts”: the context may enhance (or reduce) the activations induced by isolated

linguistic items, thus subverting the initial patterns of accessibility. For one example, even if the

most accessible meaning for the word “bank” (for an addressee) were FINANCIAL

INSTITUTION, the context might provide other sources of activation which enhance the

accessibility of the alternative concept RIVER SIDE.

An important role is played here by the notion of schema, which can be defined as a representation

tying two or more concepts together into a structured whole. In practice, any concept may activate

schemata which in turn spread the activation to their other components. Schemata are appealed to

by Recanati in order to explain how accessibility shifts may promote the search for coherence in

primary pragmatic processes: the initial advantage of a given interpretation may be overridden due

to the fact that a less accessible concept has a better fit with – and therefore receives further

activation by – some contextual piece of information via schemata of which they are both

components. In this way, concepts that are more coherent with other pieces of information in the

context come to be preferred over initially more accessible but less coherent ones.

In recent papers (especially Mazzone 2011; 2013; 2014) a sort of mediation between RT and

Recanati's approach has been proposed. Recanati (2007: 52) has claimed that, thanks to the role

played by schemata in ensuring motivated transitions between contents, associative processes can

“mimic the smart, inferential processes posited by Relevance Theory”. But then, one is tempted to

extend his approach beyond the explanation of explicit content and apply it to implicit content as

well. This would dissolve Recanati's distinction between primary (i.e., associative and inferential-

like) pragmatic processes and secondary (i.e., genuinely inferential) pragmatic processes, in line

with RT's claim that a single inferential process is responsible for the comprehension of both the

explicit and the implicit content of utterances. There would be, though, a price to pay for relevance

theorists as well: they should accept that not only is associative processing responsible for

delivering the interpretative hypotheses which are then involved in forward and backward

inferences, but also those inferences are in fact performed by associative processes. This would

make no difference for the above explanation of how the ad hoc concept ANGEL* is recovered in

the example (1), that is, by means of a combination of forward and backward inferences. What is

put in doubt is the independent assumption that such inferential mechanism cannot be implemented

by associative processes.

As a matter of fact – although there are interesting points of convergence between RT and

associative accounts – relevance theorists have explicitly contrasted associative explanations with

their own approach in a number of papers (Carston 2007; Wilson and Carston 2006; 2007; Sperber

and Wilson 2008).1 The most extensive argument against the associative account is provided by

Wilson and Carston (2007), and it is interesting to analyse it as indicative of a more general attitude

towards this issue. Wilson and Carston (2007: 243) claim that statistical associations “provide no

basis for drawing warranted conclusions”. The argument is based on considerations of statistical

associations among lexical items in a corpus, as they are represented in the connectionist model

discussed by Kintsch (2000; 2001). Wilson and Carston presume that there is a general cognitive

lesson to be learned from the discussion of this model, and it is that although all inferential

1 It might also be argued that an associative account of inferences is not compatible with RT's modularism. This point, however, is beyond the scope of this paper.

relationships are also associations, not all associations are inferential:

In the minds of many speakers of English, for instance, “shark” is non-inferentially associated with “diver”, “salt” with “pepper”, “love” with “hate”, and so on. (Wilson and Carston 2007: 244)

As a consequence, we are told that associative processes “will vastly overgenerate, and some

method of filtering out unwanted associations will be required’’ (Wilson and Carston 2007: 252). In

this sense, associative processes are unconstrained with respect to drawing warranted conclusions.

I can see two related problems in this argument.

In the first place, by saying that “shark” is non-inferentially associated with “diver”, the

authors apparently mean that only a specific class of associative relationships can properly be

considered inferential. For instance, they observe that, based on co-occurrences in the corpus, the

word “shark” can be expected to have the words “fins”, “dolphin”, “diver” and “fish” as its close

associates, but while “x is a shark” entails “x is a fish” it is not the case that it also entails “x is a

diver” or “x is a dolphin”. They conclude therefore that not all associations are inferential.

However, it is disputable that the domain of inferences should be delimited in such way.

Specifically, it can be doubted that inferences have to be equated to logical entailments based on

category inclusion. In pragmatics as well as in other cognitive domains a variety of probabilistic

inferences can be drawn on the basis of regularities of different sorts. For one example, on the basis

of regularities concerning sharks, from “x is a shark” we can infer “x has fins”.

Here is where the second problem comes in. In order for a cognitive system to license in

different circumstances either the inference from “x is a shark” to “x is a fish” or the inference from

“x is a shark” to “x has fins”, that system has to encode more information than the simple fact that

“fish” and “fins” are both associated to “shark”. If fact, insofar as the nature of the relationships is

not represented, we can only have an indiscriminate activation of all the associates without any

chance to pick up the one which is required by the current cognitive task (either “fish” or “fins”)

and, relatedly, without any chance to grasp the reason why that one is required by the current task

(based on the task, either the “is” or the “has” relationship might be relevant). But then, relevance

theorists' conclusion depends on their specific choice to consider associative representations which

convey very poor or no information. Such a choice – it should be added – is in line with the

widespread (but disputable) assumption that associative relationships may capture at most simple

perceptual configurations or even mere co-occurrences, without any other information than the

probability of their co-occurring.

As I will show in the next section, this assumption is at odds with a claim which is

commonplace in the psychological literature on concepts (for instance, see Murphy 2002), that is,

the claim that our conceptual system is held together by an information-rich network of associative

relations – which also encode information on the relationships among concepts – and that, as a

consequence, those associations license inferences. It might be objected (some referees actually

raised this point) that the notions of association respectively employed in arguments against

associative accounts and in the literature on concepts are different, and therefore that my dispute

with RT is purely verbal. I partially agree on the premise, but not on the conclusion. In many cases,

when inferential processes are distinguished by associative ones the focus is on the kind of

cognitive abilities which appear to be involved at the behavioural level. From this perspective, one

can legitimately claim that certain processes are inferential, not associative, at the behavioural level,

even if it turned out that, at the implementation level, the inferences involved are in fact performed

by associative processing. The point is that, just as it seems to be the case for RT's position, the two

different conceptions of “associative” may be mixed up so that from the fact that inferential (i.e.,

non-associative in the behavioural sense) abilities are at play the conclusion is drawn that those

inferences cannot be processed associatively (in the “implementation” sense).

There are two reasons why I incline to think that this is what happens in RT: first, because

its proponents explicitly reject Recanati's proposal that inferential-like effects can be obtained by

associative processes and, second, because Wilson and Carston's (2007) argument appears to focus

on the implementation level, not on the kind of ability at play. The clarification that I propose is

useful, in my opinion, in order to avoid precisely that sort of conceptual confusions. Similarly, I

have nothing to object to Wood et al.'s (2007) view – see section 1 – that nonhuman primates might

exhibit abilities that are better thought of as inferential rather than associative (behavioural level), to

the extent that no conclusion is drawn as to the possibility that these abilities are implemented by

associative processes.

3. Inference as statistical pattern completion

In the previous section I observed that in pragmatics as well as in other cognitive domains a

variety of probabilistic inferences can be drawn on the basis of regularities of different sorts.

Cognitive science is therefore interested in a less restricted notion of inference than mere logical

entailment based on category inclusion. Moreover, inferences in this wide sense might be performed

by associative processing, provided that sufficient information about the nature of associative

relationships is encoded. In practice, any such association licences inferences of some kind, and

precisely of the kind that the nature of its represented relationship allows it to licence. For instance,

spatial relationships allow spatial inferences, and so on and so forth.

That information-rich associative relationships allow inferences has been claimed by

otherwise very different psychological approaches to concepts such as semantic network models

and Barsalou's theory.

To start with, one of the key intuitions underlying the development of semantic network

models in the 70's was precisely that concepts can be connected by different types of relationships

licensing different types of inferences. To the extent that the node SHARK is connected to the node

FISH by means of a ISA (a category inclusion) link, the semantic network licenses the inference

from SHARK to (ISA) FISH. But with equal right, provided that the node SHARK is connected to

the node FINS by means of a HAS link, the semantic network licenses the inference from SHARK

to (HAS) FINS and so on and so forth. As previously noted, there is no reason why one should only

consider statistical associations between lexical items in texts when posing the question of whether

inferences can be performed by associative processing: conceptual associations of the above sort

should also be considered.

The connection between associative relationships and inferences is also recognised by

Barsalou, in a paper where semantic network models are nevertheless criticised for conceiving

concepts as amodal symbols and as decontextualised entities (Barsalou 2005: 221-222).

Specifically, the inferences enabled by the conceptual system are described by Barsalou (2005) in

terms of statistical pattern completion.

In his view, concepts are not isolated and context-independent entities, detached from

information on settings in which objects appear. Against this view, he proposes the notion of

situated conceptualization according to which concepts are encoded as components of specific

settings to which they are specifically tuned. Situated conceptualizations are maintained to play a

key role in drawing inferences:

The situated conceptualization that becomes active constitutes a rich source of inference. The conceptualization is essentially a pattern, namely, a complex configuration of multimodal components that represent the situation. When a component of this pattern matched the situation, the larger pattern became active in memory. The remaining pattern components-not yet observed-constitute inferences, that is, educated guesses about what might occur next. Because the remaining components co-occurred frequently with the perceived components in previous situations, inferring the remaining components is justified.

Let me focus on a couple of points touched in this quotation.

First, patterns are suggested here to play the same role that Recanati (2004) assigns to

schemata. According to Recanati, a schema receives activation from any of its components and,

once activated, it raises in turn the accessibility of its other components. In short, a schema enables

inferences – i.e., schematically motivated transitions – from any of its components to the other

components. This is also the kind of mechanism described in semantic network models. For

instance, thanks to the labelled link HAS the nodes SHARK and FINS enter into a pattern (a

schema) such that whenever SHARK is activated FINS can be inferred via that pattern. I find it

important to note that this mechanism has the structure of modus ponens: given the pattern IF A

THEN B, and given A we are justified in concluding B. In practice, the mechanism we are

describing is a generalised form of modus ponens, in various senses: because inferences are

probabilistic, because As and Bs can be connected by a number of different patterns besides

material implication (IF-THEN), and because more than two entities can be connected within a

given pattern.

Second, the above quotation not only establishes a direct link between inferences and

frequency of co-occurrence in experience, thus emphasizing the statistical character of inferences 2,

but it even suggests that frequency of co-occurrence is the reason why one is justified to infer the

remaining components of patterns. In a sense, this is an appeal to induction as a form of

justification. For instance, it is the frequency with which sharks and having fins co-occur in our

experience that justifies the inference form SHARK to (HAS) FINS. It should be kept in mind,

however, that in another sense once a schema is encoded in our conceptual system, it is this schema

that justifies the transition from one to another of its components. In this sense, what properly

justifies the inference from SHARK to FINS is the pattern SHARK HAS FINS.3 An interesting

2 “Everything about the production of inferences via pattern completion has a statistical character” (Barsalou 2005: 629).

3 One of the referees has argued that, since I am concerned with a descriptive enterprise, questions of justification had better be left aside. His/her suspicion is that “justifies” here is ultimately just a placeholder for “causes”. To be sure, I am not concerned here with normative epistemological issues. However, there is a cognitive reason for speaking of justification and not of cause: while associations in a restrictive sense can only be causes of concept activation, schematic associations in my sense can be thought to justify concept activation, to the extent that they can act as premises in rules of inference having certain concepts as their conclusions. I have described these rules of inference in terms of generalised modus ponens. (Thus, to be precise, it is the use of schemata in these rules of inference, not schemata in themselves, that justifies the conclusion.) If this is a special, cognitive use of “justification”, so be it.

The referee has two further objections to these considerations. One is that only sentential entities can act as inferential premises and conclusions. However, section 4 is precisely devoted to explaining why my generalised notion of modus ponens is thought to apply to non-sentential entities as well. The general idea is that sensorimotor representations have structure and can be combined productively in a way that is largely isomorphic to linguistic productivity. This is sufficient for sensorimotor representations to be subject to normative considerations, although this is not my concern here. The other objection is that my non-normative use of “justified” is scarcely clear. Does it mean “seems justified”? “This seems implausible – the referee says – since it would presuppose conscious awareness of the transition”. I agree that conscious awareness cannot be a general requirement for the kind of associative processes described here. As a matter of fact, in section 5 I propose that conscious awareness adds stability to the processes at issue, but the inferential structure is already provided by the schematic organization of associative knowledge. I insist that there is a clear distinction between the claim that associations are mere representations of co-occurrences and the claim that associations have schematic organization: in the latter case, but not in the former, activations can provide motivated transitions, that is, transitions from one content (p) to another (q) by means of a schema (if p then q), even when these transitions are not accompanied by conscious awareness.

application – that I will address below – of this distinction between inductive and schematic

justification is that the transition from one mental content to another can be justified even in the

absence of direct inductive evidence of the correlation, provided that there are schemata justifying

that transition.

To sum up, cognitive inferences can be conceived as a form of statistical pattern completion

and, in a sense, as a generalised form of modus ponens.

4. Schemata as sensorimotor representations

Another point raised by Barsalou is that concepts are neither amodal symbols nor

decontextualised entities. This view implies that concepts are richer in information than they are

presumed to be by semantic network models. Let me focus on the first point, the criticism of

concepts as amodal symbols.

Barsalou has famously insisted on the thesis that concepts are based on sensorimotor

representations. This is not intended to mean that our conceptual system is “a collection of holistic

images like those in a camera, video recorder, or audio recorder” (Barsalou 2005: 621). On the

contrary, in Barsalou's view situations and objects are analysed in components and features whose

sensorimotor representations are organised hierarchically in the brain, so that those components

“can be combined productively to produce infinite conceptual combinations” (Barsalou 2005: 625).

In other words, the idea is that structural (or schematic) organization and sensorimotor format of

representations can coexist. I now propose to analyse further this idea by connecting it to the notion

of schema.

Schemata or patterns, as I have described them above, are structures of representation

constituted by a certain number of components and their relationship(s). Recanati (2004) has

adopted a notation for schemata an example of which is STEAL (x) → IS ARRESTED (x). This

notation, although very abstract and simplistic from a cognitive point of view, can be a useful

approximation for certain purposes. One of its merits is that a schema is represented as a

relationship between concepts, and concepts are represented in turn as predicates with slots for

variables. This notation is reminiscent of Frege's (1952a; b) theory according to which concepts are

functions which take individuals as arguments and return truth values. For instance, the concept

DOG (x) is a function that gives the truth value TRUE by substituting for “x” the name of any

individual which is an actual dog.

My suggestion is that schemata have this general structure, that is, they are constituted by

relationships between nodes which have slots that must be filled by selecting a single item from a

range of possible arguments. However, I also propose that a cognitively plausible notion of schema

has to be richer than the above notation suggests, in two senses.

In the first place, schemata must specify the nature of the relationship(s) between their

components. As we saw above, this is required in order for schemata to enable inferences, but it is

also what one expects from all we know about learning. We do not apparently encode concepts such

as SHARK, FISH, FINS, DIVER without encoding at the same time information about how these

concepts are connected with each other – and presumably with other concepts as well. For example,

that sharks have fins – and markedly, that between sharks and fins there is a part-whole relationship

– is something that we come to represent as a simple result of their experiential co-occurrence. In

fact, one could go even further and observe that not only do we encode SHARK and FINS as

involved in a part-whole relationship, but we also represent this as a specific spatial relationship

such that fins are located on sharks in certain positions. It is not difficult to see that this allows

much more specific inferences than that from SHARK to (HAS) FINS alone.

In the second place, the “(x)” in Frege's notation is a very poor instrument for representing

the variability within concepts. It just stands for possible arguments without providing any specific

representation of the range of values within which the arguments may lie.4 However, in the history

of the notion of schema stemming from Bartlett (1932) (and including Mandler 1979; Rumelhart

and Ortony 1977; Schank and Abelson 1977; Smith 1989), there is a different approach which

4 Incidentally, that notation may be used for co-indexing of arguments (as Recanati 2004 has shown), but for our purposes we can leave this issue aside.

allows for a more concrete representation of variability. In practice, psychology and artificial

intelligence have described schemata as containing specification of the possible values for any of

their slots. This specification may include defaults for the most typical values but also other

possible values, whose relative probability can be itself specified. To be sure, these values, just as

any other component of schemata, have been traditionally represented by means of amodal symbols

(propositional labels). However, one can legitimately presume that sensorimotor representations

might also account for variability within schemata. For example, since sharks differ from one

another to some extent, in order to apply their concept to actual instances of the species, this

representation must specify not one single shape but instead a range of variability for possible

shapes. And the same holds for fins and other parts. Thus, the schema SHARK HAS FINS can be

thought to contain a sensorimotor representation of sharks and a sensorimotor representation of fins

with their respective ranges of variability.

To sum up, structural (or schematic) organization and sensorimotor format of representation

can coexist, and the schematic structure appears to consist in relationships between concepts which

have slots to be filled by selecting a single sensorimotor representation from a range of possible

ones.

5. Meaning construction

Up to this point we have been analysing the notions of inference and schema, in order to

assess whether it is the case that associative relationships between concepts are too unconstrained to

play a role in the explanation of inferential processing. Our considerations suggest instead that

associative schemata are information-rich structures of representation enabling inferences

conceived in terms of statistical pattern completion.

However, even assuming that the notion of inference as pattern completion is accepted, and

therefore that contextually modulated concepts can be arrived at justifiedly as the result of

activation spreading through schemata, the issue remains as to whether simple associative activation

can wholly account for what one can call the constructive aspect of pragmatic (and conceptual)

processes. In this vein, Carston (2007) observes that associations might suffice insofar as what is at

issue is activation and deactivation of already encoded concepts (or their parts), but they are not

sufficient in order to understand genuinely constructive processes, that is, processes in which new

patterns of features – including properties that are not directly accessible from the initial concepts –

are constructed on purpose, for instance as a result of fresh metaphor comprehension or cases of

conceptual combination. In other words, we have to deal here with the most difficult and interesting

cases of ad hoc concept construction, that is, those involving emergent properties. For instance,

Carston observes that the utterance

(2) Mary is a bulldozer

requires the construction of the ad hoc concept BULLDOZER* which can be paraphrased as being

“obstinate, insensitive, uninterested in other people's opinions and feelings”, a concept which is not

going to be found in the encyclopaedic entry or the schemata attached to the linguistically encoded

concept BULLDOZER5. Therefore, she concludes, “some process or mechanism other than, or

additional to, the activation (and deactivation) of existing concepts or parts of schemas is required”

(Carston 2007: 29).

In the next section I will show how the account based on associative schemata might explain

cases of ad hoc concepts with emergent properties. The general idea is the following: it is not

necessary a direct connection with the appropriate ad hoc concept, insofar as there is a chain of

schemata providing an indirect route to it. Before turning to that, however, there is another issue

that is worth discussing briefly. The insistence on the constructive nature of these cases might be

motivated by the intuition that concepts can not only be activated one after the other, but they can

5 At least if we suppose that the metaphor is fresh and BULLDOZER* is not lexicalised. Otherwise, one should substitute an actual fresh metaphor for this example.

also combine with each other so as to form temporarily stable configurations. This is interesting

because, as we saw in section 1, some scholars assume that conscious attention and working

memory are distinctive features of inferential (versus associative) processing.

As a matter of fact, relevance theorists have recently explored the possible contribution of

attentional processes to fresh metaphor comprehension. With reference to experimental data on

metaphor comprehension, Rubio Fernandez (2007) analyses a pattern of late suppression of

irrelevant meanings and proposes to interpret this as a case of meaning construction, a process that

would differ from simple meaning selection in that the former but not the latter would involve the

intervention of effortful attentional processes. Specifically, while in interpreting homonyms

irrelevant meanings appear to be suppressed early and in an automatic manner, on the other hand

“suppressing the literal meaning of a novel metaphor would require the operation of later,

attentional processes”, and this is why “we are usually aware of figurative language use” (Rubio

Fernandez 2007: 364). In the same line, Carston (2010) proposes that complex and extended

metaphors might involve a more controlled and reflective mode of processing than conventional

ones.

I agree that conscious attention may have a role to play in these and other cases of

comprehension (see Mazzone 2013). However, this does not prove that complex conceptual

combinations cannot be implemented by simple associative processing, with no intervention of

attentional processes.6 My suggestion is that conscious attention adds stability to such constructive

processes, but it is the schematic organization that I described earlier what makes them possible in

the first place. This claim would require a more detailed examination, but for the present purposes a

couple of considerations will suffice.

In the first place, there are reasons to think that sophisticated cognitive processes can occur

even without the intervention of conscious working memory, although working memory is crucial

in order to ensure stability over time to those processes. An interesting case in point is evidence

6 It has even been suggested that attentional processing emerges itself “from the operations and interactions of very elementary processing units” (Shanks 2010: 275), that is, from basic associative mechanisms. However, for the present purposes we can leave this aside.

from goal pursuit. Although the traditional view conceived goal pursuit essentially as a conscious

and effortful process, decades of research in social psychology starting with Bargh (1989; 1990)

have shown that goal pursuit can be automatic and unconscious. Our previous experience shapes

“associative networks that include contexts, goals that are regularly pursued in these contexts, and

means that one usually uses to attain these goals” (Hassin, Aarts, Eitam, Kusters and Kleiman 2009:

550-551). But then, these associative networks allow for goal pursuit via simple spreading of

activation. In this line, Huang and Bargh (in press) have recently insisted on the existence of deep

similarities between conscious (controlled) and unconscious (automatic) processing of goal-directed

behaviour. In practice, conscious and unconscious action control seem to differ from each other

only in their respective dynamics of activation, with the former being more stable over time than the

latter.

Another line of argument comes from Miller and Cohen's (2001) influential model of

executive functions. According to that model, working memory, conscious attention and cognitive

control are just different facets of a dynamic based in our prefrontal cortex (PFC). Now, the role of

the PFC is described as essentially “modulatory” (Miller and Cohen 2001: 183), in the sense that it

does not perform some proprietary kind of elaboration, it rather ensures that some already occurring

cognitive elaboration is maintained and shielded from distracting and competing processing.

These considerations are easily applied to a simple case of conceptual combination.

Assuming that SHARK and FINS are linked by a spatial part-whole schema, then the activation of

this schema together with its components can be seen as a temporary conceptual configuration

which is able in turn to cause further activations in the network. Whenever such a configuration

succeeds in entering working memory, it reaches a (relatively) stable activation thus activating the

related representations for all the time needed for task execution. But even when the conceptual

configuration is activated without gaining access to conscious working memory, it nevertheless may

play a role in further processing by affecting the activation of related information.

In sum, if our previous considerations are correct, there is no reason to assume that

conceptual combinations necessarily require conscious attention and working memory, although in

the normal case these are present. Specifically, my suggestion is that they give stability to schematic

associative processing. Let us finally turn to the case of emergent properties.

6. Ad hoc concepts with emergent properties

Within the pragmatic literature concerning ad hoc concept construction, emergent properties

– i.e., properties that are not present in the concepts presumed to provide the lexical meaning of

words – have been recognised as an especially difficult issue (e.g., Wilson and Carston 2006; 2007).

This issue is also addressed in psychological literature on concept combination. Hampton and

Jonsson (2012) for instance raise the issue of how, when the concepts BIRD and PET are combined

together in the complex concept PET BIRD, the emergent property LIVES IN CAGES – which is

not present in either of the constituent concepts – can be formed.7 They propose two routes by

which this might occur. One route is what they call extensional feedback: “The attribute is clearly

made available by people recalling visits to pet shops or to friends' homes where they have seen pet

birds”. This amounts to the inductive justification for inferences we spoke of earlier. Through

experience a schema is formed such as PET BIRD LIVES IN CAGES, which then justifies the

inference from the complex concept PET BIRD to the property LIVES IN CAGES. But Hampton

and Jonsson also mention another possibility: “Alternatively people might use their 'background

theories' to guess that if a bird was not kept in a cage, it would not remain a pet for very long”. The

question is, what does it mean that people use background theories and how do these theories

provide non-inductive justification for inferences such as that from PET BIRD to LIVES IN

CAGES?

In my view, the answer can be found in the notion of schema. This is the direction suggested

by Hampton himself when he describes how background theories could be incorporated into

7 In Barsalou's account, ad hoc concepts are expressed by phrasal structures such as, for instance, “things to pack in a suitcase”. However, a concept combination such as PET BIRDS can be re-described in terms of the phrasal structure “birds which are also pets”.

prototype representations (Hampton 2006: 4):

The crucial difference is that similarity to prototype is not a simple function of matching attributes, but involves deeper causal information. One way to think of this is to suppose that in addition to having a set of features, a theory-based prototype has a set of information about the relations between those features. If an item has the features, but does not have them in the right relations to each other [...], then its similarity to the prototype will be poor.

As in the notion of schema we considered earlier, the crucial idea is that the conceptual system must

encode the relationships between its components. We have already seen that this is essential in order

to have inferences conceived as pattern completions. In addition, Hampton and Jonsson suggest that

background theories – and therefore, according to Hampton (2006), relations between features – are

key to the explanation of emergent properties. My proposal is that this might occur as a form of

complex pattern completion based on chains of patterns.

An interesting indication of how this mechanism could work is found again in semantic

network models, an important feature of which was inheritance of properties from superordinate to

subordinate categories. For one example, if BIRD is a superordinate category with respect to

ROBIN and if (HAS) WINGS is encoded as a property of BIRD, then the network licences the

conclusion that that property applies to ROBIN as well. In such a case, there is no encoded pattern

directly connecting ROBIN and (HAS) WINGS and therefore no pattern completion in the sense we

considered earlier. Nevertheless, there is a chain of two patterns – ROBIN ISA BIRD and BIRD

HAS WINGS – such that ROBIN is indirectly connected to HAS WINGS thanks to that chain: this

can be called a case of complex pattern completion.

It should be noticed, however, that inheritance in semantic networks is dependent on the

specific labelled links involved: properties can only be transferred downwards along ISA links. Can

this be explained on the basis of simple spreading of activation along the network? Although a

complete account is beyond the scope of this paper, we should keep in mind the possibility of a

more complex dynamic than activation simply spreading in all directions. Specifically, we should

consider the possibility of a rich network involving inhibitory as well as excitatory links (e.g., see

Waltz and Pollack 1985). For example, superordinate categories might have inhibitory links with

properties that are specific of subordinate categories, so that those properties cannot be transferred

upwards along ISA links. For another example, the inference from ROBIN to (IS NOT) DOVE

might be licensed by lateral inhibitory links between concepts at the same hierarchical level.

To be sure, the transfer of activation from ROBIN to (HAS) WINGS does not amount to the

explicit proposition ROBIN HAS WINGS. Even more so, the inhibition of DOVE as caused by the

activation of ROBIN does not amount to the explicit proposition ROBIN IS NOT DOVE. However,

my suggestion is, first, that the dynamic of activation and inhibition together with the schematic

organization of conceptual networks can explain implicit inferential processing and, second, that

conscious working memory adds stability to this inferential process. In other words, our cognitive

system can make assessments of consistency between conceptual contents on the basis of schematic

relationships which are at its disposal also in automatic associative processing. When conscious

working memory enters into the picture, assessments of consistency become explicit since the

subject focusses attentively on schemata and chains of schemata connecting the contents at issue. It

is also possible that occasional schemata are formed in working memory as a result of contents

being connected by means of chains of encoded schemata. For instance, when the subject focusses

attentively on the sequence of two schemata ROBIN ISA BIRD and BIRD HAS WINGS, she might

form the occasional schema ROBIN HAS WINGS as a result of their combination. For another

example, by attentively focussing on the schemata ROBIN ISA BIRD and DOVE ISA BIRD the

subject might form the occasional schema ROBIN IS NOT DOVE in working memory.

With all this in mind, we can sketch an explanation of how the emergent property LIVES IN

CAGES can be inferred starting from the combination of the concepts PET and BIRD. Let me

emphasise that what follows is just a sketch, intended to give a general idea of how complex pattern

completion might work in a case like this. We should also keep in mind that any attempt to provide

explicit representations of our conceptual systems suffers from the difficulty of doing this in a non-

arbitrary way. For the sake of simplicity I will use the sketchy notation adopted by Recanati (2004),

in which there is no explicit representation of the relationships between concepts.

I propose to consider two chains of schemata, one starting from the concept BIRD, the other

from the concept PET:

(1) a BIRD (x) → FLY (x)

b FLY (x) → FREE TO GO AWAY (x)

(2) a PET (x) → PEOPLE WANT TO PREVENT (FREE TO GO AWAY (x))

b PEOPLE WANT TO PREVENT (FREE TO GO AWAY (x)) → PEOPLE KEEP

(PRISONER (x))

c PRISONER (x) → IN CAGE (x)

By way of complex pattern completion, the concept BIRD allows one to infer FREE TO GO AWAY,

which is also a component of the property PEOPLE WANT TO PREVENT (FREE TO GO AWAY

(x)) associated to the concept PET. Thus, the activation of both PET and BIRD can be expected to

cause strong activation of this property, which in turn activates the sequence ending with the

concept IN CAGE.

To be sure, as a result of the activation of the concepts PET and BIRD, other properties than

(LIVES) IN CAGES can be activated as well. As a matter of fact, in free property listing tasks a

variety of sparsely related properties is supplied by subjects. However, this appears to depend on the

specific nature of the task: free property listing is a very special cognitive task in which there is no

specific goal to be pursued except listing properties itself. Thus, it is no surprise that subjects supply

scarcely systematic lists of properties. On the contrary, in normal cases linguistic and non-linguistic

contexts can be expected to constrain concept construction in a number of ways, which include

evidence of the specific goals pursued by others – for instance, the goal pursued by a speaker when

she speaks of “pet birds” on a particular occasion. In accordance with RT's view exposed in section

2, contextual evidence about communicative goals presumably affects concept construction by

means of backward inferences to the explicit content of utterances. In other words, goals put a

rational constraint on the interpretation, by triggering a search for the concepts apt to figure in the

explicit content if this has to act as a premise drawing to the contextually expected conclusion(s).

To sum up, my general point is that such backward and forward inferences can be explained

as cases of pattern completion based on the schematic structure of associative networks of concepts.

Let me insist: this is not to say that the spreading of activation forwards and backwards in

associative networks is all there is to conceptual processing. On the contrary, my suggestion is that

most of the times conscious working memory interact with automatic activation adding stability to

the inferential process. As a consequence, (some of) the inferential relationships might be grasped

consciously.

7. Conclusions

Relevance theorists have suggested that associative relationships are not sufficient to explain

pragmatic inferences, in that associations are too unconstrained to licence warranted conclusions.

On the contrary, I have claimed that associative relationships have enough structure to drive and

constrain inferential processing. Specifically, I have tried to provide an associative account of the

inferences by which ad hoc concepts involving emergent properties could be constructed.

My proposal is based on the claim, that is widespread in psychology, that our conceptual

system contains rich representations and that those representations license inferences. More

specifically, I have analysed the notion of inference in terms of statistical pattern completion based

on schemata. In turn, schemata have been analysed as patterns constituted by concepts and the

specific relationships between them, with concepts specifying ranges of variability for their

application. Finally, I have argued that those inferential processes may occur automatically and that,

although conscious attention normally cooperates with them, it does not change drastically their

nature but only adds stability to the process.

References

Bargh, J. A. (1989). Conditional automaticity: Varieties of automatic influence on social

perception and cognition. In J. Uleman & J. Bargh (Eds.), Unintended thought (pp. 3–51). New

York: Guilford Press.

Bargh, J. A. (1990). Auto-motives: Preconscious determinants of social interaction. In E. T.

Higgins & R. M. Sorrentino (Eds.), Handbook of motivation and cognition (Vol. 2, pp. 93–130).

New York: Guilford Press.

Barsalou, L. W. (1983). Ad hoc categories. Memory & Cognition, 11, 21–27.

Barsalou, L.W. (2005). Situated conceptualization. In H. Cohen & C. Lefebvre (Eds.),

Handbook of categorization in cognitive science (pp. 619-650). St. Louis: Elsevier.

Barsalou, L.W. (2010). Ad hoc categories. In P. Hogan (Ed.), The Cambridge Encyclopaedia

of the Language Sciences (pp. 86-87). Cambridge: Cambridge University Press.

Bartlett, F. C. (1932). Remembering: An experimental and social study. Cambridge:

Cambridge University Press.

Carston, R. (2007). How many pragmatic systems are there? In M.J. Frapolli (Ed.), Saying,

Meaning, Referring. Essays on the Philosophy of François Recanati (pp. 1-17). Palgrave, New

York.

Carston, R. (2010). Metaphor: Ad hoc concepts, literal meaning and mental images.

Proceedings of the Aristotelian Society, 110(3), 295–321.

Dehaene, Stanislas, Jean Pierre Changeux, Lionel Naccache, Jerome Sackur & Claire Sergent

(2006). Conscious, preconscious, and subliminal processing: a testable taxonomy. Trends in

Cognitive Sciences, 10(5), 204–11.

Frege, G. (1952a). Function and Concept. In P. Geach and M. Black (Eds. and trans.),

Translations from the Philosophical Writings of Gottlob Frege (pp. 21-41). New York:

Philosophical Library.

Frege, G. (1952b). On Concept and Object. In P. Geach and M. Black (Eds. and trans.),

Translations from the Philosophical Writings of Gottlob Frege (pp. 42-56). New York:

Philosophical Library.

Hampton, J.A. (2006). Concepts as Prototypes. In B.H.Ross (Ed.), The Psychology of

Learning and Motivation: Advances in Research and Theory, Vol. 46 (pp. 79-113). Amsterdam:

Elsevier.

Hampton, J.A., & Jönsson, M.L (2012). Typicality and compositionality: The logic of

combining vague concepts. In M.Werning, W.Hintzen, and E.Machery (Eds.), Oxford Handbook of

Compositionality (pp. 385-402). Oxford: Oxford University Press.

Hassin, R. R., Aarts, H., Eitam, B., Custers, R., & Kleiman, T. (2009). Non-conscious goal

pursuit and the effortful control of behavior. In E. Morsella, J. A. Bargh, & P. Gollwitzer (Eds.),

Oxford handbook of human action (pp. 549–566). New York: Oxford University Press.

Huang, J.Y., Bargh, J.A. (in press). The Selfish Goal: Autonomously Operating Motivational

Structures as the Proximate Cause of Human Judgment and Behavior. Behavioral and Brain

Sciences.

Kintsch, Walter (2000). Metaphor comprehension: a computational theory. Psychonomic

Bulletin & Review, 7, 257–266.

Kintsch, Walter (2001). Predication. Cognitive Science, 25, 173–202.

Mandler, G. (1979). Organization and repetition: Organization principles with special

reference to rote learning. In L. Nilsson (Ed.), Perspectives on memory research (pp. 293-327).

Hillsdale, NJ: Erlbaum.

Mazzone, M. (2011). Schemata and associative processes in pragmatics. Journal of

Pragmatics 43, 2148–2159.

Mazzone, M. (2013). Attention to the speaker. The conscious assessment of utterance

interpretations in working memory. Language & Communication, 33, 106–114.

Mazzone, M. (2014). The continuum problem: Modified Occam's Razor and

conventionalisation of meaning, International Review of Pragmatics, 6, 29-58.

Miller, E. K., Cohen, J. D. (2001). An integrative theory of prefrontal cortex function. Annual

Review of Neuroscience, 24, 67–202.

Murphy, G.L. (2002). The Big Book of Concepts. Cambridge, MA: The MIT Press.

Recanati, F. (2004). Literal Meaning. Cambridge: Cambridge University Press.

Recanati, F. (2007). Reply to Carston. In M.J. Frapolli (Ed.), Saying, Meaning, Referring.

Essays on the Philosophy of François Recanati (pp. 49-54). Palgrave, New York.

Rubio Fernandez, Paula (2007). Suppression in Metaphor Interpretation: Differences between

Meaning Selection and Meaning Construction. Journal of Semantics, 24(4), 345–371.

Rumelhart, D. E., & Ortony, A. (1977). The representation of knowledge in memory. In R. C.

Anderson, R. J. Spiro, & W. E. Montague (Eds.), Schooling and the acquisition of knowledge (pp.

97-135). Hillsdale, N.J.: Lawrence Erlbaum Associates.

Schank, R. C. & Abelson, R. (1977). Scripts, plans, goals and understanding. Hillsdale, NJ:

Lawrence Erlbaum Associates.

Shanks, D. R. (2010). Learning: From association to cognition. Annual Review of Psychology,

61, 273–301.

Smith, E. E. (1989). Concepts and induction. In M. I. Posner (Ed), Foundations of cognitive

science (pp. 501-526). Cambridge, MA, US: The MIT Press.

Sperber, D., Wilson, D. (2008). A deflationary account of metaphors. In R. W. Gibbs (Ed.),

The Cambridge Handbook of Metaphor and Thought. (pp. 84-108). New York: Cambridge

University Press.

Waltz, D.L., Pollack, J.B. (1985). Massively Parallel Parsing: A Strongly Interactive Model of

Natural Language Interpretation. Cognitive Science, 9, 51–74.

Wilson, D., Carston, R., (2006). Metaphor, relevance and the emergent property issue. Mind

and Language, 21(3), 404–433.

Wilson, D., Carston, R., (2007). A unitary approach to lexical pragmatics: relevance, inference

and ad hoc concepts. In N. Burton-Roberts (Ed.), Pragmatics (pp. 230–260). New York: Palgrave.

Wood, J.N., Glynn, D.D., Phillips, B.C., Hauser, M.D. (2007). The Perception of Rational,

Goal-Directed Action in Nonhuman Primates. Science, 317, 1402-5.

Crossing the associative/inferential divide: ad hoc concepts and the inferential power of schemata

Documents

Transcript of Crossing the associative/inferential divide: ad hoc concepts and the inferential power of schemata