Analyzing psychotherapy process as intersubjective sensemaking: An approach based on discourse...

19
PLEASE SCROLL DOWN FOR ARTICLE This article was downloaded by: [Nitti, Mariangela] On: 27 July 2010 Access details: Access Details: [subscription number 924820690] Publisher Routledge Informa Ltd Registered in England and Wales Registered Number: 1072954 Registered office: Mortimer House, 37- 41 Mortimer Street, London W1T 3JH, UK Psychotherapy Research Publication details, including instructions for authors and subscription information: http://www.informaworld.com/smpp/title~content=t713663589 Analyzing psychotherapy process as intersubjective sensemaking: An approach based on discourse analysis and neural networks Mariangela Nitti a ; Enrico Ciavolino b ; Sergio Salvatore a ; Alessandro Gennaro a a Department of Pedogogy, Psychology, and Teaching, b Department of Philosophy and Social Sciences, University of Salento, Lecce, Italy First published on: 27 July 2010 To cite this Article Nitti, Mariangela , Ciavolino, Enrico , Salvatore, Sergio and Gennaro, Alessandro(2010) 'Analyzing psychotherapy process as intersubjective sensemaking: An approach based on discourse analysis and neural networks', Psychotherapy Research,, First published on: 27 July 2010 (iFirst) To link to this Article: DOI: 10.1080/10503301003641886 URL: http://dx.doi.org/10.1080/10503301003641886 Full terms and conditions of use: http://www.informaworld.com/terms-and-conditions-of-access.pdf This article may be used for research, teaching and private study purposes. Any substantial or systematic reproduction, re-distribution, re-selling, loan or sub-licensing, systematic supply or distribution in any form to anyone is expressly forbidden. The publisher does not give any warranty express or implied or make any representation that the contents will be complete or accurate or up to date. The accuracy of any instructions, formulae and drug doses should be independently verified with primary sources. The publisher shall not be liable for any loss, actions, claims, proceedings, demand or costs or damages whatsoever or howsoever caused arising directly or indirectly in connection with or arising out of the use of this material.

Transcript of Analyzing psychotherapy process as intersubjective sensemaking: An approach based on discourse...

PLEASE SCROLL DOWN FOR ARTICLE

This article was downloaded by: [Nitti, Mariangela]On: 27 July 2010Access details: Access Details: [subscription number 924820690]Publisher RoutledgeInforma Ltd Registered in England and Wales Registered Number: 1072954 Registered office: Mortimer House, 37-41 Mortimer Street, London W1T 3JH, UK

Psychotherapy ResearchPublication details, including instructions for authors and subscription information:http://www.informaworld.com/smpp/title~content=t713663589

Analyzing psychotherapy process as intersubjective sensemaking: Anapproach based on discourse analysis and neural networksMariangela Nittia; Enrico Ciavolinob; Sergio Salvatorea; Alessandro Gennaroa

a Department of Pedogogy, Psychology, and Teaching, b Department of Philosophy and Social Sciences,University of Salento, Lecce, Italy

First published on: 27 July 2010

To cite this Article Nitti, Mariangela , Ciavolino, Enrico , Salvatore, Sergio and Gennaro, Alessandro(2010) 'Analyzingpsychotherapy process as intersubjective sensemaking: An approach based on discourse analysis and neural networks',Psychotherapy Research,, First published on: 27 July 2010 (iFirst)To link to this Article: DOI: 10.1080/10503301003641886URL: http://dx.doi.org/10.1080/10503301003641886

Full terms and conditions of use: http://www.informaworld.com/terms-and-conditions-of-access.pdf

This article may be used for research, teaching and private study purposes. Any substantial orsystematic reproduction, re-distribution, re-selling, loan or sub-licensing, systematic supply ordistribution in any form to anyone is expressly forbidden.

The publisher does not give any warranty express or implied or make any representation that the contentswill be complete or accurate or up to date. The accuracy of any instructions, formulae and drug dosesshould be independently verified with primary sources. The publisher shall not be liable for any loss,actions, claims, proceedings, demand or costs or damages whatsoever or howsoever caused arising directlyor indirectly in connection with or arising out of the use of this material.

Analyzing psychotherapy process as intersubjective sensemaking: Anapproach based on discourse analysis and neural networks

MARIANGELA NITTI1, ENRICO CIAVOLINO2, SERGIO SALVATORE1, &

ALESSANDRO GENNARO1

1Department of Pedogogy, Psychology, and Teaching & 2Department of Philosophy and Social Sciences, University of Salento,

Lecce, Italy

(Received 30 August 2009; revised 13 January 2010; accepted 19 January 2010)

AbstractThe authors propose a method for analyzing the psychotherapy process: discourse flow analysis (DFA). DFA is a techniquerepresenting the verbal interaction between therapist and patient as a discourse network, aimed at measuring the therapist�patient discourse ability to generate new meanings through time. DFA assumes that the main function of psychotherapy is toproduce semiotic novelty. DFA is applied to the verbatim transcript of the psychotherapy. It defines the main meaningsactive within the therapeutic discourse by means of the combined use of text analysis and statistical techniques.Subsequently, it represents the dynamic interconnections among these meanings in terms of a ‘‘discursive network.’’ Thedynamic and structural indexes of the discursive network have been shown to provide a valid representation of the patient�therapist communicative flow as well as an estimation of its clinical quality. Finally, a neural network is designed specificallyto identify patterns of functioning of the discursive network and to verify the clinical validity of these patterns in terms oftheir association with specific phases of the psychotherapy process. An application of the DFA to a case of psychotherapy isprovided to illustrate the method and the kinds of results it produces.

Keywords: process research; discourse flow analysis; competitive neural networks; health services quality

The present work is an attempt to improve quality

research into the therapeutic process by means of the

combination of statistical methodologies, enabling

clinical interaction to be analyzed.

Among several approaches to psychotherapeutic

research, a common way of considering the clinical

process is to look at it as an intersubjective dynamic

of co-construction of new meanings between thera-

pist and patient. Discourse flow analysis (DFA) was

designed by a group of researchers (Salvatore et al.,

2007, in press) to develop an empirical model of

the psychotherapeutic process as intersubjective

sensemaking. According to this model, meaning is

not statically held in signs (e.g., words, gestures).

Rather, it is the product of the way people use signs

in order to regulate their communication and activity

(Harre & Gillett, 1994). People do not choose a

ready-to-use meaning; rather, they co-construct it in

the sense that the interpretation of the signs is a

matter of situated agreement among the participants

of the conversation, performed through the dialogue

and grounded on socially shared generalized mean-

ings (Valsiner, 2007) working as taken-for-granted

assumptions (Rommetveit, 1992). In sum, sense-

making is intrinsically dialogical (Linell, 2009).1

The vision of the psychotherapy process as

intersubjective sensemaking is a meta-theoretical

framework crossing several clinical approaches: psy-

chodynamic (Storolow, Atwood, & Brandchaft,

1994; Hoffman, 1998) and cognitive (Dimaggio &

Semerari, 2004) as well as humanistic (Hermans &

Hermans-Jansen, 1995) and narrative (Santos,

Goncalves, Matos, & Salvatore, 2009). Although

very different in many respects, these approaches

share the basic idea of the clinical exchange as a co-

construction of meanings (i.e., intersubjective sense-

making) aimed at changing the patients’ symbolic

(affective and/or cognitive) modality of interpreting

their experience (Gennaro et al., 2010). Psychother-

apy can, therefore, be seen as a ‘‘transformative

dialog’’ (Gergen, 1999, p. 250), where new mean-

ings are elaborated, new categories are developed,

Correspondence concerning this article should be addressed to Mariangela Nitti, Universita del Salento, Dipartimento di Scienze

Pedagogiche, Psicologiche e Didattiche, Via Stampacchia, Lecce, 73100 Italy. E-mail: [email protected]

Psychotherapy Research

2010, 1�18, iFirst article

ISSN 1050-3307 print/ISSN 1468-4381 online # 2010 Society for Psychotherapy Research

DOI: 10.1080/10503301003641886

Downloaded By: [Nitti, Mariangela] At: 14:29 27 July 2010

and one’s presuppositions (Chambers & Bickhard,

2007) are transformed within an intersubjective

context.

To achieve this aim, DFA analyzes patient�therapist

dialogue in formal terms*that is, by depicting the

structural global qualities of their communicational

exchange*rather than merely in terms of the seman-

tic contents exchanged within their dialogue. In fact,

even though most analysis methods used in psy-

chotherapy process research focus on the semantic

content of the clinical exchange, semantic content

analysis is important, but not sufficient, for a real

understanding of the clinical exchange.

One of the limitations of the semantic content

analysis is the so-called indexicality of linguistic

signs: The semantic content of a sign needs the

context to be understood (Nightingale & Cromby,

1999), so a high level of inference is required.

Furthermore, the indexicality of a sign concerns

not only the discourse semantic dimension but also

the pragmatic sphere: Because producing a sign is

not a mere linguistic operation but rather a speech

act (Austin, 1962), the psychological meaning of a

sign is related to its communicative function as

performed in discourse activity. This communicative

function is not immanent to the sign, but depends on

the way it is used in relation with the other signs

within the intersubjective circumstances of the

discourse (Harre & Gillett, 1994; Wittgenstein,

1953). The temporal dimension of the discourse is,

therefore, an outstanding factor in the process of

constructing the sense.

Consider the following sequences of thematic

contents as characterizing the dialogue of two

patients:

Patient 1 sequence: experience of frustration0anger0pain0desire to be helped by therapist

Patient 2 sequence: pain0desire to be helped by

therapist0experience of frustration0anger

The content of the talk is the same in the two cases;

yet the sequences are different. In addition, it is

evident that the two sequences differ as concerns

their global meaning. The first sequence seems to

concern the patient’s request to be supported by the

therapist in dealing with the negative emotive reac-

tion to a frustrated desire. The second sequence

seems to concern a patient expressing a negative

feelings aroused by his or her desire to be helped by

the therapist.

Among the contributions present in literature, the

matter of the indexicality and the temporal dynamics

of sensemaking has been ignored (researchers have

focused only on the semantic dimension of sense-

making) or dealt with by adopting a qualitative

discourse analysis in order to properly consider the

context and the communicative function of the signs

and analyze the discourse dynamic over time.

Our contribution seeks to construct a robust,

flexible, and automatic, although not uncritical,

procedure for seriously taking into account the

temporal dynamics of the discourse, which is a basic

(although not exhaustive) aspect of sensemaking.

As the procedure leads to the synthetic description

of the discourse network through some indexes,

such indexes are processed by a competitive neural

network that classifies them based on their common

properties. Each discourse network will be then

interpreted with reference to the average character-

istics of the category to which it belongs.

This work is organized in two parts. The first is

devoted to the description of the DFA method. The

latter concerns the exemplificative application of

DFA to a case: Pietro’s psychotherapy.

Discourse Flow Analysis

DFA was designed to develop an empirical model of

the psychotherapy process as intersubjective sense-

making. In order to go beyond the limits of semantic

content analysis described previously, DFA allows a

dynamic analysis of sensemaking, focusing on the

temporal patterns of meanings rather than on the

survey of discrete contents, by adopting an auto-

mated low-inferential procedure of content analysis,

yet at the same time is able to take the contextuality

of meaning into account. Let us now consider how

DFA operates in order to reach such results.

The main assumption of this model is the idea that,

because the relationships among contents basically

arise from associations for temporal adjacency,

sensemaking strongly depends on the way contents

are combined throughout the discourse flow.

Coherently, sensemaking is a matter not of con-

tents but of sequential combination of contents.

Starting from this assumption, DFA maps the

psychotherapeutic dialogue in terms of associations

for adjacency between semantic contents occurring

within the clinical exchange. In order to operate in

this way, DFA refers to the concept of discourse

network. A discourse network is made of nodes, each

of which represents one of the units of semantic

contents, those that are active in the communi-

cational exchange between patient and therapist.

Reducing the communicational exchange in terms of

network allows us to analyze the systemic and

dynamic structure of sensemaking.

To construct and analyze the clinical process as a

discourse network, the DFA method works in three

steps and is applied to the verbatim transcript of

therapist�patient dialogue. First, a content analysis

2 M. Nitti et al.

Downloaded By: [Nitti, Mariangela] At: 14:29 27 July 2010

identifies and categorizes the semantic contents

active in the text. Second, a sequence analysis is

applied to the results of the first step in order to

calculate the probability of pairs of contents being

adjacent; this procedure is designed to construct the

discourse network. Third, through quantitative

analysis, it is possible to obtain a synthetic descrip-

tion of the way the discourse network functions by

means of structural-dynamic indexes. Finally, the

quantitative indexes describing the discourse net-

work are processed, in the fourth step, into a

competitive neural network that can classify the

psychotherapy sessions on the basis of the index

patterns characterizing them. The steps of the

analysis are shown in Figure I.

Step 1: Content Analysis

Content analysis is applied to the integral verbatim

transcription (henceforth corpus) of the patient�therapist talk. Using text analysis software (T-Lab

5.3 Beta; Lancia, 2005), the lexical corpus is

transformed into a digital matrix (co-occurrence

matrix) and subjected to a multidimensional proce-

dure of analysis.

Preliminary Operations

First, T-Lab performs the segmentation of the text.

It divides the whole text into elementary context

units (ECUs), each of which represents a meaningful

sentence or group of sentences. To this end, T-Lab

adopts the following criterion: An ECU ends

with the first punctuation mark (‘‘.’’ or ‘‘!’’ or ‘‘?’’)

after the 250th character and it does not exceed

500 characters in length.

At the same time, T-Lab operates the lemmatization

of the lexical corpus. This substep is carried out in

terms of the several subprocedures, discussed next.

Listing. All the textual forms present in the text

are listed just as they appear in the corpus. T-Lab

considers a form to be any string of characters (even

if made up of one character) between two blank

spaces (or a blank space and a punctuation mark).

Filtering. Punctuation is eliminated from the

analysis. At the end of this procedure, the original

list of textual forms is filtered into a list of lexical

forms.

Disambiguation. This procedure is designed to

change the lexical forms that can have more than one

semantic content (e.g., ‘‘subject’’). It is performed by

modifying the lexical form in the various contexts

where it occurs. In this way, the original form is

differentiated into two or more forms with a single

semantic content. For instance, the original form

‘‘subject’’ could be differentiated into two new

forms: ‘‘subject_p’’ when its meaning is ‘‘person’’

and ‘‘subject_k’’ when, according to the context, it

means ‘‘branch of knowledge.’’ Once the disambi-

guation has been performed, listing and filtering

need to be repeated. Finally, it is worth noting that

the disambiguation may be performed only on the

most frequent words and/or on the lexical forms that

are relevant for the aim of the analysis (e.g., in the

case of an analysis of the psychotherapy process, it

could be relevant to differentiate between ‘‘sense’’ as

concerning perception and ‘‘sense’’ as synonymous

of meaning).2

Lemmatization. The lexical forms are grouped

according to their common lexical root. In other

words, the various forms of the same word are

turned into their corresponding lemma. This sub-

procedure entails the application of a dictionary to

the list of lexical forms. A dictionary is a list of the

main lemmas of a given language (e.g., English,

Italian), each of them put in correspondence with

Figure I. Steps of the analysis.

Analysis of psychotherapy through statistics 3

Downloaded By: [Nitti, Mariangela] At: 14:29 27 July 2010

the set of forms constituting its lexical variant (i.e., in

the dictionary, the lemma ‘‘do’’ is in correspondence

with the lexical forms ‘‘does,’’ ‘‘done,’’ ‘‘did,’’ and so

forth). To empower the coverage of the lemmatiza-

tion (i.e., the percentage of textual forms recognized

by the dictionary and, therefore, lemmatized), we

integrated the T-Lab dictionary with another one

elaborated by us. The DFA dictionary was produced

by listing the textual forms of a large number of texts

retrieved from a variety of discursive fields and

sources (sport, politics, novels, verbatim transcripts

of psychotherapy sessions). Eventually, the auto-

matic lemmatization is checked and, if necessary,

completed by the researcher.

Not all the lemmas are kept in the analysis.

Because of T-Lab’s computational constraints, only

the 500 more frequent lemmas are used for the

analysis.

Identification of the Co-Occurrence of ECUs for the Sake

of the Definition of the Thematic Nuclei

As these preliminary text operations are done, it is

possible to proceed with the content analysis. Con-

tent analysis is realized by using criteria of categor-

ization of the text intrinsic to the text itself rather

than external parameters of classification. The

method operates the text classification by means

of a composite multidimensional procedure of ana-

lysis combining multiple correspondence analysis

(MCA; Benzecri, 1973) and cluster analysis (CA;

Anderberg, 1973).

To perform these analyses, T-Lab produces a co-

occurrence matrix with ECU as rows and lemmas as

columns.3 In the generic cell xij we will find the value

‘‘1’’, if the jth lemma is present in the ith ECU, ‘‘0’’

otherwise, as shown in Table I.

This matrix undergoes MCA. Each factorial

dimension extracted by MCA maps the joint behavior

of groups of lemmas. Generally speaking, MCA

breaks down the lexical variability into patterns of

co-occurrence of lemmas (i.e., joint presence of

lemmas within the same ECU). Each factorial

dimension represents a pattern of this kind: a rather

stable combination of words present through the

text (or parts of the text). Accordingly, a factorial

dimension can be interpreted as a semantic micro-

dimension active in the text.

The factorial dimensions are then used as criteria

of classification in the subsequent CA (in T-Lab

language, the thematic analysis of elementary con-

text). Thus, CA groups the ECUs (and the lemmas)

according to their similarity on the semantic micro-

dimensions. In other words, a cluster represents a

subset of utterances (i.e., ECUs) that share similar

patterns of co-occurrence of lemmas; or, in a

complementary way, each cluster represents a subset

of words tending to occur in the same sentences.

Therefore, each cluster can be understood as a

thematic nucleus (or semantic content or node),

made up of a set of words whose aggregation reflects

the ‘‘isotopy’’ of semantic traits (Lancia, 2005).

To clarify this concept, let us take an example

from a previous application of DFA (Lisa’s case;

Salvatore et al., in press). Table II shows the three

Table I. Co-occurrence Matrix

ECU/Lemma Be Your Name Pen Red

What is your name? 1 1 1 0 0

The pens are red. 1 0 0 1 1

Note. ECU, elementary context unit.

Table II. Selected Part of One Cluster: Thematic Nucleus

Resulting from the Application of the DFA Method to a

Transcript of Psychotherapy (Lisa’s Case)

ECU 1 (in Session 9; score: 185.32)

T: How would you do that?

L: Uh, just I either stay in the house or do housecleaning or

whatever needs to be done um.

T: How would you do that right now if you were to just sort of like

put her aside, put her, what do you do, put her in a box almost?

L: Um.

T: Trap her. Try to do that now, to trap her and put her away.

L: Okay, um, just, just stay home.

ECU 2 (in Session 12; score: 172.70)

T: Tell her she scares you and you feel small, tell her about what

it’s like, speak from, it’s almost like saying ‘‘I’m small and

I feel . . .’’ Finish it off.

L: I’m ah. I’m small and I feel ah, ah, just helpless um.

T: Mm-hm.

L: Just giving out myself.

T: Mm-hm, a little lost?

L: Yeah, I feel um lost.

T: Tell her.

L: InsECUre.

ECU 3 (in Session 9; score 160.59)

T: So you, you sort of keep her, shield her from the people, from

the world in a sense.

L: Yeah.

T: Okay, be the shield, be the shield and speak from that and tell

her what you feel.

L: Um, I don’t want you to desert me, or just, just stay with me

and and we’ll make it through together.

T: Mm-hm. Tell her how you protect her.

Note. The score is a chi-square parameter depicting the associa-

tion between the ECU (token) and the cluster (type). The higher

the index, the higher the representativeness of the ECU with

respect to the cluster. Underlined words are word forms belonging

to the characteristic lemmas of the identified cluster (adjusted

from Salvatore et al., in press). ECU, elementary context unit; T,

therapist; L, Lisa.

4 M. Nitti et al.

Downloaded By: [Nitti, Mariangela] At: 14:29 27 July 2010

most representative ECUs of one of the clusters

defined by the analysis (the cluster in question

globally encompasses 94 ECUs, corresponding to

5.25% of the classified ECUs in the text; cf.

Salvatore et al., in press). The underlined words

are the ones whose co-occurrence made the ECU

cluster together. The thematic interpretation of the

cluster entails identifying a thematic area overlap-

ping/passing through the clustered ECUs. In the

case of the three ECUs reported in Table II, it seems

that they share a common thematic area concerning

the recognition of needs/desires in relations with

others.4

Text Indexing

The thematic nuclei identified as active in the corpus

by the combination of MCA-CA are then used for

the subsequent text indexing carried out as follows.

Each ECU is assigned to the cluster (therefore to a

thematic nucleus) with which the ECU has the

highest index of association. In this way, each ECU

is marked with the most representative thematic

nuclei identified through content analysis. Thus,

the output of this step is the transformation of

the text into a sequence of thematic nuclei, each of

which represents the semantic content of a nucleus.

Table III shows an example of the output of a

content analysis applied on a hypothetical text

segmented into a sequence of 20 ECUs and assigned

to a set of four clusters.

Evidence Supporting the Procedure of Content Analysis

In the last 20 years, there has been a dramatic

development of automatic procedure of textual

analysis and textual mining. Nowadays, we have

major evidence of the validity of this kind of

approach. Here we briefly recall two kinds of

findings, converging to highlight the power of the

automatic process of content analysis.

One source of evidence is represented by studies

dealing with the modeling of the cognitive processes

entailed in the mapping and understanding of mean-

ing, especially in the case of the reading of written

texts. Some authors within this stream of studies

have developed specific procedural models that

have proved able to simulate the competence of a

human being in mapping meanings (Andersen,

2001; Landauer & Dumais, 1997).

The other source of evidence is given by the

studies more specifically aimed at testing the validity

and reliability of automatic procedures of content

analysis. A strong tradition in this sense is provided

by the many studies concerning the validity of the

procedure of data mining for the classification of

documents. In a study aimed at comparing models

of cluster analysis applied to the classification of

documents, Steinbach, Karypis, and Kumar (2000)

provided estimates of the efficacy of various algo-

rithms of cluster analysis related to a previously

established classification. Based on their analysis,

according to the type of documents, all the proce-

dures range between .58 to .85, with most of the

performance greater than .70 (1�perfect capacity

of classification; that is the clustering capacity of

fully reproducing the normative classification).

Rosenberg, Schnurr, and Oxman (1990) have com-

pared two automatic procedures of content analysis,

based on the calculation of the frequencies of

occurrences of words (one not context sensitive,

the other context sensitive), and a clinically informed

method of content analysis based on human coding.

The three methods were applied to the transcripts

of interviews with 71 patients from four clinical

groups (cancer, depression, somatization, paranoid)

and compared regarding their capacity of rightly

classifying the subjects in accordance to the diag-

nosis. At this end, the authors performed three

separate stepwise discriminant analyses, one for

each method. The classification performances of

the three methods were then compared in a mixed

analysis of variance design, with clinical group as a

between-subjects factor and method as a within-

subject factor. The two computerized methods

showed significantly better performance than the

method based on human coding.

Table III. Cluster and ECU Sequences

ECU sequence Cluster sequence

ECU 1 CL3

ECU 2 CL1

ECU 3 CL2

ECU 4 CL1

ECU 5 CL4

ECU 6 CL1

ECU 7 CL3

ECU 8 CL2

ECU 9 CL3

ECU 10 CL1

ECU 11 CL2

ECU 12 CL4

ECU 13 CL1

ECU 14 CL3

ECU 15 CL2

ECU 16 CL3

ECU 17 CL1

ECU 18 CL2

ECU 19 CL4

ECU 20 CL1

Note. ECU, elementary context unit.

Analysis of psychotherapy through statistics 5

Downloaded By: [Nitti, Mariangela] At: 14:29 27 July 2010

With regard to DFA, the reliability of the proce-

dure of content analysis adopted by the method was

studied within the framework defined by the line of

study we just mentioned (Salvatore et al., 2010).

Two complementary approaches were adopted. On

the one hand, trained judges were asked to evaluate

the output of the content analysis provided by the

DFA procedure. On the other hand, we compared

the content analysis produced by the DFA with the

series of content analysis provided by judges. Both

kinds of tests were performed on a sample of

transcripts randomly and blindly extracted from an

Italian language psychotherapy (Katia; see Nicolo &

Salvatore, 2007). The main findings are as follows:

1. The judges were asked to rate the semantic

similarity of a set of 70 ECUs (i.e., utterances;

see prior discussion), five for each of the

14 clusters produced by the DFA procedure

of content analysis. The intracluster similarity

(i.e., the mean of the rates of similarity of each

couple of the five ECUs grouped in the same

cluster) was significantly higher than the inter-

cluster similarity (i.e., the mean of the rates of

similarity of each couple of ECUs belonging to

different clusters). This means that the DFA

procedure seems to be able to differentiate the

utterances and group them in classes that

are semantically meaningful. In other words,

DFA puts together what a human coder con-

siders more semantically homogeneous and

separates what the human coder considers

more dissimilar.

2. DFA procedures calculate an index of the

similarity between each couple of the clusters

defined: the Euclidian distance between the

centroids of the clusters. The higher this index,

the more dissimilar the cluster. We then com-

pared the judgments of similarity between each

couple of clusters with this DFA index. We

found a significant negative correlation. This

means that the DFA’s way of representing the

relationship of (dis)similarity among the ECUs,

and therefore among the clusters, tends to over-

lap the one produced by the human coders. If

one considers that the human coders in this case

were engaged in a task of semantic coding, one

can conclude that the DFA procedure yields a

classification of ECUs reflecting a network of

relationships of similarity�dissimilarity consis-

tent with the one provided by the human coders.

3. Three judges were asked to group the 70 ECUs

on the basis of their semantic similarity in terms

of the same partition produced by DFA (14

clusters, each of five ECUs). Then we com-

pared the interrater agreement between each

couple of judges and between the judges and

the DFA procedure. The comparison between

the judge�judge agreement and DFA�judge

agreement could not be differentiated; in all

cases, we recorded a fair level of agreement

(Cohen’s k between .30 and .40). Note that in

order to appreciate the level of such agreement

one has to take into account the large amount

of degrees of freedom associated with the

classification at stake. As a matter of fact, the

probability that the classifier orders 70 ECUs

(n) in 14 groups (g) of five items (k) is equal

to 6,19�100, as resulting from the formula:

P(n,g,k)�(k! (n�k)!n!)g�(5! (70�5)!70!)14.

In sum, different sources of results support the

conclusion that automatic procedures of information

retrieval are able to perform content analysis with

at least a sufficient level of reliability. Within this

scenario, the procedure of content analysis adopted

by DFA seems to be able to discriminate the

utterances in accordance with their semantic simi-

larity and to classify the semantic content of the text

in a way that does not differ from a trained human

coder (as measured by the interrater agreement).

Step 2: Creation of the Discourse Network

The clusters of ECU are used for the definition

of the discourse network. In its most elementary

version, a network is a collection of elements (nodes)

linked to each other: In our case, the nodes are

represented by the cluster of ECUs, and the con-

nections between them represent the strength of

their association. The strength of a cluster’s associa-

tion is calculated using a sequence analysis.

To explain simply how the sequence analysis

works, we give an example in Table III, which

considers 20 ECUs grouped into four clusters. The

four clusters represent, as we said, the nodes of the

discourse network. Given the sequence of clusters in

Table III, it is possible to calculate the transition

matrix deriving from the matrix of the successors.

Table IV shows the matrix of the successors. This

matrix has the ECU clusters both in rows (prede-

cessors) and columns (successors) and shows, in its

generic cell, the number of times the jth cluster

follows the ith cluster. For instance, the number of

Table IV. Frequencies of Successors

Cluster CL1 CL2 CL3 CL4

CL1 0 3 2 1

CL2 1 0 2 2

CL3 3 2 0 0

CL4 3 0 0 0

6 M. Nitti et al.

Downloaded By: [Nitti, Mariangela] At: 14:29 27 July 2010

times Cluster 1 (CL1) follows Cluster 3 (CL3) is

equal to 3, as is shown in the tables. The total

transition number is equal to the number of ECUs

minus 1 (score of 19 in Table IV), because the first

item of the sequence has no predecessors.

The resulting transition matrix, similarly to the

matrix of successors, has the clusters both in rows and

columns and shows in its generic cell the strength of

the association between two clusters. The strength

of that association is the probability, calculated by

a Markovian algorithm, that a cluster i follows j

immediately afterward: This probability is nothing

but the relative frequency of the successor, that is, the

ratio between the absolute frequency of a cluster’s

successor (the frequency in the successor matrix cell)

and the total number of the cluster’s successors (the

marginal row of the matrix). In our example, the

resulting transition matrix is shown in Table V.

The values contained in the transition matrix are

the strength of the associations among the discourse

network nodes. The direction of the connections

goes from the predecessor (i) to the successor (j).

The intensity of the association is represented by the

thickness of the arrow connecting two clusters: In

Figure II, CL4 and CL1 have the maximum

association (equal to 1; that is, every time CL4

appears throughout the discourse, it will be certainly

be followed by CL1), represented by the thickest

arrow; CL4 has less probability of following CL1

(0.167) and so the arrow is very thin.

Moreover, Markovian sequence analysis produces

as output, for each node, a set of essential informa-

tion, such as:

. Frequency of occurrence (how many times a

node is present through the discourse). In the

example, CL3 appears five times during the

interaction.

. Incoming and outgoing nodes (including the

node considered, because a node can be asso-

ciated with itself) and the corresponding fre-

quencies. In the example, CL3 has two

predecessors (CL1 and CL2), with a frequency

equal to four, and two successors (CL1 and

CL2), with a frequency of five.

This descriptive information is useful for carrying

out the quantitative analysis of the discourse network.

The aim of this step is to measure the structural and

dynamic properties of the network and to interpret

them as relevant aspects of the sensemaking dynamic.

It is worth emphasizing that sequence analysis is

applied to a single time ‘‘window,’’ consisting of a

specific temporal period of the psychotherapy under

analysis. The sequence analysis can, therefore, be

applied to single sessions of psychotherapy as well

as to part of a session (i.e., the first half hour) or to

a group of sessions. According to the aim and the

conditions of the analysis, the windows can be

defined in order to cover all the sessions or to

sample parts of them. In sum, both discursive

networks and windows of analysis will be defined

by the segmentation and/or the sampling of the

transcript the research produces.

Step 3: Quantitative Analysis of the Discourse

Network

The quantitative analysis of the discourse network is

mainly carried out by means of the following

indexes: connectivity, activity, and super-order

nodes.

Connectivity (C) is the density of the association

among nodes, that is, the relative amount of con-

nections among the nodes. It is calculated as the

percentage of connections that are active in the text

compared with the theoretical number of connec-

tions. It results from:

C�c=K2 (1)

where c is the number of connections active in the

text and K2 is the square of the number of nodes

(every node can be theoretically associated with

every other node, including itself). In our example,

the number of active connections is nine (the

number of cells of Table IV with a non-zero value)

and the theoretical number of connections is 42�16,

so the connectivity is equal to (9/16)�0.5625.

Activity (A) is the global network ability to extend

the spectrum of associations among nodes thro-

ugh time. High levels of activity depict discourse

Table V. Transition Matrix

Cluster CL1 CL2 CL3 CL4

CL1 0 0.5 0.333 0.167

CL2 0.2 0 0.400 0.400

CL3 0.6 0.4 0 0

CL4 1 0 0 0

CL1

CL3 CL4

CL2

Figure II. Example of discourse network.

Analysis of psychotherapy through statistics 7

Downloaded By: [Nitti, Mariangela] At: 14:29 27 July 2010

dynamics capable of expanding the paths of sense-

making by enriching the possible combinations

among meanings. It is calculated as the ratio

between network generative and absorbing power:

A�GEN=ABS (2)

These two parameters express the capability of the

discourse flow to, respectively, increase and decrease

the meaning variability over time. It is based on

the measure of the single node’s contribution to

the discourse meaning variability.

To calculate the contribution of a node, S(CNTj),

the sum and the difference of the frequencies of

outgoing and incoming connections (respectively, ouj

and inj) are multiplied and weighted by the number

of theoretically possible connections (two times the

number of nodes, or 2K) and the frequency of the

node (freqj):

S(CNTj)�(ou j � in j)

2K�

(ou j � in j)

freq j

(3)

According to the measure of the contribution,

nodes can be classified as ‘‘generative,’’ ‘‘relay,’’ or

‘‘absorbing.’’ The distinction is based on the differ-

ence between outgoing and incoming connections.

In a generative node, the difference is positive; in an

absorbing node, negative; and in a relay node, null.

The network’s generative power (GEN) and absorb-

ing power (ABS) are calculated as the average

contribution of the generative nodes and of the

absorbing nodes, respectively.

Super-order nodes (SN) are particular nodes

carrying out the function of super-ordered meaning

working as taken-for-granted assumptions (e.g.,

concepts of self and others, affective schemata,

metacognitive modalities, relational and attachment

strategies, unconscious plans) regulating patients’

interpretation of the experience (see Teasdale

& Barnard, 1993) and, more in general, sense-

making*in psychotherapy dialogue too.

The super-order meanings’ regulative function

results from the high frequency of occurrence and

the strong connectivity they have with other nodes.

The index is calculated as the percentage of highly

frequent nodes with high (generative or absorb-

ing) contribution as well. DFA defines the high-

frequency node as a node with a frequency higher

than 1.5 times the average ratio between token and

type. The contribution is considered high if a node

has outgoing or incoming connections with more

than 33% of the nodes in the network.

All these indexes, taken singly, give no information

on the quality of the psychotherapy process. They,

therefore, have to be taken into account together, in

their interactions, with the function of describing the

internal dynamic of the discourse network, without

any specific reference to the clinical significance of

this dynamic.

Step 4: Neural Network for the Sessions’

Classification

Neural networks can be a useful instrument for the

analysis of the psychotherapy process. As discussed

in the previous steps, the implementation of DFA

leads to the representation of the whole commu-

nicative exchange between therapist and patient in

terms of discourse networks and provides a descrip-

tion of such networks by means of structural and

dynamic indexes. The aim of this fourth step is to

provide an interpretation of these indexes based on

their classification into homogeneous classes.

Therefore, the neural network we are about to

construct has to be able to:

1. Discriminate patterns of indexes according to

regularities and differences between them;

2. Classify the sessions in classes in accordance to

these patterns of indexes; and

3. Compare the classification provided by the

network with a theoretical based classification

in order to test the convergence between the

two classifications.

The building block of an artificial neural network

is a processing unit called neuron (or neurode, node,

or unit), which captures many essential features of

biological neurons. It receives input from other

neurons or from an external source. Each input has

an associated weight (w), which can be modified so

as to model synaptic learning. The outcome of each

unit (a) is merely the result of a function like this:

ai �f (XR

j�1

wij pj )

where p is the incoming input signal and the function

argument is called net input (n) to unit i. A neural

network is composed of such units and the weighted

unidirectional connections between them; in some

neural nets, the number of units may be in the

thousands. For a correct functioning of the network,

a further element is introduced in the network, the

so-called bias. The bias is much like a weight, except

that it has a constant input of 1; like weights, biases

are scalar parameters of the neuron, adjustable by

the learning rule used. This element is nothing but a

threshold term, regulating the ‘‘activation’’ or the

‘‘inhibition’’ of the neuron (for more details, see

Floreano, 1996); for a linear output unit, the bias

term is equivalent to an intercept in a regression

model.

8 M. Nitti et al.

Downloaded By: [Nitti, Mariangela] At: 14:29 27 July 2010

The output of one unit typically becomes an input

for another; they may also be units with external

inputs/outputs.

To simplify the notation, we can consider all the

items of the network in terms of vectors and

matrixes.

a�f (wp)�f (n)

Function f is called activation function or transfer

function; in the simplest case, f is the identity

function, and the unit output is just its net input

(this is called ‘‘linear unit’’). According to the goals

of the network, it is possible to choose among many

activation functions; one of the most used is the

threshold function: If the net input is greater than a

given constant, the activation value is 1 and is 0

otherwise.

Commonly, one neuron, even with many inputs,

may not be sufficient; we might need five or 10,

operating in parallel, in what we will call a layer (see

Figure III).

A single-layer network of S neurons includes the

weight matrix (W), the sum operators, the transfer

function boxes, and the output vector a. It is possible

to design a single layer of neurons having different

transfer functions.

A multilayer network is simply the combination in

sequence of layers: In this architecture, the input

vector p is connected to each neuron through the

weight matrix W, and the resulting output vector a

represents, in turn, the input to the following layer.

The second layer may be, respectively, a hidden layer,

if at least one other layer comes directly after it, or an

output layer, if its output corresponds to the network

final outcome.

The most interesting aspect of the way neural

networks function is their autonomy in learning

information and correct behaviors. In a neural

network, knowledge is represented by the weights of

the connections among units, so learning consists

of modifying such weights. There are many types of

neural network learning rules, or training algorithms.

They fall into two broad categories: supervised

learning and unsupervised learning.

In supervised learning, the learning rule is pro-

vided with a set of examples, called training set, of

proper network behavior. The training set is made up

of pairs of input�target (desired output). The output

of the function can be a continuous value or can

predict a class label of the input object. The task of a

network based on supervised learning is to predict

the value of the function for any valid input (or, most

probably, input vector) after having seen a consider-

able number of training examples. The network then

has to generalize from the presented data to unseen

situations in a ‘‘reasonable’’ way. The learning

process operates as follows: As the inputs are applied

to the network, the network outputs are compared

with the targets. The learning rule is then used to

adjust the weights of the network in order to move the

network outputs closer to the targets. The mechan-

ism of adjusting is defined by the algorithm adopted

(e.g., perceptron learning rule, back-propagation).

In contrast, in unsupervised learning, the weights

are modified in response to network inputs only.

There are no target outputs available. Most of these

algorithms perform some kind of clustering opera-

tion. They learn to categorize the input patterns

into a finite number of classes (this is the case, for

instance, of competitive networks, which is the

model we adopt in this work).

The abilities of different networks are then related

to their structure and learning methods5; each type

of network has a different strength particular to its

applications. In the case study, we present a network

that can classify patterns by discovering regularities

or differences among them. Such a network has a

single-layer structure, and its learning is based on a

competitive mechanism.

We now have to detect the proper architecture of

the network and the corresponding learning rule.

First, the network should perform a pattern recogni-

tion: The network responds when an input vector

close to a learned vector is presented.

Because we want to verify the hypothesis of

different phases in psychotherapy, we cannot provide

the network with correct behavior examples, so data-

driven recognition is required (Floreano, 1996).

We believe that these tasks could be effectively

performed by a competitive network for these main

reasons:

1. It is a self-organizing network: The structure

and the classification parameters are defined

merely by the input vector presented to the

network.

2. After training, each neuron is in competition

with the others in order to represent a subset ofFigure III. A layer of neurons.

Analysis of psychotherapy through statistics 9

Downloaded By: [Nitti, Mariangela] At: 14:29 27 July 2010

input patterns; this competition is won by the

most representative neuron.

The mechanism of competition (Hagan, Demuth,

& Beale, 1996) between neurons is very simple

and effective. The network is supplied with a great

number of example patterns of inputs; such exam-

ples are called characteristics and occur in the space of

characteristics. A suitable distance function, applied on

each pair of items, calculates the degree of likeness

between them.

At the end of the training, the network will

respond to the input by activating only one neuron,

the one closest to the input presented. This neuron is

called the winning neuron (i*), and its output, ai is the

average of the characteristics of the cluster (subset of

inputs) represented by the neuron.

The network, therefore, groups the examples

together into subsets with the highest inner homo-

geneity and the highest outer unlikeness. To do so,

the following transfer function is defined:

a�compet (n) (4)

where the network outcome (a) results from the

application of the transfer function compet to the

net input (n), that is, the inner product between

the input vector and the weights matrix.

The compet function operates by assigning ‘‘1’’ to

the neuron with the highest net input and ‘‘0’’ to the

others. The reason for this choice is that the higher

the inner product between input vector and weights,

the lower the distance between them and the

characteristics space. The winner is, therefore, the

neuron whose weights are most similar to the inputs

presented to the network.

In a competitive network, each neuron corresponds

to a subset of inputs: For example, if the network

consists of three neurons, it will classify inputs into

three classes; if there are seven neurons, the network

will fit a seven-class classification. The researcher,

therefore, has to set the structure of the network on

the basis of the number of classes expected. Gen-

erally, this is a limitation for those who want to

discover relations between data in an explorative way.

However, in our case, we need to adopt a confirmative

approach: We want to verify the existence of regular

trends within different phases of the therapy.

Theoretical Based Classification

We adopt as the criterion of comparison the two-

stage semiotic model (TSSM; Gennaro et al., 2010;

Salvatore et al., in press). This model considers the

psychotherapy process to be composed of two phases,

with different patterns of functioning. The TSSM

sees psychotherapy as a dialogical process involving

the deconstruction of old meanings and the creation

of new ones. The patient arrives at psychotherapy

with a predefined, more or less rigid system of*explicit, implicit, and latent*assumptions (e.g.,

concepts of self and others, affective schemata,

metacognitive modalities, relational and attachment

strategies, unconscious plans), which are taken for

granted and work as super-ordered meanings regulat-

ing the interpretation of experience (see Teasdale &

Barnard, 1993). This system of assumptions can be

conceived of as the source of the patient’s psycholo-

gical problems: symptoms, at the intrapsychic as well

as at the relational level. One of the main therapeutic

activities consists, therefore, of triggering the reorga-

nization of the patient’s super-ordered meanings.

TSSM claims that the change process in the

communicational system made up of patient�therapist interaction follows two stages. In the first

stage, the patient�therapist exchange works funda-

mentally as an external source of limitation on the

patient’s system of assumptions. The first stage is,

therefore, fundamentally a deconstructive process,

with therapeutic dialogue aimed at placing con-

straints on the regulative activity of the patient’s

problematic assumptions. The weakening of the

patient’s critical super-ordered meaning opens the

way for the emergence of new super-ordered ones.

This is what happens in the second, constructive,

stage, when the patient�therapist dialogue imple-

ments new super-ordered meanings, replacing the

previous ones in regulating the meaning-making

experience.6

Following the TSSM, we expect that the incidence

of the regulative assumption (i.e., of the super-

ordered meaning depicted by the DFA’s index:

super-ordered nodes) will decrease in the first stage

(deconstructive stage) and then increase in the

second (constructive stage).

Following TSSM, we differentiate the psychother-

apy process into two macrophases, a deconstructive

and a constructive phase, adopting as a cutoff point

the session with the first negative peak of the SN

index.7

Finally, we compare the TSSM-based classifica-

tion and the one provided by the operation of pattern

recognition performed by the neural network (we use

a nonparametric test* Fisher’s exact test*for this

purpose). Insofar as these two classifications prove to

be consistent (i.e., their association is statistically

significant), then it could be concluded that the

sessions of each of the two phases defined on the

basis of the TSSM criterion are characterized by a

specific pattern of functioning of the psychotherapy

process. On the other hand, such a result would

mean that the neural network is able to discriminate

the sessions in a clinically meaningful way.

10 M. Nitti et al.

Downloaded By: [Nitti, Mariangela] At: 14:29 27 July 2010

A Case Study: Pietro’s Psychotherapy

To explain better the way the DFA method works,

we proceed with the examination of an empirical

case. The analysis is based on the verbatim tran-

scriptions of 43 sessions of a single case of supportive

psychodynamic psychotherapy.

At the time of the therapy, Pietro was a 22-year-

old undergraduate student at the University of

Padua who sought psychological counseling at the

university’s psychological support service because of

his difficulties with human relationships, resulting,

in his opinion, from insecurity and self-confidence

problems. Pietro received therapy approximately

once a week for 2 years, for a total of 79 sessions.

Results of several outcome measures indicate that

Pietro’s therapy was successful.

We now show how the DFA method operates in

practice by transforming therapy�patient conversa-

tions in discourse networks and evaluating them

in terms of dynamic systems able to generate new

intersubjective meanings. To achieve this aim, the

steps described in the previous section are followed

and some considerations about the outcomes pro-

vided.

Step 1: Content Analysis

The corpus consists of the verbatim transcriptions of

43 psychotherapy audiotaped sessions (randomly

sampled from the total of 79 sessions). Table VI

shows the output of the preliminary treatment of the

corpus (see section above).

A distinction must be made between word token

and word type: The former is the total amount of

words present in the text; the latter is the number of

words that are different from each other, subjected

to lemmatization.

The co-occurrence matrix derived by this pre-

liminary treatment of the corpus is made up of 5,367

rows (ECU) and 4,337 columns (lemmas). The

subsequent application of the MCA and CA to this

matrix led to the identification of 20 clusters/

thematic nuclei (inertia between clusters, in our

case equal to 49.1% on the total inertia). A sample

of the most representative ECUs of one of the

20 clusters is given in the Appendix. The following

text indexing was performed in accordance with

them, each ECU being attributed to the cluster with

which it was mainly associated.

Step 2: Creation of the Discourse Networks

The indexing of the ECU represents a transforma-

tion of the text into a sequence of thematic nuclei.

This sequence is the input of sequence analysis. In

this study, we have adopted the single session as a

window. That is, we performed a sequence analysis

separately for each sequence of clusters correspond-

ing to a session. This means that we created 43

discourse networks.

Step 3: Quantitative Analysis of the Discourse

Network

Indexes of the discourse network for each of the

43 sessions were calculated. Here we focus on the

main ones: connectivity, activity, and frequencies of

super-order nodes (SN) (it is worth remembering

that a proper interpretation of the results needs a

joint evaluation of all index trends).

Activity: As shown in Figure IV, the activity can be

depicted as having two phases. While in the first

part of the therapy (Sessions 1�23) we find a

continuous basic line of low activity, from the

middle of the therapy there are precise, discrete

bursts of variability (see Sessions 24, 34, and 38).

SN: The SN globally follows a U-shape trend

(Figure V). In particular, one can distinguish a

first period (roughly from Session 1 to 18, with

Session 15 being the first negative peak) with a

downward trend, followed by a middle period

(roughly Sessions 19�30) characterized by a flat

trend (with the exception of some deviant ses-

sions, in particular Session 19). In the last third of

the sessions, the trend shows a slight inversion: a

weak rise associated with a considerable increment

of the variability between the sessions.

Connectivity: The shape of the connectivity index

curve (with exception of the peaks in Sessions

19 and 32) is slowly decreasing (Figure VI).

Taken together, these trends depict a double-

phase course. Whereas connectivity is constantly

Table VI. Output of Preliminary Treatment of the Corpus of

Pietro’s Therapy

Descriptive parameter Amount

Number of sessions 43

Number of ECUs 5,367

Mean number of ECUs per session 124.81

Number of occurrences in the text

(token)

140,869

Number of lemmas the text (type) 8,053

Token-type ratio 17.49

Number of lemmas in analysis 4,616

Frequency threshold for selecting the

lemma for analysis

5

Note. ECU, elementary context unit.

Analysis of psychotherapy through statistics 11

Downloaded By: [Nitti, Mariangela] At: 14:29 27 July 2010

decreasing throughout the therapy, activity and SN

vary during that time. The first phase is character-

ized by low and constant activity and a slow but

constant reduction of SN; the second phase presents

positive peaks of activity and a stabilization or slight

increase of the SN level.

Based on the meaning of the indexes, the two

phases can be interpreted as follows: (a) In the first

part of the therapy, a reduction of the connections

among meanings occurs (decreasing connectivity)

and, at the same time, the patient’s assumptions lose

ground (reduced SN); (b) in the second part, the

discourse dynamic develops the capability of recom-

bining meanings (increasing activity variability),

without, however, this leading to the development

of new assumptions (SN level becomes stable).

This interpretation leads to the following scenario

as a plausible description of the therapy process.

Presumably, at the beginning of the therapy, Pietro

explicitly and implicitly expressed and acted out his

super-order meanings, in particular the ones con-

cerning his system of beliefs associated with his

problems. As the clinical dialogue proceeded, these

assumptions were subjected to constraints.8 Once

this process of constraining ended, a condition of

stability was achieved (cf. the stabilization of SN in

the second phase). This new state allowed space for

new ways of thinking and speaking to emerge,

enabling the patient�therapist dyad to access a

higher level of production of meaning variability

(see the increasing variability of activity in the second

phase).

0,000

2,000

4,000

6,000

8,000

10,000

12,000

14,000

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43

Z S

core

Activity

Figure IV. Activity trend.

y = 0,0002x2 - 0,0115x + 0,3193R2 = 0,1776

0,000

0,050

0,100

0,150

0,200

0,250

0,300

0,350

0,400

0,450

0,500

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 323 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43

Z S

core

SN Poli. (SN)

Figure V. SN trend.

12 M. Nitti et al.

Downloaded By: [Nitti, Mariangela] At: 14:29 27 July 2010

It is worth emphasizing that the picture just

provided is consistent with the supportive, rather

than elaborative, aim of the therapy (Gabbard,

2000). In a supportive therapy, the clinic dialogue,

coherently with its goals, works as a limitation on the

patient’s beliefs instead of creating new assumptions.

In fact, supportive therapy is aimed at increasing the

strength of the patient’s ego rather than changing its

structural way of thinking and interpreting the

experience. The limited incidence of the construc-

tive stage can, therefore, be interpreted in the light of

this characteristic of the clinical setting.

Step 4: Construction and Training of the

Network

We proceed now with the construction of the

competitive network for the classification of the

input vector into two classes. In other words,

the first output of this step is the classification by

the neural network of the sessions in two classes,

each of them characterized by a specific pattern of

values of the three DFA indexes.

After this, we compare the classification per-

formed by the neural network with the one based

on the TSSM. Following the TSSM, we adopt

Session 15 (i.e., the first negative peak of the SN

trend; cf. Figure IV) as a cutoff point. Therefore, we

take Sessions 1 to 15 as the first stage characterized

by a deconstructive dynamics and Sessions 30 to 43

as the second, constructive, stage.9

The neural network proposed consists of one layer

of two neurons; the input vector is a group of three

indexes combined resulting from the quantitative

analysis of the discourse network obtained using the

DFA method. The output number corresponds to

the number of index patterns; each output is

represented by ‘‘1’’ if the pattern belongs to the first

class of indexes or ‘‘2’’ if it belongs to the second

class. The architecture of the network is shown in

Figure VII.

As we said, the transfer function is:

a�compet (n)

which finds the neuron with the highest net input

and assigns it a ‘‘1’’ and the others ‘‘0’’ (winner-take-

all competition).

The network has been created with the statistical

software MATLAB version 7.1 using the parameters

given in Table VII. The table presents two elements

to improve the learning process: the Kohonen learning

rate and conscience. The former is a training para-

meter that controls the size of weights and bias

changes during learning. The latter allows the so-

called dead neurons to be avoided: Some neuron

weight vectors may start out far from any input

vectors and never win the competition, so that their

weight is never learned. To stop this from happening,

biases are used to give neurons that rarely win the

competition an advantage over neurons that often

win. Once the neuron’s weights have moved into a

group of input vectors and the neuron is winning

consistently, its bias will decrease to 0.

Figure VII. Competitive network for the classification into two

classes.

10,000

15,000

20,000

25,000

30,000

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43

Z Sc

ore

Connectivity

Figure VI. Connectivity trend.

Analysis of psychotherapy through statistics 13

Downloaded By: [Nitti, Mariangela] At: 14:29 27 July 2010

As regards the input ranges, those reported in

the table are [0, 1] for both the neurons, while the

indexes resulting from the quantitative analysis have

different values, as shown in Table VIII.

The real index values have been converted into a

[0, 1] range through the expression:

Inew�Iold � Imin

Imax � Imin

(5)

This solution, adopted each time new inputs are

introduced, allows different vectors to give compar-

able results obtained from different sets of data.

The network weights are initialized, by means of

the midpoint function, as the center of the input

variation range (0, 5). The biases are assigned in a

random way, through the initcon function.

Once the weights and the biases are set, there is

nothing left to do but train the network for the

classification task. The transfer function detects

the winning neuron, but this is not sufficient for

effective learning: The weights of the neuron that

win the competition need to be modified in order to

make it even closer to the input vector. To this end,

the following learning rule is applied on the winning

neuron’s weights:

iwnew�i w

old �a (pnew�iwold )

�(1�a) iwold �apnew (6)

Using this rule, the weight vector of the winner is

moved toward the input vector along a line between

the old weight vector and the input vector. The

distance the weight vector moves depends on the

value of a (the learning rate). The winning weight

vector then ‘‘learns’’ the input vector, so that it will

most probably win the competition when a similar

vector is presented to the network.

The training function trainr operates as follows:

At each training stage, all training vectors are

presented singly in random order; weights and biases

are updated at each repetition. Once the training is

ended, the inputs are presented to the network and

the classification starts.

For each session of the therapy, we presented the

corresponding index pattern as input of the network;

this input, passing through the network, was as-

signed to a subset by the corresponding winning

neuron. Table IX shows the correspondence be-

tween each session and index pattern class. The

result of the classification is discussed next.

Comparison between Network and TSSM Classification

Is it possible to state that two distinct stages of the

therapy really exist? If they do exist, what are their

characteristics?

The answer to the first question is provided by

Fisher’s exact test, which will demonstrate whether

or not there is statistical independence between the

two classifications, that is between each stage of the

therapy and one of the index classes. As mentioned

previously, if there is no independence, we can state

that the therapy can be divided into two phases, each

characterized by the prevalence of a specific class of

indexes and, therefore, by a specific trend of the

communicative exchange.

In our case, the one-sided Fisher’s test has a value

equal to 0.023 (i.e., the probability of the contin-

gency table analyzed is equal to 2.3%). This prob-

ability is small enough to reject the null hypothesis of

independence between the two variables. In other

words, a strong connection between one period of

the therapy and a specific class of indexes exists, and

it is, therefore, possible to clearly discriminate two

different stages of the therapy.

Table VIII. Index Variation Ranges

Variable Minimum Maximum

Connectivity 11.000 31.953

Activity 0.159722 14.000

SN 0.000 0.320833

Table IX. Results of the Classification

Stage I No. Stage II No.

Session 2 1 Session 45 2

Session 3 1 Session 50 2

Session 4 1 Session 51 2

Session 5 1 Session 52 1

Session 6 1 Session 56 1

Session 7 1 Session 57 2

Session 9 1 Session 58 2

Session 10 1 Session 61 1

Session 11 2 Session 64 1

Session 13 1 Session 65 2

Session 16 1 Session 71 2

Session 20 2 Session 72 1

Session 21 2 Session 76 2

Session 22 1 Session 77 1

Session 23 2 Session 78 2

Session 79 2

Table VII. Parameters of the Network

Network type Input range No. neurons Kohonen learningrate Conscience learning rate

Competitive [0, 1; 0, 1; 0, 1] 2 0.01 0.001

14 M. Nitti et al.

Downloaded By: [Nitti, Mariangela] At: 14:29 27 July 2010

We now have to answer the second question: How

is each stage of the psychotherapeutic process

characterized? As described previously, at the end

of the training, each neuron represents the average

characteristics of the input patterns it classifies. In a

neural network, memory and knowledge are stored

in weights and biases of the neurons. Thus, in a

competitive network, the weights and the biases are

simply the prototypical index configurations found

in the patterns presented.

The final network parameters leading to our index

classification are presented in Table X. As regards

weights, each row corresponds to a neuron (a class)

and each column to a quantitative index: respec-

tively, connectivity, activity, and frequencies of SN.

The first neuron classifies the patterns of indexes

that are prevalent in the first stage of the therapy,

whereas the second neuron classifies those dominant

in the second stage.

We can summarize the main characteristics of

each stage as follows: The first stage sessions present

a high level of connectivity (0.555 on a 0�1 scale)

and SN (0.6) and a very low activity level (0.061); in

a mirror image, connectivity and SN drastically

decrease while the activity increases, even if it does

not reach a very high level. Therefore, in the second

stage of the therapy, a constraint on the discourse

network structure intervenes: The connections

among nodes become loose, with an important effect

on the regulative function of meanings (SN). The

reduction of super-order nodes does not involve a

reduction of the discursive dynamic, which, in fact,

is strengthened (as shown by the increasing activity).

It is finally possible to interpret the outputs of the

network in terms of clinical effects that the therapy is

able to produce on the patient. In the first phase of

the therapy, the interaction between therapist and

patient operates toward a limitation of the patient’s

beliefs and ways of perceiving experiences. The

therapy starts with a consistent wealth of meanings

regulating the patient’s interpretation of experience;

this is confirmed by the high level of super-order

nodes at the beginning of the therapy. During the

first stage, this level slowly decreases: The initial

assumptions are renegotiated through the discourse.

The reduction of super-order meanings is an effect

of the high connectivity among meanings and, only

marginally, of the activity.

The second one is the stage of re-elaboration of

sense. In this phase, the discourse dynamic enriches

the variability of the connections among meanings.

As a matter of fact, the second neuron, correspond-

ing to the second stage of the therapy, classifies

pattern combination with higher levels of activity.

Then, in the second stage, the communicative

exchange between patient and therapist becomes

capable of enlarging the paths of sensemaking by

enriching the combination among meanings that

these paths are made of. The activity then leads to

the recombination of the super-order meanings.

In summary, the competitive neural network

constructed here detects, among the index patterns

presented, such regularities that allow two classes to

be distinguished, corresponding with the two stages

(limitation and recombination) of the psychotherapy

process.

Conclusions and Potential Developments

The work presented here is an attempt to contribute

to the development of a technique, conceived and

tested in recent years, for analyzing the psychother-

apeutic process: discourse flow analysis.

The main strength and originality of such an

approach is the attempt to depict a model representing

the psychotherapeutic process with its temporal-

dynamic dimensions, by means of integrated statis-

tical methods allowing the creation of an automatic

procedure of analysis. Such methods are here

used in an innovative way: The cluster analysis,

usually adopted for the aims of classification, be-

comes a precious instrument for the content analysis;

the sequence analysis is implemented to discover

the sensemaking dynamics and represents the dis-

course network; the neural networks allow us to

identify the patterns characterizing the psychotherapy

process.

It is worth highlighting that because of its auto-

matic procedures of analysis the method consumes

relatively little time. Depending on the specific

operation involved, the function of researchers is to

define the parameters of the analysis and control or

complete the intermediate output (e.g., the case of

the lemmatization).

The introduction of these devices, and in parti-

cular of the neural network, represents an innovative

contribution to psychotherapy research. Researchers

are aware of the necessity of developing dynamic

approaches to the study of the psychotherapy

process so that it can be analyzed in terms of pattern

modification through time. Nevertheless, this theo-

retical awareness has not yet been translated into

consistent strategies and procedures of empirical

Table X. Parameters of the Network

Weights

Variable Connectivity Activity SN Biases

1st neuron 0.555 0.061 0.60 5.442

2nd neuron 0.214 0.173 0.18 5.430

Analysis of psychotherapy through statistics 15

Downloaded By: [Nitti, Mariangela] At: 14:29 27 July 2010

analysis. The DFA has been elaborated in order to

favor the overcoming of this methodological gap.

Needless to say, the encouraging results reported

in this study must not let its limits be under-

estimated. Our study is based on just one case. We

need more analyses to validate the DFA in terms of

both construct and convergent validity. We have

already obtained some positive results in this direc-

tion (Gennaro et al., 2010; Salvatore et al., 2007, in

press). Other case analyses have been planned to test

with a larger variety of types of psychotherapies,

clinical problems, and outcome.

Moreover, we are aware of several criticisms that

can be prompted by DFA. First, the method is quite

abstract and highly demanding of computational

competence. One could be led to ask: Where is the

clinical side in all this? How could it be useful for

clinicians?

We recognize the relevance of this issue. However,

the fact that a given technical competence is thought

of as sector-based or part of the common ground is a

matter of historical evolution of the field; for

instance, the statistical competence shared among

clinical researchers nowadays are very different in

quantity and quality compared with 20 years ago.

Above all, we believe that the development of a

clinical understanding of the psychotherapy process

can be enhanced by contributions coming from more

abstract and formal models. A method has to be

evaluated not on its complexity but on the mean-

ingfulness of its results in terms of implications for

clinical thought. So the right question is: Could DFA

be useful for the clinical side?

This study allows us to think that the answer to this

question could be positive. The DFA can contribute

to elaborate and validate a general model of the

psychotherapy process (TSSM), a goal that has

theoretical as well as methodological implications.

From a theoretical point of view, DFA proposes an

intersubjective and semiotic view of the psychother-

apy process. According to this view, the clinical

quality of psychotherapy as well as the mechanisms

of change need to be sought at the level of the

functional, structural, and dynamic characteristics

of the communication between therapist and patient

rather than in isolated aspects concerning the

patient’s and/or therapist’s state of mind (feeling,

attitude, thoughts, and the like) or behaviors. From a

methodological point of view, the DFA highlights

the need to move psychotherapy process research

toward a more dynamic and pattern-focused

approach (Greenberg, 1991, 1994; Russell, 1994)

to make it closer and more consistent to the actual

clinical experience and psychotherapists’ professional

culture.

A second critical issue concerns the DFA’s way of

interpreting the indexes. Focusing on the most

important example, we have to say that the inter-

pretation of the SN index as super-ordered meaning,

and thus in terms of the patient’s assumptions,

cannot be considered more than a plausible hypoth-

esis that requires further testing. This means devel-

oping the qualitative side of the DFA. In other words,

it means retrieving the content of the super-order

nodes and analyzing their discursive characteristics

and functions. For the present, what we can only say

is that our preliminary results are consistent with the

hypothesis grounding the TSSM model and the DFA

method.

Finally, we have to come back to the claim made at

the beginning of the study concerning the capability

of DFA to take the contextuality of meanings into

account. Needless to say, DFA has a limited rather

than absolute sensibility to the context. Many con-

textual dimensions and sources (e.g., paralinguistic

cues, nonverbal behavior, and syntactic, lexical, and

pragmatic markers) are not considered, because the

method focuses on just one aspect of contextuality:

the temporal dynamics of the discursive flow. Ob-

viously, no method can fully grasp the contextuality

of communication; suffice to say that this complete-

ness is already lost in the translation from the clinical

exchange to the transcript. On the other hand, the

levels of contextuality are not fully orthogonal among

themselves. Rather, because of the redundant char-

acter of the language, there is systematically a certain

extent of vicariousness through the dimensions of

the context. For instance, a patient striving to keep

himself emotively distant from the content of his

utterances could perform this communicative feat by

using redundantly a variable set of elements (e.g., an

impersonal syntactic expression, a distancing tone,

an abstract lexicon, and a controlled paralinguistic

and nonverbal behavior).

However, it is not possible to say to what extent

the redundancy of the language is a comfortable and

stable ground for the validity of the DFA with the

findings produced as far. The next fundamental step

in the development of the DFA will address this

point, it being the focal issue of any effort toward a

low inferential and time-efficient procedure of tex-

tual analysis aiming to deal with the complexity,

contingency, and dynamism of communication, in

particular the communication in a clinical setting.

Notes1 More precisely, the dialogical nature of sensemaking concerns

three complementary aspects (authors disagree in terms of the

relevance given to each of them). First, any sign (i.e., a word, an

utterance) takes place in the flow of other signs mobilized by the

participants in the dialogue. Every time people use signs, for

16 M. Nitti et al.

Downloaded By: [Nitti, Mariangela] At: 14:29 27 July 2010

speaking as well as for thinking or whatever use (e.g., as an

answer to the earlier signs of the flow, as an anticipation of the

future signs), those utterances are made by a present inter-

locutor, or by an imagined or imaginary other (Linell, 2009). In

that sense, sensemaking*also when it is a product of an

individual thinker*is itself always and already a dialogical

activity, inherently social. Second, signs specify their meanings

according to the position in that flow. The same utterance can

produce different meanings depending on which other utter-

ances come before and after. In that sense, sensemaking is

carried out through dialogue. Third, sensemaking is not merely

a matter of description of the reality. Rather, it has an inherent

social purpose: It is shaped by the intersubjective circumstances

motivating its activation. To use Wittgenstein’s terminology, the

meaning of a sign is its function of regulating the forms of life

carried on by linguistic games. Of course, broader contextual

dimensions (cultural models, ideologies, or structures of power)

are as much constitutive of the dialogical nature of thinking and

communicating as the situated circumstances of these activities

(Ratner, 2008). However, these dimensions have to be activated

within the contingence of the life of people influenced by them,

and it is at this level that we examine them here.2 However, one can observe that the method of content analysis

that DFA adopts makes the disambiguation not strictly

required. Indeed, the method is based on the co-occurrences

of the word within the utterances (see later discussion).

Consequently, the words are interpreted in accordance with

the context of their occurrence (i.e., the other words they tend

to occur with in the same utterance). Therefore, the disambi-

guation can be performed downhill, at the moment of the

interpretation of the cluster of co-occurring words. For instance,

imagine that the word ‘‘subject’’ has not been undertaken to

disambiguation. Moreover, imagine that the output of the

analysis shows that it is associated with words such as ‘‘under-

stand,’’ ‘‘to study,’’ ‘‘book,’’ and ‘‘examination.’’ One can then

conclude that in this context of co-occurrence the semantic

content that is pertinent is probably the one concerning ‘‘branch

of knowledge’’ rather than ‘‘person.’’3 Each lemma is considered a category of the ‘‘discourse’’ variable

and each ECU*that is, the context where a specific lemma is

used*is the statistical unit of analysis.4 Needless to say, this interpretation needs to be deepened and

perhaps articulated through a comparison of the other ECUs of

the cluster as well as of the whole distribution of the ECU of the

other clusters, a task that goes beyond the illustrative aim of our

discussion here (for details, see Salvatore et al., 2010).5 Another important aspect is the dynamic of the network, that is,

the direction of the information flow between and/or within

layers.6 Needless to say, the two stages are not totally distinct and

mutually exclusive. Both of them are present throughout the

whole psychotherapy, within every session, although to different

extents. However, the two-stage assumption asserts that, at the

macroanalytical level, in a clinically efficacious psychotherapy

process, it is possible to discriminate between a first stage

where deconstructive sensemaking is dominant and a second

one where the dynamics of sensemaking take on a constructive

function.7 Obviously, this criterion is valid insofar as the SN trend is

consistent with the U-shape model expected by the TSSM.8 The clinical literature provides various conceptualizations of the

clinical exchange in terms of a dialectical source of constraints

on a patient’s beliefs (inter alia; Salvatore, Tebaldi, & Potı,

2009).9 We have excluded from the analysis the sessions of the middle

period (Sessions 16�30). This is because the variability of

the SN trend in this period leads us to think that, at least in

the case of Pietro, the middle period may have worked as a

phase of transition, with no clear and discriminative pattern

emerging.

References

Anderberg, M. R. (1973). Cluster analysis for applications. New

York: Academic Press.

Andersen, S. (2001). The emergence of meaning: Generating

symbols from random sounds*A factor analytic model. Journal

of Quantitative Linguistics, 8(2), 101�136.

Austin, J. (1962). How to do things with words. Oxford, UK: Oxford

University Press.

Benzecri, J. P. (1973). L’Analyse des donnees. Tome I: La taxinomie.

Tome II: Correspondances [Data analysis. Volume I: The taxonomy.

Volume II: Correspondences]. Paris: Dunod.

Chambers, J., & Bickhard, M. (2007). Culture, self and identity.

Interactivist contributions to a metatheory for cultural psychol-

ogy. Culture and Psychology, 13(3), 259�295.

Dimaggio, G., & Semerari, A. (2004). Disorganized narratives:

The psychological condition and its treatment. In L. Angus &

J. McLeod (Eds.), The handbook of narrative and psychotherapy.

practice, theory and research (pp. 263�282). Thousand Oaks,

CA: Sage.

Floreano, D. (1996). Manuale sulle reti neurali [Handbook on neural

networks]. Bologna: Il Mulino.

Gabbard, F. (2000). Psychodynamic psychiatry in clinical practice

(3rd ed). Washington, DC: American Psychiatric Publishing Inc.

Gennaro, A., Al-Radaideh, A., Gelo, O., Manzo, S., Nitti, M.,

Auletta, A., et al. (2010). Modelling the psychotherapy process

as a meaning-making dynamics. The two stage semiotic model

(TSSM) and the discourse flow analyzer (DFA). In S.

Salvatore, J. Valsiner, A. Gennaro, & J. B. Traves Simon

(Eds.), YIS: Yearbook of idiographic science 2009, Volume 2.

Rome: Firera.

Gergen, K. J. (1999). An invitation to social construction. London:

Sage.

Greenberg, L. E. (1991). Research on the process of change.

Psychotherapy Research, 1(1), 3�16.

Greenberg, L. E. (1994). The investigation of change: Its

measurement and explanation. In R. L. Russell (Ed.), Reasses-

sing psychotherapy research (pp. 114�143). New York: Guilford

Press.

Hagan, M. T., Demuth, H. B., & Beale, M. (1996). Neural network

design. Boston: PWS Publishing.

Harre, R., & Gillett, G. (1994). The discursive mind. London: Sage.

Hermans, H. J. M., & Hermans-Jansen, E. (1995). Self-narratives.

The construction of meaning in psychotherapy. New York: Guilford

Press.

Hoffman, I. Z. (1998). Ritual and spontaneity in the psychoanalytic

process. Hillsdale, NJ: Analytic Press.

Lancia, F. (2005). Word co-occurrence and similarity in meaning.

Retrieved July 2, 2006 from http://www.mytlab.com/wctheory.

pdf.

Landauer, T. K., & Dumais, S. (1997). A solution to Plato’s

problem: The latent semantic analysis theory of acquisition,

induction and representation of knowledge. Psychological Re-

view, 104, 211�240.

Linell, P. (2009). Rethinking language, mind, and world dialogically.

Charlotte, NC: Information Age.

Nicolo, G., & Salvatore, S. (Eds.). (2007). La ricerca sui risultati e

sul processo in psicoterapia [Research in psychotherapy outcome

and process]. Rome: Edizioni Carlo Amore.

Nightingale, D. J., & Cromby, J. (Eds.). (1999). Social construc-

tionist psychology. A critical analysis of theory and practice.

Buckingham, UK: Open University Press.

Analysis of psychotherapy through statistics 17

Downloaded By: [Nitti, Mariangela] At: 14:29 27 July 2010

Ratner, C. (2008). Cultural psychology and qualitative methodol-

ogy: Scientific and political considerations. Culture and Psychol-

ogy, 14(3), 259�288.

Rommetveit, R. (1992). Outlines of a dialogically based social�cognitive approach to human cognition and communication. In

A. H. Wold (Ed.), The dialogical alternative towards a theory of

language and mind (pp. 19�44). Oslo: Scandinavian University

Press.

Rosenberg, S. D., Schnurr, P. P., & Oxman, T. E. (1990). Content

analysis: A comparison of manual and computerized systems.

Journal of Personality Assessment, 5(1�2), 298�310.

Russell, R. L. (Ed.). (1994). Reassessing psychotherapy research.

New York: Guilford Press.

Salvatore, S., Gelo, O., Gennaro, A., Manzo, S., & Al-Radaideh,

A. A. M. (in press). Looking at the psychotherapy process as an

intersubjective dynamic of meaning making. A case study with

discourse flow analysis. Journal of Constructivist Psychology.

Salvatore, S., Gennaro, A., Auletta, A., Al-Radaideh, A. A. M.,

Aloia, N., Masiello, F., et al. (2010, June). Perspectives for the

textual data analysis: A contextual approach. Paper presented at

the 41st International Meeting of the Society for Psychotherapy

Research, Pacific Grove, CA.

Salvatore, S., Gennaro, A., Grassi, R., Manzo, S., Melgiovanni,

S., Mossi, P., et al. (2007). Lo studio del processo terapeutico

in termini di analisi della dinamica discorsiva. Applicazione del

DFA (discourse flow analyzer) al caso K [The study of the

therapeutic process in terms of the discourse dynamic. The

application of DFA (discourse flow analyzer) to case K]. In G.

Nicolo & S. Salvatore (Eds.). La ricerca sui risultati e sul processo

della psicoterapia [Psychotherapy outcome and process research] (pp.

355�397). Rome: Firera & Liuzzo Group.

Salvatore, S., Tebaldi, C., & Potı, S. (2009). The discursive

dynamic of sensemaking. In S. Salvatore, J. Valsiner, S. Strout,

& J. Clegg (Eds.), YIS: Yearbook of idiographic science 2008,

Volume 1 (pp. 39�72). Rome: Firera. Retrieved September 20,

2008, from http://www.valsiner.com/articles/salvatore.htm.

Santos, A., Goncalves, M., Matos, M., & Salvatore, S. (2009).

Innovative moments and change pathways: A good outcome

case of narrative therapy. Psychology and Psychotherapy: Theory,

Research and Practice, 82, 449�466.

Steinbach, M., Karypis, G., & Kumar, V. (2000). A comparison of

document clustering techniques (University of Minnesota Techni-

cal Report No. 00-34). Retrieved December 4, 2006 from

http://www.cs.umn.edu/tech_reports_upload/tr2000/00-034.pdf

Storolow, R. D., Atwood, G. E., & Brandchaft, B. (1994). The

intersubjective perspective. Northvale, NJ: Jason Aronson.

Teasdale, J. D., & Barnard, P. J. (1993). Affect, cognition, and

change: Re-modeling depressive thought. Hillsdale, NJ: Erlbaum.

Valsiner, J. (2007). Culture in minds and societies. Foundations of

cultural psychology. New Delhi: Sage.

Wittgenstein, L. (1953). Philosophische Untersuchungen [Philosophi-

cal investigations] (2nd ed.). Oxford, UK: Basil Blackwell.

18 M. Nitti et al.

Downloaded By: [Nitti, Mariangela] At: 14:29 27 July 2010