Vocal congruence in mother-infant play

15
Journal of Psycholinguistic Research, Vol. 17, No. 3, 1988 Vocal Congruence in Mother-Infant Play Beatrice Beebe, 1 Diane Alson, 1 Joseph Jaffe, 2 Stanley Feldstein, 3 and Cynthia Crown 3 Accepted January 22, 1988 Turn-taking is the fundamental temporal structure of adult dialogue. This structure defines two types of joint silence: intrapersonal pause (silence bounded by the vocalizations of a single speaker) and switching pause (silence bounded by the vocalizations of different speakers). Switching pauses mark the boundaries of the turn exchange. In adult conversation the mean durations of both types of pause are charaeteristically matched between partners. This matching, termed "vocal congruence," occurs developmentally earlier in the case of switching pauses. We hypothesized and confirmed that mothers and infants match switching pauses but not intrapersonal pauses at 4 months, even though the infants' vocalizations are prelinguistic. Second, since there are known affective correlates of vocal congruence in adult conversation, we hypothesized a similar affective correlate for mother-infant vocal congruence. We found, for the intrapersonal pause only, that the degree of matching within a dyad correlates with infant affective engagement. We conclude, from switching pause congruence, that a turn- taking dialogic structure is being regulated in the mother-infant pair at 4 months in the same way as seen in adult conversation. Thus, both the temporal structure of aduh dialogue and its affective correlate are prelinguistic. Partners in adult conversation tend to match certain timing parameters of their speech (Jaffe & Feldstein, 1970). This phenomenon has been termed vocal congruence. To the extent that this matching occurs, the partners rate each other as warm and similar (Welkowitz & Kuc, 1973; see Feldstein & Welkowitz, 1978, for a review). We now document that Supported in part by NIMH grant No. 41675. 1Ferkauf Graduate School of Psychology, Yeshiva University, 1300 Morris Park, Bronx, N.Y. 10461. 2New York State Psychiatric Institute and Columbia University. 3University of Maryland Baltimore County. 245 0900-6905/88/0500-0245506.00/0 1988Plenum Publishing Corporation

Transcript of Vocal congruence in mother-infant play

Journal of Psycholinguistic Research, Vol. 17, No. 3, 1988

Vocal Congruence in Mother-Infant Play

Beatrice Beebe, 1 Diane Alson, 1 Joseph Jaffe, 2 Stanley Feldstein, 3 and Cynthia Crown 3

Accepted January 22, 1988

Turn-taking is the fundamental temporal structure of adult dialogue. This structure defines two types of joint silence: intrapersonal pause (silence bounded by the vocalizations of a single speaker) and switching pause (silence bounded by the vocalizations of different speakers). Switching pauses mark the boundaries of the turn exchange. In adult conversation the mean durations of both types of pause are charaeteristically matched between partners. This matching, termed "vocal congruence," occurs developmentally earlier in the case of switching pauses. We hypothesized and confirmed that mothers and infants match switching pauses but not intrapersonal pauses at 4 months, even though the infants' vocalizations are prelinguistic. Second, since there are known affective correlates of vocal congruence in adult conversation, we hypothesized a similar affective correlate for mother-infant vocal congruence. We found, for the intrapersonal pause only, that the degree of matching within a dyad correlates with infant affective engagement. We conclude, from switching pause congruence, that a turn- taking dialogic structure is being regulated in the mother-infant pair at 4 months in the same way as seen in adult conversation. Thus, both the temporal structure of aduh dialogue and its affective correlate are prelinguistic.

Partners in adult conversation tend to match certain timing parameters of their speech (Jaffe & Feldstein, 1970). This phenomenon has been termed vocal congruence. To the extent that this matching occurs, the partners rate each other as warm and similar (Welkowitz & Kuc, 1973; see Feldstein & Welkowitz, 1978, for a review). We now document that

Supported in part by NIMH grant No. 41675. 1Ferkauf Graduate School of Psychology, Yeshiva University, 1300 Morris Park, Bronx, N.Y. 10461.

2New York State Psychiatric Institute and Columbia University. 3University of Maryland Baltimore County.

245

0900-6905/88/0500-0245506.00/0 �9 1988 Plenum Publishing Corporation

246 Beebe, Aison, Jaffe, Feldstein, and Crown

Soeok.rA m TqPr-- Pr Pr sP i i I I

T i m e I t I I I I I I I I I I I I I I I 1 I I I I I I 4 2 3 4 5 6 7 8 9 t 0 t I t 2 t 3 t 4 15 t 6 t 7 t 8 19 2 0 2 t ; )2 2 3

Fig. 1. A diagramatic representation of a conversational sequence. The numbered line at the bottom represents time in 300-msec units. Thus, the duration of the segment is 6.9 seconds. V stands for vocalization, P for pause, and SP for switching pause (the silence that frequently occurs immediately prior to a change in the speaking turn). The arrows that point down denote the end of speaker A's turns; the arrows that point up denote the end of speaker B's turns. ISS and NSS stand for interruptive and noninterruptive simultaneous speech, respectively. (Adapted from Jaffe and Feldstein, 1970).

mothers and infants, when engaged in social play at 4 months, match the temporal patterns of their vocal exchange in a manner similar to that observed in adult conversation; furthermore, this matching has an affective correlate analogous to that observed in adult conversation. The theoretical importance of the prelinguistic appearance of adultlike pat- terns of communication will be discussed.

We used the Jaffe and Feldstein model (1970), which classifies the sound-silence patterns of a conversation in terms of five parameters: speaking turns, vocalizations, intrapersonal pauses, switching pauses, and simultaneous speech. The first four parameters are relevant to this study.

These parameters are illustrated in Figure 1. Turn-taking is the fundamental temporal structure of adult dialogue, in which speaker and listener exchange roles. In this model, when a speaker utters a unilateral sound, he gains the turn and retains it until his partner utters a unilateral sound, at which moment the turn is switched. Thus, the duration of a turn is the time between speaker switches. A vocalization is a segment of uninterrupted sound (speech) uttered by the speaker who has the turn.

The turn-taking structure defines two types of joint silence: inter- personal pauses, which are joint silences bounded by the vocalization of the same speaker, and switching pauses, which are joint silences bounded by the vocalization of different speakers, i.e., terminated by a speaker switch. The switching pause is assigned to the speaker who loses the turn.

Although the vocalizations of 4-month infants are prelinguistic, the rhythm of the vocal interactions between mothers and infants of this age

Vocal Congruence 247

can be analyzed as a conversational structure (Stern, Jaffe, Beebe, & Bennett, 1975; Bateson, 1975; Jaffe, Anderson, & Stern, 1979). All the parameters described for linguistically mature conversations are obtain- able from mother-infant vocal interactions.

The phenomenon of vocal congruence in adult conversation includes the matching of both intrapersonal and switching pauses. Both types of congruence are sensitive to interpersonal and affective processes (Feld- stein & Welkowitz, 1978). Vocal congruence has been related to psychological differentiation (Marcus, Welkowitz, Feldstein, & Jaffe, 1970), social desirability (Natale, 1975), enjoyment of degree of social contact (Feldstein & Welkowitz, 1978), and ratings of warmth (Welko- witz & Kuc, 1973). Whether it is the interpersonal pause, the switching pause, or both that show an interpersonal correlate varies from study to study.

Nevertheless, there are differences between the two types of vocal congruence. Switching pause matching is a more robust phenomenon, appearing earlier developmentally and remaining intact in the face of psychopathology. Jasnow and Feldstein (1986) found matching of switching pauses, but not intrapersonal pauses, in mother-infant interac- tions at 9 months, as we also document in the current study at 4 months. In a study of peers, 5 1/2 to 7 years of age, switching pause matching appears earlier than pause matching (Welkowitz, Cariffe, & Feldstein, 1976). In a study of adult depressives, switching pause matching persists when intrapersonal pause matching is lost (Jaffe & Anderson, in press). Adolescent autistics match switching pauses with an experimenter but not with their parents; intrapersonal pauses are not matched with either adult (Feldstein, Konstantareas, Oxman, & Webster, 1982).

Therefore, in the current study, we expected to find differences between switching pause and intrapersonal pause matching. Specifically, since Welkowitz et al. (1976) find switching pause congruence a developmentally earlier phenomenon in child peers, and Jasnow and Feldstein (1986) find switching pause matching but not intrapersonal pause matching at 9 months, we expected that mothers and infants would match switching pauses to a greater degree than intrapersonal pauses.

Our purpose was to determine whether intrapersonal pause matching and switching pause matching occurred at this early point in develop- ment. Second, analogous to the affective correlates of vocal congruence in adult partners, we hypothesized an affective correlate of vocal congruence. Regardless of whether significant matching occurred in the group, we hypothesized that the degree of vocal congruence for any given dyad would be positively correlated with an independent assessment of the infant's "affective engagement" (Beebe & Gerstman, 1980) during face-to-face play.

248 Beebe, Alson, Jaffe, Feldstein, and Crown

Subjects

Fifteen 4 middle-class infant-mother pairs participated as subjects in the study. The mothers were volunteers from local Lamaze classes. Their infants were first-born females whose ages ranged from 3 1/2 to 4 months and whose Apgar scores ranged from 8 to 10. The developmental course of the children was normal to the time of the study.

Procedure

During a home visit 2 weeks prior to the start of the data-collection phase of the study, mothers were familiarized with its goals and procedures. The mothers were told that the intent of the study was to examine the social development of normal infants.

Each mother-infant pair was videotaped in a standard face-to-face free-play paradigm (Field, 1977; Stern, 1974) in a laboratory setting. A mother and an infant were seated in a bare room opposite each other in the same visual plane, with the infant in an infant seat. Two TV cameras were used, focused on the mother's and infant's face and upper torso, respectively. A special-effects generator combined the two views into one split-screen frontal view of both partners. Mothers were instructed to play with their infants as they would at home, and the pair was left alone to interact.

In order to provide optimal conditions for each pair, a 4-hour block of time was get aside to provide ample opportunity for feeding and naps as necessary. The videotaped interaction lasted approximately 20 minutes per pair, with several breaks for caretaking, so that pairs were not filmed for more than 10 minutes at a stretch. Despite the mother's natural desire to "perform" well for the study, the assumption of previous research (Stern, 1974, 1977; Tronick, Als, & Adamson, 1979) is that the basic organization of behavior will proceed in characteristic fashion.

A separate audiotape of mother and infant vocalizations was recorded during the entire play session for each pair.

4Although an N of 15 is relatively small, it is considered reasonable in view of the labor-intensive techniques of audiotape and videotape analysis used in this study.

Vocal Congruence 249

Variables

Vocal Matching Measures. Obtained from the audiotape, the first 10 minutes of vocal interaction, or a total of 10 minutes collected around any time-outs, constituted the vocal sample for a mother-infant pair. Time-outs were called when unavoidable difficulties arose in the video- taping, such as when the baby spit up, or the mother repositioned the baby, accidentally moving the baby out of the camera's view. Any maternal singing was omitted from the sample since these sounds have a different timing structure than speech. Five minutes is considered an adequate vocal sample (Breskin & Jaffe, 1970). Previous research on adult dialogue has generally seated the two subjects sufficiently far apart to obtain separation of their respective vocal signals on a two-track magnetic tape, which is then computer-analyzed for temporal parameters. Since mother and infant of necessity must be seated facing each other less than a few feet apart, it was not possible to obtain two separate vocal tracks. Therefore, frequencies of all vocal parameters were obtained from one vocal track using a stopwatch.

Anticipating this problem of one vocal track for two participants, Cassotta, Feldstein, and Jaffe (1967; see also Jaffe & Feldstein, 1970, p. 132) performed systematic studies of the comparative reliability of manual timing of parameters and a computer-automated system of timing these vocal parameters. Highly significant correlation coefficients were obtained for the manual timing of parameters, with the exception of those involving simultaneous speech. (The reliability estimates cited in Jaffe and Feldstein (1970, p. 132) for the frequency distributions of switching pause and intrapersonal pause durations, respectively, were .986 and .939.) For this reason, the simultaneous vocalization parameters were not analyzed in this study. In view of the high reliability of manual timing of parameters, a stopwatch was used to code the remaining vocal parame- ters. Two different raters achieved an interrater reliability of 96% agreement within 0.3 seconds on judgments of the onset and offset of vocal parameters.

Three measurements yielded a mean duration of each parameter (vocalization, intrapersonal pause, switching pause) for each mother and each infant. Thus, the data consist of three pairs of mean duration scores per dyad: mother and infant mean vocalization, mother and infant mean intrapersonal pause, and mother and infant mean switching pause.

Vocal congruence (or matching) was assessed in a group analysis across dyads, as it has been in the studies of vocal congruence in adults.

250 Beebe, Alson, Jaffe, Feldstein, and Crown

Matching is defined as a significant product-moment correlation across dyads between mother and infant mean values for a given parameter (vocalization, intrapersonal pause, switching pause). 5

Irrespective of the presence of significant matching across the group for any parameter, the degree of match of vocal parameters per dyad was created as a variable to assess the relationship of vocal congruence to infant affective engagement (measured as described below). Absolute difference scores per dyad between the means of mother and infant vocal parameters were calculated. These per dyad difference scores were then used in correlations with infant engagement.

Infant Affective Engagement Scale. The affective engagement of infants is assessed by an ordinal engagement scale of specific combina- tions of orientation to mother, gaze at mother, and facial expression display during face-to-face play (Beebe & Stern, 1977; Beebe & Gerstman, 1980). Levels of engagement are ordinalized in a 9-point scale and not only imply "magnitudes" of relatedness but are better concep- tualized as complex modes of relatedness across a continuum from high positive engagement to an inhibition of responsivity altogether. Table I shows the criteria of the scale.

The entire videotaped play session per dyad (approximately 20 minutes) was coded for infant engagement levels by a form of micro- analysis. Numbers showing minutes, seconds, and 30ths of seconds were superimposed on the videotape. The coder watched the videotape running in real time and pressed the "pause" button to freeze the picture at each occasion a change in infant engagement was observed, noting time and infant level. This procedure yields a profile of the proportion of time in each engagement level. Interrater reliability ranged from 85 to 99%, with a mean of 94%. The timing of the two raters was considered reliable if the change of the second rater occurred within 0.3 seconds of the time of the change coded by the first author. Mean engagement level and percentage time in "positive" engagement (levels 70, 80, 90) were calculated for each infant.

5Congruence was estimated by Pearson product-moment correlation coefficients and not by a more stringent criterion of match, an intraclass correlation. Thus, the congruence or match is an assessment of whether the durations move together, rather than an estimate of the extent to which the average durations are similar.

Vocal Congruence 251

Table I. Infant Engagement Level Scale

Level Attention Orientation

Facial expressivity

Degree mouth open

M.O.

Degree mouth widen

M.W. Other

90 High Look Vis-a-vis M.O. 4 positive

85 Medium Look Vis-h-vis M.O. 3 high positive

80 Medium Look Vis4t-vis M.O. 2 positive

70 Low Look Vis-tt-vis (a)M.O. 1 positive or(b)M.O. 0

60 Positive Look Vis-h-vis (a)M.O. 1 or 2 attention or(b)M.O. 0

50 Neutral Look Vis-a-vis M.O. 0 40 Negative Look Vis-a-vis

attention

30 Oriented, Look Vis-a-vis M.O. 0 not look away

20 Avert Look Orient M.O. 0 away away

10 Inhibition Look Vis-h-vis M.O. 0 of responsivity away or away,

or body limp eyes closed

Bowing

Bowing

M.W. 2 o r l

M.W. 1 M.W. o r 2 M.W. 0

M.W. slight M.W. 0 Grimace

or "line too." or frown

M.W. 0 or negative

M.W. 0 or negative

M.W. 0

The "percent time in positive engagement" measure is more sensitive to the infant's use of the higher positive levels of the scale than is the mean. Every infant showed a bimodal distribution, with major modes at engagement levels 30 and 60, which essentially captures the distinction between not looking and looking at the mother. Nevertheless, there are considerable individual differences in the infants' usage of the various levels of the scale. The group mean engagement level was 48.87, with a standard deviation of 6.45. The individual infants ranged from a mean engagement level of 39.35 to a mean of 60.75. The percentage positive score had a group mean of 15.02, with a standard deviation of 12.29, twice that of the mean engagement level and therefore potentially a more sensitive measure. The individual infants ranged from 43.50 to 0.44 percentage of total time in positive (levels 70, 80, 90).

252 Beebe, Alson, Jaffe, Feldstein, and Crown

Table II . Mean Vocalization and Pause Durations (in Seconds) for Adult Speech and Mother and Infant Vocalization

Mean duration Adult Mother Infant

Vocalization 1.42 b 0.92 c 0.47 a 1.10 e O.59 e 1.07 f 0.65 r

Intrapersonal pause 0.59 b 1.64 C 0.91 a 1.00 e 0.53 e 0.88 f 0 . 6 4

Switching Pause a 0.66 b 0.80 e 0.70 e 0.88 r 0.78 r

aDurations of switching pauses have not been reported in previous studies of maternal speech with the exception of Jasnow (1983) and Jasnow and Feldstein (1986).

bjaffe and Feldstein (1970). cStern (1977). aStern, Beebe, Jaffe, and Bennett (1977). eCurrent report; Alson (1982). f Jasnow and Feldstein (1986).

R E S U L T S 6

Table II presents the mean durations of vocalizations and pauses of the current group of mothers in comparison with previous reports. All maternal vocalizations are shorter, and all maternal pauses are longer, than the adult comparisons. These data confirm previous findings (Stern, Beebe, Jaffe, & Bennett, 1977; Stern, 1977) that mothers alter their speech to infants by shortening the duration of vocalizations and elongating the duration of pauses, in comparison with adult speech durations as reported by Jaffe and Feldstein (1970).

To assess whether mother and infant match the durations of vocalizations, pauses, and switching pauses, Pearson product-moment correlation coefficients were computed that compared the mother and infant mean durations of these vocal parameters. These scores are provided in Table III. Nonsignificant correlations were obtained for vocalization durations (r = . 19) and intrapersonal pause durations (r = .41), but a significant correlation (r = .75, p < .005) was obtained for

6portions of these results were first presented in Alson (1982).

Vocal Congruence 253

Table III, Mean Durations (in Seconds) of Vocalizations, Pauses, and Switching Pauses for Mother and Infant

Mother- Mother Infant infant Mother Infant Mother Infant switching switching pair vocalization vocalization pause C pause c pause pause

A 0.783 0.722 0.860 0.796 0.633 0.775 13 1.309 0.663 0.553 0.500 0.525 0.500 C 2.856 0.745 1.236 0.667 0.755 0.644 D 0.968 0.374 0.958 0.257 0.933 0.687 E 0.958 0.611 0.882 0.515 0.628 0.614 F 0.841 0.560 0.920 0.267 0.588 0.541 G 0.798 0.512 1.246 0.438 0.910 0.824 H 0.386 0.661 1.258 0.592 1.026 0.875 I 0.471 0.470 1.879 0.979 1.216 1.178 J 1.225 0.647 0.647 0.267 0.578 0.511 K 0.882 0.488 0.913 0.294 0.695 0.689 L 0.893 0.604 0.901 1.200 0.607 0.517 M 2.062 0.464 0.968 0.400 1.295 0.867 N 1.292 0.562 0.894 0.383 1.086 0.629 O 0.796 0.727 1.020 0.373 0.537 0.690

Mean 1.101 0.587 1.009 0.529 0.801 0.703 SD 0.626 0.110 0.311 0.278 0.257 0.181 r a .19 .41 .75 b

ar refers to the product-moment correlation of mother and infant values on each vocal parameter.

b p < .005. CPause refers to intrapersonal pause.

switching pause durations. 7 Thus, as a group, these mothers and infants match each other's durations of switching pauses only.

To assess whether variations in the degree to which dyads match vocal parameters are systematically related to infant affective engage- ment, Pearson product-moment correlations were computed between the mother-infant per dyad difference scores of each vocal parameter and two indices of infant engagement: mean and percentage positive engagement (levels 70, 80, 90). Table IV shows correlations between the difference scores of the mean vocal parameters and the two infant engagement indices. Only the correlation between the difference scores of intraper- sonal pauses and the infant engagement indices is significant. The

7From the work on adult conversations (Jaffe and Feldstein, 1970), we would not expect vocal durations to match.

254 Beebe, Alson, Jaffe, Feldstein, and Crown

Table IV. Correlations of Vocal Congruence Scores and Infant Engagement

Infant engagement Mother-infant per dyad absolute difference score Mean % positive

Vocalization - . 16 .10 Intrapersonal pause - . 5 9 a - . 6 8 b Switching pause - . 0 6 .21

< .05. ~P< .005.

negative correlation indicates that the higher the infant engagement, the smaller the difference score of mother and infant intrapersonal pause means--that is, the closer the match--and vice versa. Thus, although as a group mother and infant do not significantly match their intrapersonal pause durations, nevertheless the more similar the durations of their intrapersonal pauses on a per dyad basis, the higher the infant engage- ment. Conversely, there is a significant group correlation in switching pause durations, but the per dyad degree of similarity of switching pause durations is not systematically related to level of infant engagement. 8

DISCUSSION

Turn-taking is the fundamental temporal structure of dialogue, and switching pauses mark the boundaries of the turn exchange. The switching pause occurs precisely at the point at which speaker and listener exchange roles. The clearest similarity between adult and mother-infant

Sin order to assess the possible role of switching pause matching in the prediction of infant engagement, a multiple regression using the variable of the absolute difference of intraper- sonal pauses and the absolute difference of switching pauses was used to predict infant percentage positive engagement. Both predictor variables were entered simultaneously. The results of the multiple regression indicate that similarity of switching pauses does not contribute to the prediction of infant engagement. The regression coefficient for intrapersonal pauses is B = -33 .43 (p < .01), and for switching pauses, B = -12 .76 (p = .50). The multiple R = .698 (p < .018) and the multiple r 2 = .49 (n.s.). Together, the amount of variance explained by the two variables is 49%. However, 4% of the variance explained is due to switching pauses, and 96% of the variance explained is due to intrapersonal pauses. The significant prediction of infant engagement is thus based solely on similarity of intrapersonal pauses.

Vocal Congruence 255

vocal "dialogue" is the matching of the average durations of switch- ing pauses. Since these switching pauses occur at the turn boundaries, the matching of their durations between partners means that the exchange of turns is being regulated similarly: Each partner pauses for a simi- lar duration before the other takes a turn. Jasnow and Feldstein (1986) also found a correlation of switching pauses in mother-infant vocal exchanges at 9 months. Furthermore, by using a method of time-series analysis, they showed that this correlation occurred due to mutual influence.

Evidence that the infant can make the discriminations necessary to match durations such as those of switching pauses has been established by DeCasper and his colleagues. They have shown that the neonate is able to estimate intervals of time and to discriminate small differences in intervals, and to alter the durations of his own behavior, when rewarded by the sound of his mother's voice or music (DeCasper & Fifer, 1980; DeCasper & Carstens, 1980).

The presence of a turn-taking structure in mother-infant exchanges has been noted by numerous investigators (for example, Bateson, 1975; Brazelton, Kozlowski & Main, 1974; Kaye & Wells, 1980; Schaffer, Collis, & Parsons, 1977; Spitz, 1963; Stern et al., 1975; Tronick et al., 1979; Mayer & Tronick, 1985). That the smooth exchange of turns is essential to the acquisition of affective, cognitive, and linguistic compe- tence is also generally agreed (Als, 1975; Kaye, 1982; Spitz, 1963; Schaffer et al., 1977). Less is understood, however, about what is responsible for the smooth exchange of turns--that is, what constitutes the interactive "rules" of turn-taking.

Kaye and Wells (1980) found that in the neonate, both mother and infant contingently respond to the other and thus promote a "dialogue": The mother responds to the infant's pause during sucking by jiggling, and the infant responds to the mother's cessation of jiggling by resuming sucking. Mayer and Tronick (1985) use the turn-yielding signals charac- teristic of adult conversations to document that mother and infant use some of the same cues for their turn exchange. Schaffer et al. (1977) suggest that smooth turn-taking comes about through the sensitivity of the mother to the infant's patterning, such that the mother lets herself be paced by the infant and fits into his pauses.

Our data point to the importance of matching of temporal patterns as a further regulatory mechanism in the turn exchange. Although the hypothesis of rhythmic matching is frequently proposed to account for the development of turn-taking and affective involvement (Bullowa, 1979; Jaffe & Feldstein, 1970; Jaffe et al., 1979; Jaffe & Anderson, 1979;

256 Beebe, Alson, Jaffe, Feldstein, and Crown

Beebe, Stern, & Jaffe, 1979), there is rarely a clear demonstration of matching of explicitly defined parameters in infancy (see Beebe et al., 1979, as an exception). The matching of temporal patterns of behavior implies a timing mechanism (such as a biological clock) that can be adjusted to the partner. This mechanism is operative at the points of the turn exchange in early infancy.

Intrapersonal pause matching is also a strong congruence finding in adult conversation. However, we do not find it in mother-infant interactions at 4 months. Jasnow and Feldstein (1986) also failed to find a correlation of intrapersonal pause durations in mother-infant interac- tions at 9 months. The finding that mothers and infants match switching pauses but not intrapersonal pauses parallels the Welkowitz et al. (1976) finding that switching pauses reflect matching earlier (age 5 1/2) than do intrapersonal pauses (age 6 1/2) in peer conversations. Matching switch- ing pauses is thus a developmentally earlier phenomenon than matching intrapersonal pauses.

There is considerable evidence to suggest that the matching of these two types of pause functions differently. Switching pause matching is a ubiquitous phenomenon in both normal and pathological behavior and at all levels of development. In adult depressives (Jaffe & Anderson, in press) and adolescent autistics (Feldstein et al., 1982), switching pause matching continues to be exhibited when intrapersonal pause matching is absent. Thus, switching pause matching seems to be an intrinsic aspect of the turn-taking mechanism of dialogue and is biologically well "buffered" against perturbations in development. An additional differ- ence between the two types of pause is that the switching pause occurs at the moment of an exchange of speaker/listener roles, when the brains of both participants are reciprocally reorganizing from receptive to expres- sive modes, or vice versa. Contrast this with the intrapersonal pause, where the speaker retains the turn, and his brain is organized purely in the expressive mode, "running free" so to speak.

In infancY the two types of pause are further differentiated by the presence of an affective correlate in one but not the other. In adults both types of pause can show an affective correlate, using self-report tech- niques. The matching of both types of pause has been shown to be associated with measures of perceived similarity and warmth (Welkowitz & Kuc, 1973), social desirability (Natale, 1975), and degree of enjoy- ment (Feldstein & Welkowitz, 1978). However, a study of affective behavioral correlates comparable to the present study has not been done with adults.

In summary, intrapersonal pause and switching pause are more

Vocal Congruence 257

differentiated in mother-infant interaction than in normal adult conver- sation. Whereas in adult conversation both types of pause are matched, and both can have affective correlates, in the mother-infant exchange only the switching pause is matched across subjects, and only a per dyad degree of intrapersonal pause matching has an affective correlate.

We conclude that a dialogic structure is already in evidence prior to speech onset, and that it is being regulated in a manner that is similar to adult conversation. The temporal organization of turn-taking and its affective correlates can thus be seen as one major scaffolding on which later linguistic development is erected.

REFERENCES

Als, H. (1975). An ethological study of mother-infant interaction. Unpublished doctoral dissertation, University of Pennsylvania.

Alson, D. (1982). Maternal empathy in relation to infant affective engagement of Jbur months. Unpublished doctoral dissertation, Yeshiva University.

Bateson, M.C. (1975). Mother-infant exchanges: The epigenesis of conversational interaction. Annals of the New York Academy of Sciences, 263, 101-113.

Beebe, B., & Gerstman, L. (1980). The "packaging" of maternal stimulation in relation to infant facial-visual engagement. Merrill-Palmer Quarterly, 26, 321-339.

Beebe, B., & Stern, D.N. (1977). Engagement-disengagement and early object experiences. In N. Freedman & S. Grand (Eds.), Communicative structures and psychic structures: A psychoanalytic interpretation of communication. New York: Plenum Press.

Beebe, B., Stern, D., & Jaffe, J. (1979). The kinesic rhythm of mother-infant interactions. In A.W. Siegman & S. Feldstein (Eds.), Of speech and time: Temporal patterns in interpersonal contexts. Hillsdale, NJ: Erlbaum.

Brazelton, T.B., Koslowski, B., & Main, M. (1974). The origins of reciprocity: The early mother-infant interaction. In M. Lewis and L. Rosemblum (Eds.), The effect of the infant on the caregiver. New York: Wiley.

Breskin, S., & Jaffe, J. (1970). Temporal patterns of speech and sample size. Journal of Speech and Hearing Research, 13(3), 667-668.

Bullowa, M. (1979). Before speech: The beginnings of human communication. Cambridge: Cambridge University Press.

Cassotta, L., Feldstein, S., & Jaffe, J. (1967). The stability and modifiability of individual vocal characteristics in stress and nonstress interviews (Research Bulletin No. 2). New York: William Alanson White Institute.

DeCasper, A., & Carstens, A. (1980). Contingencies of stimulation: Effect on learning and emotion in neonates. Infant Behavior and Development, 4, 19-36.

DeCasper, A., & Filer, W. (1980). Of human bonding: Newborns prefer their mothers' voices. Science, 208, 1174.

Feldstein, S., Konstantareas, M., Oxman, J., & Webster, C. (1982). The chronography of interactions with autistic speakers: An initial report. Journal of Communication Disorders 15, 451-460.

Feldstein S., & Welkowitz, J. (1978). A chronography of conversation: In defense of an

258 Beebe, Alson, Jaffe, Feldstein, and Crown

Field,

Jaffe,

Jaffe,

objective approach. In A.W. Siegman & S. Feldstein (Eds.), Nonverbal behavior and communication. Hillsdale, NJ: Erlbaum. T.M. (1977). Effects of early separation, interactive deficits and experimental

manipulation on infant-mother face-to-face interaction. Child Development, 48, 763- 771. J., & Anderson, S. (1979). Communication rhythms and the evolution of language, tn A.W. Siegman & S. Feldstein (Eds.), Of speech and time. Hillsdale, NJ: Erlbaum. J. & Anderson, S. (in press). Speech rate studies in major depressive disorders: Prediction of response to medication. In S. Feldstein, C. Crown, & J. Welkowitz (Eds.), Speech sounds and silences: A social-psychological approach to clinical concerns. Hillsdale, N.J.: Erlbaum~

Jaffe, J., Anderson, S., & Stern, D. (1979). Conversational rhythms. In D. Aronson & R. Rieber (Eels.), Psycholinguistic research. New York: Erlbanm.

Jaffe, J., & Feldstein, S. (1970). Rhythms of dialogue. New York: Academic Press. Jasnow, M. (1983). Temporal accommodation in vocal behavior in mother-infant dyads.

Doctoral dissertation, George Washington University. Jasnow, M. & Feldstein, S. (1986). Adult-like temporal characteristics of mother-infant

interactions. Child Development, 57, 754-761. Kaye, K. (1982). The mental and social life of babies. Chicago: University of Chicago Press. Kaye, K., & Wells, A. (1980). Mothers' jiggling and the burst-panse pattern in neonatal

sucking. Infant Behavior and Development, 3, 29-46. Marcus, E., Welkowitz, J., Feldstein, S., & Jaffe, J. (1970). Psychological differentiation and

the congruence of temporal speech patterns. Paper presented at the Eastern Psycho- logical Association Meetings, April, Atlantic City.

Mayer, N.K., & Tronick, E.Z. (1985). Mothers' turn-giving signals and infant turn-taking in mother-infant interaction. In T. Field & N. Fox (Eds.), Social perception in infancy. Norwood, NJ: Ablex.

Natale, M. (1975). Convergence of mean vocal intensity in dyadic communication as a function of social desirability. Journal of Personality and Social Psychology, 32, 790-804.

Schaffer, H.R., Collis, G.M., & Parsons, G. (1977). Vocal interchange and visual regard in verbal and pre-verbal children. In H.R. Schaffer (Ed.), Studies in mother-infant interaction. New York: Academic Press.

Spitz, R. (1963). The evolution of the dialogue. In M. Schur (Ed.), Drives, affects, behavior (Vol. 2). New York: International Universities Press.

Stern, D.N. (1974). Mother and infant at play: The dyadic interaction involving facial, vocal and gaze behaviors. In M. Lewis & L. Rosenblum (Eds.), The effect of the infant on its caregiver. New York: Wiley.

Stern, D.N. (1977). Thefirst relationship. In J. Bruner, M. Cole, & B. Lloyd (Series Eds.), The developing child. Cambridge, MA: Harvard University Press.

Stern, D.N., Beebe, B., Jaffe, J., & Bennett, S.L. (1977). The infant's stimulus world during social interaction: A study of caregiver behavior with particular reference to repetition and timing. In H.R. Schaffer (Ed.), Studies in mother-infant interaction. New York: Academic Press.

Stern, D.N., Jaffe, J., Beebe, B., & Bennett, S.L. (1975). Vocalizing in unison and in alteration: Two modes of communication within the mother-infant dyad. Annals of the New York Academy of Sciences: Developmental Psycholinguistics and Communication Disorders, 263, 89-100.

Tronick, E., Als, H., & Adamson, L. (1979). The communicative structure of early

Vocal Congruence 259

face-to-face interactions. In M. Bullowa (Ed.), Before speech. Cambridge: Cambridge University Press.

Welkowitz, J., Cariffe, G., & Feldstein, S. (1976). Conversational congruence as a criterion of socialization in children. Child Development, 47, 269-272.

Welkowitz, J., & Kuc, M. (1973). Interrelationships among warmth, genuineness, empathy, and temporal speech patterns in interpersonal interaction. Journal of Consulting and Clinical Psychology, 41, 472-473.