The factorial structure of the Chronic Liver Disease Questionnaire (CLDQ)

10
The factorial structure of the Chronic Liver Disease Questionnaire (CLDQ) Karl-Heinz Schulz Sylvia Kroencke Heike Ewers Holger Schulz Zobair M. Younossi Accepted: 12 March 2008 Ó Springer Science+Business Media B.V. 2008 Abstract Objective The Chronic Liver Disease Questionnaire (CLDQ) is a disease-specific instrument designed to assess health-related quality of life in patients with chronic liver disease. The aim of this paper is to present the psycho- metric properties of a German version of this questionnaire. A special focus is placed on the analysis of the CLDQ’s factorial structure. Methods Five hundred and twenty-four patients com- pleted the CLDQ from May 1999 to October 2006. The results were subject to item analysis, reliability and validity assessments, and confirmatory and exploratory factor analysis. Results The distribution characteristics on the item and scale level were satisfactory. Internal consistency was good to excellent; retest reliability acceptable. Validity could be confirmed by characteristic subscale correlations with other quality-of-life scales. Confirmatory factor analysis could not sufficiently reproduce the original factor structure. Exploratory factor analysis suggested five out of six sub- scales of the original version and yielded a new subscale: ‘‘sleep.’’ Conclusion CLDQ’s reliability and validity have been confirmed. In addition, the demonstrated practical admin- istration of the questionnaire suggests that it should serve as a routine quality of life assessment of patients with chronic liver disease. Keywords Chronic liver disease Á Liver cirrhosis Á Disease-specific questionnaire Á Factor analysis Á Health-related quality of life Á Liver transplantation Á Reliability Á Validity Á Living liver donation Abbreviations CLDQ Chronic Liver Disease Questionnaire HRQL Health-related quality of life SF-36 36-Item Short-Form Health Survey HADS Hospital Anxiety and Depression Scale Introduction Health-related quality of life (HRQL) is a multidimen- sional construct that includes the psychological, social, and functional aspects of an illness as well as its physical aspects. HRQL is usually assessed with the help of self- reported measures, which reflect the patient’s appraisal of his or her own illness and its accompanying impairments [1]. Generic measures allow comparisons between groups of patients, but disease-specific instruments identify the characteristic impairments of a specific illness and are considered to be more responsive to change [2]. The Chronic Liver Disease Questionnaire (CLDQ) is a specific HRQL instrument that was systematically devel- oped by Younossi et al. [3] for patients with chronic liver disease of all etiologies and all stages of cirrhosis. The K.-H. Schulz (&) Á S. Kroencke Á H. Ewers Department of Hepatobiliary and Transplant Surgery, Center for Transplantation Medicine, University Medical Center Eppendorf, Martinistr. 52, S 35, 20246 Hamburg, Germany e-mail: [email protected] K.-H. Schulz Á S. Kroencke Á H. Ewers Á H. Schulz Department of Medical Psychology, Center for Psychosocial Medicine, University Medical Center Eppendorf, Hamburg, Germany Z. M. Younossi Center for Liver Diseases, Inova Fairfax Hospital, Annandale, VA, USA 123 Qual Life Res DOI 10.1007/s11136-008-9332-7

Transcript of The factorial structure of the Chronic Liver Disease Questionnaire (CLDQ)

The factorial structure of the Chronic Liver DiseaseQuestionnaire (CLDQ)

Karl-Heinz Schulz Æ Sylvia Kroencke ÆHeike Ewers Æ Holger Schulz Æ Zobair M. Younossi

Accepted: 12 March 2008

� Springer Science+Business Media B.V. 2008

Abstract

Objective The Chronic Liver Disease Questionnaire

(CLDQ) is a disease-specific instrument designed to assess

health-related quality of life in patients with chronic liver

disease. The aim of this paper is to present the psycho-

metric properties of a German version of this questionnaire.

A special focus is placed on the analysis of the CLDQ’s

factorial structure.

Methods Five hundred and twenty-four patients com-

pleted the CLDQ from May 1999 to October 2006. The

results were subject to item analysis, reliability and validity

assessments, and confirmatory and exploratory factor

analysis.

Results The distribution characteristics on the item and

scale level were satisfactory. Internal consistency was good

to excellent; retest reliability acceptable. Validity could be

confirmed by characteristic subscale correlations with other

quality-of-life scales. Confirmatory factor analysis could

not sufficiently reproduce the original factor structure.

Exploratory factor analysis suggested five out of six sub-

scales of the original version and yielded a new subscale:

‘‘sleep.’’

Conclusion CLDQ’s reliability and validity have been

confirmed. In addition, the demonstrated practical admin-

istration of the questionnaire suggests that it should serve

as a routine quality of life assessment of patients with

chronic liver disease.

Keywords Chronic liver disease � Liver cirrhosis �Disease-specific questionnaire � Factor analysis �Health-related quality of life � Liver transplantation �Reliability � Validity � Living liver donation

Abbreviations

CLDQ Chronic Liver Disease Questionnaire

HRQL Health-related quality of life

SF-36 36-Item Short-Form Health Survey

HADS Hospital Anxiety and Depression Scale

Introduction

Health-related quality of life (HRQL) is a multidimen-

sional construct that includes the psychological, social, and

functional aspects of an illness as well as its physical

aspects. HRQL is usually assessed with the help of self-

reported measures, which reflect the patient’s appraisal of

his or her own illness and its accompanying impairments

[1]. Generic measures allow comparisons between groups

of patients, but disease-specific instruments identify the

characteristic impairments of a specific illness and are

considered to be more responsive to change [2].

The Chronic Liver Disease Questionnaire (CLDQ) is a

specific HRQL instrument that was systematically devel-

oped by Younossi et al. [3] for patients with chronic liver

disease of all etiologies and all stages of cirrhosis. The

K.-H. Schulz (&) � S. Kroencke � H. Ewers

Department of Hepatobiliary and Transplant Surgery, Center

for Transplantation Medicine, University Medical Center

Eppendorf, Martinistr. 52, S 35, 20246 Hamburg, Germany

e-mail: [email protected]

K.-H. Schulz � S. Kroencke � H. Ewers � H. Schulz

Department of Medical Psychology, Center for Psychosocial

Medicine, University Medical Center Eppendorf, Hamburg,

Germany

Z. M. Younossi

Center for Liver Diseases, Inova Fairfax Hospital, Annandale,

VA, USA

123

Qual Life Res

DOI 10.1007/s11136-008-9332-7

original instrument was tested within a group of 133

patients and showed appropriate reliability and validity [3].

Since then, several studies have confirmed the ability of the

CLDQ to discriminate between different stages of disease

severity according to Child class [4–10]. Another study

found that the CLDQ could discern between patients with

and without comorbid diseases [11]. Furthermore, the

CLDQ has proven to be responsive after liver transplan-

tation [9, 12] as well as after application of self-care and

educational programs [13, 14]. Between 2004 and 2006, a

Thai [8], an Italian [11], a Spanish [9], and a Lithuanian

[10] adaptation of the CLDQ were published. A German

adaptation of the CLDQ was developed by Hauser et al. in

2001. The results from their sample of 203 patients with

chronic liver disease were published in 2004 [7]. In con-

trast to the Italian [11] and the Spanish [9] validation, no

factor analysis was computed in the Hauser et al. version to

analyze the factorial structure. The Thai [8] and Lithuanian

[10] validation did not include factor analysis either.

The two studies [9, 11] in which the factorial structure

of the CLDQ was reassessed yielded considerably different

results. Whereas the Spanish [9] factorial solution was

relatively close to the original [3], although a new factor,

‘‘sleep,’’ was found and some items showed higher load-

ings on other factors than the original ones, the Italian [11]

solution consisted of a five-factor model not compatible

with the original version.

Independently of the work of Hauser et al. [7], we

developed a German version of the CLDQ that was pro-

spectively applied to patients with chronic liver disease

seen in our surgical outpatient clinic and the medical

department since 1999. This paper presents the psycho-

metric properties of the questionnaire and reports our

experiences with the questionnaire in clinical practice. As

data concerning the factorial structure of the CLDQ is

inconsistent, a special focus is placed on factor analysis.

The results are compared to the original version by

Younossi et al. [3] and to the two available factorial

solutions reported by Ferrer et al. [9] and Rucci et al. [11].

Patients and methods

CLDQ

The original CLDQ is a self-administered questionnaire

that consists of 29 items on a seven-point Likert scale, with

higher scores indicating the minimum frequency of

symptoms and therefore a better HRQL. Answers reflect

the 2 weeks prior to testing. Factor analysis yielded six

subscales: fatigue, activity, emotional function, abdominal

symptoms, systemic symptoms, and worry. Means are

computed for all subscales and for the overall score.

Translation

The original CLDQ was translated in 1999 by three bilingual

professionals (two German native speakers and one English

native speaker) using the forward and backward translation

method. Differing translations were discussed until a con-

sensus was reached. Earlier versions of the questionnaire

were discussed with patients regarding understandability and

relevance of items. Additionally, we developed a slightly

modified version of the CLDQ for living liver donors.

Patients

The CLDQ was first distributed among patients with

chronic liver disease visiting the surgical outpatient clinic

in 1999. Beginning in 2000, patients from the medical

department and the surgical outpatient clinic whose disease

had advanced so that transplantation became necessary

received the questionnaire as part of their routine psycho-

logical evaluation before being admitted to the waiting list.

Liver tumor patients without cirrhosis and patients with

polycystic disease were not included in the study, because

these patients do not exhibit typical symptoms of chronic

liver disease. In 2004, we began applying the questionnaire

to the evaluation of living liver donor candidates.

Other questionnaires

Two other questionnaires were also used in the context of

the psychological evaluation for liver transplantation and

living liver donation: the 36-Item Short-Form Health Sur-

vey (SF-36), and the Hospital Anxiety and Depression

Scale—German version (HADS-D).

36-Item Short-Form Health Survey (SF-36)

The SF-36 [15] is an internationally used instrument to

measure generic HRQL. It is often used in the field of

gastroenterology [16] and is considered reliable and valid.

The German version was adapted by Bullinger and Kir-

chberger [17]. It consists of 36 items combined into eight

scales, out of which two summary scales can be computed:

‘‘physical composite score’’ and ‘‘mental composite

score.’’ Each of the eight scales results in a score ranging

from 0 to 100, with a higher score indicating a better

HRQL. T scores with a mean of 50 and a standard devia-

tion (SD) of 10 are computed for the summary scales.

Hospital Anxiety and Depression Scale—German version

(HADS-D)

The original version of the HADS was developed by

Zigmond and Snaith [18]; the German adaptation was

Qual Life Res

123

developed by Herrmann et al. [19]. It is a widely used

instrument for assessing anxiety and depression in clinical

medicine, with good reliability and validity. Each scale is

made up of seven items, which are summed to a score

ranging from 0 to 21. Scores lower than 7 are not consid-

ered clinically relevant.

Procedure and statistical analysis

Demographics and liver disease etiology were described

for the whole sample of patients with chronic liver disease

examined between 1999 and 2006. Item analysis was based

on calculated: means, SDs, floor effects (proportion of

patients with lowest HRQL scores), ceiling effects (pro-

portion of patients with highest HRQL scores), and item

discrimination (correlation of the item with its subscale).

Furthermore, the proportion of patients who did not answer

an item (‘‘missing’’) was used as a reflection of

‘‘acceptance.’’

Further analysis was conducted on a subscale level. For

each subscale, means and SDs were calculated as well as

the observed range of item scores, floor and ceiling effects,

and the proportion of patients with missing items. Reli-

ability was assessed through internal consistency

(Cronbach’s a) and retest reliability (Pearson correlation

coefficient).

Convergent and discriminant validity were assessed

through correlations (Pearson coefficient) of CLDQ

domains with SF-36 and HADS-D subscales in a multitrait,

multimethod matrix. It was hypothesized that different

scales measuring similar constructs would show high cor-

relations, whereas scales measuring different or

noncomparable traits would not correlate strongly. High

correlations (r [ .70) were hypothesized for the following

subscales: ‘‘fatigue’’ and ‘‘activity’’ (CLDQ) with ‘‘vital-

ity’’ (SF-36); ‘‘emotional function’’ (CLDQ) with ‘‘mental

health’’ (SF-36); and ‘‘emotional function’’ (CLDQ) with

‘‘anxiety’’ and with ‘‘depression’’ (HADS-D). Low corre-

lations (r \ .50) were hypothesized for the CLDQ subscale

‘‘abdominal symptoms’’ with SF-36 scales ‘‘role-emo-

tional’’ and ‘‘mental health’’ and with ‘‘anxiety’’ and

‘‘depression’’ (HADS-D); and ‘‘worry’’ (CLDQ) with

‘‘physical functioning,’’ ‘‘role-physical,’’ and ‘‘bodily

pain’’ (SF-36).

Validity was further estimated by comparing patients

with chronic liver disease to living liver donor candidates,

a sample selected for good physical health. Comparisons of

CLDQ means were based on independent samples t test.

The demographics and etiology of liver disease were

described separately for subsamples of patients for factor

analysis and retesting, and for the sample of living liver

donor candidates. The subsamples of patients with chronic

liver disease were compared with patients who were not

included in the analysis by using independent-samples t

tests or chi-square tests (according to scale level). The

demographics of living liver donor candidates and patients

with chronic liver disease were compared using the same

tests.

A confirmatory factor analysis (CFA) was computed as

another means to assess validity. The hypothesized model

according to Younossi et al. [3] was examined with AMOS

4.0 [20] using maximum likelihood (ML) estimation. The

following model fit indices were considered: the compar-

ative fit index (CFI), the standardized root mean square

residual (SRMR), and the root mean square error of

approximation (RMSEA). To show a good fit for the

model, CFI values should be .95 or higher, and the SRMR

and the RMSEA should be less than .08 and .06, respec-

tively [21]. It is generally accepted practice to evaluate fit

through a combination of fit indices; confidence in the

results increases as agreement among the fit statistics

increases.

In addition, an exploratory factor analysis (principal

component analysis with varimax rotation) was conducted.

Only factors with an eigenvalue [1 explaining the maxi-

mum cumulative variance were interpreted.

To minimize influences of missing data on the results of

factor analyses, list-wise exclusion of cases was preferred.

This resulted in a considerably reduced sample. Therefore,

demographics and liver disease etiology were described

separately for this sample and compared with the patients

not included in this analysis (see above).

Results

The demographics and liver disease etiology of 524

patients with chronic liver disease who filled out the

questionnaire from May 1999 to October 2006 are pre-

sented in Table 1. Patients reported no special problems

related to completing the questionnaire. However, we

repeatedly observed that patients who were hospitalized for

decompensated liver disease did not return the question-

naire. Among patients who did complete the questionnaire,

many asked questions about item no. 14 (‘‘bothered by a

limitation of your diet’’).

After list-wise exclusion of cases, the subsample for

factor analysis consisted of 401 patients (Table 1). This

subsample did not differ significantly from the patients

excluded from the analysis with regard to age (t = 1.05,

P = .29), gender (v2 = .44, P = .51), or liver disease

etiology (v2 = .62, P = .89).

Retest reliability was calculated utilizing a subsample of

55 patients who filled out the questionnaire a second time

(Table 1). The mean time between testing was 82 days

(SD = 41). This subsample’s mean age did not differ

Qual Life Res

123

significantly from the patients not included in this analysis

(t = .91, P = .36), but there were more women

(v2 = 4.24, P = .04) in the retest subsample. Liver disease

etiology did not differ significantly (v2 = 7.51, P = .06).

The sample of living liver donor candidates (40) is also

described in Table 1. Living liver donors were different

from patients with chronic liver disease with regard to

mean age (t = 14.94, P \ .001) but not gender (v2 = .18,

P = .68).

Item analysis (Table 2) showed that the whole range of

possible scores was utilized. There was only one item (no.

9) with a proportion of 21% of worst HRQL scores. For

all the other items, the proportion of worst HRQL scores

was lower than 15%. In comparison, there were seven

items with a proportion of best HRQL scores higher than

20% and seven more items with a proportion exceeding

even 30%. Item discrimination was good for all items.

However, two items had a noticeable number of missing

values: 41 patients did not complete item no. 14 (‘‘both-

ered by a limitation of your diet’’), and 23 patients did not

complete item no. 9 (‘‘trouble lifting or carrying heavy

objects’’).

Analysis on subscale level (Table 3) showed means

ranging between 4.11 and 5.04. No floor effects could be

determined. The subscale ‘‘activity’’ showed the largest

proportion of patients with best HRQL scores. Missing

values on this subscale were mainly related to items no. 14

and 9. For the whole questionnaire, the proportion of

patients with at least one missing item was 23.5%.

Internal consistency coefficients ranged from a = .72

for ‘‘activity’’ to a = .92 for ‘‘fatigue,’’ attaining a = .95

for the whole questionnaire. Retest reliability was also

good, with coefficients between r = .69 and r = .79,

except for the subscale ‘‘fatigue’’ (r = .58).

Table 4 shows the intercorrelations of the CLDQ sub-

scales and correlations of CLDQ subscales with SF-36 and

HADS-D subscales in a multitrait, multimethod matrix.

The CLDQ subscale intercorrelations ranged from r = .42

to r = .76. Regarding correlations of CLDQ subscales with

scales of the other two questionnaires, those hypothesized

as high showed coefficients between r = .69 and r = .85,

whereas correlations hypothesized as low were between

r = .33 and r = .48. The pattern of correlations between

the different subscales of the questionnaires is plausible

with regard to content.

Comparing patients with chronic liver disease with liv-

ing liver donor candidates (Fig. 1) yielded highly

significant differences of mean values (P \ .001) for all

subscales and for the total score.

Factor analyses

CFA revealed an only partial fit of the data: SRMR = .076

fulfilled the recommended cutoff criterion, whereas the

other two goodness-of-fit indices (CFI = .85, RMSEA =

.092) were too far out of range of the recommended cutoff

criteria for retaining a hypothesized model. On the item

level, especially items 16 and 20 in factor ‘‘emotional

function,’’ had particularly low standardized factor

loadings.

Therefore, exploratory factor analysis was conducted. It

revealed six factors with an eigenvalue[1, explaining 70%

of cumulative variance. Most items showed the highest

factor loadings on the original factors (Table 5). The scale

‘‘systemic symptoms’’ proved difficult to reproduce, with

two items (no. 3 and 6) showing higher loadings on more

than one other factor. Item no. 3 (‘‘bodily pain’’) had a

substantially higher loading on ‘‘abdominal symptoms,’’

whereas item no. 6 (‘‘shortness of breath in daily activi-

ties’’) showed almost equally high loadings on ‘‘fatigue’’

and ‘‘abdominal symptoms.’’ The scale ‘‘worry’’ corre-

sponded fully to the original. The factor ‘‘activity,’’

however, which consists of three items, could not be

reproduced at all. Item no. 7 (‘‘not able to eat as much as

you would like’’) and item no. 14 (‘‘bothered by a limita-

tion of your diet’’) had the highest loading on ‘‘abdominal

symptoms’’ and item no. 9 (‘‘trouble lifting or carrying

heavy objects’’) on ‘‘fatigue.’’ A new factor named ‘‘sleep’’

was derived from two items (no. 16 and 20) of the original

subscale ‘‘emotional function.’’

Table 1 Characteristics of patients with chronic liver disease, including subsamples, and of living liver donor candidates

All patients (N=524) Sub-sample for factor analysis (N=401) Retest subsample (N=55) Living liver donors (N=40)

Age: mean/SD 51.5/10.6 51.3/10.7 50.3/11.6 33.3/7.1

Gender: males 306 (58%) 231 (58%) 25 (46%) 22 (55%)

Etiology

Viral 127 (24%) 94 (23%) 15 (27%) –

Alcoholic 242 (46%) 187 (47%) 20 (36%) –

Autoimmunologic 100 (19%) 77 (19%) 17 (31%) –

Other 55 (11%) 43 (11%) 3 (6%) –

SD standard deviation

Qual Life Res

123

Table 6 describes means, SDs, and Cronbach’s a-coef-

ficients for the new subscales according to our factor

analysis, composed of the items with the highest factor

loadings on the respective scale.

Discussion

In the following, first the results of item analysis and

on scale level are discussed. Then, results concerning

reliability and validity including factor analysis are focused

upon. Finally, general considerations and limitations of the

study are addressed.

The Hamburg version of the CLDQ demonstrated a

satisfactory distribution of item and subscale statistics. The

whole range of possible answers was utilized, and although

some items showed ceiling effects (comparatively large

proportion of patients with best HRQL scores), this was not

the case on a subscale level. Note that some patients must

Table 2 Item analysis

CLDQ items (number) Mean SD Floora (%) Ceilinga (%) Item discrimination Missingb

Fatigue

(2) Tired or fatigued 3.63 1.65 11.7 4.2 .84 4

(4) Sleepy during the day 4.02 1.71 8.0 8.2 .78 1

(8) Decreased strength 4.02 1.90 11.6 11.0 .82 8

(11) Decreased level of energy 4.01 1.78 8.1 7.9 .86 5

(13) Drowsy 5.51 1.54 .8 35.8 .64 15

Emotional function

(10) Anxious 5.47 1.62 1.9 38.5 .64 7

(12) Unhappy 4.86 1.78 4.0 22.4 .71 5

(15) Irritable 5.14 1.53 1.6 21.5 .68 8

(16) Difficulty sleeping 4.18 1.99 10.7 15.5 .63 0

(19) Mood swings 4.68 1.62 3.5 12.0 .74 6

(20) Unable to fall asleep 4.35 1.95 6.7 18.7 .62 16

(24) Depressed 4.95 1.65 2.7 21.2 .78 14

(26) Problems concentrating 5.02 1.56 2.1 18.6 .55 7

Worry

(18) Impact on family 3.88 1.83 11.7 11.1 .72 10

(22) Symptoms developing into major problems 4.01 1.82 10.6 11.2 .84 6

(25) Condition getting worse 3.88 1.79 10.0 8.3 .85 6

(28) Never feeling any better 4.68 1.85 5.4 20.6 .68 9

(29) Availability of a liver 4.08 1.96 14.6 14.6 .66 9

Abdominal symptoms

(1) Abdominal bloating 4.34 1.78 8.6 10.5 .42 2

(5) Abdominal pain 5.46 1.65 2.1 37.9 .69 7

(17) Abdominal discomfort 4.84 1.85 5.0 23.8 .67 2

Activity

(7) Not able to eat as much as you would like 5.03 2.00 7.2 33.5 .59 8

(9) Trouble lifting or carrying heavy objects 3.79 2.15 21.0 15.4 .53 23

(14) Limitation of diet 5.39 1.90 5.4 40.0 .50 41

Systemic symptoms

(3) Bodily pain 5.05 1.77 4.9 27.6 .50 9

(6) Shortness of breath in daily activities 5.16 1.83 5.2 34.0 .53 6

(21) Muscle cramps 5.32 1.69 2.9 34.4 .50 7

(23) Dry mouth 4.64 1.85 7.6 18.8 .59 8

(27) Itching 4.97 1.90 5.8 29.0 .43 6

CLDQ, Chronic Liver Disease Questionnaire; SD, standard deviationa Proportion of patients with worst (floor, item score: 1) and best (ceiling, item score: 7) HRQL scoresb Number of patients with item missing

Nmax = 524

Qual Life Res

123

occupy the highest and lowest scores for the questionnaire

to cover the entire impairment range. Floor effects were not

apparent in our sample, perhaps because patients with

strongly impaired health did not feel able to fill out the

questionnaire. Although distribution and item discrimina-

tion proved satisfactory, there were problems regarding

acceptance of two items of the subscale ‘‘activity.’’ Item

no. 14 (‘‘How much of the time during the last two weeks

have you been bothered by a limitation of your diet?’’) was

not completed by 8% of patients, and no. 9 (‘‘How much of

the time during the last two weeks have you had trouble

lifting or carrying heavy objects?’’) was not completed by

4% of patients. Some patients did not feel that the

questions applied to them because they did not have a

limitation of their diet and did not even try to lift or carry

heavy objects, so they refrained from answering the ques-

tions. These items might benefit from an additional

sentence such as: ‘‘If the question does not apply to you,

please chose ‘none of the time’.’’ A considerable propor-

tion of patients with one or more items missing in the

whole questionnaire can be attributed to items no. 9 and 14.

Nevertheless, the proportion in the Hamburg sample is

relatively large compared with other reports of missing

items: 0.4% [7] and 3.4% [9], respectively. Perhaps this is

because our data were gathered over a long period in the

context of the psychological evaluation of patients for liver

Table 3 Analysis on subscale level and reliability

Subscale Mean SD Range Floora (%) Ceilinga (%) Missingb (%) Cronbach’s a Retest reliability

Abdominal symptoms 4.89 1.44 1–7 1.6 12.9 2.1 .76 .71

Fatigue 4.24 1.48 1–7 3.2 5.1 5.9 .92 .58

Systemic symptoms 5.04 1.27 1.2–7 .6 12.0 5.0 .75 .79

Activity 4.72 1.62 1–7 3.1 17.3 13.0 .72 .70

Emotional function 4.83 1.28 1.4–7 .4 9.6 8.2 .89 .69

Worry 4.11 1.56 1–7 5.1 6.9 5.9 .90 .78

Total score 4.62 1.20 1.4–7 .7 5.2 23.5 .95 .76

SD standard deviationa Proportion of patients with worst (floor, score: 1) and best (ceiling, score: 7) HRQL scoresb Proportion of patients with any item missing on the subscale

Nmax = 524

Table 4 Chronic Liver Disease

Questionnaire (CLDQ) subscale

intercorrelations and

correlations of CLDQ subscales

with 36-Item Short Form Health

Survey (SF-36) and Hospital

Anxiety and Depression

Scale—German version

(HADS-D) subscales (multitrait

multimethod matrix)

Correlations hypothesized to be

high are marked in bold.

Correlations hypothesized to be

low are marked in italic

Abdominal

symptoms

Fatigue Systemic

symptoms

Activity Emotional

function

Worry

CLDQ

Abdominal symptoms

Fatigue .63

Systemic symptoms .65 .69

Activity .60 .76 .62

Emotional function .56 .73 .65 .64

Worry .42 .58 .51 .55 .66

SF-36

Physical functioning .40 .64 .56 .69 .52 .48

Role-physical .38 .65 .52 .62 .52 .47

Bodily pain .59 .53 .62 .55 .50 .37

General health .43 .61 .48 .51 .51 .53

Vitality .54 .85 .60 .70 .67 .56

Social functioning .40 .54 .45 .53 .56 .46

Role-emotional .33 .54 .47 .51 .55 .49

Mental health .46 .62 .53 .58 .80 .61

HADS-D

Anxiety -.42 -.52 -.47 -.48 -.69 -.61

Depression -.37 -.54 -.45 -.44 -.69 -.55

Qual Life Res

123

transplantation and not in the setting of a specifically

designed study where patients could get assistance in

completing the questionnaire if necessary. Ferrer et al. [9]

reported that 21% of their patients needed help completing

the questionnaire.

Internal consistency was good to excellent. Retest reli-

ability showed acceptable scores, especially when

considering the comparatively long interval between tests.

Hauser et al. [7] found slightly higher coefficients, but the

time between testing ranged from 3 to 8 days in their

sample. Ferrer et al. [9] applied the second questionnaire

after 2 weeks. Our results indicate that a longer interval for

retesting might also be acceptable. However, HRQL is

likely to worsen during this period. The fact that the sub-

scale ‘‘fatigue’’ shows a lower coefficient (r = .58) is

understandable, because ‘‘fatigue’’ is likely to be the most

unstable factor.

CLDQ subscale intercorrelations were moderate to high,

which on the one hand allows interpretation of the different

subscales but on the other hand also makes a calculation of

a global score appropriate. Whether these interrelations are

due to the structure of the instrument itself or reflect

covariance of the underlying constructs has to be deter-

mined in future studies.

The pattern of correlations between the CLDQ subscales

and SF-36 and HADS-D subscales is further evidence for

the convergent and discriminant validity of this question-

naire. Similar patterns were found by Ferrer et al. [9] and

Hauser et al. [7]. High correlations of three CLDQ sub-

scales (‘‘fatigue,’’ ‘‘activity,’’ ‘‘emotional function’’) with

two SF-36 scales (‘‘vitality,’’ ‘‘mental health’’) and of one

CLDQ subscale (‘‘emotional function’’) with HADS scales

do not contradict the assertion of the CLDQ being a dis-

ease-specific measure. However, studies with patients with

different diseases are necessary to support the disease

specificity of the instrument. Furthermore, three of the

CLDQ subscales (‘‘abdominal symptoms,’’ ‘‘systemic

symptoms,’’ ‘‘worry’’) show only low to moderate corre-

lations with the other questionnaires. Therefore, these

subscales can be regarded as adding specific information to

the generic questionnaires.

The observation that living liver donor candidates show

significantly better HRQL scores than patients with chronic

liver disease confirms the ability of the CLDQ to dis-

criminate between different groups of patients. It should be

noted that living liver donors are significantly younger than

patients with chronic liver disease. However, we believe

that this difference is due to the necessary good health

status of these subjects and that the age difference is of

only subordinate importance. Previous studies show sig-

nificant mean differences according to the severity of liver

disease [3–10] and for patients with and without comorbid

diseases [11]. In this context, additionally we point out that

there is no specific HRQL instrument for living liver

donors. A modified version of the CLDQ may be able to fill

this gap and be applied to this field as well. Because living

donors undergo liver surgery, they may develop postoper-

ative liver-related symptoms. Therefore, the questionnaire

may help to assess the impact of living liver donation and

the course of the recovery process.

Confirmatory factor analysis revealed that the factor

solution reported by Younossi et al. [3] is not sufficiently

reproducible in our sample. Therefore, an exploratory

factor analysis was conducted, yielding in a six-factor

model. The original factor structure in part was con-

firmed; however, the factor ‘‘activity’’ could not be

reproduced and a new factor, ‘‘sleep,’’ was identified. In

their sample of 149 patients, Ferrer et al. [9] were able to

reproduce all original factors. They also found a new

factor, ‘‘sleep,’’ composed of the same two items as in our

0

1

2

3

4

5

6

7

Abdominalsymptoms

Fatiguesymptoms

Activityfunction

Worry

Patients with chronic liver disease Living liver donors

Systemic Emotional Total score

Fig. 1 Mean Chronic Liver

Disease Questionnaire (CLDQ)

scores (with standard

deviations) of patients with

chronic liver disease and of

living liver donors. Higher

scores indicate a better quality

of life

Qual Life Res

123

Hamburg adaptation

Original CLDQ

Factor1

FA

Factor2

WO

Factor3

EF

Factor4

AS

Factor5

SS

Factor6

Sleep

FATIGUE (FA)(2) tired or fatigued .73 .23 .32 .24 .22 .07(4) sleepy during the day .70 .16 .32 .15 .27 .13(8) decreased strength .76 .28 .23 .29 .13 .18(11) decreased level of energy .79 .21 .31 .23 .15 .12(13) drowsy .42 .16 .49 .27 .31 .07EMOTIONAL FUNCTION (EF)(10) anxious .13 .26 .58 .40 .03 .21(12) unhappy .25 .38 .64 .31 -.08 .22(15) irritable .25 .12 .69 .09 .33 .16(16) difficulty sleeping .19 .13 .21 .14 .15 .87(19) mood swings .24 .33 .72 .18 .18 .12(20) unable to fall asleep .15 .18 .22 .09 .16 .88(24) depressed .18 .40 .72 .19 .13 .18(26) problems concentrating .39 .18 .50 .08 .46 .06WORRY (WO)(18) impact on family .19 .77 .27 .12 -.02 .06(22) symptoms developing into major problems .16 .81 .27 .16 .14 .11

(25) condition getting worse .19 .84 .23 .11 .16 .09(28) never feeling any better .17 .63 .35 .14 .32 .15(29) availability of a liver .26 .75 .04 .06 .23 .10ABDOMINAL SYMPTOMS (AS)(1) abdominal bloating .21 .13 .13 .37 .49 -.09(5) abdominal pain .17 .06 .19 .83 .24 .04(17) abdominal discomfort .22 .16 .25 .72 .25 .07ACTIVITY(7) Not able to eat as much as you would like .41 .18 .14 .56 .05 .21

(9) trouble lifting or carrying heavy objects .70 .29 .04 .31 .14 .13

(14) limitation of diet .31 .27 .22 .41 -.09 .24SYSTEMIC SYMPTOMS (SS)(3) bodily pain .35 .05 .21 .57 .28 .05(6) shortness of breath in dailyactivities

.42 .16 .02 .41 .36 .24

(21) muscle cramps .12 .05 .23 .16 .64 .06(23) dry mouth .30 .15 .16 .32 .49 .23(27) itching .09 .25 .02 .09 .68 .20

NOTES: item no. in parentheses; highest factor loadings for each factor are in bold script; factor loadings corresponding to the original factors are marked with a grey background; factor loadings with a black background indicate highest loadings on other factors than the original ones; the factor “activity” could not be reproduced; a new factor “sleep” was found

Table 5 Exploratory factor analysis of the Hamburg version of the Chronic Liver Disease Questionnaire (CLDQ) (N=401)

Qual Life Res

123

analysis. Even though they were able to reproduce the

factor ‘‘activity,’’ they found by far the lowest Cronbach’s

a coefficient for this subscale, and one of the three items

(no. 9) showed a substantially higher factor loading on

‘‘fatigue.’’ In our sample, ‘‘activity’’ also demonstrated

the lowest coefficient of internal consistency and item

no. 9 a high factor loading on ‘‘fatigue.’’ Furthermore, the

factor ‘‘systemic symptoms’’ was difficult to reproduce in

both studies. Ferrer et al. [9] found three items (nos. 3, 6,

23) showing considerably higher loadings on more than

one other factor. In our study, two of these items (nos. 3

and 6) showed multiple loadings on different factors.

Regarding both studies, these items could not be assigned

clearly to any dimension. Considering the factor ‘‘worry,’’

Ferrer et al. [9] identified two items (nos. 22 and 25) that

loaded higher on ‘‘emotional function,’’ although the

loadings on ‘‘worry’’ were also still high. In our sample,

‘‘worry’’ corresponded fully to the original.

Rucci et al. [11] found a five-factor solution (347)

remarkably different from the others but with some simi-

larities. A new factor, ‘‘somatic symptoms,’’ combined all

items of the scales ‘‘abdominal symptoms’’ and ‘‘fatigue,’’

all but one item from ‘‘systemic symptoms,’’ and one item

each from ‘‘activity’’ (no. 9) and ‘‘emotional function’’ (no.

26). Another factor, named ‘‘anxiety/depression,’’ consisted

of four items of the original subscale ‘‘emotional function,’’

and a new factor, ‘‘well-being,’’ combined one item from

‘‘activity’’ (no. 7) and one from ‘‘emotional function’’ (no.

12). In accordance with our findings, the factor ‘‘activity’’

could not be reproduced either. Furthermore, Rucci et al.

[11] also found a factor, ‘‘sleep problems,’’ consisting of the

same items as in our and the Spanish sample [9] plus one

item from ‘‘systemic symptoms’’ (no. 21, ‘‘muscle

cramps’’). They also found a factor, ‘‘worries,’’ corre-

sponding to the original subscale plus one item from

‘‘activity’’ (no. 14). Their relatively different factorial

structure might be explained by the fact that their sample

consisted solely of patients with hepatitis C.

As the compared factorial solutions originated from

different countries, some of the discrepancies could be due

to the linguistic adaptations and the different cultural

backgrounds. Replications of the factor analysis of the

original American version and for the different linguistic

adaptations would help to clarify this question.

We do not yet want to recommend a change of the

subscales derived from the original factorial solution

because a direct comparison between the different versions

would no longer be possible. However, there is evidence

that at least some items should be assigned differently or,

alternatively, be eliminated from the questionnaire because

of multiple loadings. Additional studies are needed to

determine which factorial structure should be favored.

For interested parties, we describe means and SDs for

the new subscales according to our factor analysis

(Table 6). Internal consistency for these subscales is very

good except for the subscale ‘‘systemic symptoms,’’ which

contains the most heterogeneous group of items.

Concerning the comparison with the German adaptation

by Hauser et al. [7], only a few discrepancies regarding the

translation of certain items can be observed. In particular,

translation of the items concerning ‘‘abdominal bloating’’

(no. 1), ‘‘drowsiness’’ (no. 13), and ‘‘limitation of diet’’

(no. 14) differed between the two versions. At present, we

recommend the use of the version described in this paper

because the factorial structure is available, which is based

on a larger sample size. As to differences to the original

American version, until now, the factorial structure repor-

ted by Younossi et al. [3] could not be sufficiently

confirmed in different countries [9, 11]. Further studies are

necessary to clarify whether these differences are due to

linguistic ambiguities or cultural differences in dealing

with the illness [22].

We found the CLDQ to be very useful in the psycho-

logical evaluation of patients for liver transplantation. In

this context, it can be applied before and after liver trans-

plantation and serve as an outcome measure in prospective

studies. It has already shown its responsiveness after liver

transplantation [9, 12] and its ability to provide additional

relevant information to generic HRQL instruments, which

lack sensitivity for liver-specific symptoms [9].

Table 6 Suggested new Chronic Liver Disease Questionnaire (CLDQ) subscales according to factor analysis (N=401)

Subscale Mean Standard deviation Cronbach’s a Items contained (no.)

Fatigue 4.11 1.54 .92 2, 4, 6, 8, 9, 11

Worry 4.07 1.57 .90 18, 22, 25, 28, 29

Emotional function 5.08 1.29 .91 10, 12, 13, 15, 19, 24, 26

Abdominal symptoms 5.14 1.41 .82 3, 5, 7, 14, 17

Systemic symptoms 4.81 1.29 .68 1, 21, 23, 27

Sleep 4.26 1.91 .94 16, 20

Total score 4.58 1.19 .95 1–29

New subscale not contained in the original CLDQ marked in italic

Qual Life Res

123

Our study did have some limitations. First, no compari-

sons according to severity of liver disease (e.g., Child A, B,

and C) could be provided because this data was not avail-

able. However, comparison of different severity groups is

just one way to determine the validity of the instrument, and

several studies [4–10] have already confirmed the CLDQ’s

ability to discriminate between Child classes.

Second, the ability of the CLDQ to discriminate

between patients with chronic liver disease and living liver

donors was expected because the CLDQ has already been

shown to discriminate between different stages of disease

severity. However, this was the first time the CLDQ was

applied to living donors, and therefore, new information is

provided.

In conclusion, these data represent the largest sample of

patients assessed with the CLDQ in the literature. The

results reconfirm the reliability and validity of the CLDQ.

Furthermore, our data confirms the practical administration

of the questionnaire as part of the routine psychological

evaluation of patients with chronic liver disease. Compar-

ing the different factorial solutions, the subscales

‘‘activity’’ and ‘‘systemic symptoms’’ seem to be the most

difficult to reproduce, whereas the fact that all studies were

able to reproduce the factor ‘‘worry’’ emphasizes its

robustness. The factorial structure of the questionnaire

should be subject to international discussion and, if possi-

ble, a consensus should be reached by means of further

factor-analytical studies.

Acknowledgements This study was supported by the Deutsche

Forschungsgemeinschaft (DFG) (SCHU 974/5-1/2). We thank

Susanne Kjer and Stefanie Kraft for their help with data collection.

References

1. Irvine, E. J. (1999) The quality of quality of life. Gut, 44, 450–

451.

2. Younossi, Z. M., & Guyatt, G. (1998). Quality-of-life assess-

ments and chronic liver disease. The American Journal ofGastroenterology, 93, 1037–1041.

3. Younossi, Z. M., Guyatt, G., Kiwi, M., Boparai, N., & King, D.

(1999). Development of a disease specific questionnaire to

measure health related quality of life in patients with chronic liver

disease. Gut, 45, 295–300.

4. Younossi, Z. M., Kiwi, M. L., Boparai, N., Price, L. L., & Guyatt,

G., (2000). Cholestatic liver diseases and health-related quality of

life. The American Journal of Gastroenterology, 95, 497–502.

5. Younossi, Z. M., Boparai, N., Price, L. L., Kiwi, M. L.,

McCormick, M., & Guyatt, G., (2001). Health-related quality of

life in chronic liver disease: The impact of type and severity of

disease. The American Journal of Gastroenterology, 96, 2199–

2205.

6. Atiq, M., Gill, M. L., & Khokhar, N., (2004). Quality of life

assessment in Pakistani patients with chronic liver disease.

Journal of the Pakistan Medical Association, 54, 113–115.

7. Hauser, W., Schnur, M., Steder-Neukamm, U., Muthny, F. A., &

Grandt, D. (2004). Validation of the German version of the

Chronic Liver Disease Questionnaire. European Journal ofGastroenterology & Hepatology, 16, 599–606.

8. Sobhonslidsuk, A., Silpakit, C., Kongsakon, R., Satitpornkul, P.,

& Sripetch, C. (2004). Chronic Liver Disease Questionnaire:

Translation and validation in Thais. World Journal of Gastro-enterology, 10, 1954–1957.

9. Ferrer, M., Cordoba, J., Garin, O., et al. (2006). Validity of the

Spanish version of the Chronic Liver Disease Questionnaire

(CLDQ) as a standard outcome for quality of life assessment.

Liver Transplantation, 12, 95–104.

10. Sumskiene, J., Sumskas, L., Petrauskas, D., & Kupcinskas, L.

(2006). Disease-specific health-related quality of life and its

determinants in liver cirrhosis patients in Lithuania. WorldJournal of Gastroenterology, 12, 7792–7797.

11. Rucci, P., Taliani, G., Cirrincione, L., et al. (2005) Validity and

reliability of the Italian version of the Chronic Liver Disease

Questionnaire (CLDQ-I) for the assessment of health-related

quality of life. Digestive and Liver Disease, 37, 850–860.

12. Younossi, Z. M., McCormick, M., Price, L. L., et al. (2000).

Impact of liver transplantation on health-related quality of life.

Liver Transplantation, 6, 779–783.

13. Zandi, M., Adib-Hajbagheri, M., Memarian, R., Nejhad, A. K., &

Alavian, S. M. (2005). Effects of a self-care program on quality

of life of cirrhotic patients referring to Tehran Hepatitis Center.

Health and Quality Life Outcomes, 3, 35.

14. Sharif, F., Mohebbi, S., Tabatabaee, H. R., Saberi-Firoozi, M., &

Gholamzadeh, S. (2005). Effects of psycho-educational inter-

vention on health-related quality of life (QOL) of patients with

chronic liver disease referring to Shiraz University of Medical

Sciences. Health and Quality Life Outcomes, 3, 81.

15. Ware, J. E., & Sherbourne, C. D. (1992). The MOS 36-Item

Short-Form Health Survey (SF-36). I. Conceptual framework and

item selection. Medical Care, 30, 473–483.

16. Yacavone, R. F., Locke, G. R., Provenzale, D. T., & Eisen, G. M.

(2001). Quality of life measurement in gastroenterology: What is

available? The American Journal of Gastroenterology, 96, 285–

297.

17. Bullinger, M., & Kirchberger, I. (1998). SF-36 Fragebogen zum

Gesundheitszustand (pp. 1–155). Gottingen: Hogrefe.

18. Zigmond, A. S., & Snaith, R. P. (1983). The Hospital Anxiety and

Depression Scale. Acta Psychiatr Scand, 67, 361–370.

19. Herrmann, C., Buss, U., & Snaith, R. P. (1994). HADS-D Hos-pital Anxiety and Depressions Scale—Deutsch version (pp. 1–

43). Bern: Huber.

20. Arbuckle, J. L., & Wothke, W., (1999) Amos 4.0 users’ guide.

Chicago: SmallWaters.

21. Hu, L., & Bentler, P. M. (1999). Cutoff criteria for fit indexes in

covariance structure analysis: Conventional criteria versus new

alternatives. Structural Equation Modeling, 6, 1–55.

22. Guillemin, F., Bombardier, C., & Beaton, D. (1993). Cross-cul-

tural adaptation of health-related quality of life measures:

Literature review and proposed guidelines. Journal of ClinicalEpidemiology, 46, 1417–1432.

Qual Life Res

123