Illustrating Bayesian evaluation of informative hypotheses for regression models
Bayesian model selection for heteroskedastic models
Transcript of Bayesian model selection for heteroskedastic models
BAYESIAN MODEL SELECTION FOR
HETEROSKEDASTIC MODELS
∗Cathy W. S. Chen(1), Richard Gerlach(2), and Mike K. P. So(3)
(1)Department of Statistics, Feng Chia University, Taiwan.
(2)Econometrics and Business Statistics, University of Sydney, Australia.
(3)Information and Systems Management, Hong Kong University of Science & Technology
Abstract
It is well known that volatility asymmetry exists in financial markets. This pa-
per reviews and investigates recently developed techniques for Bayesian estimation
and model selection applied to a large group of modern asymmetric heteroskedastic
models. These include the GJR-GARCH, threshold autoregression with GARCH
errors, threshold GARCH and Double threshold heteroskedastic model with aux-
iliary threshold variables. Further we briefly review recent methods for Bayesian
model selection, such as: reversible jump Markov chain Monte Carlo, Monte Carlo
estimation via independent sampling from each model and importance sampling
methods. Seven heteroskedastic models are then compared, for three long series
of daily Asian market returns, in a model selection study illustrating the preferred
model selection method. Major evidence of nonlinearity in mean and volatility is
found, with the preferred model having a weighted threshold variable of local and
international market news.
Keywords: asymmetric volatility model; Markov chain Monte Carlo; posterior model
probability, parallel sampling.
∗Corresponding author is Cathy W. S. Chen, Department of Statistics, Feng Chia University, Taiwan.
Email: [email protected]; Ph: +886 4 24517250 ext. 4412; Fax: +886 4 24517092.
1 Introduction
The family of autoregressive conditional heteroskedastic (ARCH) models by Engle (1982)
and generalized ARCH (GARCH) by Bollerslev (1986), have become the widely accepted
volatility models. However, they only allow symmetric responses to past shocks. Poon
and Granger (2003) described a list of prominent characteristics or stylized facts for
financial time series. This includes dynamic volatility and volatility persistence, fat tailed
(compared to normality), mean stationary returns and also asymmetric volatility: in
response to past positive and negative returns.
The first asymmetric volatility model was the Exponential GARCH (EGARCH) of
Nelson (1991). Since then numerous models such as the GJR-GARCH of Glosten et
al. (1993), Threshold GARCH (TGARCH) of Zakoian (1994), and quadratic ARCH
(QGARCH) of Sendana (1995) have been proposed. Many of these are based on the
nonlinear threshold autoregressive (TAR) model of Tong (1978). Li and Li (1996) in-
troduced a double threshold ARCH (DT-ARCH) model, to capture volatility and mean
asymmetry, and this was extended to the DT-GARCH by Brooks (2001). Chen, Chiang
and So (2003) proposed an exogenous mean factor and an exogenous threshold variable in
a double threshold DTX-GARCH model, highlighting mean and volatility asymmetries in
response to US market news. Chen and So (2006) proposed a nonlinear GARCH model
that employed a weighted average of local and exogenous factors as the threshold vari-
able, also including t-distributed errors to capture fat-tailed returns. An advantage of a
weighted average threshold is that information from different markets or industries can
be combined to influence the change in regimes. The relative importance of these sources
is then assessed by the estimated weights.
The aforementioned studies have confirmed the asymmetric volatility phenomenon:
volatility in international stock markets is higher following ’bad’ local or international
market news, i.e. negative returns. They have also extended the phenomenon of asym-
metry, finding that volatility persistence, and the spillover effect of international market
news on local mean returns, are higher, and that average returns are often lower, following
bad local or international market news.
Practitioners are increasingly turning to Bayesian methods for the analysis of com-
1
plicated heteroskedastic models. This move seems due to the advent of inexpensive high
speed computers and the development of stochastic integration methodology, especially
Markov Chain Monte Carlo (MCMC) approaches. MCMC is a computationally intensive
simulation method for numerical integration developed in the 1980s, making it possible
to tackle more complex, realistic models and problems. Bayesian methods have been
successfully applied in similar nonlinear GARCH models by Geweke (1994), Bauwens
and Lubrano (1998), Vrontos, Dellaportas and Politis (2000) and many others. These
methods have the further advantage of being valid under the stationarity and positivity
parameter constraints usually required for such models (see Silvapulle and Sen 2004 for
problems with large sample theory under such constraints) and the ability to do joint
finite sample inference on all model parameters, including delay lag and threshold cut-off
level, which are alternately often estimated by information criteria (Li and Li, 1996) or
simply ascribed a value (Brooks, 2001).
However, comparison across models may not proceed in a completely similar fash-
ion, due to the fact that formal model comparison via Bayes factors remains difficult
(Berg, Meyer and Yu 2004). We briefly review three Bayesian model comparison meth-
ods, focusing on a recent sample relevant to heteroskedastic models. These include :
importance sampling to produce posterior model odds ratios, as in Geweke (1995) and
Gerlach, Carter and Kohn (1999), also applied to choose between four competing GARCH
models in Chen and So (2006) and between GARCH and stochastic volatility models in
Gerlach and Tuyl (2006); the reversible jump (RJ) MCMC method of Green (1995), ap-
plied to choose between a symmetric and nonlinear GARCH model in So, Chen and Chen
(2005), between GARCH and DTGARCH in Chen, So and Gerlach (2005) and between a
GARCH and EGARCH model in Vrontos, Dellaportas and Politis (2000); and the recent
direct posterior model probability method of Congdon (2006). All of these methods in-
volve MCMC extensions to allow estimation of either marginal model likelihoods, Bayes
factors or posterior model probabilities directly. Since in most cases, the computation
of marginal likelihoods involves high-dimensional integration, the computational time re-
quired is usually quite long; see the discussion in Carlin and Chib (1995), Chib (1995)
and Godsill (2001). Recently, however, Congdon (2006) introduced a more efficient ap-
proach to compute posterior model probabilities, following on from work in Carlin and
2
Chib (1995), Godsill (2001) and Scott (2002). We discuss these three methods and make
a recommendation on the preferred option in Section 3.
The purpose of this paper is to review model selection methods and then recommend
a method for heteroskedastic models. This method will be applied to choose between
seven competing heteroskedastic models for some financial return data. We thus intend
to find the optimal heteroskedastic model in these financial markets, and in doing so fully
investigate and analyse the apparent nonlinear behaviour in these markets.
The rest of the paper is set out as follows. Section 2 describes several nonlinear
GARCH models, and in particular a general double threshold GARCH model specification
with auxiliary variables. Section 3 discusses the three model comparison methods. Section
4 contains an analysis of returns from three major Asian stock indices, while Section 5
concludes.
2 Threshold Nonlinear Heteroskedastic Models
We consider seven modern heteroskedastic models in this paper, all with t-distributed
errors, with details given in this section.
We consider single and double threshold models, including models with no asymmetry,
only mean asymmetry, only volatility asymmetry and both mean and volatility asymme-
try. The simplest is the ARX-GARCH-t model in So, Chen and Liu (2006) which assumes
an autoregressive (AR) mean process with exogenous (X) variables. Other asymmetric
generalizations in the mean or variance equations only are also in our list. For example, the
ARX-GJR-GARCH-t model in Chen, Gerlach and So (2006), a threshold ARX or TARX-
GARCH-t model (Li and Lam 1995’s model is a particular case) and ARX-TGARCH-t
model in So, Chen and Chen (2005), where ’GJR’ stands for the asymmetric variance
specification proposed by Glosten, Jagannathan and Runkle (1993). Finally three double
threshold models are considered: a DTX-GARCH with exogenous threshold (Chen, Ger-
lach, and So, 2006), a DT-GARCH with local market or self-exciting threshold (Chen, So
and Gerlach, 2005) and a DTX-GARCH whose threshold is a linear combination of the
local and exogenous variables, as in Chen and So (2006).
Building on the work of Tong (1978), Li and Li (1996), Brooks (2001) and Chen,
3
Chiang and So (2003), Chen and So (2006) introduced the following general threshold
heteroskedastic model:
yt = φ(j)0 +
pj∑
i=1
φ(j)i yt−i +
qj∑
k=1
lj∑
i=1
ψ(j)ki xkt−i + at, if rj−1 ≤ zt−d < rj,
at =√
htεt, εti.i.d.∼ D(0, 1),
ht = α(j)0 +
dj∑
i=1
α(j)i a2
t−i +cj∑
l=1
β(j)l ht−l,
where
zt = w1z1t + · · ·+ wmzmt, 0 ≤ wi ≤ 1,m∑
i=1
wi = 1,
for j = 1, . . . , g and D(0, 1) is an i.i.d. error distribution with mean 0 and variance 1. Here
yt are the observed data; ht is the conditional volatility of yt|y1, . . . , yt−1; zt is the threshold
variable; d is a positive integer, commonly referred to as the delay lag; and the number of
regimes is set as g; the model orders pj, qj, dj, cj are non-negative integers. The threshold
values rj satisfy −∞ = r0 < r1 < . . . < rg = ∞, so the intervals [rj−1, rj), j = 1, · · · , gform a partition of the space of the threshold variable zt−d. Empirical evidence in the
literature shows that εt tends to be more fat-tailed than normal, as assumed in Li and Li
(1996), Brooks(2001); e.g. see Chen, Chiang and So (2003). Here a Student-t distribution,
standardised to have unit variance, is fitted to εt to capture this empirical leptokurtosis.
A feature of this model is that the threshold variable zt is a linear combination of
auxiliary variables zit; i = 1, . . . ,m. These can be any function of exogenous xit or
endogenous variables yt, . . . , y1. Some examples are:
1. z1t = yt and zit = xit for i > 1. w1 = 1 and wi = 0 for i > 1 give the models of Li
and Li (1996) and Brooks (2001).
2. z1t = at, instead of yt, where
at =
yt − φ
(j)0 −
pj∑
i=1
φ(j)i yt−i −
qj∑
k=1
lj∑
i=1
ψ(j)ki xkt−i
I(rj−1 ≤ zt−d < rj)
is a function of exogenous variables and yt’s and again w1 = 1 and wi = 0 for i > 1.
The GJR-GARCH model falls into this category.
3. zit = xit, the threshold is a linear combination of exogenous variables only.
4
For identifiability and stationarity, we follow Chen and So (2006) to assume that
φ(j)pj6= 0, the g autoregressive vectors (φ
(j)0 , ..., φ(j)
pj), j = 1, . . . , g, are distinct and
p∑
i=1
maxj|φ(j)
i | < 1,
where p = max{p1, ..., pg} and φ(j)i = 0 for i > pj. We also have the following standard
restrictions on the variance parameters which guarantee positivity and stationary:
α(j)0 > 0, α
(j)i , β
(j)l ≥ 0 and
∑dj
i=1 α(j)i +
∑cj
l=1 β(j)l < 1.
3 Model selection
We consider several competing parametric Bayesian models for the same observation
matrix y1,n. In Bayesian model hypothesis testing, the decision between two models Mi
versus Mj is made by posterior odds ratio:
Choose Mi if PORij > 1 where
PORij =p(Mi|y1,n)
p(Mj|y1,n)=
p(y1,n|Mi)
p(y1,n|Mj)
p(Mi)
p(Mj),
p(Mi) is the prior probability of model Mi and p(y1,n|Mi) is the marginal (or integrated)
likelihood, defined as:
p(y1,n|Mj) =∫
p(y1,n|θj,Mj)p(θj|Mj) dθj, (1)
where θj is the parameter vector for Mj and p(y1,n|θj,Mj), p(θj|Mj) are the sampling
density function and the prior density function under Mj, respectively. This method can
be applied equally well to nested or non-nested models, a clear advantage over frequentist
model selection criteria, e.g. information criteria. Without any prior information on
model choice, the prior odds ratio=1 and the PORij then becomes the well-known Bayes
factor (Kass and Raftery, 1995). When there are more than two models, the model with
highest posterior probability is chosen.
Marginal likelihoods have proven a challenge to estimate in the Bayesian and frequen-
tist literature, see Kass and Raftery (1995) for a review of the main issues. This is mainly
5
because they involve a multi-dimensional integration over the parameter space, as in (1),
which often has high dimension (being the number of parameters in the model), especially
when θj involves latent variables or many exogenous factors. For many modern statistical
models it is not possible to analytically complete this integral. Often the likelihood is not
of a known distributional form in the parameters and even if it was, non-conjugate priors
would make a closed form solution to the integral generally impossible. Kass and Raftery
(1995) recommended using a multivariate Laplacian approximation to the integrand in
(1), but this can be inaccurate for complex multi-parameter models. Thus usually the
integral is done numerically, but it has proven a challenge to do so efficiently and accu-
rately. Standard approaches like adaptive quadrature are usually not sufficient. Monte
Carlo approaches, which usually become MCMC because of the high dimension involved,
are now the common approach. Even then, before Congdon (2006), the suggested ap-
proaches were very difficult to implement and often involved the inverse of the likelihood
function, which can become numerically unstable. We review three recently suggested
methods below.
3.1 Importance sampling
To estimate the marginal likelihood for each model, we firstly consider the importance
sampling MCMC method proposed in Geweke (1995) and Gerlach, Carter and Kohn
(1999). When k ≥ t,
p(yt|y1,t−1,Mj) =
N∑i=1
p(yt|y1,t−1, θ[i]k ,Mj)/p(yt,k|y1,t−1,θ
[i]k ,Mj)
N∑i=1
1/p(yt,k|y1,t−1, θ[i]k ,Mj)
,
where θ[i]k is the ith MCMC iterate from the posterior p(θj|y1,k, Mj). This estimator can
be calculated by running separate MCMC sampling schemes while increasing the sample
size k sequentially, say k = 100, 200,. . . , n, then evaluating p(yt|y1,t−1,Mj) with k− t not
greater than the increments. An estimate of the marginal likelihood is then evaluated as:
p(y1,n|Mj) =n∏
t=1
p(yt|y1,t−1,Mj) ; j = 1, . . . , k.
This method was employed successfully in Chen and So (2006); Gerlach and Tuyl (2006)
used it to compare a range of symmetric and Markov switching GARCH and stochastic
6
volatility models; and Gerlach et al. (2006) to choose between two competing asymmetric
DT-GARCH models. However, the method is quite computationally intensive, in com-
parison to those presented next, since it requires the MCMC sampling scheme to be run
more than once (actually n/100, rounded up, times). Further, since the method involves
an inverse likelihood term in both the numerator and denominator, it can be prone to
numerical difficulties and instability, especially for smaller sample sizes k. A similar ap-
proach was adopted by Osiewalski, Pajor, and Pipien (2006) who compared bivariate
GARCH and stochastic volatility models. They computed the marginal likelihood by the
harmonic mean method of Newton and Raftery (1994):
p(y1,n|Mj) =
{∫
θj
[p(y1,n|θj,Mj)]−1dP (θj|Mj,y
1,n)
}−1
where P (θj|Mj, y1,n) denotes the posterior cumulative distribution function. The authors
pointed out that this method can be computationally very demanding, and we note it can
again be numerically unstable. There is also the possibility that the standard error does
not exist for these estimators. As such, Bayesian comparison of models may not be ideal
under this method.
3.2 Reversible-jump (RJ) MCMC method
We next consider the RJMCMC method of Green (1995) which was employed to choose
between pairs of GARCH models by Vrontos, Dellaportas and Politis (2000), So, Chen
and Chen (2005) and Chen, So and Gerlach (2005). This method adds a model indicator
to the MCMC sampling scheme and then ’jumps’ between the (potentially non-nested)
models whilst maintaining the ’detailed balance’ conditions ensuring convergence of the
Markov chain.
To apply the RJMCMC method we must choose prior specifications, a jumping rule
and proposal distributions. We consider jumps between Models Mi and Mj, requiring a
one-to-one bijective transformation between the two models. In general this bijection can
be quite complex, however, we define ui = θj and uj = θi, thus implying a simple trans-
formation Jacobian of |∂(θj ,uj)
∂(θi,ui)| = 1; ensuring the necessary condition that the dimensions
of (θi, ui) and (θj, uj) are the same. i.e. d(θi) + d(ui) = d(uj) + d(θj). Such a Jacobian
7
was employed by Vrontos, Dellaportas and Politis (2002), though others are possible. The
detailed process is as follows:
Step 1 : Simulate a proposal θj from a proposal density qi(ui);
Step 2 : The jump to Mj is accepted with the probability min{1, ℘} where
℘ =L(y1,n|Mj,θj)Π(θj|Mj)p(Mj)J(Mi, Mj)qj(uj|θj)
L(y1,n|Mi, θi)Π(θi|Mi)p(Mi)J(Mj, Mi)qi(ui|θi)
∣∣∣∣∣∂(θj, uj)
∂(θi, ui)
∣∣∣∣∣ ; (2)
If the jump is not accepted, stay in Mi and update the parameters θi.
The term L(y1,n|Mj,θj) is the likelihood for model Mj, Π(θj|Mj) is the prior distribu-
tion and p(Mj) is the prior probability for each model. We have generally set the prior
probability of a jump from model Mi to Mj to one, i.e. J(Mi, Mj) = 1 allowing jumps
at each MCMC iteration and p(Mi) = 0.5 to reflect prior model ignorance (again other
choices are viable); the reversible jump acceptance probability for such a move can then
be reduced to
min
(1,
L(y1,n|Mj, θj)Π(θj|Mj)qj(uj)
L(y1,n|Mi,θi)Π(θi|Mi)qi(ui)
),
with the kernels qi and qj being independent of θi and θj, respectively. The posterior
model probability estimate for Mj is the proportion of times the MCMC sample chose
model Mj. This method has enjoyed recent popularity in the literature, but often ap-
plied to choose between two models only. There are also many technical issues in its
implementation. The most important of these concerns the choice of proposal densities.
The method can be quite sensitive to this choice, especially in more complex models
such as those considered here, and the complexity and difficulty in implementation can
increase significantly when considering more than two models at one time. As such we
will recommend and employ in this paper the final method considered below, by Congdon
(2006).
3.3 Direct posterior model probability estimation
This method, proposed by Congdon (2006) following Carlin and Chib (1995), Godsill
(2001) and Scott (2002), again uses a Monte Carlo, but this time direct, approximation
8
to the posterior probability for each model. There is no increase in complexity for this
method as the number of models increases beyond two.
We consider K competing models for the same observation matrix y1,n, with MCMC
samples {θ(j) = (θ(j)1 , . . . , θ
(j)K ), j = M +1, N} from p(θ|y1,n) = ΠK
k=1p(θk|y1,n). A Monte
Carlo estimate of p(Mi|y1,n) is obtained as
p(Mi|y1,n) =N∑
j=M+1
p(Mi|y1,n,θ(j))/(N −M),
where
p(Mi|y1,n,θ(j)) ∝ p(y1,n|θ(j)i ,Mi)p(θ
(j)i |Mi)p(Mi), (3)
with θi are the parameters from model i; θ(j)i is the jth MCMC iterate from the posterior
distribution of model i; p(Mi) is the prior probability in favour of model i; p(Θi|Mi) is
the prior distribution for model i and p(y1,n|θ(j)i ,Mi) is the likelihood function for model
Mi. The normalising factor in (3) is the sum of the total model probabilities:
G(j)k = p(y1,n|θ(j)
k ,Mk)p(θ(j)k |Mk)p(Mk).
We could set p(Mk) = 1/K to reflect prior model ignorance. For numerical efficiency, the
calculation in (3) in practice is based on scaled versions of the likelihoods. The scaling
employs the maximum L(j)max of the model log-likelihoods at each iteration and subtracts it
from each model likelihood, i.e. = L(j)k −L(j)
max. Exponentiating the scaled log-likelihoods
then gives the scaled total model likelihoods G(j)k , so that:
P (Mi|y1,n,θ(j)) = G(j)i /
K∑
k=1
G(j)k .
As noted by Congdon (2006), ’while parallel sampling of the models is not necessary, such
a form of sampling facilitates model averaging and assessing the impact of individual
observations on the overall estimated Bayes factor.’ In summary, the estimator can be
formulated as:
p(Mi|y1,n) =∫
p(Mi,θi|y1,n)dθi =∫
P (Mi|θi, y1,n)p(θi|y1,n)dθi
≈ 1
N −M
N∑
j=M+1
P (Mi|y1,n,θ(j)i )
=1
N −M
N∑
j=M+1
p(y1,n|θ(j)i ,Mi)p(θ
(j)i |Mi)p(Mi)
∑Kk=1 p(y1,n|θ(j)
k ,Mk)p(θ(j)k |Mk)p(Mk)
.
9
This method is far simpler and less computationally intensive to employ than either
importance sampling or RJMCMC. It simply requires individual independent or parallel
MCMC samples from each model to be compared.
4 Empirical study
We illustrate the MCMC and model selection methods using the daily Nikkei 225 index,
Hang Seng index (HSI) and the Taiwan stock market index (TAIEX), all obtained from
Datastream International, from January 4, 1996, to October 17, 2006. We employ the
US S&P500 composite index return as an exogenous variable in the mean equation in
all models, and as a potential auxiliary variable for the threshold models. All subsequent
analyses are performed on the daily log returns, yt=(log pt − log pt−1) × 100, where pt
is the price index at time t. To understand the characteristics of the data, summary
statistics and time series plots are provided in Table 1 and Figure 1, respectively. The
seven competing heteroskedastic models are described as follows:
Table 1: Summary statistics: stock index returns for the period January 4, 1996, to October17, 2006.
Excess Normalityn Mean Std Skewness Kurtosis Min. Max. test*
Nikkei 225 2733 -0.0079 1.4170 -0.0374 1.9827 -7.2339 7.66048 448.29(0.0000)
Hang Seng 2594 0.0144 1.6540 0.1540 11.7140 -14.7347 17.2471 14841.16(0.0000)
TAIEX 2568 0.0076 1.5968 -0.1216 2.5147 -9.9360 8.5198 682.94(0.0000)
SP500 2733 0.02899 1.1163 -0.1053 3.4621 -7.1127 5.5732 1369.95(0.0000)
*Jarque-Bera (1987) normality test.
Model 1 is the ARX-GARCH-t model.
yt = φ0 + φ1yt−1 + ψ1xt−1 + at,
at =√
htεt, εt ∼ tν ,
10
Returns of Nikkei 225
0 500 1000 1500 2000 2500
−5
05
Returns of Hang Seng Index
0 500 1000 1500 2000 2500
−15
−10
−5
05
1015
Returns of TAIEX
0 500 1000 1500 2000 2500
−10
−5
05
Figure 1: Daily returns for the period January 4, 1996, to October 17, 2006.
ht = α0 + α1a2t−1 + β1ht−1.
Model 2 is the ARX-GJR-GARCH model. The mean equation is the same as Model 1.
The volatility is given as follows:
ht = α0 + (α1 + γS−t−1)a2t−1 + β1ht−1,
where S−t−1 =
1 if at−1 ≤ 0,
0 if at−1 > 0.
Model 3 is the TARX-GARCH model. The volatility equation is the same as Model 1.
The mean equation is given as follows:
yt =
φ(1)0 + φ
(1)1 yt−1 + ψ
(1)1 xt−1 + at, yt−d ≤ r
φ(2)0 + φ
(2)1 yt−1 + ψ
(2)1 xt−1 + at, yt−d > r.
11
Model 4 is the ARX-TGARCH model. The mean equation is the same as Model 1. The
volatility is asymmetry follows
ht =
α(1)0 + α
(1)1 a2
t−1 + β(1)1 ht−1, yt−d ≤ r
α(2)0 + α
(2)1 a2
t−1 + β(2)1 ht−1 yt−d > r.
Model 5 is the DT-GARCH-t with an exogenous threshold variable.
yt =
φ(1)0 + φ
(1)1 yt−1 + ψ
(1)1 xt−1 + at, xt−d ≤ r
φ(2)0 + φ
(2)1 yt−1 + ψ
(2)1 xt−1 + at, xt−d > r,
at =√
htεt, εt ∼ tν
ht =
α(1)0 + α
(1)1 a2
t−1 + β(1)1 ht−1, xt−d ≤ r
α(2)0 + α
(2)1 a2
t−1 + β(2)1 ht−1, xt−d > r.
Model 6 is the DT-GARCH-t with domestic threshold variable. The model is the same
as Model 5 except the threshold variable is yt−d instead of xt−d.
Finally, Model 7 is DT-GARCH-t with a weighted threshold variable.
yt =
φ(1)0 + φ
(1)1 yt−1 + ψ
(1)1 xt−1 + at, zt−d ≤ r
φ(2)0 + φ
(2)1 yt−1 + ψ
(2)1 xt−1 + at, zt−d > r,
at =√
htεt, εt ∼ tν
ht =
α(1)0 + α
(1)1 a2
t−1 + β(1)1 ht−1, zt−d ≤ r
α(2)0 + α
(2)1 a2
t−1 + β(2)1 ht−1, zt−d > r,
where zt = w1yt + w2xt, 0 ≤ wi ≤ 1, w1 + w2 = 1.
Models 3 and 4 are special cases of Model 6. Model 5 is similar to Model 6 except
that yt−d is replaced by xt−d. Model 7 is from Chen and So (2006) which formulates the
threshold variable as a linear combination of auxiliary variables. These are all asymmetric
models, with the final two being double threshold models, one with local or domestic
market return threshold and one whose threshold is a weighted combination of the local
and US market returns.
4.1 Estimation results
We implemented diagnostic tools, MCMC trace plots (or history plots) and autocorrela-
tion plots of MCMC iterates to check convergence for each model, which seemed to be
12
satisfied in each case after a few hundred iterations. All delay lags were chosen as 1 with
high probability (close to 1). All other parameter estimates for Models 1 to 7 are given
in Tables 2-4 for the Nikkei, Hang Seng and TAIEX respectively.
Table 2: Estimates of the parameters for the Nikkei 225 for each model; standard errors are inparentheses.
M1 M2 M3 M4 M5 M6 M7
φ(1)0 0.0150 -0.0028 -0.0882 0.0111 -0.1678 -0.0681 -0.3086
(0.0212) (0.0211) (0.0910) (0.0220) (0.1019) (0.0999) (0.1320)φ
(1)1 -0.0667 -0.0641 -0.0768 -0.0637 -0.0732 -0.1083 -0.2087
(0.0184) (0.0180) (0.0346) (0.0186) (0.0567) (0.0596) (0.0633)ψ
(1)1 0.4281 0.4244 0.4552 0.4253 0.4310 0.4131 0.3769
(0.0233) (0.0216) (0.0509) (0.0218) (0.0679) (0.0421) (0.0604)α
(1)0 0.0225 0.0304 0.0224 0.1256 0.2285 0.1883 0.2507
( 0.0079 ) (0.0090) (0.0080 ) (0.0338) (0.0656) (0.0457) (0.0546)α
(1)1 0.0730 0.0384 0.0744 0.0831 0.0542 0.0883 0.0659
(0.0108) (0.0105) (0.0107) (0.0221) (0.0248) (0.0191) (0.0182)β
(1)1 0.9170 0.9069 0.9158 0.9013 0.9017 0.8868 0.9064
(0.0121) (0.0128) (0.0122) (0.0266) (0.0419) (0.0304) (0.0314)γ 0.0799
( 0.0192)φ
(2)0 0.0628 0.0418 -0.0135 0.0306
(0.0332) (0.0231) (0.0296) (0.0274)φ
(2)1 -0.0847 -0.0815 -0.0518 -0.0794
(0.0253) (0.0235) (0.0301) (0.0259)ψ
(2)1 0.4091 0.4167 0.4336 0.3984
(0.0431) (0.0371) (0.279) (0.0300)α
(2)0 0.0090 0.0145 0.0111 0.0196
(0.0078) (0.0085) (0.0083) (0.0092)α
(2)1 0.0660 0.0728 0.0470 0.0465
(0.0118) (0.0122) (0.0115) (0.0112)β
(2)1 0.9060 0.9058 0.9023 0.9018
(0.0140) (0.0145) (0.0127) (0.0137)r -0.3718 -0.3958 -0.8935 -0.5626 -0.5844
(0.1930) (0.0702) (0.0555) (0.1175) (0.0433)ν 9.6435 10.0294 9.4794 10.6876 10.8622 10.7217 11.8812
(1.7018 ) (1.8858) (1.6779) (2.1688) (2.2513) (2.21.74) (2.7948)w1 0.5831
(0.0689)d 1 1 1 1 1
These models and estimates combine to describe the features of the mean and volatility
of these data sets. In particular Models 1 to 7 all agree that the mean persistence in the
13
Table 3: Estimates of the parameters for the Hang Seng index for each model; standard errorsare in parentheses.
M1 M2 M3 M4 M5 M6 M7
φ(1)0 0.0372 0.0283 0.0171 0.0376 0.0104 0.0206 -0.0585
(0.0206) (0.0206) (0.0661) (0.0214) (0.1599) (0.0542) (0.0911)φ
(1)1 -0.0139 -0.0102 -0.0701 -0.0106 -0.0680 -0.0862 -0.1425
(0.0181) (0.0176) (0.0431) (0.0189) (0.0509) (0.0380) (0.0472)ψ
(1)1 0.5234 0.5156 0.5582 0.5286 0.5606 0.5572 0.5562
(0.0229) (0.0223) (0.0467) (0.0236) (0.0991) (0.0446) (0.0568)α
(1)0 0.0120 0.0160 0.0120 0.0966 0.1594 0.0722 0.1033
(0.0044) (0.0052) (0.0044) (0.0281) (0.0491) (0.0294) (0.0339)α
(1)1 0.0655 0.0401 0.0660 0.1029 0.1160 0.1020 0.1082
(0.0108) (0.0108) (0.0107) (0.0222) (0.0310) (0.0210) (0.0280)β
(1)1 0.9294 0.9246 0.9289 0.8894 0.8701 0.8902 0.8805
(0.0114) (0.0119) (0.0114) (0.0241) (0.0333) (0.0222) (0.0306)γ 0.0545
(0.0169)φ
(2)0 0.0253 0.0537 0.0270 0.0365
(0.0352 ) (0.0228) (0.0256) (0.0232)φ
(2)1 0.0255 -0.0039 0.0191 0.0171
(0.0296) (0.0194) (0.0228) (0.0217)ψ
(2)1 0.4858 0.4973 0.5026 0.5125
(0.0467) (0.0333) (0.0319) (0.0280)α
(2)0 0.0061 0.0112 0.0071 0.0111
(0.0047) (0.0056) (0.0048) (0.0055)α
(2)1 0.0580 0.0607 0.0401 0.0454
(0.0130) (0.0125) (0.0119) (0.0122)β
(2)1 0.9160 0.9172 0.9395 0.9343
(0.0138) (0.0142) (0.0139) (0.0135)r -0.1322 -0.4281 -0.8831 -0.4853 -0.7037
(0.3607) (0.0922) (0.0890) (0.1901) (0.0865)ν 8.1804 8.3535 8.0297 8.6639 8.5556 8.5865 8.4692
(1.2913) (1.3853) (1.2269) (1.4695) (1.4675) (1.4759) (1.4476)w1 0.6857
(0.1648)d 2 1 1 2 2
Nikkei is small, but significant, and negative. This parameter is again always negative,
but only significant for models 6-7, for the Hang Seng index, while it is never significant
for the TAIEX, but it is largest in magnitude for Models 6-7 again. There does not seem
to be a strong asymmetric effect for the AR parameter in these markets for models 3,
5-7. These mostly negative estimates and symmetric effects are in agreement with results
14
Table 4: Estimates of the parameters for the TAIEX for each model; standard errors are inparentheses.
M1 M2 M3 M4 M5 M6 M7
φ(1)0 0.0310 0.0183 0.0003 0.0324 -0.1548 -0.1648 -0.2169
(0.0245) (0.0240) (0.0883) (0.0245) (0.0743) (0.1331) (0.1608)φ
(1)1 0.0139 0.0179 -0.0362 0.0146 0.0100 -0.0817 -0.1020
(0.0191) (0.0190) (0.0465) (0.0185) (0.0520) (0.0617) (0.0675)ψ
(1)1 0.3379 0.3329 0.3396 0.3404 0.3525 0.3096 0.3044
(0.0263) (0.0253) (0.0462) (0.0258) (0.0545) (0.0516) (0.0607)α
(1)0 0.0197 0.0275 0.0208 0.1493 0.2522 0.2786 0.3081
(0.0077) (0.0109) (0.0084) (0.0462) (0.0943) (0.0635) (0.0720)α
(1)1 0.0622 0.0360 0.0631 0.0586 0.0577 0.0571 0.0540
(0.0109) (0.0113) (0.0107) (0.0250) (0.0248) (0.0178) (0.0182)β
(1)1 0.9317 0.9247 0.9304 0.9271 0.9168 0.9321 0.9333
(0.0120) (0.0156) (0.0119) (0.0269) (0.0343) (0.0213) (0.0218)γ 0.0593
(0.0174)φ
(2)0 0.0597 0.0687 0.0256 0.0279
(0.0682 ) (0.0282) (0.0289) (0.0282)φ
(2)1 0.0141 0.0039 0.0219 0.0241
(0.0460) (0.0226) (0.0280) (0.0279)ψ
(2)1 0.3239 0.3355 0.3501 0.3456
(0.0479) (0.0282) (0.0297) (0.0310)α
(2)0 0.0105 0.0182 0.0257 0.0302
(0.0088) (0.0111) (0.0124) (0.0131)α
(2)1 0.0683 0.0687 0.0461 0.0482
(0.0168) (0.0139) (0.0121) (0.0129)β
(2)1 0.9086 0.9075 0.9019 0.9002
(0.0177) (0.0166) (0.0167) (0.0177)r -0.1623 -0.4552 -0.8057 -0.7798 -0.8087
(0.5437) (0.0859) (0.1536) (0.0399) (0.0868)ν 6.5824 6.9101 6.5274 6.7973 6.7648 7.2934 7.3345
(0.8440) (0.9966) (0.8524) (0.9424) (0.9157) (1.0402) (1.1065)w1 0.9115
(0.0719)d 1 2 2 1 1
in Li and Lam (1995) and Chen and So (2006). The mean spillover effect from the US
return is strong, positive and highly significant, in all regimes, models and markets, but
again seems not to be asymmetric. The volatility persistence is strong and positive (i.e.
α1 + β1 ≈ 1) in all regimes across all models and markets, agreeing with the well-known
empirical fact, but again quite symmetric in effect.
15
Asymmetric effects also seem strong and clear in all nonlinear models in each market.
For the Nikkei and TAIEX indices, models with mean asymmetry (Models 3, 5 - 7) clearly
indicate that the mean return is lower and negative following bad news from the US or the
domestic market (or combined as in Model 7), with higher and mostly positive average
returns following good news. This is because φ(1)0 < φ
(2)0 ; φ
(1)1 ≈ φ
(2)1 ; ψ
(1)1 ≈ ψ
(2)1 in Models
3 and Models 5 to 7 in these two markets. This effect is not apparent in Hong Kong,
however. The threshold cut-offs for Models 3 to 7 were all estimated between -0.37 and
-0.9 for Nikkei; -0.13 and -0.88 for Hang Seng and -0.16 to -0.81 for Taiwan. These are
almost all significantly lower than the usual value of 0 employed in similar studies. i.e.
bad news is not ’financially significant’ until a threshold somewhat below zero and closer
to -0.5% (on average).
Asymmetric volatility was also very strong and clear across markets. In particular
volatility levels are significantly higher following bad news, as highlighted by γ > 0 in
Model 2 and α(1)0 > α
(2)0 ; α
(1)1 ≈ α
(2)1 ; β
(1)1 ≈ β
(2)1 in Models 4 to 7. It seems the asymmetric
effects are clear and apparent in the mean and variance intercept terms, but perhaps not
present at all in the other parameters, except possibly in the ARCH terms α1 in the Hang
Seng market. This is highlighted in Figures 2 and 3, showing boxplots of the MCMC
iterates for the mean parameters (Figure 2) and the volatility parameters (Figure 4)
from Model 7. From left to right, the first three boxplots are the parameters in regime
1, while the next three are from regime 2. It is quite apparent that the only clearly
significant differences between regimes are in the intercept parameters φ0 and α0, except
for differences in α1 for the Hang Seng market.
For Model 7 the weight chosen for the US return in the weighted threshold variable
is 0.58 (Nikkei) and 0.69 (Hang Seng), but these are not significantly different from 0.5.
The weight for the TAIEX was estimated to be 0.91 and significantly different to 0.5.
We should not interpret these weights too exactly, since the two threshold variables have
slightly different variances (although both on the same scale of percentage returns it is
debatable as to whether these returns should be standardised in the threshold variable,
we chose not to), however it seems each market has ’close’ to an equal effect on this
threshold variable that drives asymmetric nonlinear behaviour in Japan and Hong Kong,
but that the local TAIEX has a far stronger influence than the US in driving asymmetric
16
Figure 2: Boxplots for parameter estimates of (φ(1)0 , φ
(1)1 ψ
(1)1 , φ
(2)0 , φ
(2)1 ψ
(2)1 ) in Model 7 for the
Nikkei market.
behaviour in Taiwan, again around a significantly negative threshold value of about -0.8%.
Finally, all degrees of freedom parameters are estimated between 6 and 12 across markets
and models, far below what would be expected from normality and justifying the use of
t-distributed residuals.
4.2 Model selection results
We apply the method of Congdon (2006) to estimate posterior model probabilities for all
seven models considered in each market. These are shown in Table 5 and Figure 4. For
exposition, two additional scenarios are shown for Japan, where only the first five and
17
Figure 3: Boxplots for parameter estimates of (α(1)0 , α
(1)1 , β
(1)0 , α
(2)0 , α
(2)1 , β
(2)0 ) in Model 7 for the
Nikkei market.
then the first six models are considered. The same MCMC iterates are used in all three
scenarios. We first summarize what we get from the analysis of the Nikkei market.
Considering the first scenario, as shown in the first row of Table 5, Models 1 to 5
are compared via their estimated posterior probabilities under Congdon’s method. In
contrast to results in Chen, Gerlach and So (2006), the GJR-GARCH model, with local
shock threshold, threshold value of 0 and a linear mean equation, is preferred to the
double threshold DTGARCH model with exogenous threshold and potentially non-zero
threshold value. We note that in Chen, Gerlach and So (2006) there was no exogenous
term in the mean equation of the GJR model. Its inclusion in this study may partly explain
18
Table 5: Posterior probabilities for each model across the three markets.M1 M2 M3 M4 M5 M6 M7
Nikkei 225 5 candidates 0.2166 0.4076 0.1464 0.0462 0.1832Nikkei 225 6 candidates 0.1158 0.2415 0.0793 0.0221 0.1018 0.4396Nikkei 225 7 candidates 0.0637 0.1432 0.0437 0.0100 0.0492 0.2699 0.4203
Hang Seng 7 candidates 0.1328 0.2108 0.0865 0.0178 0.0597 0.2848 0.2077
TAIEX 7candidates 0.0585 0.1642 0.0513 0.0177 0.0297 0.3307 0.3479
our result, but also not exactly the same data series have been used. The DTGARCH
and symmetric GARCH are favoured next, followed closely by the TARX-GARCH, with
little weight given to the ARX-TGARCH model. This suggests that there is a strong
asymmetric ARCH effect, as described in the GJR model, driven by local market shocks.
Figure 4: The estimated posterior model probabilities for the Nikkei market, over three scenar-
ios.
Next we add the DTGARCH model with local threshold variable. While the GJR-
GARCH is still favoured as 2nd in this list, the DTGARCH is clearly the superior model
for this dataset. Little weight is given to the other four models. Again we see a strong
nonlinear local market effect, but this time the (only) model allowing for both asymmetry
19
in mean and volatility, in response to local market returns, is clearly favoured. Again we
note that a US exogenous mean effect is included in the DTGARCH model with local
threshold, again explaining our differing result from that in Chen et al (2006).
Finally we add Model 7 which allows a weighted combination of US and local returns
as the threshold. While some weight is still given to the GJR-GARCH model, the battle
is now clearly between the DTGARCH with local threshold and the DTGARCH with
weighted threshold. Model 7 (weighted threshold) wins out with posterior probability
0.42, compared to 0.27 for Model 6. The data clearly favour nonlinearity in mean and
volatility being driven by the weighted threshold variable. This makes sense from a
financial viewpoint, since this threshold uses all the news information available, i.e. local
and US returns, from the previous trading day. For the Hang Seng index and TAIEX,
we find supporting evidence that again M6 and M7 are preferable. The marginally best
model for HSI is M6, indicating that the domestic return threshold variable is slightly
more appropriate than a weighted average threshold variable. Although we find very
marginal superior performance of M7 to M6 for TAIEX, the difference is small. This is
not surprising as ω1 is so close to one in M7 that these models are almost the same in
practical applications.
5 Conclusion
We briefly review in this paper three computational methods for estimating marginal
likelihoods and highlight the advantages of Congdon (2006)’s approach for heteroskedastic
models. The proposed approach is a special case of composite space MCMC sampling
where jumps between models are not involved and so only evidence on the model ratios
need to be accumulated. The construction of proposal densities for RJMCMC are also
avoided in this approach. Seven asymmetric volatility models, which differ in the mean
structure, in the volatility structure or in the construction of the threshold variable, are
evaluated. Our results show that the weighted average threshold formulation introduced
in Chen and So (2006) is preferred for the Nikkei 225 and the TAIEX market data with
a local threshold and double threshold model also scoring highly. This latter model is
preferred for the Hang Seng index. These results can initiate new discussion on how
20
to combine information from different sources in determining regimes for asymmetric
heteroskedastic models. It is easy to apply the proposed method to compare complex
models and to the comparison of non-nested models. All the methods discussed here
can also be applied to other volatility models, like Markov switching GARCH models
and stochastic volatility models, retaining the advantages discussed. Choice of auxiliary
variables in Chen and So (2006)’s specification and the determination of the number of
regimes (beyond two) in threshold processes are important topics for further research.
Acknowledgment: C. W. S. Chen acknowledges research support from National Science
Council of Taiwan grant 95-2118-M-035-001.
References
Bauwens, L. and Lubrano, M. (1998) Bayesian inference on GARCH models using Gibbs
sampler, Econometrics Journal, 1, 23-46.
Berg, A., Meyer, R., and Yu, J. (2004) Deviance Information Criterion for Comparing
Stochastic Volatility Models. Journal of Business and Economic Statistics, 22, 107-
120.
Bollerslev, T. (1986) Generalized autoregressive heteroskedasticity. Journal of Econo-
metrics, 31, 307-327.
Brooks, C. (2001) A double-threshold GARCH Model for the French Franc/Deutschmark
Exchange Rate. Journal of Forecasting, 20, 135-143.
Carlin, B. P. and Chib, S. (1995) Bayesian model choice via Markov chain Monte Carlo.
Journal of the Royal Statistical Society, Series B, 57, 473-484.
Chen, C. W. S., Chiang, T. C. and So, M. K. P. (2003) Asymmetrical Reaction to
US Stock-return News: Evidence from Major Stock Markets Based on a Double-
Threshold Model. The Journal of Economics and Business. Special issue on Glob-
alization in the New Millennium: Evidence on Financial and Economic Integration,
55, 487-502.
21
Chen, C. W. S., Gerlach, R., and So, M. K. P. (2006) Comparison of non-nested asym-
metric heteroscedastic models, Computational Statistics & Data Analysis. Special
issue on Nonlinear Modelling & Financial econometrics, 51, 2164-2178.
Chen, C. W. S. and So, M. K. P. (2006) On a threshold heteroscedastic model. Interna-
tional Journal of Forecasting, 22, 73-89.
Chen, C. W. S., So, M. K. P. and, Gerlach, R. H.(2005) Assessing and testing for thresh-
old nonlinearity in stock returns. Australian & New Zealand Journal of Statistics,
47, 473-488.
Chib, S. (1995) Marginal likelihood from the Gibbs output. Journal of the American
Statistical Association, 90, 1313-1321.
Congdon, P. (2006) Bayesian model choice based on Monte Carlo estimates of posterior
model probabilities. Computational Statistics and Data Analysis, 50, 346-357.
Engle, R. F. (1982) Autoregressive, conditional heteroskedasticity with estimates of the
variance of United Kingdom inflation. Econometrica, 50, 987-1007.
Gerlach, R., Carter, C. K., and Kohn, R. (1999) Diagnostics for time series analysis,
Journal of Time Series Analysis, 20, 309-330.
Gerlach, R., Chen, C.W.S., Lin, S.Y., and Huang, M.H. (2006) Asymmetric responses
of international markets to trading volume’, Physica A: Statistical Mechanics and
its Applications, 360, 422-444.
Gerlach, R. and Tuyl, F. (2006) MCMC Methods for Comparing Stochastic Volatility
and GARCH Models, International Journal of Forecasting, 22, 91-107.
Glosten, L., Jagannathan, R., & Runke, D. (1993) On Relationship between the expected
value and the volatility of the nominal excess return on stocks. Journal of Finance,
487, 1779-1801.
Geweke J. (1995) Bayesian comparison of econometric models. Working Paper 532.
Research Department, Federal Reserve Bank of Minneapolis.
22
Geweke, J. (1996) Variable Selection and Model Comparison in Regression, in Bayesian
Statistics V, eds. J. Bernardo, J. Berger, A. Dawid, and A. Smith, Oxford: Oxford
University Press, 609-620.
Godsill, S. J. (2001) On the relationship between Markov chain Monte Carlo methods
for model uncertainty. Journal of Computational and Graphical Statistics, 10, 1-19.
Green, P.J. (1995) Reversible jump MCMC computation and Bayesian model determi-
nation. Biometrika, 82, 711-732.
Kass and Raftery (1995) Bayes Factors, Journal of the American Statistical Association,
90, 773-795.
Li, W. K. and Lam K. (1995) Modelling asymmetry in stock returns by a threshold
ARCH model. The Statistician 44, 333-341.
Li, C. W. and Li, W. K. (1996) On a double-threshold autoregressive heteroscedastic
time series model. Journal of Applied Econometrics, 11, 253-274.
Nelson, D. B. (1991) Conditional heteroskedasticity in asset returns: A new approach.
Econometrica, 59, 347- 370.
Newton M. A. and Raftery A. E., (1994) Approximate Bayesian inference by the weighted
likelihood bootstrap (with discussion), Journal of the Royal Statistical Society, Se-
ries B, 56, 3-48.
Osiewalski, Pajor, and Pipien (2006) Bayesian comparison of bivariate GARCH and SV
models, Acta Universitatis Lodziensis - Folia Oeconomica, forthcoming.
Poon, S.H. and Granger, C.W.J. (2003) Forecasting volatility in financial markets: A
review. Journal of Economic Literature, 41, 478-539.
Scott, S. (2002) Bayesian methods for hidden Markov models: recursive computing in
the 21st Century, Journal of the American Statistical Association, 97, 337-351.
Sentana, E. (1995) Quadratic ARCH models. Review of Economic Studies, 62, 639-661.
23
Silvapulle, M. J. and Sen, P. K. (2004) Constrained Statistical Inference: Inequality,
Order, and Shape Restrictions, Wiley, New York.
So, M. K. P., Chen, C. W. S., and Chen, M. T. (2005) A Bayesian threshold nonlinearity
test in financial time series, Journal of Forecasting, 24, 61-75.
So, M. K. P., Chen, C. W. S., and Liu, F.C. (2006) Best subset selection of autore-
gressive models with exogenous variables and generalized autoregressive conditional
heteroscedasticity errors, Journal of the Royal Statistical Society, Series C, 55, 201-
224.
So, M. K. P. and Li, W. K. (1999) Bayesian unit-root testing in stochastic volatility
models. Journal of Business & Economic Statistics, 17, 491-496.
Tong, H. (1978) On a Threshold Model in Pattern Recognition and Signal Processing,
ed. C. H. Chen, Sijhoff & Noordhoff: Amsterdam.
Vrontos, I. D., Dellaportas, P. and Politis, D. N. (2000) Full Bayesian inference for
GARCH and EGARCH models. Journal of Business & Economic Statistics, 18,
187-198.
Zakoian, J. M. (1994) Threshold heteroskedastic models. Journal of Economic Dynamics
and Control, 18, 931-955.
24