Multiantenna GLR Detection of Rank-One Signals With Known Power Spectrum in White Noise With Unknown...
Transcript of Multiantenna GLR Detection of Rank-One Signals With Known Power Spectrum in White Noise With Unknown...
1
Multiantenna GLR detection of rank-one signalswith known power spectrum in white noise with
unknown spatial correlationJosep Salaโ , Senior Member, IEEE, Gonzalo Vazquez-Vilarโก, Student Member, IEEE, Roberto
Lopez-Valcarceโก, Member, IEEE,โ Dept. of Signal Theory and Communications, Technical University of Catalonia, 08034 Barcelona, SPAIN
E-mail: [email protected]โกDept. of Signal Theory and Communications, University of Vigo, 36310 Vigo, SPAIN
E-mail: {gvazquez,valcarce}@gts.uvigo.essubmitted (minors after AQ) to: IEEE Transactions on Signal Processing
EDICS: SPC-DETC (Detection, Estimation and Demodulation), SSP-DETC (Detection)
AbstractโMultiple-antenna detection of a Gaussian signalwith spatial rank one in temporally white Gaussian noise witharbitrary and unknown spatial covariance is considered. Thisis motivated by spectrum sensing problems in the context ofDynamic Spectrum Access in which several secondary networkscoexist but do not cooperate, creating a background of spatiallycorrelated broadband interference. When the temporal correla-tion of the signal of interest is assumed known up to a scale factor,the corresponding Generalized Likelihood Ratio Test is shown toyield a scalar optimization problem. Closed-form expressions ofthe test are obtained for the general signal spectrum case inthe low signal-to-noise ratio (SNR) regime, as well as for signalswith binary-valued power spectrum in arbitrary SNR. The tworesulting detectors turn out to be equivalent. An asymptoticapproximation to the test distribution for the low-SNR regime isderived, closely matching empirical results from spectrum sensingsimulation experiments.
Index TermsโGLR test, detection, multiantenna array, corre-lated noise, noise uncertainty, cognitive radio, spectrum sensing,spectral flatness measure, Capon beamformer
I. INTRODUCTION
Array processing for signal detection and parameter esti-mation has been a topic of deep research interest for decades.Many techniques have been proposed under the assumptionthat the noise is Gaussian and spatially uncorrelated (or hasa known spatial structure that allows prewhitening) [1]โ[5].However, in many practical cases the noise field may presentunknown spatial correlation, due to co-channel interference incommunications applications, and to clutter and jamming inradar array processing [6]โ[10]. Our main motivation stemsfrom the problem of spectrum sensing for Dynamic SpectrumAccess (DSA) in licensed bands [11], [12]. DSA aims at amore efficient usage of spectrum by allowing unlicensed (orsecondary) users to opportunistically access channels in thosebands, as long as interference to licensed (or primary) usersis avoided; this can be achieved by dynamically tuning to
This work has been jointly supported by projects 2009SGR1236 (AGAUR),2009/062-2010/85 (Consolidation of Research Units), DYNACS (TEC2010-21245-C02/TCM) and COMONSENS (CONSOLIDER-INGENIO 2010CSD2008-00010) financed by the Catalan, Galician and Spanish Governmentsand the European Regional Development Fund (ERDF).
different carrier frequencies to sense the radio environment forunused channels. Spectrum sensing schemes must be robust towireless propagation phenomena such as large- and small-scalefading as well as to interference. The first two issues can bedealt with by resorting to cooperative detection strategies [13]and multiantenna detectors [14]โ[17], respectively.
Multiantenna spectrum sensing methods usually exploitspatial correlation of the signal, assuming a spatially uncor-related noise field. Whereas this assumption may be adequatein early stages of DSA-based schemes, in which a singlesecondary network coexists with the primary system, this is notnecessarily so when several independent secondary systemsaccess the same frequency band. Unless strict coordinationamong such systems is imposed, no quiet periods will beavailable in order to sense primary activity in a given channel.Hence, such activity will have to be detected in the presenceof a background noise consisting not only of thermal noise,but also the aggregate interference from the rest of secondarynetworks. The physical layer of secondary systems is likelyto make use of channel fragmentation, aggregation, and/orbonding in order to enhance spectrum utilization (such is thecase for e.g. the IEEE 802.22 standard for opportunistic accessto TV white spaces [18]), so that the background interferencewill likely be broadband (with respect to the bandwidth ofa primary channel), and potentially correlated in space, dueto the well-localized origin of secondary transmissions. Otherpotential sources of broadband interference of course exist, e.g.the emissions of Broadband over Power Line (BPL) systems inthe HF and VHF bands [19]. After bandlimiting by the channelselection filter of the receiver, the contribution due to thebroadband interference will tend to be Gaussian distributed,and will be modeled as temporally uncorrelated in this work.More sophisticated noise models, e.g. autoregressive in timeand/or space [20], may be of interest in certain scenarios butfall out of the scope of this paper.
On the other hand, it may well be possible for the spec-trum sensor to have knowledge about the (normalized) PowerSpectrum Density (PSD) of primary signals, as the emissionmasks of many licensed services (e.g., broadcast and cellularnetworks) are in the public domain. Knowledge of the signal
2
PSD has been exploited in [21] assuming a single antennaand known noise variance; and in [22], [23] for multiple an-tennas with spatially uncorrelated noise with known commonvariance, and respectively known and unknown channel gains.
Motivated by the above considerations, we derive the Gen-eralized Likelihood Ratio (GLR) detector and its ReceiverOperating Characteristic (ROC) for a random signal withspatial rank one and known PSD in temporally white Gaussiannoise of arbitrary spatial covariance. The nuisance parametersare the unknown antenna gains for the signal of interest andthe unknown spatial covariance of the noise. No structure isassumed on the latter, thus avoiding artificial constraints onthe unknown spatial rank of the interference, for the sake ofrobustness and detector complexity. With this model, intuitionsuggests that detection should be feasible as long as thetemporal correlation of the signal differs from that of thenoise process, i.e., the signal is not temporally white. Indeed,the performance of the GLR detector will be shown to bedependent on the spectral flatness measure at the output ofthe minimum variance beamformer.
Following common practice, the Gaussian model is adoptedfor the primary signal. The resulting model is tractable andleads to useful detectors even under different signal distri-butions. In addition, if the primary system uses multicarriermodulation, the Gaussian assumption becomes quite accuratefor the number of subcarriers usually found in practice [24].
Previous works in the literature on signal detection arerelated to the scheme presented in this paper. The single-antenna scenario considering real- rather than complex-valueddata in the proposed signal model has been treated in [30]using the Expectation-Maximization (EM) algorithm. On theother hand, the problem of signal detection with multiple an-tennas and spatially correlated noise has been addressed by theGeneralized Multivariate Analysis of Variance (GMANOVA)approach [25] when the signal is deterministic dependenton unknown parameters. Adaptive subspace detectors canbe used if the signal has low spatial rank and signal-freetraining data are available for estimating the unknown noisecovariance; see [26] for the case of deterministic signals,and [27] for Gaussian signal components. Other approachesfor the Gaussian signal model exploit rotational invariance ofthe noise field (if it exists) [28] or assume a parametric modelof the noise [20]. When the spatial correlation of noise isunstructured, and lacking training data, as in our model, itmay be possible to exploit the temporal correlation propertiesof signal and noise as in [29], where it is assumed that noisehas a much shorter temporal correlation length than that ofthe signals. However, none of these works derives the GLRtest under the proposed signal model. Moreover, the detectorherein described can be validated with the result in [30] inthe single antenna setting, leading to a simpler univariateoptimization scheme, and can significantly outperform otherdetectors which apply under the same model, as shown bysimulation results for that from [29].
The paper is structured as follows. The signal model isgiven in Sec. II, together with a summary of the mainresults to be developed in the sequel. The derivation of theexact GLR detector is carried out in Sec. III. The resulting
scheme involves the optimization of a data-dependent termwith respect to a scalar variable, which cannot be solved inclosed form in general. Nevertheless, in Sec. IV two importantparticular instances are considered that will result in a closed-form scheme, namely the low-SNR and binary-valued signalPSD cases. The asymptotic performance of the GLR detectoris analytically derived in Sec. V. Numerical results are givenin Sec. VI, and Sec. VII presents some closing remarks.
Notation: lower- and upper-case boldface symbols denotevectors and matrices, respectively. The ๐-th unit vector isdenoted by ๐๐. The trace, determinant, transpose, conjugate,conjugate transpose (Hermitian), adjoint, and Moore-Penrosepseudoinverse of ๐จ are denoted by tr๐จ, det๐จ, ๐จ๐ , ๐จโ,๐จ๐ป , adj(๐จ) and ๐จโ respectively. diag(๐จ) is a diagonalmatrix with diagonal equal to that of ๐จ. The Kroneckerproduct of ๐จ and ๐ฉ is denoted by ๐จโ๐ฉ. The column-wisevectorization of ๐จ is denoted by vec(๐จ). For Hermitian ๐จ, itslargest (resp. smallest) eigenvalue is denoted by ๐max[๐จ] (resp.๐min[๐จ]); if ๐จ is positive (semi)definite, ๐จ1/2 denotes itsunique Hermitian square root. The natural (base ๐) logarithmis denoted by log. Finally, ๐ โผ ๐๐ฉ (๐,๐ท ) indicates that ๐ iscircularly complex Gaussian with mean ๐ and covariance ๐ท .
II. PROBLEM FORMULATION
A. Signal Model
The general signal model considered in this paper is asfollows:
๐๐ = ๐๐ ๐ + ๐๐ โ โ๐ , ๐ โ {1, โ โ โ , ๐}, (1)
where ๐๐ represents a snapshot (sample of sensor outputs attime ๐) from an array of ๐ sensors, ๐ โ โ๐ is the unknownspatial signature vector, and {๐ ๐} โ โ, {๐๐} โ โ๐ areindependent zero-mean complex circular Gaussian processes.The signal vector ๐
.= [ ๐ 1 โ โ โ ๐ ๐ ]๐ โ โ๐ is zero
mean with known temporal covariance ๐ช.= ๐ผ[๐๐๐ป ], i.e.,
๐ โผ ๐๐ฉ (0,๐ช). The process {๐ ๐} is assumed wide-sensestationary with PSD ๐๐ ๐ (๐
๐๐); thus, ๐ช is Hermitian Toeplitz.In addition, we assume that ๐ผ[โฃ๐ ๐โฃ2] = 1 without loss ofgenerality (since any scaling factor can be absorbed into ๐),so that ๐ช has an all-ones diagonal. On the other hand, theprocess {๐๐} models the background noise and interference.It is assumed to be temporally white1, but spatially correlatedwith unknown and unstructured spatial covariance matrix ฮฃ:๐ผ[๐๐๐
๐ป๐ ] = ฮฃ if ๐ = ๐, and zero otherwise. Therefore,
defining the ๐๐ ร 1 data and noise vectors respectively as๐
.= [ ๐๐
1 โ โ โ ๐๐๐ ]๐ and ๐
.= [ ๐๐
1 โ โ โ ๐๐๐ ]๐ , one
can rewrite (1) as
๐ = ๐โ ๐+ ๐, (2)
which is Gaussian with covariance ๐น.= ๐ผ[๐๐๐ป ] = ๐ช โ
๐๐๐ป + ๐ฐ๐ โฮฃ.The problem considered is the detection of the presence
of signal ๐ given the data vector ๐. Thus, the corresponding
1If ๐๐ is temporally nonwhite but with known temporal correlation, theproposed model can be obtained after a temporal prewhitening step.
3
hypothesis test is given by
โ0 : ๐ โผ ๐๐ฉ (0,๐น0), ๐น0 = ๐ฐ๐ โฮฃ, (3)
โ1 : ๐ โผ ๐๐ฉ (0,๐น1), ๐น1 = ๐ช โ ๐๐๐ป + ๐ฐ๐ โฮฃ
๐ โ= 0, (4)
where ฮฃ is an unknown, unstructured, Hermitian positivedefinite matrix under both hypotheses.
Observe from (3)-(4) that a necessary condition for theproblem to be well-posed is that ๐ช โ= ๐ฐ๐ . Otherwise, thecovariance matrix under โ1 could be written as ๐ฐ๐ โ ฮฃ withฮฃ = ๐๐๐ป + ฮฃ; the fact that both ฮฃ and ฮฃ are unknown,together with the lack of structure in these matrices, yieldsthe same admissible set of covariance matrices under bothhypotheses, which then become indistinguishable.
In order to cope with the unknown parameters ๐ and ฮฃ, asensible approach is the Generalized Likelihood Ratio (GLR)test [31], in which these parameters are substituted by theirMaximum Likelihood (ML) estimates under each hypothesis:
๐ =maxฮฃ,๐ ๐(๐ โฃฮฃ,๐)maxฮฃ ๐(๐ โฃฮฃ,0)
โ1
โทโ0
๐พ, (5)
where the probability density function (p.d.f.) ๐ is given by
๐(๐ โฃฮฃ,๐) = 1
๐๐๐ det๐นexp{โ๐๐ป๐นโ1๐}. (6)
B. Summary of Main Results
Before delving into the derivation of the GLR test (5), wesummarize and discuss our main results. To this end, let usintroduce the data matrix ๐ โ โ๐ร๐ and its (economy-size)singular value decomposition (SVD):
๐.= [ ๐1 โ โ โ ๐๐ ] = ๐ผ๐บ๐ฝ ๐ป , (7)
where ๐ผ โ โ๐ร๐ is unitary, ๐บ โ โ๐ร๐ is positive definitediagonal, and ๐ฝ โ โ๐ร๐ is semi-unitary, i.e., ๐ฝ ๐ป๐ฝ = ๐ฐ๐ .Note that ๐ is related to the data vector ๐ โ โ๐๐ by ๐ =vec(๐ ). Then we have the following.
Theorem 1: The GLR test (5) can be written as follows,with ๐พ a suitable threshold:
๐ = max๐โฅ0
๐ก(๐)โ1
โทโ0
๐พ, ๐ก(๐).=๐โ๐
min
[๐ฝ ๐ป(๐ฐ๐ + ๐๐ชโ)โ1๐ฝ
]det(๐ฐ๐ + ๐๐ชโ)
,
(8)which involves a scalar optimization problem in the variable๐.
Note that the dependence of (8) with the data is onlythrough the semi-unitary matrix ๐ฝ , which is related to thetemporal dimension of the data. This is due to the lack ofknowledge about the spatial covariance of the noise-plus-interference process ๐๐. In addition, it is readily checked that if๐ช = ๐ฐ๐ then the statistic in (8) becomes ๐ = 1 independentlyof the data, reflecting again the fact that white signals are notdetectable under this model.
The parameter ๐ featuring in (8) is related to the Signal-to-Noise Ratio (SNR), as will be shown in the sequel. To thebest of our knowledge, there is no closed-form solution for thegeneral maximization problem (8). Nevertheless, the followingresult applies in the asymptotic regime of low SNR.
Lemma 1: For vanishingly small SNR, the GLR test (8)becomes equivalent to the following test:
๐ โฒ .= ๐max[๐ฝ๐ป๐ชโ๐ฝ ]
โ1
โทโ0
๐พโฒ. (9)
In order to illustrate the meaning of (9), let ๐ช = ๐ญฮ๐ญ๐ป
with ฮ = diag(๐0, โ โ โ , ๐๐โ1) be an eigenvalue decomposi-tion (EVD) of ๐ช . It is well known [31] that as ๐ โ โ theeigenvalues of ๐ช approach ๐๐ โ ๐๐ ๐ (๐
๐ 2๐๐๐ ), 0 โค ๐ โค ๐โ1,
whereas the matrix of eigenvectors ๐ญ approach the ๐ ร ๐orthonormal Inverse Discrete Fourier Transform (IDFT) matrix(this can be formally justified in terms of the asymptoticequivalence between sequences of matrices and the asymptoticeigenvalue distribution of circulant and Toeplitz matrices [32]).Let us now write ๐ฝ โ = [ ๐1 โ โ โ ๐๐ ]. The (๐, ๐) elementof ๐ฝ ๐ป๐ชโ๐ฝ is therefore ๐๐ป
๐ ๐ช๐๐ = (๐ญ๐ป๐๐)๐ปฮ(๐ญ๐ป๐๐),
which for ๐ โ โ can be thought of as a spectrally weightedfrequency-domain crosscorrelation between the outputs ofthe ๐-th and ๐-th orthonormalized data streams (since ๐ญ๐ป๐๐
approaches the ๐ -point DFT of ๐๐). The spectral weightsare given by the eigenvalues of ๐ช , i.e., the sampled PSDof the signal process. Thus, the test statistic ๐ โฒ in (9) isthe largest eigenvalue of this spectrally weighted frequency-domain correlation matrix. The discussion above also showsthat an approximation to ๐ฝ ๐ป๐ชโ๐ฝ for ๐ large can be ef-ficiently computed by means of the FFT operation, similarlyto [21], with no significant performance loss even for moderatevalues of ๐ .
If ๐ = 1, i.e., only one antenna is available, then it isreadily checked that
๐ โฒ =๐๐ป๐ช๐
๐๐ป๐, (10)
which is the ratio of the spectrally weighted energy to totalenergy, and can be seen as a generalization to the unknownnoise power case of the spectral correlation-based detectorfrom [21].
The following result gives the exact expression of theGLR test in closed form for a particular family of temporalcovariance matrices ๐ช .
Lemma 2: Assume that {0, ๐} are the only eigenvalues of๐ช , with ๐ > 0. Then the GLR test (8) is equivalent to theclosed-form test (9), independently of the SNR value.
Thus, for the case of a process {๐ ๐} with a flat bandpassPSD, the asymptotic (for low SNR) GLR test is also thecorresponding GLR test for all SNR values. This suggests thatthe loss incurred by the detector (9) with other types of signalspectra at moderate SNR may be small. Numerical results willattest to this remark.
To close this section, we note that it is possible to analyti-cally derive the asymptotic distribution of the test statistic ๐under both hypotheses. The corresponding expressions will bepresented in Sec. V.
III. DERIVATION OF THE GLR TEST
In this section we obtain the required ML estimates for thederivation of the GLR detector (5).
4
A. Preliminaries
Under โ0, (6) can be written as
๐(๐ โฃฮฃ,0) =[exp{โ tr(ฮฃโ1๏ฟฝ๏ฟฝ)}
๐๐ detฮฃ
]๐
, (11)
where we have introduced the sample covariance matrix
๏ฟฝ๏ฟฝ.=
1
๐๐ ๐ ๐ป . (12)
It is well known [5] that (11) is maximized for ฮฃ0 = ๏ฟฝ๏ฟฝ,yielding
maxฮฃ
๐(๐ โฃฮฃ,0) = [(๐๐)๐ det ๏ฟฝ๏ฟฝ]โ๐ . (13)
Obtaining the ML estimates under โ1 is more involved. Letus begin by introducing
๐โฅ.= ฮฃโ1๐, ๐
.= ๐๐ปฮฃโ1๐, ๐ฎ(๐)
.= (๐ฐ๐ + ๐๐ช)โ1.
(14)Vector ๐โฅ is the Capon beamformer [33], whereas ๐ is themaximum SNR that can be obtained at the output of a linearcombiner ๐ฅ๐ = ๐๐ป๐๐, attained precisely at ๐ = ๐โฅ. Notethat ๐ฎ(๐) and ๐ช share the same eigenvectors, and thereforethey commute. It will be also convenient to introduce the unitnorm vectors
๏ฟฝ๏ฟฝ.=
๐โ๐๐ป๐
, ๏ฟฝ๏ฟฝโฅ.=
๐โฅโ๐๐ปโฅ ๐โฅ
. (15)
Now we need expressions for the determinant and inverse of๐น1 = ๐ชโ๐๐๐ป+๐ฐ๐โฮฃ, featuring in the p.d.f. (6). ApplyingSylvesterโs determinant theorem [37] and the properties of theKronecker product,
det๐น1 = (detฮฃ)๐ det(๐ฐ๐ + ๐๐ช). (16)
Regarding the inverse of ๐น1, it is shown in Appendix A that
๐นโ11 = ๐ฐ๐ โฮฃโ1 โ๐ช๐ฎ(๐)โ ๐โฅ๐๐ป
โฅ . (17)
With this, the quadratic form ๐๐ป๐นโ11 ๐ can be written as (see
Appendix B):
๐๐ป๐นโ11 ๐ = tr[๐พโฅ๐ ๐ ๐ป ] + tr[๐พโฅ๐ ๐ฎโ(๐)๐ ๐ป ], (18)
where the matrices ๐พโฅ, ๐พโฅ are defined as
๐พโฅ.= ๐โ1๐โฅ๐๐ป
โฅ (19)
= ฮฃโ1[๐โ1๐๐๐ป
โฅ], (20)
๐พโฅ.= ฮฃโ1 โ ๐โ1๐โฅ๐๐ป
โฅ (21)
= ฮฃโ1[๐ฐ๐ โ ๐โ1๐๐๐ป
โฅ]. (22)
Note that ฮฃโ1 = ๐พโฅ +๐พโฅ, and that ๐พโฅ has rank one andis positive semidefinite. In addition, ๐พโฅ๐ = 0, and thus therank of ๐พโฅ is at most ๐ โ 1. The geometric interpretationof these matrices is as follows. For any ๐ โ โ๐ , write ๐ =๐๐+ ๏ฟฝ๏ฟฝ for some ๐ โ โ, ๏ฟฝ๏ฟฝ โ โ๐ with ๏ฟฝ๏ฟฝ๐ป๐โฅ = 0. Note thatas long as ๐ = ๐๐ป
โฅ ๐ โ= 0, this decomposition exists and isunique. (This condition means that the Capon beamformer isnot orthogonal to the spatial signature, i.e., it does not block
the signal at its output completely. It is satisfied for full rankฮฃ, as ๐ = ๐๐ปฮฃโ1๐ > 0). Then[
๐โ1๐๐๐ปโฅ]๐ = ๐๐,
[๐ฐ๐ โ ๐โ1๐๐๐ป
โฅ]๐ = ๏ฟฝ๏ฟฝ, (23)
i.e., ฮ .= ๐ฐ๐ โ๐โ1๐๐๐ป
โฅ is the oblique projector along ๐ ontothe subspace orthogonal to ๐โฅ.
Note that ๐๐ปฮฃโ1๐ = โฃ๐โฃ2๐๐ปฮฃโ1๐ + ๏ฟฝ๏ฟฝ๐ป๐พโฅ๏ฟฝ๏ฟฝ, which ispositive for ๐ โ= 0. Then for ๐ = 0, one has that ๐ = ๏ฟฝ๏ฟฝ is in asubspace of dimension ๐โ1, and ๐๐ปฮฃโ1๐ = ๏ฟฝ๏ฟฝ๐ป๐พโฅ๏ฟฝ๏ฟฝ > 0for ๏ฟฝ๏ฟฝ โ= 0, showing that ๐พโฅ is positive semidefinite of rank๐ โ 1.
For completeness, we provide an expression for ฮฃ in termsof these matrices, given by
ฮฃ = ฮ ๐พ โ โฅฮ
๐ป + ๐โ1๐๐๐ป , (24)
which is proved in Remark 2 (Appendix D).The procedure whereby two different orthogonal spaces
are considered for the signal and the noise plus interferencesubspaces in the spatial domain for this non-collaborative spec-trum sensing scenario constitutes a recurrent and successfulapproach in multiantenna detection [30] [34], array signalprocessing [35] as well as in spectral estimation contexts [36].The use of suitable parameter transformations in detectionschemes has also been addressed in [30].
B. A convenient change of variables
The decomposition of the unstructured inverse spatial co-variance ฮฃโ1 = ๐พโฅ + ๐พโฅ into two structured positivesemidefinite matrices will be useful in order to obtain the MLestimates. To this end, we introduce a change of variablesin order to replace the original set of unstructured unknownparameters ฮฉ
.= {๐,ฮฃ} by another set ฮฉโฒ based on ๐พโฅ and
๐พโฅ. In order to account for the structure of these matrices,let us introduce the following (economy-size) EVDs:
๐พโฅ = ๐พโฅ๏ฟฝ๏ฟฝโฅ๏ฟฝ๏ฟฝ๐ปโฅ , ๐พโฅ = ๐ผโฅฮโฅ๐ผ๐ป
โฅ , (25)
where ๐พโฅ โ โ is positive, ๏ฟฝ๏ฟฝโฅ โ โ๐ has unit norm (note from(19) that ๏ฟฝ๏ฟฝโฅ is the unit-norm Capon beamformer and ๐พโฅ theinverse of the noise power at its output), ฮโฅ โ โ(๐โ1)ร(๐โ1)
is positive definite diagonal, and ๐ผโฅ โ โ๐ร(๐โ1) has or-thonormal columns. In addition, we require that (i) ๐ผ๐ป
โฅ ๏ฟฝ๏ฟฝ = 0(to ensure that ๐พโฅ๏ฟฝ๏ฟฝ = 0); and (ii) ๏ฟฝ๏ฟฝ๐ป
โฅ ๏ฟฝ๏ฟฝ โ= 0 (to ensure that๐พโฅ + ๐พโฅ has full rank). Note that [๐ผโฅ ๏ฟฝ๏ฟฝ ] โ โ๐ร๐ isunitary.
With these, the new parameter space we consider is givenby
ฮฉโฒ .= {๐, ๏ฟฝ๏ฟฝ, ๐พโฅ, ๏ฟฝ๏ฟฝโฅ,ฮโฅ,๐ผโฅ}. (26)
It is readily checked that under the standing assumptions ๐ โ=0, ฮฃ = ฮฃ๐ป full rank, there is a one-to-one correspondence2
between the parameter spaces ฮฉ and ฮฉโฒ, which is summarizedin Table I. Hence, the maximization of the likelihood functioncan be carried out over either ฮฉ or ฮฉโฒ with identical result.
2Apart from irrelevant rotational ambiguities in ๏ฟฝ๏ฟฝ and ๏ฟฝ๏ฟฝโฅ, and rota-tional/ordering ambiguities in the EVD of ๐พโฅ.
5
In order to write the likelihood function in terms of the newparameters, note from (16) and (18) that
โ log ๐(๐ โฃฮฃ,๐)= ๐๐ log ๐ +๐ log detฮฃโ log det๐ฎโ(๐)+ ๐ tr[๐พโฅ๏ฟฝ๏ฟฝ] + tr[๐พโฅ๐ ๐ฎโ(๐)๐ ๐ป ]. (27)
The following result links the determinant of the unstructuredspatial covariance matrix with the new parameters; see Ap-pendix C for the proof.
Lemma 3: Under the parameterization (25)-(26), it holdsthat det(ฮฃโ1) = ๐พโฅ
โฃโฃ๏ฟฝ๏ฟฝ๐ป ๏ฟฝ๏ฟฝโฅโฃโฃ2 det(ฮโฅ).
Then (27) becomes
โ log ๐(๐ โฃฮฃ,๐)= ๐๐ log ๐ โ๐ log ๐พโฅ โ๐ log
โฃโฃ๏ฟฝ๏ฟฝ๐ป ๏ฟฝ๏ฟฝโฅโฃโฃ2
โ ๐ log detฮโฅ โ log det๐ฎโ(๐)+ ๐ tr[๐ผโฅฮโฅ๐ผ๐ป
โฅ ๏ฟฝ๏ฟฝ] + ๐พโฅ๏ฟฝ๏ฟฝ๐ปโฅ ๐ ๐ฎโ(๐)๐ ๐ป ๏ฟฝ๏ฟฝโฅ. (28)
C. ML estimation under โ1
We proceed now to minimize (28) with respect to theparameters in (26). The optimum values of ๐พโฅ and ฮโฅ arereadily found:
๐พโฅ =๐
๏ฟฝ๏ฟฝ๐ปโฅ ๐ ๐ฎโ(๐)๐ ๐ป ๏ฟฝ๏ฟฝโฅ
, ฮโฅ =[diag(๐ผ๐ป
โฅ ๏ฟฝ๏ฟฝ๐ผโฅ)]โ1
.
(29)Substituting (29) back in (28), we obtain3
โ log ๐
= ๐
(๐ + log
๐๐
๐
)โ๐ log
โฃโฃ๏ฟฝ๏ฟฝ๐ป ๏ฟฝ๏ฟฝโฅโฃโฃ2 โ log det๐ฎโ(๐)
+ ๐ log det[diag(๐ผ๐ป
โฅ ๏ฟฝ๏ฟฝ๐ผโฅ)]
+ ๐ log(๏ฟฝ๏ฟฝ๐ปโฅ ๐ ๐ฎโ(๐)๐ ๐ป๏ฟฝ๏ฟฝโฅ
). (30)
The optimum value of ๐ผโฅ is provided by the following result,whose proof is given in Appendix D:
Lemma 4: Let us consider the following cost: ๐ฝ(๐ผโฅ).=
det[diag(๐ผ๐ป
โฅ ๏ฟฝ๏ฟฝ๐ผโฅ)]. Then the solution of
min๐ผโฅ
๐ฝ(๐ผโฅ) subject to ๐ผ๐ปโฅ ๐ผโฅ = ๐ฐ๐โ1, ๐ผ๐ป
โฅ ๏ฟฝ๏ฟฝ = 0 (31)
is given by ๐ฝmin = ๏ฟฝ๏ฟฝ๐ป๏ฟฝ๏ฟฝโ1๏ฟฝ๏ฟฝ โ det ๏ฟฝ๏ฟฝ, and is attained when๐ผโฅ is an orthonormal basis for the range space of (๐ฐ๐ โ๏ฟฝ๏ฟฝ๏ฟฝ๏ฟฝ๐ป)๏ฟฝ๏ฟฝ(๐ฐ๐ โ ๏ฟฝ๏ฟฝ๏ฟฝ๏ฟฝ๐ป).Substituting the value of ๐ฝmin into (30), one has
โ log ๐
= ๐
(๐ + log
๐๐
๐+ log det ๏ฟฝ๏ฟฝ
)โ๐ log
โฃโฃ๏ฟฝ๏ฟฝ๐ป ๏ฟฝ๏ฟฝโฅโฃโฃ2
โ log det๐ฎโ(๐)
+ ๐ log(๏ฟฝ๏ฟฝ๐ป๏ฟฝ๏ฟฝโ1๏ฟฝ๏ฟฝ) +๐ log(๏ฟฝ๏ฟฝ๐ปโฅ ๐ ๐ฎโ(๐)๐ ๐ป ๏ฟฝ๏ฟฝโฅ
).(32)
3For the sake of clarity and with some abuse of the notation, we denotethe value of ๐(๐ โฃฮฃ,๐) obtained after maximization w.r.t. a parameter in ฮฉโฒsimply by ๐ .
Now, minimizing (32) with respect to ๏ฟฝ๏ฟฝ and ๏ฟฝ๏ฟฝโฅ (unit-normsignature vector and Capon beamformer respectively) amountsto minimizing
(๏ฟฝ๏ฟฝ๐ป๏ฟฝ๏ฟฝโ1๏ฟฝ๏ฟฝ) โ (๏ฟฝ๏ฟฝ๐ปโฅ ๐ ๐ฎโ(๐)๐ ๐ป ๏ฟฝ๏ฟฝโฅ)โฃโฃ๏ฟฝ๏ฟฝ๐ป ๏ฟฝ๏ฟฝโฅ
โฃโฃ2 , (33)
subject to ๏ฟฝ๏ฟฝ๐ป ๏ฟฝ๏ฟฝโฅ โ= 0. To this end, the following result willbe applied; see Appendix E for the proof.
Lemma 5: Let ๐จ1, ๐จ2 be two ๐ ร ๐ positive definiteHermitian matrices. The minimum value of
๐น (๐1,๐2).=
(๐๐ป1 ๐จ1๐1) โ (๐๐ป
2 ๐จ2๐2)
โฃ๐๐ป1 ๐2โฃ2 (34)
subject to ๐๐ป1 ๐2 โ= 0 is given by ๐นmin = ๐min(๐จ1๐จ2) =
๐min(๐จ2๐จ1), and is attained if ๐1 (resp. ๐2) is a minimumright eigenvector4 of ๐จ2๐จ1 (resp. ๐จ1๐จ2).Therefore, from the SVD (7), the minimum value of (33) isfound to be ๐min[๐ ๐ฎโ(๐)๐ ๐ป๏ฟฝ๏ฟฝโ1] = ๐๐min[๐ฝ
๐ป๐ฎโ(๐)๐ฝ ].Using ๐ โ = ๐ฝ ๐บโ1๐ผ๐ป , the corresponding ML estimates ห๐and ห๐โฅ are seen to be the unit-norm minimum eigenvectors ofthe matrices ๐ ๐ฎโ(๐)๐ โ and (๐ ๐ป)โ ๐ฎโ(๐)๐ ๐ป , respectively.
Substitution of these optimum values into (32) finally yields
maxฮฃ,๐
๐(๐ โฃฮฃ,๐) (35)
= [(๐๐)๐ det ๏ฟฝ๏ฟฝ]โ๐ โ max๐โฅ0
{(๐๐min[๐ฝ
๐ป๐ฎโ(๐)๐ฝ ]
det๐ฎโ(๐)
)โ1},
which, together with (13), proves that the GLR test is indeed(8) as in Theorem 1.
The ML estimates of the parameters are summarized in Ta-ble I. The expression for the ML estimate ๏ฟฝ๏ฟฝโฅ = ๏ฟฝ๏ฟฝโฅฮโฅ๏ฟฝ๏ฟฝ๐ป
โฅgiven in Table I is justified in Remark 1 of Appendix D whilethat of ฮฃ stems from (24).
As previously mentioned, there is no closed-form expressionfor the ML estimate of ๐, which has to be computed bynumerical means, e.g. a gradient search as in Sec. VI. Thisraises the issue of local optima, about which the followingcan be said. First, experimental evidence suggests that thelikelihood function is unimodal in ๐. Second, the asymptotic(as ๐ โ โ) likelihood function does turn out to be unimodal;see Remark 3 in Appendix F. And finally, for certain kinds oftemporal covariance matrices unimodality can be analyticallyestablished, as shown in the next section.
IV. GLR TEST FOR LOW-SNR AND BINARY-VALUED
EIGENSPECTRUM CASES
In this section we first explore the behavior of the GLRtest (8) for asymptotically low SNR, with arbitrary (but non-white) temporal correlation of the signal of interest. After this,we particularize the GLR test (8) to the case in which theeigenvalues of the temporal covariance matrix ๐ช reduce to{0, ๐} with ๐ > 0 (recall that these eigenvalues approach thesamples of the signal PSD as ๐ โ โ; thus, this situationincludes the case of a PSD occupying only a fraction ofthe Nyquist bandwidth, in which it is constant), showing
4In the sequel we shall refer to right eigenvectors simply as eigenvectors.
6
Transformed parameters: ฮฉโฒ = {๐, ๏ฟฝ๏ฟฝ, ๐พโฅ, ๏ฟฝ๏ฟฝโฅ,ฮโฅ,๐ผโฅ} ML estimates of transformed parameters
๐ = ๐๐ปฮฃโ1๐ ๐ = argmin๐โฅ0
{๐๐
min
[(๐ ๐ฎโ(๐)๐ ๐ป)(๐ ๐ ๐ป)โ1
]
det๐ฎโ(๐)
}
๏ฟฝ๏ฟฝ = ๐/โ๐๐ป๐ ห๐ = unit-norm minimum eigenvector of ๐ ๐ฎโ(๐)๐ โ
๏ฟฝ๏ฟฝโฅ = (ฮฃโ1๐)/โ๐๐ปฮฃโ2๐ ห๐โฅ = unit-norm minimum eigenvector of (๐ ๐ป)โ ๐ฎโ(๐)๐ ๐ป
๐พโฅ = (๐๐ปฮฃโ2๐)/(๐๐ปฮฃโ1๐) ๐พโฅ =(
1๐
ห๐๐ปโฅ ๐ ๐ฎโ(๐)๐ ๐ป ห๐โฅ
)โ1
๐ผโฅฮโฅ๐ผ๐ปโฅ = ฮฃโ1 โ (๐๐ปฮฃโ1๐)โ1(ฮฃโ1๐)(ฮฃโ1๐)๐ป ๏ฟฝ๏ฟฝโฅฮโฅ๏ฟฝ๏ฟฝ๐ป
โฅ =[(๐ฐ๐ โ ห๐ ห๐๐ป )๏ฟฝ๏ฟฝ(๐ฐ๐ โ ห๐ ห๐๐ป )
]โ Original parameters: ฮฉ = {ฮฃ,๐} ML estimates of original parameters
๐ = (โ
๐/๐พโฅ/โฃ๏ฟฝ๏ฟฝ๐ปโฅ ๏ฟฝ๏ฟฝโฃ) โ ๏ฟฝ๏ฟฝ ๏ฟฝ๏ฟฝ =
(โ๐/๐พโฅ/โฃ ห๐๐ป
โฅห๐โฃ)โ ห๐
ฮฃโ1 = ๐ผโฅฮโฅ๐ผ๐ปโฅ + ๐พโฅ๏ฟฝ๏ฟฝโฅ๏ฟฝ๏ฟฝ๐ป
โฅ ฮฃโ1 = ๏ฟฝ๏ฟฝโฅฮโฅ๏ฟฝ๏ฟฝ๐ปโฅ + ๐พโฅ ห๐โฅ ห๐๐ป
โฅฮฃ = ฮ (๐ผโฅฮโ1
โฅ ๐ผ๐ปโฅ )ฮ ๐ป + (๐พโฅโฃ๏ฟฝ๏ฟฝ๐ป
โฅ ๏ฟฝ๏ฟฝโฃ2)โ1๏ฟฝ๏ฟฝ๏ฟฝ๏ฟฝ๐ป ฮฃ = ฮ ๏ฟฝ๏ฟฝฮ ๐ป + ๐โ1๏ฟฝ๏ฟฝ๏ฟฝ๏ฟฝ๐ป
ฮ .= ๐ฐ๐ โ (๏ฟฝ๏ฟฝ๐ป
โฅ ๏ฟฝ๏ฟฝ)โ1๏ฟฝ๏ฟฝ๏ฟฝ๏ฟฝ๐ปโฅ ฮ = ๐ฐ๐ โ (ห๐๐ป
โฅห๐)โ1 ห๐ ห๐๐ป
โฅ
TABLE ITRUE PARAMETERS (LEFT) AND ML ESTIMATES (RIGHT) UNDER โ1 , WITH ๏ฟฝ๏ฟฝ = 1
๐๐ ๐ ๐ป , ๐ฎ(๐)
.= (๐ฐ๐ + ๐๐ช)โ1 .
that a closed-form expression for the detector exists in thiscase, valid for all SNR values. Remarkably, the closed-formdetectors obtained in both cases are the same.
A. GLR test for asymptotically low SNR
From (8), the GLR statistic is ๐ = max๐โฅ0 ๐ก(๐). For small๐, the following first-order approximations
(๐ฐ๐+๐๐ชโ)โ1 โ ๐ฐ๐โ๐๐ชโ, det(๐ฐ๐+๐๐ชโ) โ 1+๐ tr๐ชโ
(36)hold. Therefore,
๐ก(๐) โ ๐โ๐min [๐ฝ
๐ป(๐ฐ๐ โ ๐๐ชโ)๐ฝ ]
1 + ๐ tr๐ชโ =(1 โ ๐๐max[๐ฝ
๐ป๐ชโ๐ฝ ])โ๐
1 + ๐ tr๐ชโ .
(37)Taking logarithms and using the approximation log(1+๐ฅ) โ ๐ฅfor small โฃ๐ฅโฃ, one has
log ๐ก(๐) โ ๐๐
(๐max[๐ฝ
๐ป๐ชโ๐ฝ ]โ 1
๐tr๐ชโ
). (38)
Suppose now that ๐ were known; in that case, the GLRstatistic is directly ๐ก(๐), and (38) shows that, in low SNR, theGLR test is equivalent to comparing ๐max[๐ฝ
๐ป๐ชโ๐ฝ ] againsta threshold. This amounts to saying that for vanishingly smallSNR, knowledge of ๐ becomes irrelevant, and the GLR test canbe rephrased as in (9) since tr๐ชโ is a constant independentof the data, thus establishing Lemma 1.
B. Binary-valued eigenspectrum
We now set to prove Lemma 2. Consider the EVD ๐ช =๐ญฮ๐ญ๐ป , and assume that the only eigenvalues of ๐ช are ๐and zero, with multiplicities ๐ฟ and ๐ โ๐ฟ respectively. Sinceit is assumed that ๐ช has ones on its diagonal, it follows thattr๐ช = ๐ฟ๐ = ๐ . Letting ๐
.= ๐ฟ/๐ < 1 denote the fraction
of nonzero eigenvalues, then ๐ = ๐โ1. The GLR statistic ๐from (8) can be written as
๐ = max๐โฅ0
{๐โ๐
min
[(๐ญ โ๐ฝ )๐ป(๐ฐ๐ + ๐ฮ)โ1๐ญ โ๐ฝ
]det(๐ฐ๐ + ๐ฮ)
}(39)
It is readily checked that (๐ฐ๐ + ๐ฮ)โ1 = ๐ฐ๐ โ ๐1+๐โ1๐ฮ and
det(๐ฐ๐ + ๐ฮ) = (1 + ๐โ1๐)๐๐ . Therefore,
๐ 1/๐ (40)
= max๐โฅ0
โงโจโฉ๐โ1
min
[(๐ญ โ๐ฝ )๐ป(๐ฐ๐ โ ๐
1+๐โ1๐ฮ)๐ญ โ๐ฝ]
(1 + ๐โ1๐)๐
โซโฌโญ
=
{min๐โฅ0
[(1โ ๐๏ฟฝ๏ฟฝ
1 + ๐โ1๐
)(1 + ๐โ1๐)๐
]}โ1
, (41)
where ๏ฟฝ๏ฟฝ.= ๐max[๐ฝ
๐ป๐ชโ๐ฝ ]. It is straightforward to show thatthe minimum in (41) is attained at
๐ = max
{0,
๏ฟฝ๏ฟฝโ 1
๐โ1 โ ๏ฟฝ๏ฟฝ
}, if 0 โค ๏ฟฝ๏ฟฝ โค ๐โ1, (42)
whereas no minimum exists if ๏ฟฝ๏ฟฝ > ๐โ1. However, thiscase need not be considered: since ๐ฝ ๐ป๐ฝ = ๐ฐ๐ , by thePoincare separation theorem [38] one has ๐max[๐ฝ
๐ป๐ชโ๐ฝ ] โค๐max[๐ช
โ] = ๐โ1. Thus, (41) becomes
๐ 1/๐ =
{1, if 0 โค ๏ฟฝ๏ฟฝ โค 1,
๏ฟฝ๏ฟฝโ๐(
1โ๐1โ๐๏ฟฝ๏ฟฝ
)1โ๐
, if 1 < ๏ฟฝ๏ฟฝ โค ๐โ1,(43)
which is a non-decreasing function of ๏ฟฝ๏ฟฝ โ [0, ๐โ1]. Hence, theGLR test for binary-valued eigenspectrum is equivalent to thetest (9), as was to be shown. It has a nice geometrical inter-pretation: if we collect in ๐ญ the columns of ๐ญ correspondingto the nonzero eigenvalue ๐ = ๐โ1, then ๐ช = ๐โ1๐ญ๐ญ๐ป , and
๐max[๐ฝ๐ป๐ชโ๐ฝ ] = ๐โ1๐max[(๐ญ
๐ป๐ฝ โ)๐ป(๐ญ๐ป๐ฝ โ)]= ๐โ1 cos2 ๐, (44)
where ๐ is the minimum principal angle [39] between thesubspaces spanned by ๐ฝ โ and ๐ญ . The GLR statistic is thusa measure of the largest achievable projection of a vector inthe range space of ๐ฝ โ (data subspace) onto the range spaceof ๐ญ (reference subspace). Note that cos2 ๐ > ๐ in order tohave ๐ > 1.
7
V. PERFORMANCE ANALYSIS IN LOW SNR
The following result, whose proof is given in AppendixF, provides the asymptotic distributions of the GLR statisticunder each hypothesis, assuming that the SNR is small. Thus,it enables us to compute the probabilities of detection ๐D andfalse alarm ๐FA for a given threshold.
Theorem 2: As ๐ โ โ and for sufficiently small SNR,the GLR statistic (8) is asymptotically distributed as
2 log๐ โผ{
๐22๐โ1, under โ0,
๐โฒ22๐โ1(๐ผsph(๐)), under โ1,
(45)
๐ผsph(๐).= 2๐ log
1๐ tr(๐ฐ๐ + ๐๐ช)
[det(๐ฐ๐ + ๐๐ช)]1/๐, (46)
where ๐.= ๐๐ปฮฃโ1๐, and ๐2
๐, ๐โฒ2๐ (๐ผ) denote respectively
central and non-central chi-square distributions with ๐ degreesof freedom and non-centrality parameter ๐ผ.
The argument of the log in (46) is the ratio of the arithmeticmean (AM) to geometric mean (GM) of the eigenvalues of๐ฐ๐ + ๐๐ช , which is the temporal covariance matrix at theoutput of the Capon beamformer. By the AM-GM inequality,this ratio is no less than one, and it is equal to one iffall eigenvalues are equal, i.e., iff ๐ช = ๐ฐ๐ . In that case๐ผsph(๐) = 0 for all ๐ and the asymptotic distributions under โ1
and โ0 coincide, which is consistent with the fact that whitesignals are not detectable under this signal model. ApplyingSzegoโs theorem [40], the asymptotic value of the AM-GMratio can be written in terms of the signal PSD:
lim๐โโ
1๐ tr(๐ฐ๐ + ๐๐ช)
[det(๐ฐ๐ + ๐๐ช)]1/๐(47)
=12๐
โซ ๐
โ๐[1 + ๐๐๐ ๐ (๐๐๐)]d๐
exp{
12๐
โซ ๐
โ๐ log[1 + ๐๐๐ ๐ (๐๐๐)]d๐} ,
which is the inverse of the Spectral Flatness Measure (SFM)[41] associated with the power spectrum 1 + ๐๐๐ ๐ (๐
๐๐). Itsminimum value is 1 for ๐ = 0, increasing monotonically with๐ toward the inverse of the SFM associated to ๐๐ ๐ (๐
๐๐) [42].We could ask for the signal eigenvalue distribution maxi-
mizing the performance of the GLR detector at a given SNR.In view of Theorem 2, this amounts to maximizing ๐ผsph(๐).Recalling that tr๐ช = ๐ due to signal power normalization,and with {๐0, . . . , ๐๐โ1} the eigenvalues of ๐ช , the problembecomes
min{๐๐}
๐โ1โ๐=0
(1 + ๐๐๐) (48)
subject to๐โ1โ๐=0
๐๐ = ๐, ๐๐ โฅ 0, ๐ = 0, . . . , ๐ โ 1,
whose solutions are ๐๐ = ๐ for some ๐ โ {0, 1, . . . , ๐ โ 1}and ๐๐ = 0 for ๐ โ= ๐ (independently of ๐), as can be easilyshown following the same reasoning as in [21, Sec. IV]. For๐ โ โ, this implies that the optimum PSD concentrates allits power at a single frequency. This is not surprising, as thiskind of peaky spectra minimize the SFM and are easier todetect in the presence of noise.
0 5 10 15 20 2510
โ4
10โ2
100
Statistic 2 logT
Simulation (ฯ = 0)Analytical
(a)
0 10 20 30 40 50 6010
โ4
10โ2
100
Statistic 2 logT
Simulation (ฯ = 0.05)Simulation (ฯ = 0.1)Simulation (ฯ = 0.2)Analytical
(b)
Fig. 1. Distribution of the statistic 2 log ๐ for ๐ = 4 and ๐ = 128. (a)Under โ0 (b) Under โ1.
Finally, note from Theorem 2 that, as desired, the asymptoticperformance of the GLR detector does not depend on thespecific spatial correlation profile of the noise, but only onthe operational SNR ๐.
VI. NUMERICAL RESULTS
We proceed to examine the performance of the proposed de-tectors via Monte Carlo simulations and check the accuracy ofthe analytical approximations. In all experiments, the signaturevector ๐ and the noise spatial covariance matrix ฮฃ = ๐ฏ๐ฏ๐ป
are randomly generated at each Monte Carlo run, with ๐ scaledin order to obtain a given SNR value ๐ = ๐๐ปฮฃโ1๐. Theentries of ๐ and ๐ฏ โ โ๐ร๐ are independent zero-meancircular Gaussian. We refer to the two proposed detectors as:
1) Iterative GLRT: the exact GLR test from (8), in whichthe optimization w.r.t. ๐ is performed using a gradientdescent algorithm detailed in Appendix G.
2) Asymptotic GLRT: the closed-form detector from (9),which has been shown to coincide with the GLR detectorfor vanishing SNR or for a binary-valued eigenspectrum(PSD) of the primary signal.
For each signal type used in the simulations, the autocorre-lation of the signal is estimated from a register of 106 noise-free samples, and the Toeplitz matrix ๐ช is built from theseestimates.
In order to check the accuracy of the theoretical distributionsof the GLR statistic given in Theorem 2, these are compared inFig. 1 against the empirical histograms of the iterative GLRTscheme, for a setting with ๐ = 4 antennas and ๐ = 128.The PSD of the signal is constant within its support, whichequals half the Nyquist bandwidth. A very good agreement isobserved, even for this moderate value of the sample size ๐ .
8
To establish suitable benchmarks for the proposed schemes,we additionally consider the three following detectors, all ofwhich incorporate knowledge of ๐ช .
1) A direct generalization of the single-antenna detectorof Quan et al. [21], in which the following statistic iscompared against a threshold:
๐Quan et al. = tr(๐ ๐ช๐ ๐ป). (49)
2) An energy ratio (ER) detector in which (49) is normal-ized by the total observed energy, in order to cope withthe noise uncertainty issue. The corresponding statisticis
๐ER =tr(๐ ๐ช๐ ๐ป)
tr(๐ ๐ ๐ป), (50)
which is a direct generalization of (10) to the multi-antenna setting.
3) The โrule Tโ detector proposed by Stoica & Cedervallin [29], which exploits that the temporal correlationlength of the noise is much shorter than that of thesignal of interest. Its specification is too lengthy and thereader is referred to [29] for more information. Using thenotation from [29], our implementation assumes ๏ฟฝ๏ฟฝ = 1(signal spatial rank), ๐ = 16 (truncation point) and๐พ = 1 (number of correlation lags).
A. Detection performance in the low SNR regime
The empirical ROC curves of the different detectors con-sidered are shown in Fig. 2, together with the analyticalapproximation from Theorem 2, for a setting with ๐ = 4,๐ = 512 and ๐ = 0.2. Two different kinds of signals areconsidered. The first one is an OFDM-modulated digital TVbaseband signal with a bandwidth of 3.805 MHz, sampled at16 Msps and quantized to 9-bit precision. The signal samplesare approximately Gaussian distributed, with a PSD that isalmost constant within its support (about 48% of the Nyquistbandwidth). The second signal uses a 16-QAM constellationand square-root raised cosine pulses with roll-off factor 1,and it is sampled at twice its baud rate with random timingoffset. In this case, the corresponding samples do not followa Gaussian distribution. In accordance with the signal model,frequency-flat channels are assumed; the impact of frequencyselectivity will be considered in Sec. VI-C.
As can be seen in Fig. 2, the proposed schemes signifi-cantly outperform the detectors of Quan et al. and Stoica &Cedervall, as well as the Energy Ratio detector. The SNR issufficiently low for the asymptotic GLRT to achieve the sameperformance as the iterative GLRT detector. The agreement ofthe empirical and theoretical ROC curves for the GLR test isagain quite good. Also note that the GLR detectors performnoticeably better with the OFDM signal than with the QAMsignal. This is explained by the shape of the respective PSDs:in this setting, the inverse SFM given by (47) equals 1.01525and 1.00705 respectively for the OFDM and QAM signals.
B. Detection performance vs. SNR
The exact GLR detector (8) and its asymptotic version (9)were tested with the same OFDM and QAM signals of the
10โ2
10โ1
100
10โ2
10โ1
100
PFA
1โ
PD
Analytical (ฮปsph)iterative-GLRTasymptotic-GLRTQuan et al.ER detectorStoica-Cedervall
(a)
10โ2
10โ1
100
10โ2
10โ1
100
PFA
1โ
PD
Analytical (ฮปsph)iterative-GLRTasymptotic-GLRTQuan et al.ER detectorStoica-Cedervall
(b)
Fig. 2. ROC curves for ๐ = 4, ๐ = 512 and ๐ = 0.2: (a) OFDM and (b)square root raised cosine signals.
โ14 โ12 โ10 โ8 โ6 โ4 โ210
โ4
10โ3
10โ2
10โ1
100
Average SNR [dB]
1โ
PD
iterative-GLRTasymptotic-GLRTAnalytical (ฮปsph)
Sqrt Raised Cosine(rolloff = 1)
OFDM
Fig. 3. Missed detection probability vs. SNR for fixed ๐FA = 0.05, ๐ = 4and ๐ = 128.
previous section, but now varying the operational SNR in orderto check the performance loss incurred by the asymptotic GLRscheme. Fig. 3 shows the probability of missed detection interms of the average SNR per antenna, ๐/๐ , for ๐ = 4, ๐ =128 and fixed ๐FA = 0.05. Notice the good match between
9
โ17 โ16 โ15 โ14 โ13 โ12 โ11 โ10 โ9 โ810
โ3
10โ2
10โ1
100
Average SNR [dBs]
1โ
PD
iterative-GLRTasymptotic-GLRT
โ1 0 1โ40
โ30
โ20
โ10
0
10
Normalized frequency
psd
[dB
]
Fig. 4. Detection performance vs. SNR for fixed ๐FA = 0.05, with ๐ = 4and ๐ = 256. Lines: frequency-flat channels. Markers: Frequency-selectiveWINNER II channel model.
the empirical and analytical (asymptotic) results over the SNRrange considered, which is not limited to just the very lowSNR region. The better results observed for the OFDM signalare again explained by the fact that the corresponding PSDis โless flatโ. Also note that the two versions of the GLRdetector yield the same result with the OFDM signal; this isas expected, since the PSD of the OFDM signal is very close tothe binary-valued eigenspectrum case for which both detectorsare equivalent (cf. Sec IV-B). On the other hand, for the raisedcosine PSD the asymptotic form of the GLR detector presentsa performance loss that increases with the SNR. Hence, withnon-binary valued power spectra, this detector offers a tradeoffbetween complexity and performance.
C. Effect of frequency selective channels
To close this section, we consider a realistic scenario basedon the parameters of the GSM system. The separation betweenGSM channels is 200 kHz, although in a given geographicalarea two adjacent channels cannot be active in order to avoidinterference. Thus, the effective channel separation is 400 kHz,which is the same as the approximate bandwidth of the GSMsignal, based on Gaussian Minimum Shift Keying (GMSK)[43]. We thus synthesized samples of a baseband GMSKwaveform based on GSM parameters at a sampling rate of400 ksps.
Frequency-selective channels were generated according tothe WINNER Phase II model [44] with Profile C1 (suburban),and Non-line-of-sight (NLOS). The central frequency is 1.8GHz and the channel bandwidth is 400 kHz. Transmitter andreceiver are randomly distributed on a 10 km-side square. Forthe receiver we take a linear array with ๐ = 4 antennaswith inter-element separation of 10 cm. This channel is re-scaled to obtain a fixed operational SNR and is applied tothe GSM waveform. Fig. 4 shows the probability of detectionof the two proposed schemes vs. average per antenna SNRfor fixed ๐FA = 0.05. The inset also shows the PSD ofthe GMSK waveform. It is observed that both the exact
and asymptotic versions of the GLR detector are robust tofrequency selectivity effects, and in fact the results obtainedvirtually match those of a frequency-flat scenario.
We note that some small degree of non-whiteness in thecombined spectral content of noise plus interfering broadbandsources should be expected within the detectorโs bandwidthdue to random frequency selectivity affecting the secondaryusers (assuming a uniform PSD at the transmitter over thedetectorโs bandwidth). Nonetheless, for the detection of acomparatively narrow-band primary signal in the simulatedWINNER Phase II channel model, this effect (fluctuations inthe tenths of dB range over a span of 400 kHz) is less criticalthan the mitigation of spatially correlated interference. Weshould remark as reported in [45], that under all real-worldconditions, unavoidable model uncertainties appear in termsof an SNR wall that establishes a trade-off between primaryuserโs SNR and detector robustness. That is, below a minimumSNR, any detector ceases to operate reliably regardless ofthe duration of the observation window. Although worthmentioning, a detailed statistical analysis of these degradingeffects5 falls out of the scope of this paper and has not beenincluded for reasons of space.
VII. CONCLUSIONS
The multiantenna GLR detector for a Gaussian signal ofknown (up to a scaling) PSD and spatial rank one in tempo-rally white Gaussian noise-plus-interference of arbitrary andunknown spatial correlation has been derived in terms of aunivariate optimization problem over an SNR parameter. Ananalytical expression for the statistical distributions of thetest in the low SNR regime has been obtained, resulting inclose agreement with empirical results for realistic scenar-ios featuring practical (non-Gaussian) communication signals.This is attributed to the fact that, as the detector statistic isbased on sample correlations, the underlying distribution ofindividual samples becomes asymptotically irrelevant for largedata records. The spectral flatness measure at the output ofthe minimum variance beamformer was found to determinethe performance of the GLR detector, such that less spectrallyflat signals enjoy improved detectability. At the other extreme,temporally white signals are not detectable under the modelconsidered. The asymptotic version of the GLR detector forlow SNR was also derived, resulting in a closed-form testwith much lower computational cost to which the exact GLRdetector also reduces if the signal PSD is binary-valued.
APPENDIX
A. Proof of (17)
Letting ๐.= ฮฃโ1/2๐, one has
๐นโ11 (51)
= [๐ช โ ๐๐๐ป + ๐ฐ๐ โฮฃ]โ1
= (๐ฐ๐ โฮฃโ1/2)[๐ช โ ๐๐๐ป + ๐ฐ๐๐ ]โ1(๐ฐ๐ โฮฃโ1/2).
5For example, detection within the steep spectral roll-off region of thebroadband interferer(s).
10
Recalling the definition (14) of ๐ฎ(๐), we now claim that
[๐ช โ ๐๐๐ป + ๐ฐ๐๐ ]โ1 = ๐ฐ๐๐ โ๐ช๐ฎ(๐)โ ๐๐๐ป , (52)
which can be checked by directly multiplying the right-handside of (52) by ๐ชโ๐๐๐ป+๐ฐ๐๐ to see that it yields the identitymatrix (using that ๐ช and ๐ฎ(๐) share the same eigenvectors).Substituting (52) back into (51) and noting from (14) thatฮฃโ1/2๐ = ๐โฅ, the desired result is obtained.
B. Proof of (18)
From (17), and using the property tr(๐จ๐1 ๐จ
๐ป2 ๐จ3๐จ4) =
vec(๐จ2)๐ป(๐จ1 โ๐จ3)vec(๐จ4) [47], one has
๐๐ป๐นโ11 ๐ (53)
= vec(๐ )๐ป(๐ฐ๐ โฮฃโ1 โ๐ช๐ฎ(๐)โ ๐โฅ๐๐ปโฅ )vec(๐ )
= tr(ฮฃโ1๐ ๐ ๐ป)โ tr(๐โฅ๐๐ปโฅ ๐ (๐ช๐ฎ(๐))๐๐ ๐ป). (54)
Applying now the identity ๐ช๐ฎ(๐) = ๐โ1[๐ฐ๐ โ๐ฎ(๐)], whichfollows from the definition (14) of ๐ฎ(๐), noting that ๐ฎ๐ (๐) =๐ฎโ(๐), and then rearranging terms yields the desired result.
C. Proof of Lemma 3
First, note that ๐พโฅ = [๐ผโฅ ๏ฟฝ๏ฟฝ ]ฮโฅ[๐ผโฅ ๏ฟฝ๏ฟฝ ]๐ป constitutes afull EVD of ๐พโฅ, where
ฮโฅ.=
[ฮโฅ
0
]. (55)
Then the determinant of ฮฃโ1 can be written as
det(ฮฃโ1)
= det(๐พโฅ +๐พโฅ)
= det(๐พโฅ๏ฟฝ๏ฟฝโฅ๏ฟฝ๏ฟฝ๐ป
โฅ + [๐ผโฅ ๏ฟฝ๏ฟฝ ]ฮโฅ[๐ผโฅ ๏ฟฝ๏ฟฝ ]๐ป)
(56)
= det(๐พโฅ[๐ผโฅ ๏ฟฝ๏ฟฝ ]๐ป ๏ฟฝ๏ฟฝโฅ๏ฟฝ๏ฟฝ๐ป
โฅ [๐ผโฅ ๏ฟฝ๏ฟฝ ] + ฮโฅ)
(57)
= det(ฮโฅ) + ๐พโฅ๏ฟฝ๏ฟฝ๐ปโฅ [๐ผโฅ ๏ฟฝ๏ฟฝ ] adj(ฮโฅ)[๐ผโฅ ๏ฟฝ๏ฟฝ ]๐ป ๏ฟฝ๏ฟฝโฅ,(58)
where in the last step we used the fact that det(๐จ+ ๐๐๐ป) =det๐จ+๐๐ป adj(๐จ)๐ [46]. Note now from (55) that det(ฮโฅ) =0, whereas adj(ฮโฅ) = det(ฮโฅ)๐๐๐๐ป๐ . The desired resultthen follows.
D. Proof of Lemma 4
Let ๐โฅ.= (๐ฐ๐ โ ๏ฟฝ๏ฟฝ๏ฟฝ๏ฟฝ๐ป)๐ be the orthogonal projection
of the data matrix onto the subspace orthogonal to ๐. Let๏ฟฝ๏ฟฝโฅ
.= 1
๐๐โฅ๐ ๐ปโฅ be the corresponding sample covariance
matrix, with (economy-size) EVD ๏ฟฝ๏ฟฝโฅ = ๐ธ๐ซ๐ธ๐ป (i.e., ๐ซ โโ(๐โ1)ร(๐โ1) is positive diagonal and ๐ธ โ โ๐ร(๐โ1)
has orthonormal columns). Note that since ๐ผ๐ปโฅ ๏ฟฝ๏ฟฝ = 0, then
๐ผ๐ปโฅ ๏ฟฝ๏ฟฝ๐ผโฅ = ๐ผ๐ป
โฅ ๏ฟฝ๏ฟฝโฅ๐ผโฅ. Therefore, by virtue of Hadamardโsinequality [46], the cost ๐ฝ satisfies
๐ฝ(๐ผโฅ) = det[diag(๐ผ๐ป
โฅ ๐ธ๐ซ๐ธ๐ป๐ผโฅ)]
โฅ det(๐ผ๐ปโฅ ๐ธ๐ซ๐ธ๐ป๐ผโฅ), (59)
with equality holding in (59) iff the matrix between parenthe-ses is diagonal. This happens if ๐ผโฅ = ๐ธ, which is feasible
since its columns are orthonormal and ๐ธ๐ป ๏ฟฝ๏ฟฝ = 0. Thus๐ฝmin = det๐ซ.
Let now ๏ฟฝ๏ฟฝ.= [๐ธ ๏ฟฝ๏ฟฝ ], which is unitary. Write the inverse
of ๏ฟฝ๏ฟฝ๐ป๏ฟฝ๏ฟฝ๏ฟฝ๏ฟฝ in terms of the adjoint matrix as
(๏ฟฝ๏ฟฝ๐ป๏ฟฝ๏ฟฝ๏ฟฝ๏ฟฝ)โ1 =adj(๏ฟฝ๏ฟฝ๐ป๏ฟฝ๏ฟฝ๏ฟฝ๏ฟฝ)
det(๏ฟฝ๏ฟฝ๐ป๏ฟฝ๏ฟฝ๏ฟฝ๏ฟฝ), (60)
which can be rewritten as
๏ฟฝ๏ฟฝ๐ป๏ฟฝ๏ฟฝโ1๏ฟฝ๏ฟฝ =adj(๏ฟฝ๏ฟฝ๐ป๏ฟฝ๏ฟฝ๏ฟฝ๏ฟฝ)
det ๏ฟฝ๏ฟฝ. (61)
The (๐,๐) element of this matrix is therefore
๏ฟฝ๏ฟฝ๐ป๏ฟฝ๏ฟฝโ1๏ฟฝ๏ฟฝ =๐๐ป๐ adj(๏ฟฝ๏ฟฝ๐ป๏ฟฝ๏ฟฝ๏ฟฝ๏ฟฝ)๐๐
det ๏ฟฝ๏ฟฝ=
det(๐ธ๐ป๏ฟฝ๏ฟฝ๐ธ)
det ๏ฟฝ๏ฟฝ. (62)
Finally, observe that ๐ซ = ๐ธ๐ป๏ฟฝ๏ฟฝโฅ๐ธ = ๐ธ๐ป๏ฟฝ๏ฟฝ๐ธ because๐ธ๐ป ๏ฟฝ๏ฟฝ = 0. This, together with (62), yields the desired result.
Remark 1: Note from (29) that the ML estimate of ฮโฅ isgiven by
ฮโฅ =[diag
(๏ฟฝ๏ฟฝ๐ป
โฅ ๏ฟฝ๏ฟฝ๏ฟฝ๏ฟฝโฅ)]โ1
= ๐ซโ1, (63)
since the ML estimate of ๐ผโฅ is ๏ฟฝ๏ฟฝโฅ = ๐ธ. Hence, theML estimate of ๐พโฅ = ๐ผโฅฮโฅ๐ผ๐ป
โฅ turns out to be ๏ฟฝ๏ฟฝโฅ =๏ฟฝ๏ฟฝโฅฮโฅ๏ฟฝ๏ฟฝ๐ป
โฅ = ๐ธ๐ซโ1๐ธ๐ป = ๏ฟฝ๏ฟฝโ โฅ.
Remark 2: We note that the Moore-Penrose pseudoinverse๐พ โ
โฅ = ๐ผโฅฮโ1โฅ ๐ผ๐ป
โฅ fulfils ๐พ โ โฅ๐ = 0, with ๐พโฅ๐พ
โ โฅ the
projector onto the subspace orthogonal to ๐ and ๐พโฅ๐พโ โฅ +
๐๐๐ป
= I๐ . Then, to verify (24), we premultiply by ฮฃโ1:ฮฃโ1(ฮ ๐พ โ
โฅฮ ๐ป + ๐โ1๐๐H) = ๐พโฅ๐พ
โ โฅฮ
๐ป + I๐ โ ฮ ๐ป =
I๐ โ ๐๐Hฮ ๐ป . As ฮ ๐ = 0, the previous result is I๐ ,
verifying that (24) is a valid expression for ฮฃ.
E. Proof of Lemma 5
Since ๐น (๐1,๐2) is invariant to scalings in ๐1 and ๐2, itsminimization is equivalent to the problem
min๐1,๐2
(๐๐ป1 ๐จ1๐1)(๐
๐ป2 ๐จ2๐2) subject to โฃ๐๐ป
1 ๐2โฃ2 = ๐2,
(64)with ๐2 > 0 an arbitrary positive constant. Thus, consider theLangrangian
โ = (๐๐ป1 ๐จ1๐1)(๐
๐ป2 ๐จ2๐2)โ ๐(โฃ๐๐ป
1 ๐2โฃ2 โ ๐2), (65)
with ๐ the Lagrange multiplier. Equating the gradient of โw.r.t. ๐1, ๐2 to zero, we respectively obtain
(๐๐ป2 ๐จ2๐2)๐จ1๐1 = ๐(๐๐ป
2 ๐1)๐2, (66)
(๐๐ป1 ๐จ1๐1)๐จ2๐2 = ๐(๐๐ป
1 ๐2)๐1. (67)
To discard the possibility of having ๐๐ป1 ๐2 = 0, note that
this would mean that the left-hand sides of (66)-(67) are bothzero, which can only happen if ๐1 = ๐2 = 0 since ๐จ1, ๐จ2
are positive definite. Thus, assuming ๐1 โ= 0, ๐2 โ= 0, solvingfor ๐ in (66)-(67) yields
๐ =(๐๐ป
1 ๐จ1๐1)(๐๐ป2 ๐จ2๐2)
โฃ๐๐ป1 ๐2โฃ2 , (68)
11
that is, the Lagrange multiplier takes the value of the attainedcost. Substituting now the value of ๐2 from (66) and of ๐from (68) into (67), one has ๐จ2๐จ1๐1 = ๐๐1. Similarly, italso holds that ๐จ1๐จ2๐2 = ๐๐2. Thus ๐ is an eigenvalue of๐จ2๐จ1 (resp. ๐จ1๐จ2) with associated eigenvector ๐1 (resp.๐2). Hence the cost is minimized when ๐ is the smallesteigenvalue of these matrices. Equivalently, ๐1 (resp. ๐2) isa generalized eigenvector [39] of the pair (๐จ1,๐จ
โ12 ) [resp.
(๐จ2,๐จโ11 )].
F. Proof of Theorem 2
The asymptotic distributions of the GLR statistic ๐GLR inthe weak signal regime are given in [31, Sec. 6.5] for a testof the form
โ0 : ๐ฝ๐ = ๐ฝ๐0 , ๐ฝ๐ โ1 : ๐ฝ๐ โ= ๐ฝ๐0 , ๐ฝ๐ , (69)
where ๐ฝ๐ โ โ๐, ๐ฝ๐ โ โ๐ . One wishes to test whether ๐ฝ๐ =๐ฝ๐0 as opposed to ๐ฝ๐ โ= ๐ฝ๐0 , and ๐ฝ๐ is a vector of nuisanceparameters which are unknown (but the same) under eitherhypothesis. The distributions are
2 log๐GLR โผ{
๐2๐ , under โ0,
๐โฒ2๐ (๐ผ), under โ1,
(70)
where the non-centrality parameter ๐ผ is given in [31, eq.(6.24)]. In our case, the nuisance parameters are given by theelements of the noise spatial covariance ฮฃ. On the other hand,the parameter to be tested is ๐ = 0 versus ๐ โ= 0. However,since the p.d.f. depends on ๐ only through ๐๐๐ป , which isinvariant to multiplication of ๐ by a unit-magnitude complexscalar, we can fix the imaginary part of the last element of ๐(say) to zero. Thus the vector ๐ฝ๐ comprises the real parts ofthe elements of ๐, plus the imaginary parts of the elements of๐ save the last one. The size of ๐ฝ๐ is therefore ๐ = 2๐ โ 1.
Direct application of [31, eq. (6.24)] to determine ๐ผ isinvolved for the model at hand. We propose an alternativeapproach based on the following result, whose proof will bepresented in turn.
Lemma 6: Consider the GLR statistic ๐ from (8). Then onehas
lim๐โโ
๐ผ[โฃ๐ โ ๐ (๐0)โฃ2] = 0, (71)
where ๐0 = ๐๐ปฮฃโ1๐ = ๐๐ปโฅ ๐ is the true value of the SNR,
and
๐ (๐).=
[ 1๐ tr(๐ฐ๐ + ๐๐ช)
[det(๐ฐ๐ + ๐๐ช)]1/๐
]๐. (72)
From Lemma 6, it follows that
lim๐โโ
๐ผ[2 log๐ ] = lim๐โโ
2 log๐ (๐0) (73)
= lim๐โโ
2๐ log1๐ tr(๐ฐ๐ + ๐0๐ช)
[det(๐ฐ๐ + ๐0๐ช)]1/๐.
On the other hand, note that if ๐ฅ โผ ๐โฒ2๐ (๐ผ), then ๐ผ[๐ฅ] = ๐+๐ผ.
Hence,
lim๐โโ
๐ผ[2 log๐ ] = (2๐ โ 1) + lim๐โโ
๐ผ. (74)
It follows from (73)-(74) that for sufficiently large ๐ , ๐ผ โซ2๐โ1 and one may approximate ๐ผ โ 2 log๐ (๐0) = ๐ผsph(๐0).
Proof of Lemma 6: Write the data matrix as ๐ = ๐๐๐ +ฮฃ1/2๐พ , where the entries of ๐พ
.= ฮฃโ1/2[ ๐1 ๐2 โ โ โ ๐๐ ]
are independent zero-mean complex circular Gaussian randomvariables with unit variance. It follows that for ๐จ โ โ๐ร๐ ,
๐ผ[๐ ๐จ๐ ๐ป ] = tr(๐ชโ๐จ)๐๐๐ป + tr(๐จ)ฮฃ. (75)
Now note that 1๐๐ ๐จ๐ ๐ป is a consistent estimator of its mean
for the matrices ๐จ in this paper, and hence
1
๐๐ ๐จ๐ ๐ป var
=tr(๐ชโ๐จ)
๐๐๐๐ป +
tr(๐จ)
๐ฮฃ, (76)
wherevar= denotes stochastic convergence in variance [48],
i.e., ๐๐var= ๐๐ iff lim๐โโ ๐ผ[โฃ๐๐ โ ๐๐ โฃ2] = 0, applying
componentwise for matrices. Let us rewrite the GLR statistic๐ as
๐ = max๐โฅ0
๐โ๐min
[( 1๐๐ ๐ ๐ป)โ1( 1
๐๐ ๐ฎโ(๐)๐ ๐ป)]
[det๐ฎโ(๐)]โ1. (77)
Then, taking into account that tr๐ชโ = tr ๐ฐ๐ = ๐ , theasymptotic behavior of ๐ can be found:
๐var= (78)
max๐โฅ0๐โ๐
min [1๐ tr๐ฎโ(๐)โ (ฮฃ+๐๐๐ป )โ1(ฮฃ+๐(๐)๐๐๐ป )]
[det๐ฎโ(๐)]โ1 =
max๐โฅ0
[1๐ tr๐ฎโ(๐)
[det๐ฎโ(๐)]1/๐ โ ๐min
[๐ฐ๐ โ 1โ๐(๐)
1+๐0๐โฅ๐๐ป
]]โ๐
(79)
where
๐(๐).=
tr(๐ชโ๐ฎโ(๐))tr๐ฎโ(๐)
. (80)
In order to obtain (79), the Matrix Inversion Lemma has beenapplied. Denote now the matrix in brackets in (79) by ๐ฉ. Then๐ฉ has an eigenvalue equal to one with multiplicity ๐ โ 1,since ๐ฉ๐ = ๐ for any ๐ such that ๐๐ป๐ = 0. The remainingeigenvalue is associated to the eigenvector ๐โฅ and is given by
๐ =1 + ๐0๐(๐)
1 + ๐0. (81)
We claim that ๐(๐) โค 1 for all ๐ โฅ 0, so that ๐ in (81) isless than one and hence it is the minimum eigenvalue of ๐ฉ.First, note that ๐(0) = tr๐ชโ/ tr ๐ฐ๐ = 1. On the other hand,for ๐ > 0, since ๐ช and ๐ฎ(๐) share the same eigenvectors, wecan write ๐(๐) in (80) in terms of the eigenvalues {๐๐}๐โ1
๐=0
of ๐ช as
๐(๐) =
โ๐โ1๐=0
๐๐
1+๐๐๐โ๐โ1๐=0
11+๐๐๐
(82)
=1
๐
[๐โ๐โ1
๐=01
1+๐๐๐
โ 1
], (83)
where (83) follows from the fact that ๐ =โ
๐1
1+๐๐๐+
๐โ
๐๐๐
1+๐๐๐. Note now that for positive numbers {๐ฅ๐}๐โ1
๐=0 ,the Cauchy-Schwarz inequality can be invoked to show that(
1๐
โ๐ ๐ฅ๐
) (1๐
โ๐
1๐ฅ๐
)โฅ 1. Hence, taking ๐ฅ๐ = 1 + ๐๐๐,
and noting thatโ
๐ ๐๐ = ๐ , one has (1+๐)๐
โ๐
11+๐๐๐
โฅ 1.
12
Applying this to (83) yields ๐(๐) โค 1, as desired. Therefore,from (79), one has
๐var=
[min๐โฅ0
[1๐ tr๐ฎโ(๐)
det1/๐ ๐ฎโ(๐)โ 1 + ๐0๐(๐)
1 + ๐0
]]โ๐
. (84)
Note from (80) that 1 + ๐0๐(๐) can be written as
1 + ๐0๐(๐) =tr[(๐ฐ๐ + ๐0๐ช
โ)๐ฎโ(๐)]tr๐ฎโ(๐)
. (85)
Substituting (85) in (84),
๐var=
(1
๐(1 + ๐0)min๐โฅ0
๐ป(๐)
)โ๐
, (86)
where we have introduced
๐ป(๐).=
tr[(๐ฐ๐ + ๐0๐ชโ)๐ฎโ(๐)]
det1/๐ ๐ฎโ(๐). (87)
Since ๐ฎโ(๐) = (๐ฐ๐ + ๐๐ชโ)โ1, applying Jensenโs inequalityone has
log๐ป(๐)
= log๐โ1โ๐=0
1 + ๐0๐๐1 + ๐๐๐
+1
๐
๐โ1โ๐=0
log(1 + ๐๐๐) (88)
โฅ log๐ +1
๐
๐โ1โ๐=0
log1 + ๐0๐๐1 + ๐๐๐
+1
๐
๐โ1โ๐=0
log(1 + ๐๐๐)
(89)
= log๐ +1
๐
๐โ1โ๐=0
log(1 + ๐0๐๐) (90)
= log๐ป(๐0), (91)
showing that the minimum is attained at ๐ = ๐0. Hence,
๐var=
โโโ
[โ๐โ1๐=0 (1 + ๐0๐๐)
]1/๐1 + ๐0
โโโ
โ๐
= ๐ (๐0), (92)
as was to be shown.Remark 3: Asymptotic unimodality in ๐. In addition to
being minimized at ๐ = ๐0, the function ๐ป(๐) in (87)has no other local minimum in ๐ โฅ 0 provided that thedetectability condition ๐ช โ= ๐ฐ๐ holds, as we show next.Differentiating (88),
d log๐ป(๐)
d๐=
1
๐
๐โ1โ๐=0
๐๐1 + ๐๐๐
โ[๐โ1โ๐=0
1 + ๐0๐๐1 + ๐๐๐
]โ1 ๐โ1โ๐=0
๐๐(1 + ๐0๐๐)
(1 + ๐๐๐)2. (93)
Let now
๐ฟ.= ๐0โ๐, ๐ฅ๐
.=
๐๐1 + ๐๐๐
โ 1 + ๐0๐๐1 + ๐๐๐
= 1+ ๐ฟ๐ฅ๐,
(94)
so that (93) becomes
d log๐ป(๐)
d๐(95)
=1
๐
โ๐
๐ฅ๐ โ[โ
๐
(1 + ๐ฟ๐ฅ๐)
]โ1 โ๐
๐ฅ๐(1 + ๐ฟ๐ฅ๐)
= โ[โ
๐
(1 + ๐ฟ๐ฅ๐)
]โ1
๏ธธ ๏ธท๏ธท ๏ธธ.=๐1
โกโฃโ
๐
๐ฅ2๐ โ 1
๐
(โ๐
๐ฅ๐
)2โคโฆ
๏ธธ ๏ธท๏ธท ๏ธธ.=๐2
โ ๐ฟ.
(96)
Note from (94) that ๐1 > 0 for all ๐ โฅ 0. On the other hand,from the Cauchy-Schwarz inequality one has ๐2 โฅ 0, withequality iff all ๐ฅ๐ are equal. But this would imply that alleigenvalues of ๐ช are equal, i.e., ๐ช = ๐ฐ๐ ; thus, ๐2 > 0.We conclude that the derivative (96) is negative if ๐ฟ > 0and positive if ๐ฟ < 0, which shows that ๐ป(๐) has a uniqueminimum at ๐ = ๐0.
G. Gradient descent computations
The goal is to maximize ๐ก(๐) in (8) over ๐ โฅ 0. Equivalently,we can minimize ๐ก0(๐)
.= ๐กโ1/๐ (๐) by means of an iterative
gradient descent of the form ๐๐+1 = ๐๐ โ ๐๐๐กโฒ0(๐๐), with
๐๐ > 0 a suitable stepsize sequence. In the simulations, weinitialized ๐0 = 1, ๐0 = 100. The stepsize is reduced as per๐๐ = 0.25๐๐โ1 whenever there is a sign change in the descentdirection. The stopping criterion adopted is โฃ๐กโฒ0(๐๐)โฃ < 10โ5
with a maximum of 100 iterations.The required derivative is obtained as follows. Denote the
minimum unit-norm eigenvector of ๐ฝ ๐ป(๐ฐ๐ + ๐๐ชโ)โ1๐ฝ by๐0(๐). Then we can write
๐0(๐).= ๐min[๐ฝ
๐ป(๐ฐ๐ + ๐๐ชโ)โ1๐ฝ ] (97)
= ๐๐ป0 (๐)
[๐ฝ ๐ป(๐ฐ๐ + ๐๐ชโ)โ1๐ฝ
]๐0(๐), (98)
and therefore, using the expressions for eigenvalues derivativesin [47], one has
๐กโฒ0(๐)
= [det(๐ฎโ(๐))]โ1/๐ โ โโ๐
(๐๐ป0 (๐)
[๐ฝ ๐ป๐ฎโ(๐)๐ฝ
]๐0(๐)
)+ ๐0(๐) โ โ
โ๐[det(๐ฎโ(๐))]โ1/๐
= โ [det(๐ฎโ(๐))]โ1/๐ โ โ (
๐๐ป0 (๐)
[๐ฝ ๐ป๐ฎโ(๐)๐ชโ๐ฎโ(๐)๐ฝ
]๐0(๐)
)+ ๐0(๐) โ [det(๐ฎโ(๐))]โ1/๐ 1
๐tr (๐ฎโ(๐)๐ชโ)
=
[โ๐๐ป
0 (๐)๐จ(๐)๐0(๐) +1
๐๐0(๐) โ tr (๐ฎโ(๐)๐ชโ)
]โ
โ [det(๐ฎโ(๐))]โ1/๐,
where ๐จ(๐).= ๐ฝ ๐ป๐ฎโ(๐)๐ชโ๐ฎโ(๐)๐ฝ . Using the EVD ๐ช =
๐ญฮ๐ญ๐ป and the fact that ๐ญ๐ป approaches the orthonormal DFTmatrix for large ๐ , one can precompute ๐ญ๐ป๐ฝ โ efficiently byapplying the FFT to the columns of ๐ฝ โ. The elements of๐จ(๐) are then obtained as weighted crosscorrelations between
13
the columns of ๐ญ๐ป๐ฝ โ, where the weights are of the form๐๐/(1 + ๐๐๐)
2. A similar approach can be used to compute๐ฝ ๐ป๐ฎโ(๐)๐ฝ efficiently.
REFERENCES
[1] M. Wax, T. Kailath, โDetection of signals by information-theoreticcriteria,โ IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-33,pp. 387โ392, Apr. 1985.
[2] M. Viberg, B. Ottersten, T. Kailath, โDetection and estimation in sensorarrays using weighted subspace fitting,โ IEEE Trans. Signal Process.,vol. 39, pp. 2436โ2449, Nov. 1991.
[3] L. L. Scharf, B. Friedlander, โMatched subspace detectors,โ IEEE Trans.Signal Process., vol. 42, pp. 2146โ2157, Aug. 1994.
[4] O. Besson, S. Kraut, L. L. Scharf, โDetection of an unknown rank-onecomponent in white noise,โ IEEE Trans. Signal Process., vol. 54, pp.2835โ2839, Jul. 2006.
[5] A. Leshem, A.-J. van der Veen, โMultichannel detection of Gaussiansignals with uncalibrated receivers,โ IEEE Signal Process. Lett., vol. 8,pp. 120โ122, Apr. 2001.
[6] S. A. Vorobyov, A. Gershman, Kon Max Wong, โMaximum likelihooddirection-of-arrival estimation in unknown noise fields using sparsesensor arrays,โ IEEE Trans. Signal Process., vol. 53, pp. 34โ43, Jan.2005.
[7] M. Agrawal, S. Prasad, โA modified likelihood function approach toDOA estimation in the presence of unknown spatially correlated Gaus-sian noise using a uniform linear array,โ IEEE Trans. Signal Process.,vol. 48, pp. 2743โ2749, Oct. 2000.
[8] Yuehua Wu, Kwok-Wai Tam, Fu Li, โDetermination of number ofsources with multiple arrays in correlated noise fields,โ IEEE Trans.Signal Process., vol. 50, pp. 1257โ1260, Jun. 2002.
[9] P. Stoica, Kon Max Wong, Qiang Wu, โOn a nonparametric detectionmethod for array signal processing in correlated noise fields,โ IEEETrans. Signal Process., vol. 44, pp. 1030โ1032, Apr. 1996.
[10] M. Viberg, P. Stoica, B. Ottersten, โMaximum likelihood array pro-cessing in spatially correlated noise fields using parameterized signals,โIEEE Trans. Signal Process., vol. 45, pp. 996โ1004, Apr. 1997.
[11] I. Akyildiz, Won-Yeol Lee, M. Vuran, S. Mohanty, โA survey onspectrum management in cognitive radio networks,โ IEEE Commun.Mag., vol. 46, pp. 40โ48, Apr. 2008.
[12] J. M. Peha, โSharing spectrum through spectrum policy reform andcognitive radio,โ Proc. IEEE, vol. 97, pp. 708โ719, Apr. 2009.
[13] J. Unnikrishnan, V. Veeravalli, โCooperative sensing for primary detec-tion in cognitive radio,โ IEEE J. Sel. Topics Signal Process., vol. 2, pp.18โ27, Feb. 2008.
[14] A. Taherpour, M. Nasiri-Kenari, S. Gazor, โMultiple antenna spectrumsensing in cognitive radios,โ IEEE Trans. Wireless Commun., vol. 9, pp.814โ823, Feb. 2010.
[15] Rui Zhang, Teng Lim, Ying-Chang Liang, Yonghong Zeng, โMulti-antenna based spectrum sensing for cognitive radios: A GLRT ap-proach,โ IEEE Trans. Commun., vol. 58, pp. 84โ88, Jan. 2010.
[16] Pu Wang, Jun Fang, Ning Han, Hongbin Li, โMultiantenna-assistedspectrum sensing for cognitive radio,โ IEEE Trans. Veh. Technol., vol.59, pp. 1791โ1800, May 2010.
[17] D. Ramฤฑrez, G. Vazquez-Vilar, R. Lopez-Valcarce, J. Vฤฑa, I. Santamarฤฑa,โDetection of rank-P signals in cognitive radio networks with uncal-ibrated multiple antennas,โ IEEE Trans. Signal Process., vol. 59, pp.3764โ3774, Aug. 2011.
[18] IEEE 802.22 WRAN Working Group, IEEE 802.22-2011 Standard forCognitive Wireless Regional Area Networks (RAN) for Operation in TVBands. Piscataway, NJ: IEEE Standards Association, Jul. 2011.
[19] National Telecommunications and Information Administration, Report04-413: Potential interference from broadband over power line (BPL)systems to Federal Government radiocommunications at 1.7-80 MHz.Washington, D.C.: NTIA, Apr. 2004.
[20] J. P. Le Cadre, โParametric methods for spatial signal processing in thepresence of unknown colored noise fields,โ IEEE Trans. Acoust., Speech,Signal Process., vol. ASSP-37, pp. 965โ983, Jul. 1989.
[21] Zhi Quan, Wenyi Zhang, S. J. Shellhammer, A. H. Sayed, โOptimalspectral feature detection for spectrum sensing at very low SNR,โ IEEETrans. Commun., vol. 59, pp. 201โ212, Jan. 2011.
[22] Wenyi Zhang, H. Poor, Zhi Quan, โFrequency-domain correlation: anasymptotically optimum approximation of quadratic likelihood ratiodetectors,โ IEEE Trans. Signal Process., vol. 58, pp. 969โ979, Mar.2010.
[23] G. Vazquez-Vilar, R. Lopez-Valcace, J. Sala, โMultiantenna spectrumsensing exploiting spectral a priori information,โ IEEE Trans. WirelessComm.,, Vol.10, No.12, pp. 4345โ4355, Dec. 2011.
[24] J. Tellado, Multicarrier modulation with low PAR: Applications to DSLand wireless. Norwell, MA: Kluwer Academic, 2000.
[25] A. Dogandzic, A. Nehorai, โGeneralized multivariate analysis of vari-ance - A unified framework for signal processing in correlated noise,โIEEE Signal Process. Mag., vol. 20, pp. 39โ54, Sep. 2003.
[26] S. Kraut, L. L. Scharf, L. T. McWhorter, โAdaptive subspace detectors,โIEEE Trans. Signal Process., vol. 49, pp. 1โ16, Jan. 2001.
[27] Yuanwei Jin, B. Friedlander, โA CFAR adaptive subspace detector forsecond-order Gaussian signals,โ IEEE Trans. Signal Process., vol. 53,pp. 871โ884, Mar. 2005.
[28] A. Paulraj, T. Kailath, โEigenstructure methods for direction-of-arrivalestimation in the presence of unknown noise fields,โ IEEE Trans.Acoust., Speech, Signal Process., vol. ASSP-34, pp. 13โ20, Jan. 1986.
[29] P. Stoica, M. Cedervall, โDetection tests for array processing in unknowncorrelated noise fields,โ IEEE Trans. Signal Process., vol. 45, pp. 2351โ2362, Sep. 1997.
[30] B. C. Levy, Principles of Signal Detection and Parameter Estimation,Springer, 2010.
[31] S. M. Kay, Fundamentals of Statistical Signal Processing: DetectionTheory. Englewood Cliffs, NJ: Prentice-Hall, 1998.
[32] R. M. Gray, Toeplitz and Circulant Matrices: A Review. NY:Hanover/Now, 2006.
[33] J. Capon, โHigh-resolution frequency-wavenumber spectrum analysis,โProc. IEEE, vol. 57, pp. 1408โ1418, Aug. 1969.
[34] A. J. Laub, Matrix Analysis for Scientists and Engineers, Soc. forIndustrial and Applied Math., Philadelphia, 2005.
[35] H. L. van Trees, Optimum Array Processing (Detection, Estimation andModulation Theory), Wiley, 2002.
[36] S. M. Kay, Modern Spectral Estimation: Theory and Applications,Prentice-Hall, 1987.
[37] D. E. Bernstein, Matrix Mathematics: Theory, Facts and Formulas, 2nded. Princeton University Press, 2009.
[38] R. A. Horn, C. R. Johnson, Matrix Analysis. Cambridge University Press,1986.
[39] G. H. Golub, C. F. van Loan, Matrix Computations, 2nd ed. JohnHopkins University Press, 1989.
[40] U. Grenander, G. Szego, Toeplitz Forms and Their Applications. NewYork, Chelsea Pub. Co., 1984.
[41] A. J. Gray, J. Markel, โA spectral flatness measure for studying theautocorrelation method of linear prediction of speech analysis,โ IEEETrans. Acoust., Speech, Signal Process., vol. ASSP-22, pp. 207โ217,Jun. 1974
[42] J. P. Dugre, L. L. Scharf, A. Beex, โA note on the measurement ofspectral flatness and the calculation of prediction error variances,โ Proc.IEEE Int. Conf. Acoust., Speech, Signal Process., 1980, vol. 5, pp. 607โ611.
[43] EN 300 910, โDigital cellular telecommunications system (Phase 2+);Radio transmission and reception (GSM 05.05 version 8.5.1 Release1999),โ ETSI, Nov. 2000.
[44] L. Hentila, P. Kyosti, M. Kaske, M. Narandzic, M. Alatossava, โMatlabimplementation of the WINNER Phase II Channel Model v1.1โ, Online:https://www.ist-winner.org/phase 2 model.html, 2007.
[45] R. Tandra, A. Sahai, โSNR Walls for Signal Detectionโ, IEEE Journal ofSelected Topics in Signal Processing, Vol.2, No.1, pp. 3-17, Feb. 2008.
[46] P. Lancaster, M. Tismenetsky, The Theory of Matrices, 2nd ed. SanDiego, CA: Academic Press, 1985.
[47] J. R.Magnus, H. Neudecker, Matrix Differential Calculus with Applica-tions to Statistics and Econometrics, revised ed. Chichester, England:Wiley, 1999.
[48] R. J. Serfling, Approximation Theorems of Mathematical Statistics.Chichester, England: Wiley, 1980.