Multiantenna GLR Detection of Rank-One Signals With Known Power Spectrum in White Noise With Unknown...

13
1 Multiantenna GLR detection of rank-one signals with known power spectrum in white noise with unknown spatial correlation Josep Salaโ€ , Senior Member, IEEE, Gonzalo Vazquez-Vilarโ€ก, Student Member, IEEE, Roberto Lยด opez-Valcarceโ€ก, Member, IEEE, โ€ Dept. of Signal Theory and Communications, Technical University of Catalonia, 08034 Barcelona, SPAIN E-mail: [email protected] โ€กDept. of Signal Theory and Communications, University of Vigo, 36310 Vigo, SPAIN E-mail: {gvazquez,valcarce}@gts.uvigo.es submitted (minors after AQ) to: IEEE Transactions on Signal Processing EDICS: SPC-DETC (Detection, Estimation and Demodulation), SSP-DETC (Detection) Abstractโ€”Multiple-antenna detection of a Gaussian signal with spatial rank one in temporally white Gaussian noise with arbitrary and unknown spatial covariance is considered. This is motivated by spectrum sensing problems in the context of Dynamic Spectrum Access in which several secondary networks coexist but do not cooperate, creating a background of spatially correlated broadband interference. When the temporal correla- tion of the signal of interest is assumed known up to a scale factor, the corresponding Generalized Likelihood Ratio Test is shown to yield a scalar optimization problem. Closed-form expressions of the test are obtained for the general signal spectrum case in the low signal-to-noise ratio (SNR) regime, as well as for signals with binary-valued power spectrum in arbitrary SNR. The two resulting detectors turn out to be equivalent. An asymptotic approximation to the test distribution for the low-SNR regime is derived, closely matching empirical results from spectrum sensing simulation experiments. Index Termsโ€”GLR test, detection, multiantenna array, corre- lated noise, noise uncertainty, cognitive radio, spectrum sensing, spectral ๏ฌ‚atness measure, Capon beamformer I. I NTRODUCTION Array processing for signal detection and parameter esti- mation has been a topic of deep research interest for decades. Many techniques have been proposed under the assumption that the noise is Gaussian and spatially uncorrelated (or has a known spatial structure that allows prewhitening) [1]โ€“[5]. However, in many practical cases the noise ๏ฌeld may present unknown spatial correlation, due to co-channel interference in communications applications, and to clutter and jamming in radar array processing [6]โ€“[10]. Our main motivation stems from the problem of spectrum sensing for Dynamic Spectrum Access (DSA) in licensed bands [11], [12]. DSA aims at a more ef๏ฌcient usage of spectrum by allowing unlicensed (or secondary) users to opportunistically access channels in those bands, as long as interference to licensed (or primary) users is avoided; this can be achieved by dynamically tuning to This work has been jointly supported by projects 2009SGR1236 (AGAUR), 2009/062-2010/85 (Consolidation of Research Units), DYNACS (TEC2010- 21245-C02/TCM) and COMONSENS (CONSOLIDER-INGENIO 2010 CSD2008-00010) ๏ฌnanced by the Catalan, Galician and Spanish Governments and the European Regional Development Fund (ERDF). different carrier frequencies to sense the radio environment for unused channels. Spectrum sensing schemes must be robust to wireless propagation phenomena such as large- and small-scale fading as well as to interference. The ๏ฌrst two issues can be dealt with by resorting to cooperative detection strategies [13] and multiantenna detectors [14]โ€“[17], respectively. Multiantenna spectrum sensing methods usually exploit spatial correlation of the signal, assuming a spatially uncor- related noise ๏ฌeld. Whereas this assumption may be adequate in early stages of DSA-based schemes, in which a single secondary network coexists with the primary system, this is not necessarily so when several independent secondary systems access the same frequency band. Unless strict coordination among such systems is imposed, no quiet periods will be available in order to sense primary activity in a given channel. Hence, such activity will have to be detected in the presence of a background noise consisting not only of thermal noise, but also the aggregate interference from the rest of secondary networks. The physical layer of secondary systems is likely to make use of channel fragmentation, aggregation, and/or bonding in order to enhance spectrum utilization (such is the case for e.g. the IEEE 802.22 standard for opportunistic access to TV white spaces [18]), so that the background interference will likely be broadband (with respect to the bandwidth of a primary channel), and potentially correlated in space, due to the well-localized origin of secondary transmissions. Other potential sources of broadband interference of course exist, e.g. the emissions of Broadband over Power Line (BPL) systems in the HF and VHF bands [19]. After bandlimiting by the channel selection ๏ฌlter of the receiver, the contribution due to the broadband interference will tend to be Gaussian distributed, and will be modeled as temporally uncorrelated in this work. More sophisticated noise models, e.g. autoregressive in time and/or space [20], may be of interest in certain scenarios but fall out of the scope of this paper. On the other hand, it may well be possible for the spec- trum sensor to have knowledge about the (normalized) Power Spectrum Density (PSD) of primary signals, as the emission masks of many licensed services (e.g., broadcast and cellular networks) are in the public domain. Knowledge of the signal

Transcript of Multiantenna GLR Detection of Rank-One Signals With Known Power Spectrum in White Noise With Unknown...

1

Multiantenna GLR detection of rank-one signalswith known power spectrum in white noise with

unknown spatial correlationJosep Salaโ€ , Senior Member, IEEE, Gonzalo Vazquez-Vilarโ€ก, Student Member, IEEE, Roberto

Lopez-Valcarceโ€ก, Member, IEEE,โ€ Dept. of Signal Theory and Communications, Technical University of Catalonia, 08034 Barcelona, SPAIN

E-mail: [email protected]โ€กDept. of Signal Theory and Communications, University of Vigo, 36310 Vigo, SPAIN

E-mail: {gvazquez,valcarce}@gts.uvigo.essubmitted (minors after AQ) to: IEEE Transactions on Signal Processing

EDICS: SPC-DETC (Detection, Estimation and Demodulation), SSP-DETC (Detection)

Abstractโ€”Multiple-antenna detection of a Gaussian signalwith spatial rank one in temporally white Gaussian noise witharbitrary and unknown spatial covariance is considered. Thisis motivated by spectrum sensing problems in the context ofDynamic Spectrum Access in which several secondary networkscoexist but do not cooperate, creating a background of spatiallycorrelated broadband interference. When the temporal correla-tion of the signal of interest is assumed known up to a scale factor,the corresponding Generalized Likelihood Ratio Test is shown toyield a scalar optimization problem. Closed-form expressions ofthe test are obtained for the general signal spectrum case inthe low signal-to-noise ratio (SNR) regime, as well as for signalswith binary-valued power spectrum in arbitrary SNR. The tworesulting detectors turn out to be equivalent. An asymptoticapproximation to the test distribution for the low-SNR regime isderived, closely matching empirical results from spectrum sensingsimulation experiments.

Index Termsโ€”GLR test, detection, multiantenna array, corre-lated noise, noise uncertainty, cognitive radio, spectrum sensing,spectral flatness measure, Capon beamformer

I. INTRODUCTION

Array processing for signal detection and parameter esti-mation has been a topic of deep research interest for decades.Many techniques have been proposed under the assumptionthat the noise is Gaussian and spatially uncorrelated (or hasa known spatial structure that allows prewhitening) [1]โ€“[5].However, in many practical cases the noise field may presentunknown spatial correlation, due to co-channel interference incommunications applications, and to clutter and jamming inradar array processing [6]โ€“[10]. Our main motivation stemsfrom the problem of spectrum sensing for Dynamic SpectrumAccess (DSA) in licensed bands [11], [12]. DSA aims at amore efficient usage of spectrum by allowing unlicensed (orsecondary) users to opportunistically access channels in thosebands, as long as interference to licensed (or primary) usersis avoided; this can be achieved by dynamically tuning to

This work has been jointly supported by projects 2009SGR1236 (AGAUR),2009/062-2010/85 (Consolidation of Research Units), DYNACS (TEC2010-21245-C02/TCM) and COMONSENS (CONSOLIDER-INGENIO 2010CSD2008-00010) financed by the Catalan, Galician and Spanish Governmentsand the European Regional Development Fund (ERDF).

different carrier frequencies to sense the radio environment forunused channels. Spectrum sensing schemes must be robust towireless propagation phenomena such as large- and small-scalefading as well as to interference. The first two issues can bedealt with by resorting to cooperative detection strategies [13]and multiantenna detectors [14]โ€“[17], respectively.

Multiantenna spectrum sensing methods usually exploitspatial correlation of the signal, assuming a spatially uncor-related noise field. Whereas this assumption may be adequatein early stages of DSA-based schemes, in which a singlesecondary network coexists with the primary system, this is notnecessarily so when several independent secondary systemsaccess the same frequency band. Unless strict coordinationamong such systems is imposed, no quiet periods will beavailable in order to sense primary activity in a given channel.Hence, such activity will have to be detected in the presenceof a background noise consisting not only of thermal noise,but also the aggregate interference from the rest of secondarynetworks. The physical layer of secondary systems is likelyto make use of channel fragmentation, aggregation, and/orbonding in order to enhance spectrum utilization (such is thecase for e.g. the IEEE 802.22 standard for opportunistic accessto TV white spaces [18]), so that the background interferencewill likely be broadband (with respect to the bandwidth ofa primary channel), and potentially correlated in space, dueto the well-localized origin of secondary transmissions. Otherpotential sources of broadband interference of course exist, e.g.the emissions of Broadband over Power Line (BPL) systems inthe HF and VHF bands [19]. After bandlimiting by the channelselection filter of the receiver, the contribution due to thebroadband interference will tend to be Gaussian distributed,and will be modeled as temporally uncorrelated in this work.More sophisticated noise models, e.g. autoregressive in timeand/or space [20], may be of interest in certain scenarios butfall out of the scope of this paper.

On the other hand, it may well be possible for the spec-trum sensor to have knowledge about the (normalized) PowerSpectrum Density (PSD) of primary signals, as the emissionmasks of many licensed services (e.g., broadcast and cellularnetworks) are in the public domain. Knowledge of the signal

2

PSD has been exploited in [21] assuming a single antennaand known noise variance; and in [22], [23] for multiple an-tennas with spatially uncorrelated noise with known commonvariance, and respectively known and unknown channel gains.

Motivated by the above considerations, we derive the Gen-eralized Likelihood Ratio (GLR) detector and its ReceiverOperating Characteristic (ROC) for a random signal withspatial rank one and known PSD in temporally white Gaussiannoise of arbitrary spatial covariance. The nuisance parametersare the unknown antenna gains for the signal of interest andthe unknown spatial covariance of the noise. No structure isassumed on the latter, thus avoiding artificial constraints onthe unknown spatial rank of the interference, for the sake ofrobustness and detector complexity. With this model, intuitionsuggests that detection should be feasible as long as thetemporal correlation of the signal differs from that of thenoise process, i.e., the signal is not temporally white. Indeed,the performance of the GLR detector will be shown to bedependent on the spectral flatness measure at the output ofthe minimum variance beamformer.

Following common practice, the Gaussian model is adoptedfor the primary signal. The resulting model is tractable andleads to useful detectors even under different signal distri-butions. In addition, if the primary system uses multicarriermodulation, the Gaussian assumption becomes quite accuratefor the number of subcarriers usually found in practice [24].

Previous works in the literature on signal detection arerelated to the scheme presented in this paper. The single-antenna scenario considering real- rather than complex-valueddata in the proposed signal model has been treated in [30]using the Expectation-Maximization (EM) algorithm. On theother hand, the problem of signal detection with multiple an-tennas and spatially correlated noise has been addressed by theGeneralized Multivariate Analysis of Variance (GMANOVA)approach [25] when the signal is deterministic dependenton unknown parameters. Adaptive subspace detectors canbe used if the signal has low spatial rank and signal-freetraining data are available for estimating the unknown noisecovariance; see [26] for the case of deterministic signals,and [27] for Gaussian signal components. Other approachesfor the Gaussian signal model exploit rotational invariance ofthe noise field (if it exists) [28] or assume a parametric modelof the noise [20]. When the spatial correlation of noise isunstructured, and lacking training data, as in our model, itmay be possible to exploit the temporal correlation propertiesof signal and noise as in [29], where it is assumed that noisehas a much shorter temporal correlation length than that ofthe signals. However, none of these works derives the GLRtest under the proposed signal model. Moreover, the detectorherein described can be validated with the result in [30] inthe single antenna setting, leading to a simpler univariateoptimization scheme, and can significantly outperform otherdetectors which apply under the same model, as shown bysimulation results for that from [29].

The paper is structured as follows. The signal model isgiven in Sec. II, together with a summary of the mainresults to be developed in the sequel. The derivation of theexact GLR detector is carried out in Sec. III. The resulting

scheme involves the optimization of a data-dependent termwith respect to a scalar variable, which cannot be solved inclosed form in general. Nevertheless, in Sec. IV two importantparticular instances are considered that will result in a closed-form scheme, namely the low-SNR and binary-valued signalPSD cases. The asymptotic performance of the GLR detectoris analytically derived in Sec. V. Numerical results are givenin Sec. VI, and Sec. VII presents some closing remarks.

Notation: lower- and upper-case boldface symbols denotevectors and matrices, respectively. The ๐‘˜-th unit vector isdenoted by ๐’†๐‘˜. The trace, determinant, transpose, conjugate,conjugate transpose (Hermitian), adjoint, and Moore-Penrosepseudoinverse of ๐‘จ are denoted by tr๐‘จ, det๐‘จ, ๐‘จ๐‘‡ , ๐‘จโˆ—,๐‘จ๐ป , adj(๐‘จ) and ๐‘จโ€  respectively. diag(๐‘จ) is a diagonalmatrix with diagonal equal to that of ๐‘จ. The Kroneckerproduct of ๐‘จ and ๐‘ฉ is denoted by ๐‘จโŠ—๐‘ฉ. The column-wisevectorization of ๐‘จ is denoted by vec(๐‘จ). For Hermitian ๐‘จ, itslargest (resp. smallest) eigenvalue is denoted by ๐œ†max[๐‘จ] (resp.๐œ†min[๐‘จ]); if ๐‘จ is positive (semi)definite, ๐‘จ1/2 denotes itsunique Hermitian square root. The natural (base ๐‘’) logarithmis denoted by log. Finally, ๐’™ โˆผ ๐’ž๐’ฉ (๐,๐‘ท ) indicates that ๐’™ iscircularly complex Gaussian with mean ๐ and covariance ๐‘ท .

II. PROBLEM FORMULATION

A. Signal Model

The general signal model considered in this paper is asfollows:

๐’š๐‘› = ๐’‰๐‘ ๐‘› + ๐’›๐‘› โˆˆ โ„‚๐‘€ , ๐‘› โˆˆ {1, โ‹… โ‹… โ‹… , ๐‘}, (1)

where ๐’š๐‘› represents a snapshot (sample of sensor outputs attime ๐‘›) from an array of ๐‘€ sensors, ๐’‰ โˆˆ โ„‚๐‘€ is the unknownspatial signature vector, and {๐‘ ๐‘›} โˆˆ โ„‚, {๐’›๐‘›} โˆˆ โ„‚๐‘€ areindependent zero-mean complex circular Gaussian processes.The signal vector ๐’”

.= [ ๐‘ 1 โ‹… โ‹… โ‹… ๐‘ ๐‘ ]๐‘‡ โˆˆ โ„‚๐‘ is zero

mean with known temporal covariance ๐‘ช.= ๐”ผ[๐’”๐’”๐ป ], i.e.,

๐’” โˆผ ๐’ž๐’ฉ (0,๐‘ช). The process {๐‘ ๐‘›} is assumed wide-sensestationary with PSD ๐‘†๐‘ ๐‘ (๐‘’

๐‘—๐œ”); thus, ๐‘ช is Hermitian Toeplitz.In addition, we assume that ๐”ผ[โˆฃ๐‘ ๐‘›โˆฃ2] = 1 without loss ofgenerality (since any scaling factor can be absorbed into ๐’‰),so that ๐‘ช has an all-ones diagonal. On the other hand, theprocess {๐’›๐‘›} models the background noise and interference.It is assumed to be temporally white1, but spatially correlatedwith unknown and unstructured spatial covariance matrix ฮฃ:๐”ผ[๐’›๐‘˜๐’›

๐ป๐‘™ ] = ฮฃ if ๐‘˜ = ๐‘™, and zero otherwise. Therefore,

defining the ๐‘€๐‘ ร— 1 data and noise vectors respectively as๐’š

.= [ ๐’š๐‘‡

1 โ‹… โ‹… โ‹… ๐’š๐‘‡๐‘ ]๐‘‡ and ๐’›

.= [ ๐’›๐‘‡

1 โ‹… โ‹… โ‹… ๐’›๐‘‡๐‘ ]๐‘‡ , one

can rewrite (1) as

๐’š = ๐’”โŠ— ๐’‰+ ๐’›, (2)

which is Gaussian with covariance ๐‘น.= ๐”ผ[๐’š๐’š๐ป ] = ๐‘ช โŠ—

๐’‰๐’‰๐ป + ๐‘ฐ๐‘ โŠ—ฮฃ.The problem considered is the detection of the presence

of signal ๐’” given the data vector ๐’š. Thus, the corresponding

1If ๐’›๐‘› is temporally nonwhite but with known temporal correlation, theproposed model can be obtained after a temporal prewhitening step.

3

hypothesis test is given by

โ„‹0 : ๐’š โˆผ ๐’ž๐’ฉ (0,๐‘น0), ๐‘น0 = ๐‘ฐ๐‘ โŠ—ฮฃ, (3)

โ„‹1 : ๐’š โˆผ ๐’ž๐’ฉ (0,๐‘น1), ๐‘น1 = ๐‘ช โŠ— ๐’‰๐’‰๐ป + ๐‘ฐ๐‘ โŠ—ฮฃ

๐’‰ โˆ•= 0, (4)

where ฮฃ is an unknown, unstructured, Hermitian positivedefinite matrix under both hypotheses.

Observe from (3)-(4) that a necessary condition for theproblem to be well-posed is that ๐‘ช โˆ•= ๐‘ฐ๐‘ . Otherwise, thecovariance matrix under โ„‹1 could be written as ๐‘ฐ๐‘ โŠ— ฮฃ withฮฃ = ๐’‰๐’‰๐ป + ฮฃ; the fact that both ฮฃ and ฮฃ are unknown,together with the lack of structure in these matrices, yieldsthe same admissible set of covariance matrices under bothhypotheses, which then become indistinguishable.

In order to cope with the unknown parameters ๐’‰ and ฮฃ, asensible approach is the Generalized Likelihood Ratio (GLR)test [31], in which these parameters are substituted by theirMaximum Likelihood (ML) estimates under each hypothesis:

๐‘‡ =maxฮฃ,๐’‰ ๐‘“(๐’š โˆฃฮฃ,๐’‰)maxฮฃ ๐‘“(๐’š โˆฃฮฃ,0)

โ„‹1

โ‰ทโ„‹0

๐›พ, (5)

where the probability density function (p.d.f.) ๐‘“ is given by

๐‘“(๐’š โˆฃฮฃ,๐’‰) = 1

๐œ‹๐‘€๐‘ det๐‘นexp{โˆ’๐’š๐ป๐‘นโˆ’1๐’š}. (6)

B. Summary of Main Results

Before delving into the derivation of the GLR test (5), wesummarize and discuss our main results. To this end, let usintroduce the data matrix ๐’€ โˆˆ โ„‚๐‘€ร—๐‘ and its (economy-size)singular value decomposition (SVD):

๐’€.= [ ๐’š1 โ‹… โ‹… โ‹… ๐’š๐‘ ] = ๐‘ผ๐‘บ๐‘ฝ ๐ป , (7)

where ๐‘ผ โˆˆ โ„‚๐‘€ร—๐‘€ is unitary, ๐‘บ โˆˆ โ„‚๐‘€ร—๐‘€ is positive definitediagonal, and ๐‘ฝ โˆˆ โ„‚๐‘ร—๐‘€ is semi-unitary, i.e., ๐‘ฝ ๐ป๐‘ฝ = ๐‘ฐ๐‘€ .Note that ๐’€ is related to the data vector ๐’š โˆˆ โ„‚๐‘€๐‘ by ๐’š =vec(๐’€ ). Then we have the following.

Theorem 1: The GLR test (5) can be written as follows,with ๐›พ a suitable threshold:

๐‘‡ = max๐œŒโ‰ฅ0

๐‘ก(๐œŒ)โ„‹1

โ‰ทโ„‹0

๐›พ, ๐‘ก(๐œŒ).=๐œ†โˆ’๐‘

min

[๐‘ฝ ๐ป(๐‘ฐ๐‘ + ๐œŒ๐‘ชโˆ—)โˆ’1๐‘ฝ

]det(๐‘ฐ๐‘ + ๐œŒ๐‘ชโˆ—)

,

(8)which involves a scalar optimization problem in the variable๐œŒ.

Note that the dependence of (8) with the data is onlythrough the semi-unitary matrix ๐‘ฝ , which is related to thetemporal dimension of the data. This is due to the lack ofknowledge about the spatial covariance of the noise-plus-interference process ๐’›๐‘›. In addition, it is readily checked that if๐‘ช = ๐‘ฐ๐‘ then the statistic in (8) becomes ๐‘‡ = 1 independentlyof the data, reflecting again the fact that white signals are notdetectable under this model.

The parameter ๐œŒ featuring in (8) is related to the Signal-to-Noise Ratio (SNR), as will be shown in the sequel. To thebest of our knowledge, there is no closed-form solution for thegeneral maximization problem (8). Nevertheless, the followingresult applies in the asymptotic regime of low SNR.

Lemma 1: For vanishingly small SNR, the GLR test (8)becomes equivalent to the following test:

๐‘‡ โ€ฒ .= ๐œ†max[๐‘ฝ๐ป๐‘ชโˆ—๐‘ฝ ]

โ„‹1

โ‰ทโ„‹0

๐›พโ€ฒ. (9)

In order to illustrate the meaning of (9), let ๐‘ช = ๐‘ญฮ›๐‘ญ๐ป

with ฮ› = diag(๐œ†0, โ‹… โ‹… โ‹… , ๐œ†๐‘โˆ’1) be an eigenvalue decomposi-tion (EVD) of ๐‘ช . It is well known [31] that as ๐‘ โ†’ โˆž theeigenvalues of ๐‘ช approach ๐œ†๐‘˜ โ†’ ๐‘†๐‘ ๐‘ (๐‘’

๐‘— 2๐œ‹๐‘˜๐‘ ), 0 โ‰ค ๐‘˜ โ‰ค ๐‘โˆ’1,

whereas the matrix of eigenvectors ๐‘ญ approach the ๐‘ ร— ๐‘orthonormal Inverse Discrete Fourier Transform (IDFT) matrix(this can be formally justified in terms of the asymptoticequivalence between sequences of matrices and the asymptoticeigenvalue distribution of circulant and Toeplitz matrices [32]).Let us now write ๐‘ฝ โˆ— = [ ๐’—1 โ‹… โ‹… โ‹… ๐’—๐‘€ ]. The (๐‘–, ๐‘—) elementof ๐‘ฝ ๐ป๐‘ชโˆ—๐‘ฝ is therefore ๐’—๐ป

๐‘— ๐‘ช๐’—๐‘– = (๐‘ญ๐ป๐’—๐‘—)๐ปฮ›(๐‘ญ๐ป๐’—๐‘–),

which for ๐‘ โ†’ โˆž can be thought of as a spectrally weightedfrequency-domain crosscorrelation between the outputs ofthe ๐‘–-th and ๐‘—-th orthonormalized data streams (since ๐‘ญ๐ป๐’—๐‘–

approaches the ๐‘ -point DFT of ๐’—๐‘–). The spectral weightsare given by the eigenvalues of ๐‘ช , i.e., the sampled PSDof the signal process. Thus, the test statistic ๐‘‡ โ€ฒ in (9) isthe largest eigenvalue of this spectrally weighted frequency-domain correlation matrix. The discussion above also showsthat an approximation to ๐‘ฝ ๐ป๐‘ชโˆ—๐‘ฝ for ๐‘ large can be ef-ficiently computed by means of the FFT operation, similarlyto [21], with no significant performance loss even for moderatevalues of ๐‘ .

If ๐‘€ = 1, i.e., only one antenna is available, then it isreadily checked that

๐‘‡ โ€ฒ =๐’š๐ป๐‘ช๐’š

๐’š๐ป๐’š, (10)

which is the ratio of the spectrally weighted energy to totalenergy, and can be seen as a generalization to the unknownnoise power case of the spectral correlation-based detectorfrom [21].

The following result gives the exact expression of theGLR test in closed form for a particular family of temporalcovariance matrices ๐‘ช .

Lemma 2: Assume that {0, ๐œ†} are the only eigenvalues of๐‘ช , with ๐œ† > 0. Then the GLR test (8) is equivalent to theclosed-form test (9), independently of the SNR value.

Thus, for the case of a process {๐‘ ๐‘›} with a flat bandpassPSD, the asymptotic (for low SNR) GLR test is also thecorresponding GLR test for all SNR values. This suggests thatthe loss incurred by the detector (9) with other types of signalspectra at moderate SNR may be small. Numerical results willattest to this remark.

To close this section, we note that it is possible to analyti-cally derive the asymptotic distribution of the test statistic ๐‘‡under both hypotheses. The corresponding expressions will bepresented in Sec. V.

III. DERIVATION OF THE GLR TEST

In this section we obtain the required ML estimates for thederivation of the GLR detector (5).

4

A. Preliminaries

Under โ„‹0, (6) can be written as

๐‘“(๐’š โˆฃฮฃ,0) =[exp{โˆ’ tr(ฮฃโˆ’1๏ฟฝ๏ฟฝ)}

๐œ‹๐‘€ detฮฃ

]๐‘

, (11)

where we have introduced the sample covariance matrix

๏ฟฝ๏ฟฝ.=

1

๐‘๐’€ ๐’€ ๐ป . (12)

It is well known [5] that (11) is maximized for ฮฃ0 = ๏ฟฝ๏ฟฝ,yielding

maxฮฃ

๐‘“(๐’š โˆฃฮฃ,0) = [(๐œ‹๐‘’)๐‘€ det ๏ฟฝ๏ฟฝ]โˆ’๐‘ . (13)

Obtaining the ML estimates under โ„‹1 is more involved. Letus begin by introducing

๐’–โˆฅ.= ฮฃโˆ’1๐’‰, ๐œŒ

.= ๐’‰๐ปฮฃโˆ’1๐’‰, ๐‘ฎ(๐œŒ)

.= (๐‘ฐ๐‘ + ๐œŒ๐‘ช)โˆ’1.

(14)Vector ๐’–โˆฅ is the Capon beamformer [33], whereas ๐œŒ is themaximum SNR that can be obtained at the output of a linearcombiner ๐‘ฅ๐‘› = ๐’˜๐ป๐’š๐‘›, attained precisely at ๐’˜ = ๐’–โˆฅ. Notethat ๐‘ฎ(๐œŒ) and ๐‘ช share the same eigenvectors, and thereforethey commute. It will be also convenient to introduce the unitnorm vectors

๏ฟฝ๏ฟฝ.=

๐’‰โˆš๐’‰๐ป๐’‰

, ๏ฟฝ๏ฟฝโˆฅ.=

๐’–โˆฅโˆš๐’–๐ปโˆฅ ๐’–โˆฅ

. (15)

Now we need expressions for the determinant and inverse of๐‘น1 = ๐‘ชโŠ—๐’‰๐’‰๐ป+๐‘ฐ๐‘โŠ—ฮฃ, featuring in the p.d.f. (6). ApplyingSylvesterโ€™s determinant theorem [37] and the properties of theKronecker product,

det๐‘น1 = (detฮฃ)๐‘ det(๐‘ฐ๐‘ + ๐œŒ๐‘ช). (16)

Regarding the inverse of ๐‘น1, it is shown in Appendix A that

๐‘นโˆ’11 = ๐‘ฐ๐‘ โŠ—ฮฃโˆ’1 โˆ’๐‘ช๐‘ฎ(๐œŒ)โŠ— ๐’–โˆฅ๐’–๐ป

โˆฅ . (17)

With this, the quadratic form ๐’š๐ป๐‘นโˆ’11 ๐’š can be written as (see

Appendix B):

๐’š๐ป๐‘นโˆ’11 ๐’š = tr[๐‘พโŠฅ๐’€ ๐’€ ๐ป ] + tr[๐‘พโˆฅ๐’€ ๐‘ฎโˆ—(๐œŒ)๐’€ ๐ป ], (18)

where the matrices ๐‘พโˆฅ, ๐‘พโŠฅ are defined as

๐‘พโˆฅ.= ๐œŒโˆ’1๐’–โˆฅ๐’–๐ป

โˆฅ (19)

= ฮฃโˆ’1[๐œŒโˆ’1๐’‰๐’–๐ป

โˆฅ], (20)

๐‘พโŠฅ.= ฮฃโˆ’1 โˆ’ ๐œŒโˆ’1๐’–โˆฅ๐’–๐ป

โˆฅ (21)

= ฮฃโˆ’1[๐‘ฐ๐‘€ โˆ’ ๐œŒโˆ’1๐’‰๐’–๐ป

โˆฅ]. (22)

Note that ฮฃโˆ’1 = ๐‘พโˆฅ +๐‘พโŠฅ, and that ๐‘พโˆฅ has rank one andis positive semidefinite. In addition, ๐‘พโŠฅ๐’‰ = 0, and thus therank of ๐‘พโŠฅ is at most ๐‘€ โˆ’ 1. The geometric interpretationof these matrices is as follows. For any ๐’™ โˆˆ โ„‚๐‘€ , write ๐’™ =๐‘๐’‰+ ๏ฟฝ๏ฟฝ for some ๐‘ โˆˆ โ„‚, ๏ฟฝ๏ฟฝ โˆˆ โ„‚๐‘€ with ๏ฟฝ๏ฟฝ๐ป๐’–โˆฅ = 0. Note thatas long as ๐œŒ = ๐’–๐ป

โˆฅ ๐’‰ โˆ•= 0, this decomposition exists and isunique. (This condition means that the Capon beamformer isnot orthogonal to the spatial signature, i.e., it does not block

the signal at its output completely. It is satisfied for full rankฮฃ, as ๐œŒ = ๐’‰๐ปฮฃโˆ’1๐’‰ > 0). Then[

๐œŒโˆ’1๐’‰๐’–๐ปโˆฅ]๐’™ = ๐‘๐’‰,

[๐‘ฐ๐‘€ โˆ’ ๐œŒโˆ’1๐’‰๐’–๐ป

โˆฅ]๐’™ = ๏ฟฝ๏ฟฝ, (23)

i.e., ฮ .= ๐‘ฐ๐‘€ โˆ’๐œŒโˆ’1๐’‰๐’–๐ป

โˆฅ is the oblique projector along ๐’‰ ontothe subspace orthogonal to ๐’–โˆฅ.

Note that ๐’™๐ปฮฃโˆ’1๐’™ = โˆฃ๐‘โˆฃ2๐’‰๐ปฮฃโˆ’1๐’‰ + ๏ฟฝ๏ฟฝ๐ป๐‘พโŠฅ๏ฟฝ๏ฟฝ, which ispositive for ๐’™ โˆ•= 0. Then for ๐‘ = 0, one has that ๐’™ = ๏ฟฝ๏ฟฝ is in asubspace of dimension ๐‘€โˆ’1, and ๐’™๐ปฮฃโˆ’1๐’™ = ๏ฟฝ๏ฟฝ๐ป๐‘พโŠฅ๏ฟฝ๏ฟฝ > 0for ๏ฟฝ๏ฟฝ โˆ•= 0, showing that ๐‘พโŠฅ is positive semidefinite of rank๐‘€ โˆ’ 1.

For completeness, we provide an expression for ฮฃ in termsof these matrices, given by

ฮฃ = ฮ ๐‘พ โ€ โŠฅฮ 

๐ป + ๐œŒโˆ’1๐’‰๐’‰๐ป , (24)

which is proved in Remark 2 (Appendix D).The procedure whereby two different orthogonal spaces

are considered for the signal and the noise plus interferencesubspaces in the spatial domain for this non-collaborative spec-trum sensing scenario constitutes a recurrent and successfulapproach in multiantenna detection [30] [34], array signalprocessing [35] as well as in spectral estimation contexts [36].The use of suitable parameter transformations in detectionschemes has also been addressed in [30].

B. A convenient change of variables

The decomposition of the unstructured inverse spatial co-variance ฮฃโˆ’1 = ๐‘พโˆฅ + ๐‘พโŠฅ into two structured positivesemidefinite matrices will be useful in order to obtain the MLestimates. To this end, we introduce a change of variablesin order to replace the original set of unstructured unknownparameters ฮฉ

.= {๐’‰,ฮฃ} by another set ฮฉโ€ฒ based on ๐‘พโˆฅ and

๐‘พโŠฅ. In order to account for the structure of these matrices,let us introduce the following (economy-size) EVDs:

๐‘พโˆฅ = ๐›พโˆฅ๏ฟฝ๏ฟฝโˆฅ๏ฟฝ๏ฟฝ๐ปโˆฅ , ๐‘พโŠฅ = ๐‘ผโŠฅฮ“โŠฅ๐‘ผ๐ป

โŠฅ , (25)

where ๐›พโˆฅ โˆˆ โ„ is positive, ๏ฟฝ๏ฟฝโˆฅ โˆˆ โ„‚๐‘€ has unit norm (note from(19) that ๏ฟฝ๏ฟฝโˆฅ is the unit-norm Capon beamformer and ๐›พโˆฅ theinverse of the noise power at its output), ฮ“โŠฅ โˆˆ โ„(๐‘€โˆ’1)ร—(๐‘€โˆ’1)

is positive definite diagonal, and ๐‘ผโŠฅ โˆˆ โ„‚๐‘€ร—(๐‘€โˆ’1) has or-thonormal columns. In addition, we require that (i) ๐‘ผ๐ป

โŠฅ ๏ฟฝ๏ฟฝ = 0(to ensure that ๐‘พโŠฅ๏ฟฝ๏ฟฝ = 0); and (ii) ๏ฟฝ๏ฟฝ๐ป

โˆฅ ๏ฟฝ๏ฟฝ โˆ•= 0 (to ensure that๐‘พโˆฅ + ๐‘พโŠฅ has full rank). Note that [๐‘ผโŠฅ ๏ฟฝ๏ฟฝ ] โˆˆ โ„‚๐‘€ร—๐‘€ isunitary.

With these, the new parameter space we consider is givenby

ฮฉโ€ฒ .= {๐œŒ, ๏ฟฝ๏ฟฝ, ๐›พโˆฅ, ๏ฟฝ๏ฟฝโˆฅ,ฮ“โŠฅ,๐‘ผโŠฅ}. (26)

It is readily checked that under the standing assumptions ๐’‰ โˆ•=0, ฮฃ = ฮฃ๐ป full rank, there is a one-to-one correspondence2

between the parameter spaces ฮฉ and ฮฉโ€ฒ, which is summarizedin Table I. Hence, the maximization of the likelihood functioncan be carried out over either ฮฉ or ฮฉโ€ฒ with identical result.

2Apart from irrelevant rotational ambiguities in ๏ฟฝ๏ฟฝ and ๏ฟฝ๏ฟฝโˆฅ, and rota-tional/ordering ambiguities in the EVD of ๐‘พโŠฅ.

5

In order to write the likelihood function in terms of the newparameters, note from (16) and (18) that

โˆ’ log ๐‘“(๐’š โˆฃฮฃ,๐’‰)= ๐‘€๐‘ log ๐œ‹ +๐‘ log detฮฃโˆ’ log det๐‘ฎโˆ—(๐œŒ)+ ๐‘ tr[๐‘พโŠฅ๏ฟฝ๏ฟฝ] + tr[๐‘พโˆฅ๐’€ ๐‘ฎโˆ—(๐œŒ)๐’€ ๐ป ]. (27)

The following result links the determinant of the unstructuredspatial covariance matrix with the new parameters; see Ap-pendix C for the proof.

Lemma 3: Under the parameterization (25)-(26), it holdsthat det(ฮฃโˆ’1) = ๐›พโˆฅ

โˆฃโˆฃ๏ฟฝ๏ฟฝ๐ป ๏ฟฝ๏ฟฝโˆฅโˆฃโˆฃ2 det(ฮ“โŠฅ).

Then (27) becomes

โˆ’ log ๐‘“(๐’š โˆฃฮฃ,๐’‰)= ๐‘€๐‘ log ๐œ‹ โˆ’๐‘ log ๐›พโˆฅ โˆ’๐‘ log

โˆฃโˆฃ๏ฟฝ๏ฟฝ๐ป ๏ฟฝ๏ฟฝโˆฅโˆฃโˆฃ2

โˆ’ ๐‘ log detฮ“โŠฅ โˆ’ log det๐‘ฎโˆ—(๐œŒ)+ ๐‘ tr[๐‘ผโŠฅฮ“โŠฅ๐‘ผ๐ป

โŠฅ ๏ฟฝ๏ฟฝ] + ๐›พโˆฅ๏ฟฝ๏ฟฝ๐ปโˆฅ ๐’€ ๐‘ฎโˆ—(๐œŒ)๐’€ ๐ป ๏ฟฝ๏ฟฝโˆฅ. (28)

C. ML estimation under โ„‹1

We proceed now to minimize (28) with respect to theparameters in (26). The optimum values of ๐›พโˆฅ and ฮ“โŠฅ arereadily found:

๐›พโˆฅ =๐‘

๏ฟฝ๏ฟฝ๐ปโˆฅ ๐’€ ๐‘ฎโˆ—(๐œŒ)๐’€ ๐ป ๏ฟฝ๏ฟฝโˆฅ

, ฮ“โŠฅ =[diag(๐‘ผ๐ป

โŠฅ ๏ฟฝ๏ฟฝ๐‘ผโŠฅ)]โˆ’1

.

(29)Substituting (29) back in (28), we obtain3

โˆ’ log ๐‘“

= ๐‘

(๐‘€ + log

๐œ‹๐‘€

๐‘

)โˆ’๐‘ log

โˆฃโˆฃ๏ฟฝ๏ฟฝ๐ป ๏ฟฝ๏ฟฝโˆฅโˆฃโˆฃ2 โˆ’ log det๐‘ฎโˆ—(๐œŒ)

+ ๐‘ log det[diag(๐‘ผ๐ป

โŠฅ ๏ฟฝ๏ฟฝ๐‘ผโŠฅ)]

+ ๐‘ log(๏ฟฝ๏ฟฝ๐ปโˆฅ ๐’€ ๐‘ฎโˆ—(๐œŒ)๐’€ ๐ป๏ฟฝ๏ฟฝโˆฅ

). (30)

The optimum value of ๐‘ผโŠฅ is provided by the following result,whose proof is given in Appendix D:

Lemma 4: Let us consider the following cost: ๐ฝ(๐‘ผโŠฅ).=

det[diag(๐‘ผ๐ป

โŠฅ ๏ฟฝ๏ฟฝ๐‘ผโŠฅ)]. Then the solution of

min๐‘ผโŠฅ

๐ฝ(๐‘ผโŠฅ) subject to ๐‘ผ๐ปโŠฅ ๐‘ผโŠฅ = ๐‘ฐ๐‘€โˆ’1, ๐‘ผ๐ป

โŠฅ ๏ฟฝ๏ฟฝ = 0 (31)

is given by ๐ฝmin = ๏ฟฝ๏ฟฝ๐ป๏ฟฝ๏ฟฝโˆ’1๏ฟฝ๏ฟฝ โ‹… det ๏ฟฝ๏ฟฝ, and is attained when๐‘ผโŠฅ is an orthonormal basis for the range space of (๐‘ฐ๐‘€ โˆ’๏ฟฝ๏ฟฝ๏ฟฝ๏ฟฝ๐ป)๏ฟฝ๏ฟฝ(๐‘ฐ๐‘€ โˆ’ ๏ฟฝ๏ฟฝ๏ฟฝ๏ฟฝ๐ป).Substituting the value of ๐ฝmin into (30), one has

โˆ’ log ๐‘“

= ๐‘

(๐‘€ + log

๐œ‹๐‘€

๐‘+ log det ๏ฟฝ๏ฟฝ

)โˆ’๐‘ log

โˆฃโˆฃ๏ฟฝ๏ฟฝ๐ป ๏ฟฝ๏ฟฝโˆฅโˆฃโˆฃ2

โˆ’ log det๐‘ฎโˆ—(๐œŒ)

+ ๐‘ log(๏ฟฝ๏ฟฝ๐ป๏ฟฝ๏ฟฝโˆ’1๏ฟฝ๏ฟฝ) +๐‘ log(๏ฟฝ๏ฟฝ๐ปโˆฅ ๐’€ ๐‘ฎโˆ—(๐œŒ)๐’€ ๐ป ๏ฟฝ๏ฟฝโˆฅ

).(32)

3For the sake of clarity and with some abuse of the notation, we denotethe value of ๐‘“(๐’š โˆฃฮฃ,๐’‰) obtained after maximization w.r.t. a parameter in ฮฉโ€ฒsimply by ๐‘“ .

Now, minimizing (32) with respect to ๏ฟฝ๏ฟฝ and ๏ฟฝ๏ฟฝโˆฅ (unit-normsignature vector and Capon beamformer respectively) amountsto minimizing

(๏ฟฝ๏ฟฝ๐ป๏ฟฝ๏ฟฝโˆ’1๏ฟฝ๏ฟฝ) โ‹… (๏ฟฝ๏ฟฝ๐ปโˆฅ ๐’€ ๐‘ฎโˆ—(๐œŒ)๐’€ ๐ป ๏ฟฝ๏ฟฝโˆฅ)โˆฃโˆฃ๏ฟฝ๏ฟฝ๐ป ๏ฟฝ๏ฟฝโˆฅ

โˆฃโˆฃ2 , (33)

subject to ๏ฟฝ๏ฟฝ๐ป ๏ฟฝ๏ฟฝโˆฅ โˆ•= 0. To this end, the following result willbe applied; see Appendix E for the proof.

Lemma 5: Let ๐‘จ1, ๐‘จ2 be two ๐‘€ ร— ๐‘€ positive definiteHermitian matrices. The minimum value of

๐น (๐’–1,๐’–2).=

(๐’–๐ป1 ๐‘จ1๐’–1) โ‹… (๐’–๐ป

2 ๐‘จ2๐’–2)

โˆฃ๐’–๐ป1 ๐’–2โˆฃ2 (34)

subject to ๐’–๐ป1 ๐’–2 โˆ•= 0 is given by ๐นmin = ๐œ†min(๐‘จ1๐‘จ2) =

๐œ†min(๐‘จ2๐‘จ1), and is attained if ๐’–1 (resp. ๐’–2) is a minimumright eigenvector4 of ๐‘จ2๐‘จ1 (resp. ๐‘จ1๐‘จ2).Therefore, from the SVD (7), the minimum value of (33) isfound to be ๐œ†min[๐’€ ๐‘ฎโˆ—(๐œŒ)๐’€ ๐ป๏ฟฝ๏ฟฝโˆ’1] = ๐‘๐œ†min[๐‘ฝ

๐ป๐‘ฎโˆ—(๐œŒ)๐‘ฝ ].Using ๐’€ โ€  = ๐‘ฝ ๐‘บโˆ’1๐‘ผ๐ป , the corresponding ML estimates ห†๐’‰and ห†๐’–โˆฅ are seen to be the unit-norm minimum eigenvectors ofthe matrices ๐’€ ๐‘ฎโˆ—(๐œŒ)๐’€ โ€  and (๐’€ ๐ป)โ€ ๐‘ฎโˆ—(๐œŒ)๐’€ ๐ป , respectively.

Substitution of these optimum values into (32) finally yields

maxฮฃ,๐’‰

๐‘“(๐’š โˆฃฮฃ,๐’‰) (35)

= [(๐œ‹๐‘’)๐‘€ det ๏ฟฝ๏ฟฝ]โˆ’๐‘ โ‹…max๐œŒโ‰ฅ0

{(๐œ†๐‘min[๐‘ฝ

๐ป๐‘ฎโˆ—(๐œŒ)๐‘ฝ ]

det๐‘ฎโˆ—(๐œŒ)

)โˆ’1},

which, together with (13), proves that the GLR test is indeed(8) as in Theorem 1.

The ML estimates of the parameters are summarized in Ta-ble I. The expression for the ML estimate ๏ฟฝ๏ฟฝโŠฅ = ๏ฟฝ๏ฟฝโŠฅฮ“โŠฅ๏ฟฝ๏ฟฝ๐ป

โŠฅgiven in Table I is justified in Remark 1 of Appendix D whilethat of ฮฃ stems from (24).

As previously mentioned, there is no closed-form expressionfor the ML estimate of ๐œŒ, which has to be computed bynumerical means, e.g. a gradient search as in Sec. VI. Thisraises the issue of local optima, about which the followingcan be said. First, experimental evidence suggests that thelikelihood function is unimodal in ๐œŒ. Second, the asymptotic(as ๐‘ โ†’ โˆž) likelihood function does turn out to be unimodal;see Remark 3 in Appendix F. And finally, for certain kinds oftemporal covariance matrices unimodality can be analyticallyestablished, as shown in the next section.

IV. GLR TEST FOR LOW-SNR AND BINARY-VALUED

EIGENSPECTRUM CASES

In this section we first explore the behavior of the GLRtest (8) for asymptotically low SNR, with arbitrary (but non-white) temporal correlation of the signal of interest. After this,we particularize the GLR test (8) to the case in which theeigenvalues of the temporal covariance matrix ๐‘ช reduce to{0, ๐œ†} with ๐œ† > 0 (recall that these eigenvalues approach thesamples of the signal PSD as ๐‘ โ†’ โˆž; thus, this situationincludes the case of a PSD occupying only a fraction ofthe Nyquist bandwidth, in which it is constant), showing

4In the sequel we shall refer to right eigenvectors simply as eigenvectors.

6

Transformed parameters: ฮฉโ€ฒ = {๐œŒ, ๏ฟฝ๏ฟฝ, ๐›พโˆฅ, ๏ฟฝ๏ฟฝโˆฅ,ฮ“โŠฅ,๐‘ผโŠฅ} ML estimates of transformed parameters

๐œŒ = ๐’‰๐ปฮฃโˆ’1๐’‰ ๐œŒ = argmin๐œŒโ‰ฅ0

{๐œ†๐‘

min

[(๐’€ ๐‘ฎโˆ—(๐œŒ)๐’€ ๐ป)(๐’€ ๐’€ ๐ป)โˆ’1

]

det๐‘ฎโˆ—(๐œŒ)

}

๏ฟฝ๏ฟฝ = ๐’‰/โˆš๐’‰๐ป๐’‰ ห†๐’‰ = unit-norm minimum eigenvector of ๐’€ ๐‘ฎโˆ—(๐œŒ)๐’€ โ€ 

๏ฟฝ๏ฟฝโˆฅ = (ฮฃโˆ’1๐’‰)/โˆš๐’‰๐ปฮฃโˆ’2๐’‰ ห†๐’–โˆฅ = unit-norm minimum eigenvector of (๐’€ ๐ป)โ€ ๐‘ฎโˆ—(๐œŒ)๐’€ ๐ป

๐›พโˆฅ = (๐’‰๐ปฮฃโˆ’2๐’‰)/(๐’‰๐ปฮฃโˆ’1๐’‰) ๐›พโˆฅ =(

1๐‘

ห†๐’–๐ปโˆฅ ๐’€ ๐‘ฎโˆ—(๐œŒ)๐’€ ๐ป ห†๐’–โˆฅ

)โˆ’1

๐‘ผโŠฅฮ“โŠฅ๐‘ผ๐ปโŠฅ = ฮฃโˆ’1 โˆ’ (๐’‰๐ปฮฃโˆ’1๐’‰)โˆ’1(ฮฃโˆ’1๐’‰)(ฮฃโˆ’1๐’‰)๐ป ๏ฟฝ๏ฟฝโŠฅฮ“โŠฅ๏ฟฝ๏ฟฝ๐ป

โŠฅ =[(๐‘ฐ๐‘€ โˆ’ ห†๐’‰ ห†๐’‰๐ป )๏ฟฝ๏ฟฝ(๐‘ฐ๐‘€ โˆ’ ห†๐’‰ ห†๐’‰๐ป )

]โ€ Original parameters: ฮฉ = {ฮฃ,๐’‰} ML estimates of original parameters

๐’‰ = (โˆš

๐œŒ/๐›พโˆฅ/โˆฃ๏ฟฝ๏ฟฝ๐ปโˆฅ ๏ฟฝ๏ฟฝโˆฃ) โ‹… ๏ฟฝ๏ฟฝ ๏ฟฝ๏ฟฝ =

(โˆš๐œŒ/๐›พโˆฅ/โˆฃ ห†๐’–๐ป

โˆฅห†๐’‰โˆฃ)โ‹… ห†๐’‰

ฮฃโˆ’1 = ๐‘ผโŠฅฮ“โŠฅ๐‘ผ๐ปโŠฅ + ๐›พโˆฅ๏ฟฝ๏ฟฝโˆฅ๏ฟฝ๏ฟฝ๐ป

โˆฅ ฮฃโˆ’1 = ๏ฟฝ๏ฟฝโŠฅฮ“โŠฅ๏ฟฝ๏ฟฝ๐ปโŠฅ + ๐›พโˆฅ ห†๐’–โˆฅ ห†๐’–๐ป

โˆฅฮฃ = ฮ (๐‘ผโŠฅฮ“โˆ’1

โŠฅ ๐‘ผ๐ปโŠฅ )ฮ ๐ป + (๐›พโˆฅโˆฃ๏ฟฝ๏ฟฝ๐ป

โˆฅ ๏ฟฝ๏ฟฝโˆฃ2)โˆ’1๏ฟฝ๏ฟฝ๏ฟฝ๏ฟฝ๐ป ฮฃ = ฮ ๏ฟฝ๏ฟฝฮ ๐ป + ๐œŒโˆ’1๏ฟฝ๏ฟฝ๏ฟฝ๏ฟฝ๐ป

ฮ .= ๐‘ฐ๐‘€ โˆ’ (๏ฟฝ๏ฟฝ๐ป

โˆฅ ๏ฟฝ๏ฟฝ)โˆ’1๏ฟฝ๏ฟฝ๏ฟฝ๏ฟฝ๐ปโˆฅ ฮ  = ๐‘ฐ๐‘€ โˆ’ (ห†๐’–๐ป

โˆฅห†๐’‰)โˆ’1 ห†๐’‰ ห†๐’–๐ป

โˆฅ

TABLE ITRUE PARAMETERS (LEFT) AND ML ESTIMATES (RIGHT) UNDER โ„‹1 , WITH ๏ฟฝ๏ฟฝ = 1

๐‘๐’€ ๐’€ ๐ป , ๐‘ฎ(๐œŒ)

.= (๐‘ฐ๐‘ + ๐œŒ๐‘ช)โˆ’1 .

that a closed-form expression for the detector exists in thiscase, valid for all SNR values. Remarkably, the closed-formdetectors obtained in both cases are the same.

A. GLR test for asymptotically low SNR

From (8), the GLR statistic is ๐‘‡ = max๐œŒโ‰ฅ0 ๐‘ก(๐œŒ). For small๐œŒ, the following first-order approximations

(๐‘ฐ๐‘+๐œŒ๐‘ชโˆ—)โˆ’1 โ‰ˆ ๐‘ฐ๐‘โˆ’๐œŒ๐‘ชโˆ—, det(๐‘ฐ๐‘+๐œŒ๐‘ชโˆ—) โ‰ˆ 1+๐œŒ tr๐‘ชโˆ—

(36)hold. Therefore,

๐‘ก(๐œŒ) โ‰ˆ ๐œ†โˆ’๐‘min [๐‘ฝ

๐ป(๐‘ฐ๐‘ โˆ’ ๐œŒ๐‘ชโˆ—)๐‘ฝ ]

1 + ๐œŒ tr๐‘ชโˆ— =(1 โˆ’ ๐œŒ๐œ†max[๐‘ฝ

๐ป๐‘ชโˆ—๐‘ฝ ])โˆ’๐‘

1 + ๐œŒ tr๐‘ชโˆ— .

(37)Taking logarithms and using the approximation log(1+๐‘ฅ) โ‰ˆ ๐‘ฅfor small โˆฃ๐‘ฅโˆฃ, one has

log ๐‘ก(๐œŒ) โ‰ˆ ๐œŒ๐‘

(๐œ†max[๐‘ฝ

๐ป๐‘ชโˆ—๐‘ฝ ]โˆ’ 1

๐‘tr๐‘ชโˆ—

). (38)

Suppose now that ๐œŒ were known; in that case, the GLRstatistic is directly ๐‘ก(๐œŒ), and (38) shows that, in low SNR, theGLR test is equivalent to comparing ๐œ†max[๐‘ฝ

๐ป๐‘ชโˆ—๐‘ฝ ] againsta threshold. This amounts to saying that for vanishingly smallSNR, knowledge of ๐œŒ becomes irrelevant, and the GLR test canbe rephrased as in (9) since tr๐‘ชโˆ— is a constant independentof the data, thus establishing Lemma 1.

B. Binary-valued eigenspectrum

We now set to prove Lemma 2. Consider the EVD ๐‘ช =๐‘ญฮ›๐‘ญ๐ป , and assume that the only eigenvalues of ๐‘ช are ๐œ†and zero, with multiplicities ๐ฟ and ๐‘ โˆ’๐ฟ respectively. Sinceit is assumed that ๐‘ช has ones on its diagonal, it follows thattr๐‘ช = ๐ฟ๐œ† = ๐‘ . Letting ๐‘

.= ๐ฟ/๐‘ < 1 denote the fraction

of nonzero eigenvalues, then ๐œ† = ๐‘โˆ’1. The GLR statistic ๐‘‡from (8) can be written as

๐‘‡ = max๐œŒโ‰ฅ0

{๐œ†โˆ’๐‘

min

[(๐‘ญ โˆ—๐‘ฝ )๐ป(๐‘ฐ๐‘ + ๐œŒฮ›)โˆ’1๐‘ญ โˆ—๐‘ฝ

]det(๐‘ฐ๐‘ + ๐œŒฮ›)

}(39)

It is readily checked that (๐‘ฐ๐‘ + ๐œŒฮ›)โˆ’1 = ๐‘ฐ๐‘ โˆ’ ๐œŒ1+๐‘โˆ’1๐œŒฮ› and

det(๐‘ฐ๐‘ + ๐œŒฮ›) = (1 + ๐‘โˆ’1๐œŒ)๐‘๐‘ . Therefore,

๐‘‡ 1/๐‘ (40)

= max๐œŒโ‰ฅ0

โŽงโŽจโŽฉ๐œ†โˆ’1

min

[(๐‘ญ โˆ—๐‘ฝ )๐ป(๐‘ฐ๐‘ โˆ’ ๐œŒ

1+๐‘โˆ’1๐œŒฮ›)๐‘ญ โˆ—๐‘ฝ]

(1 + ๐‘โˆ’1๐œŒ)๐‘

โŽซโŽฌโŽญ

=

{min๐œŒโ‰ฅ0

[(1โˆ’ ๐œŒ๏ฟฝ๏ฟฝ

1 + ๐‘โˆ’1๐œŒ

)(1 + ๐‘โˆ’1๐œŒ)๐‘

]}โˆ’1

, (41)

where ๏ฟฝ๏ฟฝ.= ๐œ†max[๐‘ฝ

๐ป๐‘ชโˆ—๐‘ฝ ]. It is straightforward to show thatthe minimum in (41) is attained at

๐œŒ = max

{0,

๏ฟฝ๏ฟฝโˆ’ 1

๐‘โˆ’1 โˆ’ ๏ฟฝ๏ฟฝ

}, if 0 โ‰ค ๏ฟฝ๏ฟฝ โ‰ค ๐‘โˆ’1, (42)

whereas no minimum exists if ๏ฟฝ๏ฟฝ > ๐‘โˆ’1. However, thiscase need not be considered: since ๐‘ฝ ๐ป๐‘ฝ = ๐‘ฐ๐‘€ , by thePoincare separation theorem [38] one has ๐œ†max[๐‘ฝ

๐ป๐‘ชโˆ—๐‘ฝ ] โ‰ค๐œ†max[๐‘ช

โˆ—] = ๐‘โˆ’1. Thus, (41) becomes

๐‘‡ 1/๐‘ =

{1, if 0 โ‰ค ๏ฟฝ๏ฟฝ โ‰ค 1,

๏ฟฝ๏ฟฝโˆ’๐‘(

1โˆ’๐‘1โˆ’๐‘๏ฟฝ๏ฟฝ

)1โˆ’๐‘

, if 1 < ๏ฟฝ๏ฟฝ โ‰ค ๐‘โˆ’1,(43)

which is a non-decreasing function of ๏ฟฝ๏ฟฝ โˆˆ [0, ๐‘โˆ’1]. Hence, theGLR test for binary-valued eigenspectrum is equivalent to thetest (9), as was to be shown. It has a nice geometrical inter-pretation: if we collect in ๐‘ญ the columns of ๐‘ญ correspondingto the nonzero eigenvalue ๐œ† = ๐‘โˆ’1, then ๐‘ช = ๐‘โˆ’1๐‘ญ๐‘ญ๐ป , and

๐œ†max[๐‘ฝ๐ป๐‘ชโˆ—๐‘ฝ ] = ๐‘โˆ’1๐œ†max[(๐‘ญ

๐ป๐‘ฝ โˆ—)๐ป(๐‘ญ๐ป๐‘ฝ โˆ—)]= ๐‘โˆ’1 cos2 ๐œƒ, (44)

where ๐œƒ is the minimum principal angle [39] between thesubspaces spanned by ๐‘ฝ โˆ— and ๐‘ญ . The GLR statistic is thusa measure of the largest achievable projection of a vector inthe range space of ๐‘ฝ โˆ— (data subspace) onto the range spaceof ๐‘ญ (reference subspace). Note that cos2 ๐œƒ > ๐‘ in order tohave ๐‘‡ > 1.

7

V. PERFORMANCE ANALYSIS IN LOW SNR

The following result, whose proof is given in AppendixF, provides the asymptotic distributions of the GLR statisticunder each hypothesis, assuming that the SNR is small. Thus,it enables us to compute the probabilities of detection ๐‘ƒD andfalse alarm ๐‘ƒFA for a given threshold.

Theorem 2: As ๐‘ โ†’ โˆž and for sufficiently small SNR,the GLR statistic (8) is asymptotically distributed as

2 log๐‘‡ โˆผ{

๐œ’22๐‘€โˆ’1, under โ„‹0,

๐œ’โ€ฒ22๐‘€โˆ’1(๐›ผsph(๐œŒ)), under โ„‹1,

(45)

๐›ผsph(๐œŒ).= 2๐‘ log

1๐‘ tr(๐‘ฐ๐‘ + ๐œŒ๐‘ช)

[det(๐‘ฐ๐‘ + ๐œŒ๐‘ช)]1/๐‘, (46)

where ๐œŒ.= ๐’‰๐ปฮฃโˆ’1๐’‰, and ๐œ’2

๐‘‘, ๐œ’โ€ฒ2๐‘‘ (๐›ผ) denote respectively

central and non-central chi-square distributions with ๐‘‘ degreesof freedom and non-centrality parameter ๐›ผ.

The argument of the log in (46) is the ratio of the arithmeticmean (AM) to geometric mean (GM) of the eigenvalues of๐‘ฐ๐‘ + ๐œŒ๐‘ช , which is the temporal covariance matrix at theoutput of the Capon beamformer. By the AM-GM inequality,this ratio is no less than one, and it is equal to one iffall eigenvalues are equal, i.e., iff ๐‘ช = ๐‘ฐ๐‘ . In that case๐›ผsph(๐œŒ) = 0 for all ๐œŒ and the asymptotic distributions under โ„‹1

and โ„‹0 coincide, which is consistent with the fact that whitesignals are not detectable under this signal model. ApplyingSzegoโ€™s theorem [40], the asymptotic value of the AM-GMratio can be written in terms of the signal PSD:

lim๐‘โ†’โˆž

1๐‘ tr(๐‘ฐ๐‘ + ๐œŒ๐‘ช)

[det(๐‘ฐ๐‘ + ๐œŒ๐‘ช)]1/๐‘(47)

=12๐œ‹

โˆซ ๐œ‹

โˆ’๐œ‹[1 + ๐œŒ๐‘†๐‘ ๐‘ (๐‘’๐‘—๐œ”)]d๐œ”

exp{

12๐œ‹

โˆซ ๐œ‹

โˆ’๐œ‹ log[1 + ๐œŒ๐‘†๐‘ ๐‘ (๐‘’๐‘—๐œ”)]d๐œ”} ,

which is the inverse of the Spectral Flatness Measure (SFM)[41] associated with the power spectrum 1 + ๐œŒ๐‘†๐‘ ๐‘ (๐‘’

๐‘—๐œ”). Itsminimum value is 1 for ๐œŒ = 0, increasing monotonically with๐œŒ toward the inverse of the SFM associated to ๐‘†๐‘ ๐‘ (๐‘’

๐‘—๐œ”) [42].We could ask for the signal eigenvalue distribution maxi-

mizing the performance of the GLR detector at a given SNR.In view of Theorem 2, this amounts to maximizing ๐›ผsph(๐œŒ).Recalling that tr๐‘ช = ๐‘ due to signal power normalization,and with {๐œ†0, . . . , ๐œ†๐‘โˆ’1} the eigenvalues of ๐‘ช , the problembecomes

min{๐œ†๐‘›}

๐‘โˆ’1โˆ๐‘›=0

(1 + ๐œŒ๐œ†๐‘›) (48)

subject to๐‘โˆ’1โˆ‘๐‘›=0

๐œ†๐‘› = ๐‘, ๐œ†๐‘– โ‰ฅ 0, ๐‘– = 0, . . . , ๐‘ โˆ’ 1,

whose solutions are ๐œ†๐‘— = ๐‘ for some ๐‘— โˆˆ {0, 1, . . . , ๐‘ โˆ’ 1}and ๐œ†๐‘› = 0 for ๐‘› โˆ•= ๐‘— (independently of ๐œŒ), as can be easilyshown following the same reasoning as in [21, Sec. IV]. For๐‘ โ†’ โˆž, this implies that the optimum PSD concentrates allits power at a single frequency. This is not surprising, as thiskind of peaky spectra minimize the SFM and are easier todetect in the presence of noise.

0 5 10 15 20 2510

โˆ’4

10โˆ’2

100

Statistic 2 logT

pdf

Simulation (ฯ = 0)Analytical

(a)

0 10 20 30 40 50 6010

โˆ’4

10โˆ’2

100

Statistic 2 logT

pdf

Simulation (ฯ = 0.05)Simulation (ฯ = 0.1)Simulation (ฯ = 0.2)Analytical

(b)

Fig. 1. Distribution of the statistic 2 log ๐‘‡ for ๐‘€ = 4 and ๐‘ = 128. (a)Under โ„‹0 (b) Under โ„‹1.

Finally, note from Theorem 2 that, as desired, the asymptoticperformance of the GLR detector does not depend on thespecific spatial correlation profile of the noise, but only onthe operational SNR ๐œŒ.

VI. NUMERICAL RESULTS

We proceed to examine the performance of the proposed de-tectors via Monte Carlo simulations and check the accuracy ofthe analytical approximations. In all experiments, the signaturevector ๐’‰ and the noise spatial covariance matrix ฮฃ = ๐‘ฏ๐‘ฏ๐ป

are randomly generated at each Monte Carlo run, with ๐’‰ scaledin order to obtain a given SNR value ๐œŒ = ๐’‰๐ปฮฃโˆ’1๐’‰. Theentries of ๐’‰ and ๐‘ฏ โˆˆ โ„‚๐‘€ร—๐‘€ are independent zero-meancircular Gaussian. We refer to the two proposed detectors as:

1) Iterative GLRT: the exact GLR test from (8), in whichthe optimization w.r.t. ๐œŒ is performed using a gradientdescent algorithm detailed in Appendix G.

2) Asymptotic GLRT: the closed-form detector from (9),which has been shown to coincide with the GLR detectorfor vanishing SNR or for a binary-valued eigenspectrum(PSD) of the primary signal.

For each signal type used in the simulations, the autocorre-lation of the signal is estimated from a register of 106 noise-free samples, and the Toeplitz matrix ๐‘ช is built from theseestimates.

In order to check the accuracy of the theoretical distributionsof the GLR statistic given in Theorem 2, these are compared inFig. 1 against the empirical histograms of the iterative GLRTscheme, for a setting with ๐‘€ = 4 antennas and ๐‘ = 128.The PSD of the signal is constant within its support, whichequals half the Nyquist bandwidth. A very good agreement isobserved, even for this moderate value of the sample size ๐‘ .

8

To establish suitable benchmarks for the proposed schemes,we additionally consider the three following detectors, all ofwhich incorporate knowledge of ๐‘ช .

1) A direct generalization of the single-antenna detectorof Quan et al. [21], in which the following statistic iscompared against a threshold:

๐‘‡Quan et al. = tr(๐’€ ๐‘ช๐’€ ๐ป). (49)

2) An energy ratio (ER) detector in which (49) is normal-ized by the total observed energy, in order to cope withthe noise uncertainty issue. The corresponding statisticis

๐‘‡ER =tr(๐’€ ๐‘ช๐’€ ๐ป)

tr(๐’€ ๐’€ ๐ป), (50)

which is a direct generalization of (10) to the multi-antenna setting.

3) The โ€œrule Tโ€ detector proposed by Stoica & Cedervallin [29], which exploits that the temporal correlationlength of the noise is much shorter than that of thesignal of interest. Its specification is too lengthy and thereader is referred to [29] for more information. Using thenotation from [29], our implementation assumes ๏ฟฝ๏ฟฝ = 1(signal spatial rank), ๐‘€ = 16 (truncation point) and๐พ = 1 (number of correlation lags).

A. Detection performance in the low SNR regime

The empirical ROC curves of the different detectors con-sidered are shown in Fig. 2, together with the analyticalapproximation from Theorem 2, for a setting with ๐‘€ = 4,๐‘ = 512 and ๐œŒ = 0.2. Two different kinds of signals areconsidered. The first one is an OFDM-modulated digital TVbaseband signal with a bandwidth of 3.805 MHz, sampled at16 Msps and quantized to 9-bit precision. The signal samplesare approximately Gaussian distributed, with a PSD that isalmost constant within its support (about 48% of the Nyquistbandwidth). The second signal uses a 16-QAM constellationand square-root raised cosine pulses with roll-off factor 1,and it is sampled at twice its baud rate with random timingoffset. In this case, the corresponding samples do not followa Gaussian distribution. In accordance with the signal model,frequency-flat channels are assumed; the impact of frequencyselectivity will be considered in Sec. VI-C.

As can be seen in Fig. 2, the proposed schemes signifi-cantly outperform the detectors of Quan et al. and Stoica &Cedervall, as well as the Energy Ratio detector. The SNR issufficiently low for the asymptotic GLRT to achieve the sameperformance as the iterative GLRT detector. The agreement ofthe empirical and theoretical ROC curves for the GLR test isagain quite good. Also note that the GLR detectors performnoticeably better with the OFDM signal than with the QAMsignal. This is explained by the shape of the respective PSDs:in this setting, the inverse SFM given by (47) equals 1.01525and 1.00705 respectively for the OFDM and QAM signals.

B. Detection performance vs. SNR

The exact GLR detector (8) and its asymptotic version (9)were tested with the same OFDM and QAM signals of the

10โˆ’2

10โˆ’1

100

10โˆ’2

10โˆ’1

100

PFA

1โˆ’

PD

Analytical (ฮปsph)iterative-GLRTasymptotic-GLRTQuan et al.ER detectorStoica-Cedervall

(a)

10โˆ’2

10โˆ’1

100

10โˆ’2

10โˆ’1

100

PFA

1โˆ’

PD

Analytical (ฮปsph)iterative-GLRTasymptotic-GLRTQuan et al.ER detectorStoica-Cedervall

(b)

Fig. 2. ROC curves for ๐‘€ = 4, ๐‘ = 512 and ๐œŒ = 0.2: (a) OFDM and (b)square root raised cosine signals.

โˆ’14 โˆ’12 โˆ’10 โˆ’8 โˆ’6 โˆ’4 โˆ’210

โˆ’4

10โˆ’3

10โˆ’2

10โˆ’1

100

Average SNR [dB]

1โˆ’

PD

iterative-GLRTasymptotic-GLRTAnalytical (ฮปsph)

Sqrt Raised Cosine(rolloff = 1)

OFDM

Fig. 3. Missed detection probability vs. SNR for fixed ๐‘ƒFA = 0.05, ๐‘€ = 4and ๐‘ = 128.

previous section, but now varying the operational SNR in orderto check the performance loss incurred by the asymptotic GLRscheme. Fig. 3 shows the probability of missed detection interms of the average SNR per antenna, ๐œŒ/๐‘€ , for ๐‘€ = 4, ๐‘ =128 and fixed ๐‘ƒFA = 0.05. Notice the good match between

9

โˆ’17 โˆ’16 โˆ’15 โˆ’14 โˆ’13 โˆ’12 โˆ’11 โˆ’10 โˆ’9 โˆ’810

โˆ’3

10โˆ’2

10โˆ’1

100

Average SNR [dBs]

1โˆ’

PD

iterative-GLRTasymptotic-GLRT

โˆ’1 0 1โˆ’40

โˆ’30

โˆ’20

โˆ’10

0

10

Normalized frequency

psd

[dB

]

Fig. 4. Detection performance vs. SNR for fixed ๐‘ƒFA = 0.05, with ๐‘€ = 4and ๐‘ = 256. Lines: frequency-flat channels. Markers: Frequency-selectiveWINNER II channel model.

the empirical and analytical (asymptotic) results over the SNRrange considered, which is not limited to just the very lowSNR region. The better results observed for the OFDM signalare again explained by the fact that the corresponding PSDis โ€less flatโ€. Also note that the two versions of the GLRdetector yield the same result with the OFDM signal; this isas expected, since the PSD of the OFDM signal is very close tothe binary-valued eigenspectrum case for which both detectorsare equivalent (cf. Sec IV-B). On the other hand, for the raisedcosine PSD the asymptotic form of the GLR detector presentsa performance loss that increases with the SNR. Hence, withnon-binary valued power spectra, this detector offers a tradeoffbetween complexity and performance.

C. Effect of frequency selective channels

To close this section, we consider a realistic scenario basedon the parameters of the GSM system. The separation betweenGSM channels is 200 kHz, although in a given geographicalarea two adjacent channels cannot be active in order to avoidinterference. Thus, the effective channel separation is 400 kHz,which is the same as the approximate bandwidth of the GSMsignal, based on Gaussian Minimum Shift Keying (GMSK)[43]. We thus synthesized samples of a baseband GMSKwaveform based on GSM parameters at a sampling rate of400 ksps.

Frequency-selective channels were generated according tothe WINNER Phase II model [44] with Profile C1 (suburban),and Non-line-of-sight (NLOS). The central frequency is 1.8GHz and the channel bandwidth is 400 kHz. Transmitter andreceiver are randomly distributed on a 10 km-side square. Forthe receiver we take a linear array with ๐‘€ = 4 antennaswith inter-element separation of 10 cm. This channel is re-scaled to obtain a fixed operational SNR and is applied tothe GSM waveform. Fig. 4 shows the probability of detectionof the two proposed schemes vs. average per antenna SNRfor fixed ๐‘ƒFA = 0.05. The inset also shows the PSD ofthe GMSK waveform. It is observed that both the exact

and asymptotic versions of the GLR detector are robust tofrequency selectivity effects, and in fact the results obtainedvirtually match those of a frequency-flat scenario.

We note that some small degree of non-whiteness in thecombined spectral content of noise plus interfering broadbandsources should be expected within the detectorโ€™s bandwidthdue to random frequency selectivity affecting the secondaryusers (assuming a uniform PSD at the transmitter over thedetectorโ€™s bandwidth). Nonetheless, for the detection of acomparatively narrow-band primary signal in the simulatedWINNER Phase II channel model, this effect (fluctuations inthe tenths of dB range over a span of 400 kHz) is less criticalthan the mitigation of spatially correlated interference. Weshould remark as reported in [45], that under all real-worldconditions, unavoidable model uncertainties appear in termsof an SNR wall that establishes a trade-off between primaryuserโ€™s SNR and detector robustness. That is, below a minimumSNR, any detector ceases to operate reliably regardless ofthe duration of the observation window. Although worthmentioning, a detailed statistical analysis of these degradingeffects5 falls out of the scope of this paper and has not beenincluded for reasons of space.

VII. CONCLUSIONS

The multiantenna GLR detector for a Gaussian signal ofknown (up to a scaling) PSD and spatial rank one in tempo-rally white Gaussian noise-plus-interference of arbitrary andunknown spatial correlation has been derived in terms of aunivariate optimization problem over an SNR parameter. Ananalytical expression for the statistical distributions of thetest in the low SNR regime has been obtained, resulting inclose agreement with empirical results for realistic scenar-ios featuring practical (non-Gaussian) communication signals.This is attributed to the fact that, as the detector statistic isbased on sample correlations, the underlying distribution ofindividual samples becomes asymptotically irrelevant for largedata records. The spectral flatness measure at the output ofthe minimum variance beamformer was found to determinethe performance of the GLR detector, such that less spectrallyflat signals enjoy improved detectability. At the other extreme,temporally white signals are not detectable under the modelconsidered. The asymptotic version of the GLR detector forlow SNR was also derived, resulting in a closed-form testwith much lower computational cost to which the exact GLRdetector also reduces if the signal PSD is binary-valued.

APPENDIX

A. Proof of (17)

Letting ๐’’.= ฮฃโˆ’1/2๐’‰, one has

๐‘นโˆ’11 (51)

= [๐‘ช โŠ— ๐’‰๐’‰๐ป + ๐‘ฐ๐‘ โŠ—ฮฃ]โˆ’1

= (๐‘ฐ๐‘ โŠ—ฮฃโˆ’1/2)[๐‘ช โŠ— ๐’’๐’’๐ป + ๐‘ฐ๐‘€๐‘ ]โˆ’1(๐‘ฐ๐‘ โŠ—ฮฃโˆ’1/2).

5For example, detection within the steep spectral roll-off region of thebroadband interferer(s).

10

Recalling the definition (14) of ๐‘ฎ(๐œŒ), we now claim that

[๐‘ช โŠ— ๐’’๐’’๐ป + ๐‘ฐ๐‘€๐‘ ]โˆ’1 = ๐‘ฐ๐‘€๐‘ โˆ’๐‘ช๐‘ฎ(๐œŒ)โŠ— ๐’’๐’’๐ป , (52)

which can be checked by directly multiplying the right-handside of (52) by ๐‘ชโŠ—๐’’๐’’๐ป+๐‘ฐ๐‘€๐‘ to see that it yields the identitymatrix (using that ๐‘ช and ๐‘ฎ(๐œŒ) share the same eigenvectors).Substituting (52) back into (51) and noting from (14) thatฮฃโˆ’1/2๐’’ = ๐’–โˆฅ, the desired result is obtained.

B. Proof of (18)

From (17), and using the property tr(๐‘จ๐‘‡1 ๐‘จ

๐ป2 ๐‘จ3๐‘จ4) =

vec(๐‘จ2)๐ป(๐‘จ1 โŠ—๐‘จ3)vec(๐‘จ4) [47], one has

๐’š๐ป๐‘นโˆ’11 ๐’š (53)

= vec(๐’€ )๐ป(๐‘ฐ๐‘ โŠ—ฮฃโˆ’1 โˆ’๐‘ช๐‘ฎ(๐œŒ)โŠ— ๐’–โˆฅ๐’–๐ปโˆฅ )vec(๐’€ )

= tr(ฮฃโˆ’1๐’€ ๐’€ ๐ป)โˆ’ tr(๐’–โˆฅ๐’–๐ปโˆฅ ๐’€ (๐‘ช๐‘ฎ(๐œŒ))๐‘‡๐’€ ๐ป). (54)

Applying now the identity ๐‘ช๐‘ฎ(๐œŒ) = ๐œŒโˆ’1[๐‘ฐ๐‘ โˆ’๐‘ฎ(๐œŒ)], whichfollows from the definition (14) of ๐‘ฎ(๐œŒ), noting that ๐‘ฎ๐‘‡ (๐œŒ) =๐‘ฎโˆ—(๐œŒ), and then rearranging terms yields the desired result.

C. Proof of Lemma 3

First, note that ๐‘พโŠฅ = [๐‘ผโŠฅ ๏ฟฝ๏ฟฝ ]ฮ“โŠฅ[๐‘ผโŠฅ ๏ฟฝ๏ฟฝ ]๐ป constitutes afull EVD of ๐‘พโŠฅ, where

ฮ“โŠฅ.=

[ฮ“โŠฅ

0

]. (55)

Then the determinant of ฮฃโˆ’1 can be written as

det(ฮฃโˆ’1)

= det(๐‘พโˆฅ +๐‘พโŠฅ)

= det(๐›พโˆฅ๏ฟฝ๏ฟฝโˆฅ๏ฟฝ๏ฟฝ๐ป

โˆฅ + [๐‘ผโŠฅ ๏ฟฝ๏ฟฝ ]ฮ“โŠฅ[๐‘ผโŠฅ ๏ฟฝ๏ฟฝ ]๐ป)

(56)

= det(๐›พโˆฅ[๐‘ผโŠฅ ๏ฟฝ๏ฟฝ ]๐ป ๏ฟฝ๏ฟฝโˆฅ๏ฟฝ๏ฟฝ๐ป

โˆฅ [๐‘ผโŠฅ ๏ฟฝ๏ฟฝ ] + ฮ“โŠฅ)

(57)

= det(ฮ“โŠฅ) + ๐›พโˆฅ๏ฟฝ๏ฟฝ๐ปโˆฅ [๐‘ผโŠฅ ๏ฟฝ๏ฟฝ ] adj(ฮ“โŠฅ)[๐‘ผโŠฅ ๏ฟฝ๏ฟฝ ]๐ป ๏ฟฝ๏ฟฝโˆฅ,(58)

where in the last step we used the fact that det(๐‘จ+ ๐’‘๐’’๐ป) =det๐‘จ+๐’’๐ป adj(๐‘จ)๐’‘ [46]. Note now from (55) that det(ฮ“โŠฅ) =0, whereas adj(ฮ“โŠฅ) = det(ฮ“โŠฅ)๐’†๐‘€๐’†๐ป๐‘€ . The desired resultthen follows.

D. Proof of Lemma 4

Let ๐’€โŠฅ.= (๐‘ฐ๐‘€ โˆ’ ๏ฟฝ๏ฟฝ๏ฟฝ๏ฟฝ๐ป)๐’€ be the orthogonal projection

of the data matrix onto the subspace orthogonal to ๐’‰. Let๏ฟฝ๏ฟฝโŠฅ

.= 1

๐‘๐’€โŠฅ๐’€ ๐ปโŠฅ be the corresponding sample covariance

matrix, with (economy-size) EVD ๏ฟฝ๏ฟฝโŠฅ = ๐‘ธ๐‘ซ๐‘ธ๐ป (i.e., ๐‘ซ โˆˆโ„(๐‘€โˆ’1)ร—(๐‘€โˆ’1) is positive diagonal and ๐‘ธ โˆˆ โ„‚๐‘€ร—(๐‘€โˆ’1)

has orthonormal columns). Note that since ๐‘ผ๐ปโŠฅ ๏ฟฝ๏ฟฝ = 0, then

๐‘ผ๐ปโŠฅ ๏ฟฝ๏ฟฝ๐‘ผโŠฅ = ๐‘ผ๐ป

โŠฅ ๏ฟฝ๏ฟฝโŠฅ๐‘ผโŠฅ. Therefore, by virtue of Hadamardโ€™sinequality [46], the cost ๐ฝ satisfies

๐ฝ(๐‘ผโŠฅ) = det[diag(๐‘ผ๐ป

โŠฅ ๐‘ธ๐‘ซ๐‘ธ๐ป๐‘ผโŠฅ)]

โ‰ฅ det(๐‘ผ๐ปโŠฅ ๐‘ธ๐‘ซ๐‘ธ๐ป๐‘ผโŠฅ), (59)

with equality holding in (59) iff the matrix between parenthe-ses is diagonal. This happens if ๐‘ผโŠฅ = ๐‘ธ, which is feasible

since its columns are orthonormal and ๐‘ธ๐ป ๏ฟฝ๏ฟฝ = 0. Thus๐ฝmin = det๐‘ซ.

Let now ๏ฟฝ๏ฟฝ.= [๐‘ธ ๏ฟฝ๏ฟฝ ], which is unitary. Write the inverse

of ๏ฟฝ๏ฟฝ๐ป๏ฟฝ๏ฟฝ๏ฟฝ๏ฟฝ in terms of the adjoint matrix as

(๏ฟฝ๏ฟฝ๐ป๏ฟฝ๏ฟฝ๏ฟฝ๏ฟฝ)โˆ’1 =adj(๏ฟฝ๏ฟฝ๐ป๏ฟฝ๏ฟฝ๏ฟฝ๏ฟฝ)

det(๏ฟฝ๏ฟฝ๐ป๏ฟฝ๏ฟฝ๏ฟฝ๏ฟฝ), (60)

which can be rewritten as

๏ฟฝ๏ฟฝ๐ป๏ฟฝ๏ฟฝโˆ’1๏ฟฝ๏ฟฝ =adj(๏ฟฝ๏ฟฝ๐ป๏ฟฝ๏ฟฝ๏ฟฝ๏ฟฝ)

det ๏ฟฝ๏ฟฝ. (61)

The (๐‘€,๐‘€) element of this matrix is therefore

๏ฟฝ๏ฟฝ๐ป๏ฟฝ๏ฟฝโˆ’1๏ฟฝ๏ฟฝ =๐’†๐ป๐‘€ adj(๏ฟฝ๏ฟฝ๐ป๏ฟฝ๏ฟฝ๏ฟฝ๏ฟฝ)๐’†๐‘€

det ๏ฟฝ๏ฟฝ=

det(๐‘ธ๐ป๏ฟฝ๏ฟฝ๐‘ธ)

det ๏ฟฝ๏ฟฝ. (62)

Finally, observe that ๐‘ซ = ๐‘ธ๐ป๏ฟฝ๏ฟฝโŠฅ๐‘ธ = ๐‘ธ๐ป๏ฟฝ๏ฟฝ๐‘ธ because๐‘ธ๐ป ๏ฟฝ๏ฟฝ = 0. This, together with (62), yields the desired result.

Remark 1: Note from (29) that the ML estimate of ฮ“โŠฅ isgiven by

ฮ“โŠฅ =[diag

(๏ฟฝ๏ฟฝ๐ป

โŠฅ ๏ฟฝ๏ฟฝ๏ฟฝ๏ฟฝโŠฅ)]โˆ’1

= ๐‘ซโˆ’1, (63)

since the ML estimate of ๐‘ผโŠฅ is ๏ฟฝ๏ฟฝโŠฅ = ๐‘ธ. Hence, theML estimate of ๐‘พโŠฅ = ๐‘ผโŠฅฮ“โŠฅ๐‘ผ๐ป

โŠฅ turns out to be ๏ฟฝ๏ฟฝโŠฅ =๏ฟฝ๏ฟฝโŠฅฮ“โŠฅ๏ฟฝ๏ฟฝ๐ป

โŠฅ = ๐‘ธ๐‘ซโˆ’1๐‘ธ๐ป = ๏ฟฝ๏ฟฝโ€ โŠฅ.

Remark 2: We note that the Moore-Penrose pseudoinverse๐‘พ โ€ 

โŠฅ = ๐‘ผโŠฅฮ“โˆ’1โŠฅ ๐‘ผ๐ป

โŠฅ fulfils ๐‘พ โ€ โŠฅ๐’‰ = 0, with ๐‘พโŠฅ๐‘พ

โ€ โŠฅ the

projector onto the subspace orthogonal to ๐’‰ and ๐‘พโŠฅ๐‘พโ€ โŠฅ +

๐’‰๐’‰๐ป

= I๐‘€ . Then, to verify (24), we premultiply by ฮฃโˆ’1:ฮฃโˆ’1(ฮ ๐‘พ โ€ 

โŠฅฮ ๐ป + ๐œŒโˆ’1๐’‰๐’‰H) = ๐‘พโŠฅ๐‘พ

โ€ โŠฅฮ 

๐ป + I๐‘€ โˆ’ ฮ ๐ป =

I๐‘€ โˆ’ ๐’‰๐’‰Hฮ ๐ป . As ฮ ๐’‰ = 0, the previous result is I๐‘€ ,

verifying that (24) is a valid expression for ฮฃ.

E. Proof of Lemma 5

Since ๐น (๐’–1,๐’–2) is invariant to scalings in ๐’–1 and ๐’–2, itsminimization is equivalent to the problem

min๐’–1,๐’–2

(๐’–๐ป1 ๐‘จ1๐’–1)(๐’–

๐ป2 ๐‘จ2๐’–2) subject to โˆฃ๐’–๐ป

1 ๐’–2โˆฃ2 = ๐‘2,

(64)with ๐‘2 > 0 an arbitrary positive constant. Thus, consider theLangrangian

โ„’ = (๐’–๐ป1 ๐‘จ1๐’–1)(๐’–

๐ป2 ๐‘จ2๐’–2)โˆ’ ๐œ’(โˆฃ๐’–๐ป

1 ๐’–2โˆฃ2 โˆ’ ๐‘2), (65)

with ๐œ’ the Lagrange multiplier. Equating the gradient of โ„’w.r.t. ๐’–1, ๐’–2 to zero, we respectively obtain

(๐’–๐ป2 ๐‘จ2๐’–2)๐‘จ1๐’–1 = ๐œ’(๐’–๐ป

2 ๐’–1)๐’–2, (66)

(๐’–๐ป1 ๐‘จ1๐’–1)๐‘จ2๐’–2 = ๐œ’(๐’–๐ป

1 ๐’–2)๐’–1. (67)

To discard the possibility of having ๐’–๐ป1 ๐’–2 = 0, note that

this would mean that the left-hand sides of (66)-(67) are bothzero, which can only happen if ๐’–1 = ๐’–2 = 0 since ๐‘จ1, ๐‘จ2

are positive definite. Thus, assuming ๐’–1 โˆ•= 0, ๐’–2 โˆ•= 0, solvingfor ๐œ’ in (66)-(67) yields

๐œ’ =(๐’–๐ป

1 ๐‘จ1๐’–1)(๐’–๐ป2 ๐‘จ2๐’–2)

โˆฃ๐’–๐ป1 ๐’–2โˆฃ2 , (68)

11

that is, the Lagrange multiplier takes the value of the attainedcost. Substituting now the value of ๐’–2 from (66) and of ๐œ’from (68) into (67), one has ๐‘จ2๐‘จ1๐’–1 = ๐œ’๐’–1. Similarly, italso holds that ๐‘จ1๐‘จ2๐’–2 = ๐œ’๐’–2. Thus ๐œ’ is an eigenvalue of๐‘จ2๐‘จ1 (resp. ๐‘จ1๐‘จ2) with associated eigenvector ๐’–1 (resp.๐’–2). Hence the cost is minimized when ๐œ’ is the smallesteigenvalue of these matrices. Equivalently, ๐’–1 (resp. ๐’–2) isa generalized eigenvector [39] of the pair (๐‘จ1,๐‘จ

โˆ’12 ) [resp.

(๐‘จ2,๐‘จโˆ’11 )].

F. Proof of Theorem 2

The asymptotic distributions of the GLR statistic ๐‘‡GLR inthe weak signal regime are given in [31, Sec. 6.5] for a testof the form

โ„‹0 : ๐œฝ๐‘Ÿ = ๐œฝ๐‘Ÿ0 , ๐œฝ๐‘  โ„‹1 : ๐œฝ๐‘Ÿ โˆ•= ๐œฝ๐‘Ÿ0 , ๐œฝ๐‘ , (69)

where ๐œฝ๐‘Ÿ โˆˆ โ„๐‘Ÿ, ๐œฝ๐‘  โˆˆ โ„๐‘ . One wishes to test whether ๐œฝ๐‘Ÿ =๐œฝ๐‘Ÿ0 as opposed to ๐œฝ๐‘Ÿ โˆ•= ๐œฝ๐‘Ÿ0 , and ๐œฝ๐‘  is a vector of nuisanceparameters which are unknown (but the same) under eitherhypothesis. The distributions are

2 log๐‘‡GLR โˆผ{

๐œ’2๐‘Ÿ , under โ„‹0,

๐œ’โ€ฒ2๐‘Ÿ (๐›ผ), under โ„‹1,

(70)

where the non-centrality parameter ๐›ผ is given in [31, eq.(6.24)]. In our case, the nuisance parameters are given by theelements of the noise spatial covariance ฮฃ. On the other hand,the parameter to be tested is ๐’‰ = 0 versus ๐’‰ โˆ•= 0. However,since the p.d.f. depends on ๐’‰ only through ๐’‰๐’‰๐ป , which isinvariant to multiplication of ๐’‰ by a unit-magnitude complexscalar, we can fix the imaginary part of the last element of ๐’‰(say) to zero. Thus the vector ๐œฝ๐‘Ÿ comprises the real parts ofthe elements of ๐’‰, plus the imaginary parts of the elements of๐’‰ save the last one. The size of ๐œฝ๐‘Ÿ is therefore ๐‘Ÿ = 2๐‘€ โˆ’ 1.

Direct application of [31, eq. (6.24)] to determine ๐›ผ isinvolved for the model at hand. We propose an alternativeapproach based on the following result, whose proof will bepresented in turn.

Lemma 6: Consider the GLR statistic ๐‘‡ from (8). Then onehas

lim๐‘โ†’โˆž

๐”ผ[โˆฃ๐‘‡ โˆ’ ๐‘‡ (๐œŒ0)โˆฃ2] = 0, (71)

where ๐œŒ0 = ๐’‰๐ปฮฃโˆ’1๐’‰ = ๐’–๐ปโˆฅ ๐’‰ is the true value of the SNR,

and

๐‘‡ (๐œŒ).=

[ 1๐‘ tr(๐‘ฐ๐‘ + ๐œŒ๐‘ช)

[det(๐‘ฐ๐‘ + ๐œŒ๐‘ช)]1/๐‘

]๐‘. (72)

From Lemma 6, it follows that

lim๐‘โ†’โˆž

๐”ผ[2 log๐‘‡ ] = lim๐‘โ†’โˆž

2 log๐‘‡ (๐œŒ0) (73)

= lim๐‘โ†’โˆž

2๐‘ log1๐‘ tr(๐‘ฐ๐‘ + ๐œŒ0๐‘ช)

[det(๐‘ฐ๐‘ + ๐œŒ0๐‘ช)]1/๐‘.

On the other hand, note that if ๐‘ฅ โˆผ ๐œ’โ€ฒ2๐‘Ÿ (๐›ผ), then ๐”ผ[๐‘ฅ] = ๐‘Ÿ+๐›ผ.

Hence,

lim๐‘โ†’โˆž

๐”ผ[2 log๐‘‡ ] = (2๐‘€ โˆ’ 1) + lim๐‘โ†’โˆž

๐›ผ. (74)

It follows from (73)-(74) that for sufficiently large ๐‘ , ๐›ผ โ‰ซ2๐‘€โˆ’1 and one may approximate ๐›ผ โ‰ˆ 2 log๐‘‡ (๐œŒ0) = ๐›ผsph(๐œŒ0).

Proof of Lemma 6: Write the data matrix as ๐’€ = ๐’‰๐’”๐‘‡ +ฮฃ1/2๐‘พ , where the entries of ๐‘พ

.= ฮฃโˆ’1/2[ ๐’›1 ๐’›2 โ‹… โ‹… โ‹… ๐’›๐‘ ]

are independent zero-mean complex circular Gaussian randomvariables with unit variance. It follows that for ๐‘จ โˆˆ โ„‚๐‘ร—๐‘ ,

๐”ผ[๐’€ ๐‘จ๐’€ ๐ป ] = tr(๐‘ชโˆ—๐‘จ)๐’‰๐’‰๐ป + tr(๐‘จ)ฮฃ. (75)

Now note that 1๐‘๐’€ ๐‘จ๐’€ ๐ป is a consistent estimator of its mean

for the matrices ๐‘จ in this paper, and hence

1

๐‘๐’€ ๐‘จ๐’€ ๐ป var

=tr(๐‘ชโˆ—๐‘จ)

๐‘๐’‰๐’‰๐ป +

tr(๐‘จ)

๐‘ฮฃ, (76)

wherevar= denotes stochastic convergence in variance [48],

i.e., ๐‘Ž๐‘var= ๐‘๐‘ iff lim๐‘โ†’โˆž ๐”ผ[โˆฃ๐‘Ž๐‘ โˆ’ ๐‘๐‘ โˆฃ2] = 0, applying

componentwise for matrices. Let us rewrite the GLR statistic๐‘‡ as

๐‘‡ = max๐œŒโ‰ฅ0

๐œ†โˆ’๐‘min

[( 1๐‘๐’€ ๐’€ ๐ป)โˆ’1( 1

๐‘๐’€ ๐‘ฎโˆ—(๐œŒ)๐’€ ๐ป)]

[det๐‘ฎโˆ—(๐œŒ)]โˆ’1. (77)

Then, taking into account that tr๐‘ชโˆ— = tr ๐‘ฐ๐‘ = ๐‘ , theasymptotic behavior of ๐‘‡ can be found:

๐‘‡var= (78)

max๐œŒโ‰ฅ0๐œ†โˆ’๐‘

min [1๐‘ tr๐‘ฎโˆ—(๐œŒ)โ‹…(ฮฃ+๐’‰๐’‰๐ป )โˆ’1(ฮฃ+๐‘Ž(๐œŒ)๐’‰๐’‰๐ป )]

[det๐‘ฎโˆ—(๐œŒ)]โˆ’1 =

max๐œŒโ‰ฅ0

[1๐‘ tr๐‘ฎโˆ—(๐œŒ)

[det๐‘ฎโˆ—(๐œŒ)]1/๐‘ โ‹… ๐œ†min

[๐‘ฐ๐‘€ โˆ’ 1โˆ’๐‘Ž(๐œŒ)

1+๐œŒ0๐’–โˆฅ๐’‰๐ป

]]โˆ’๐‘

(79)

where

๐‘Ž(๐œŒ).=

tr(๐‘ชโˆ—๐‘ฎโˆ—(๐œŒ))tr๐‘ฎโˆ—(๐œŒ)

. (80)

In order to obtain (79), the Matrix Inversion Lemma has beenapplied. Denote now the matrix in brackets in (79) by ๐‘ฉ. Then๐‘ฉ has an eigenvalue equal to one with multiplicity ๐‘€ โˆ’ 1,since ๐‘ฉ๐’— = ๐’— for any ๐’— such that ๐’—๐ป๐’‰ = 0. The remainingeigenvalue is associated to the eigenvector ๐’–โˆฅ and is given by

๐œ† =1 + ๐œŒ0๐‘Ž(๐œŒ)

1 + ๐œŒ0. (81)

We claim that ๐‘Ž(๐œŒ) โ‰ค 1 for all ๐œŒ โ‰ฅ 0, so that ๐œ† in (81) isless than one and hence it is the minimum eigenvalue of ๐‘ฉ.First, note that ๐‘Ž(0) = tr๐‘ชโˆ—/ tr ๐‘ฐ๐‘ = 1. On the other hand,for ๐œŒ > 0, since ๐‘ช and ๐‘ฎ(๐œŒ) share the same eigenvectors, wecan write ๐‘Ž(๐œŒ) in (80) in terms of the eigenvalues {๐œ†๐‘›}๐‘โˆ’1

๐‘›=0

of ๐‘ช as

๐‘Ž(๐œŒ) =

โˆ‘๐‘โˆ’1๐‘›=0

๐œ†๐‘›

1+๐œŒ๐œ†๐‘›โˆ‘๐‘โˆ’1๐‘›=0

11+๐œŒ๐œ†๐‘›

(82)

=1

๐œŒ

[๐‘โˆ‘๐‘โˆ’1

๐‘›=01

1+๐œŒ๐œ†๐‘›

โˆ’ 1

], (83)

where (83) follows from the fact that ๐‘ =โˆ‘

๐‘›1

1+๐œŒ๐œ†๐‘›+

๐œŒโˆ‘

๐‘›๐œ†๐‘›

1+๐œŒ๐œ†๐‘›. Note now that for positive numbers {๐‘ฅ๐‘›}๐‘โˆ’1

๐‘›=0 ,the Cauchy-Schwarz inequality can be invoked to show that(

1๐‘

โˆ‘๐‘› ๐‘ฅ๐‘›

) (1๐‘

โˆ‘๐‘›

1๐‘ฅ๐‘›

)โ‰ฅ 1. Hence, taking ๐‘ฅ๐‘› = 1 + ๐œŒ๐œ†๐‘›,

and noting thatโˆ‘

๐‘› ๐œ†๐‘› = ๐‘ , one has (1+๐œŒ)๐‘

โˆ‘๐‘›

11+๐œŒ๐œ†๐‘›

โ‰ฅ 1.

12

Applying this to (83) yields ๐‘Ž(๐œŒ) โ‰ค 1, as desired. Therefore,from (79), one has

๐‘‡var=

[min๐œŒโ‰ฅ0

[1๐‘ tr๐‘ฎโˆ—(๐œŒ)

det1/๐‘ ๐‘ฎโˆ—(๐œŒ)โ‹… 1 + ๐œŒ0๐‘Ž(๐œŒ)

1 + ๐œŒ0

]]โˆ’๐‘

. (84)

Note from (80) that 1 + ๐œŒ0๐‘Ž(๐œŒ) can be written as

1 + ๐œŒ0๐‘Ž(๐œŒ) =tr[(๐‘ฐ๐‘ + ๐œŒ0๐‘ช

โˆ—)๐‘ฎโˆ—(๐œŒ)]tr๐‘ฎโˆ—(๐œŒ)

. (85)

Substituting (85) in (84),

๐‘‡var=

(1

๐‘(1 + ๐œŒ0)min๐œŒโ‰ฅ0

๐ป(๐œŒ)

)โˆ’๐‘

, (86)

where we have introduced

๐ป(๐œŒ).=

tr[(๐‘ฐ๐‘ + ๐œŒ0๐‘ชโˆ—)๐‘ฎโˆ—(๐œŒ)]

det1/๐‘ ๐‘ฎโˆ—(๐œŒ). (87)

Since ๐‘ฎโˆ—(๐œŒ) = (๐‘ฐ๐‘ + ๐œŒ๐‘ชโˆ—)โˆ’1, applying Jensenโ€™s inequalityone has

log๐ป(๐œŒ)

= log๐‘โˆ’1โˆ‘๐‘›=0

1 + ๐œŒ0๐œ†๐‘›1 + ๐œŒ๐œ†๐‘›

+1

๐‘

๐‘โˆ’1โˆ‘๐‘›=0

log(1 + ๐œŒ๐œ†๐‘›) (88)

โ‰ฅ log๐‘ +1

๐‘

๐‘โˆ’1โˆ‘๐‘›=0

log1 + ๐œŒ0๐œ†๐‘›1 + ๐œŒ๐œ†๐‘›

+1

๐‘

๐‘โˆ’1โˆ‘๐‘›=0

log(1 + ๐œŒ๐œ†๐‘›)

(89)

= log๐‘ +1

๐‘

๐‘โˆ’1โˆ‘๐‘›=0

log(1 + ๐œŒ0๐œ†๐‘›) (90)

= log๐ป(๐œŒ0), (91)

showing that the minimum is attained at ๐œŒ = ๐œŒ0. Hence,

๐‘‡var=

โŽ›โŽœโŽ

[โˆ๐‘โˆ’1๐‘›=0 (1 + ๐œŒ0๐œ†๐‘›)

]1/๐‘1 + ๐œŒ0

โŽžโŽŸโŽ 

โˆ’๐‘

= ๐‘‡ (๐œŒ0), (92)

as was to be shown.Remark 3: Asymptotic unimodality in ๐œŒ. In addition to

being minimized at ๐œŒ = ๐œŒ0, the function ๐ป(๐œŒ) in (87)has no other local minimum in ๐œŒ โ‰ฅ 0 provided that thedetectability condition ๐‘ช โˆ•= ๐‘ฐ๐‘ holds, as we show next.Differentiating (88),

d log๐ป(๐œŒ)

d๐œŒ=

1

๐‘

๐‘โˆ’1โˆ‘๐‘›=0

๐œ†๐‘›1 + ๐œŒ๐œ†๐‘›

โˆ’[๐‘โˆ’1โˆ‘๐‘›=0

1 + ๐œŒ0๐œ†๐‘›1 + ๐œŒ๐œ†๐‘›

]โˆ’1 ๐‘โˆ’1โˆ‘๐‘›=0

๐œ†๐‘›(1 + ๐œŒ0๐œ†๐‘›)

(1 + ๐œŒ๐œ†๐‘›)2. (93)

Let now

๐›ฟ.= ๐œŒ0โˆ’๐œŒ, ๐‘ฅ๐‘›

.=

๐œ†๐‘›1 + ๐œŒ๐œ†๐‘›

โ‡’ 1 + ๐œŒ0๐œ†๐‘›1 + ๐œŒ๐œ†๐‘›

= 1+ ๐›ฟ๐‘ฅ๐‘›,

(94)

so that (93) becomes

d log๐ป(๐œŒ)

d๐œŒ(95)

=1

๐‘

โˆ‘๐‘›

๐‘ฅ๐‘› โˆ’[โˆ‘

๐‘›

(1 + ๐›ฟ๐‘ฅ๐‘›)

]โˆ’1 โˆ‘๐‘›

๐‘ฅ๐‘›(1 + ๐›ฟ๐‘ฅ๐‘›)

= โˆ’[โˆ‘

๐‘›

(1 + ๐›ฟ๐‘ฅ๐‘›)

]โˆ’1

๏ธธ ๏ธท๏ธท ๏ธธ.=๐‘1

โŽกโŽฃโˆ‘

๐‘›

๐‘ฅ2๐‘› โˆ’ 1

๐‘

(โˆ‘๐‘›

๐‘ฅ๐‘›

)2โŽคโŽฆ

๏ธธ ๏ธท๏ธท ๏ธธ.=๐‘2

โ‹…๐›ฟ.

(96)

Note from (94) that ๐‘1 > 0 for all ๐œŒ โ‰ฅ 0. On the other hand,from the Cauchy-Schwarz inequality one has ๐‘2 โ‰ฅ 0, withequality iff all ๐‘ฅ๐‘› are equal. But this would imply that alleigenvalues of ๐‘ช are equal, i.e., ๐‘ช = ๐‘ฐ๐‘ ; thus, ๐‘2 > 0.We conclude that the derivative (96) is negative if ๐›ฟ > 0and positive if ๐›ฟ < 0, which shows that ๐ป(๐œŒ) has a uniqueminimum at ๐œŒ = ๐œŒ0.

G. Gradient descent computations

The goal is to maximize ๐‘ก(๐œŒ) in (8) over ๐œŒ โ‰ฅ 0. Equivalently,we can minimize ๐‘ก0(๐œŒ)

.= ๐‘กโˆ’1/๐‘ (๐œŒ) by means of an iterative

gradient descent of the form ๐œŒ๐‘˜+1 = ๐œŒ๐‘˜ โˆ’ ๐œ‡๐‘˜๐‘กโ€ฒ0(๐œŒ๐‘˜), with

๐œ‡๐‘˜ > 0 a suitable stepsize sequence. In the simulations, weinitialized ๐œŒ0 = 1, ๐œ‡0 = 100. The stepsize is reduced as per๐œ‡๐‘˜ = 0.25๐œ‡๐‘˜โˆ’1 whenever there is a sign change in the descentdirection. The stopping criterion adopted is โˆฃ๐‘กโ€ฒ0(๐œŒ๐‘˜)โˆฃ < 10โˆ’5

with a maximum of 100 iterations.The required derivative is obtained as follows. Denote the

minimum unit-norm eigenvector of ๐‘ฝ ๐ป(๐‘ฐ๐‘ + ๐œŒ๐‘ชโˆ—)โˆ’1๐‘ฝ by๐’–0(๐œŒ). Then we can write

๐œ†0(๐œŒ).= ๐œ†min[๐‘ฝ

๐ป(๐‘ฐ๐‘ + ๐œŒ๐‘ชโˆ—)โˆ’1๐‘ฝ ] (97)

= ๐’–๐ป0 (๐œŒ)

[๐‘ฝ ๐ป(๐‘ฐ๐‘ + ๐œŒ๐‘ชโˆ—)โˆ’1๐‘ฝ

]๐’–0(๐œŒ), (98)

and therefore, using the expressions for eigenvalues derivativesin [47], one has

๐‘กโ€ฒ0(๐œŒ)

= [det(๐‘ฎโˆ—(๐œŒ))]โˆ’1/๐‘ โ‹… โˆ‚โˆ‚๐œŒ

(๐’–๐ป0 (๐œŒ)

[๐‘ฝ ๐ป๐‘ฎโˆ—(๐œŒ)๐‘ฝ

]๐’–0(๐œŒ)

)+ ๐œ†0(๐œŒ) โ‹… โˆ‚

โˆ‚๐œŒ[det(๐‘ฎโˆ—(๐œŒ))]โˆ’1/๐‘

= โˆ’ [det(๐‘ฎโˆ—(๐œŒ))]โˆ’1/๐‘ โ‹…โ‹… (

๐’–๐ป0 (๐œŒ)

[๐‘ฝ ๐ป๐‘ฎโˆ—(๐œŒ)๐‘ชโˆ—๐‘ฎโˆ—(๐œŒ)๐‘ฝ

]๐’–0(๐œŒ)

)+ ๐œ†0(๐œŒ) โ‹… [det(๐‘ฎโˆ—(๐œŒ))]โˆ’1/๐‘ 1

๐‘tr (๐‘ฎโˆ—(๐œŒ)๐‘ชโˆ—)

=

[โˆ’๐’–๐ป

0 (๐œŒ)๐‘จ(๐œŒ)๐’–0(๐œŒ) +1

๐‘๐œ†0(๐œŒ) โ‹… tr (๐‘ฎโˆ—(๐œŒ)๐‘ชโˆ—)

]โ‹…

โ‹… [det(๐‘ฎโˆ—(๐œŒ))]โˆ’1/๐‘,

where ๐‘จ(๐œŒ).= ๐‘ฝ ๐ป๐‘ฎโˆ—(๐œŒ)๐‘ชโˆ—๐‘ฎโˆ—(๐œŒ)๐‘ฝ . Using the EVD ๐‘ช =

๐‘ญฮ›๐‘ญ๐ป and the fact that ๐‘ญ๐ป approaches the orthonormal DFTmatrix for large ๐‘ , one can precompute ๐‘ญ๐ป๐‘ฝ โˆ— efficiently byapplying the FFT to the columns of ๐‘ฝ โˆ—. The elements of๐‘จ(๐œŒ) are then obtained as weighted crosscorrelations between

13

the columns of ๐‘ญ๐ป๐‘ฝ โˆ—, where the weights are of the form๐œ†๐‘›/(1 + ๐œŒ๐œ†๐‘›)

2. A similar approach can be used to compute๐‘ฝ ๐ป๐‘ฎโˆ—(๐œŒ)๐‘ฝ efficiently.

REFERENCES

[1] M. Wax, T. Kailath, โ€Detection of signals by information-theoreticcriteria,โ€ IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-33,pp. 387โ€“392, Apr. 1985.

[2] M. Viberg, B. Ottersten, T. Kailath, โ€Detection and estimation in sensorarrays using weighted subspace fitting,โ€ IEEE Trans. Signal Process.,vol. 39, pp. 2436โ€“2449, Nov. 1991.

[3] L. L. Scharf, B. Friedlander, โ€Matched subspace detectors,โ€ IEEE Trans.Signal Process., vol. 42, pp. 2146โ€“2157, Aug. 1994.

[4] O. Besson, S. Kraut, L. L. Scharf, โ€Detection of an unknown rank-onecomponent in white noise,โ€ IEEE Trans. Signal Process., vol. 54, pp.2835โ€“2839, Jul. 2006.

[5] A. Leshem, A.-J. van der Veen, โ€Multichannel detection of Gaussiansignals with uncalibrated receivers,โ€ IEEE Signal Process. Lett., vol. 8,pp. 120โ€“122, Apr. 2001.

[6] S. A. Vorobyov, A. Gershman, Kon Max Wong, โ€Maximum likelihooddirection-of-arrival estimation in unknown noise fields using sparsesensor arrays,โ€ IEEE Trans. Signal Process., vol. 53, pp. 34โ€“43, Jan.2005.

[7] M. Agrawal, S. Prasad, โ€A modified likelihood function approach toDOA estimation in the presence of unknown spatially correlated Gaus-sian noise using a uniform linear array,โ€ IEEE Trans. Signal Process.,vol. 48, pp. 2743โ€“2749, Oct. 2000.

[8] Yuehua Wu, Kwok-Wai Tam, Fu Li, โ€Determination of number ofsources with multiple arrays in correlated noise fields,โ€ IEEE Trans.Signal Process., vol. 50, pp. 1257โ€“1260, Jun. 2002.

[9] P. Stoica, Kon Max Wong, Qiang Wu, โ€On a nonparametric detectionmethod for array signal processing in correlated noise fields,โ€ IEEETrans. Signal Process., vol. 44, pp. 1030โ€“1032, Apr. 1996.

[10] M. Viberg, P. Stoica, B. Ottersten, โ€Maximum likelihood array pro-cessing in spatially correlated noise fields using parameterized signals,โ€IEEE Trans. Signal Process., vol. 45, pp. 996โ€“1004, Apr. 1997.

[11] I. Akyildiz, Won-Yeol Lee, M. Vuran, S. Mohanty, โ€A survey onspectrum management in cognitive radio networks,โ€ IEEE Commun.Mag., vol. 46, pp. 40โ€“48, Apr. 2008.

[12] J. M. Peha, โ€Sharing spectrum through spectrum policy reform andcognitive radio,โ€ Proc. IEEE, vol. 97, pp. 708โ€“719, Apr. 2009.

[13] J. Unnikrishnan, V. Veeravalli, โ€Cooperative sensing for primary detec-tion in cognitive radio,โ€ IEEE J. Sel. Topics Signal Process., vol. 2, pp.18โ€“27, Feb. 2008.

[14] A. Taherpour, M. Nasiri-Kenari, S. Gazor, โ€Multiple antenna spectrumsensing in cognitive radios,โ€ IEEE Trans. Wireless Commun., vol. 9, pp.814โ€“823, Feb. 2010.

[15] Rui Zhang, Teng Lim, Ying-Chang Liang, Yonghong Zeng, โ€Multi-antenna based spectrum sensing for cognitive radios: A GLRT ap-proach,โ€ IEEE Trans. Commun., vol. 58, pp. 84โ€“88, Jan. 2010.

[16] Pu Wang, Jun Fang, Ning Han, Hongbin Li, โ€Multiantenna-assistedspectrum sensing for cognitive radio,โ€ IEEE Trans. Veh. Technol., vol.59, pp. 1791โ€“1800, May 2010.

[17] D. Ramฤฑrez, G. Vazquez-Vilar, R. Lopez-Valcarce, J. Vฤฑa, I. Santamarฤฑa,โ€Detection of rank-P signals in cognitive radio networks with uncal-ibrated multiple antennas,โ€ IEEE Trans. Signal Process., vol. 59, pp.3764โ€“3774, Aug. 2011.

[18] IEEE 802.22 WRAN Working Group, IEEE 802.22-2011 Standard forCognitive Wireless Regional Area Networks (RAN) for Operation in TVBands. Piscataway, NJ: IEEE Standards Association, Jul. 2011.

[19] National Telecommunications and Information Administration, Report04-413: Potential interference from broadband over power line (BPL)systems to Federal Government radiocommunications at 1.7-80 MHz.Washington, D.C.: NTIA, Apr. 2004.

[20] J. P. Le Cadre, โ€Parametric methods for spatial signal processing in thepresence of unknown colored noise fields,โ€ IEEE Trans. Acoust., Speech,Signal Process., vol. ASSP-37, pp. 965โ€“983, Jul. 1989.

[21] Zhi Quan, Wenyi Zhang, S. J. Shellhammer, A. H. Sayed, โ€Optimalspectral feature detection for spectrum sensing at very low SNR,โ€ IEEETrans. Commun., vol. 59, pp. 201โ€“212, Jan. 2011.

[22] Wenyi Zhang, H. Poor, Zhi Quan, โ€Frequency-domain correlation: anasymptotically optimum approximation of quadratic likelihood ratiodetectors,โ€ IEEE Trans. Signal Process., vol. 58, pp. 969โ€“979, Mar.2010.

[23] G. Vazquez-Vilar, R. Lopez-Valcace, J. Sala, โ€Multiantenna spectrumsensing exploiting spectral a priori information,โ€ IEEE Trans. WirelessComm.,, Vol.10, No.12, pp. 4345โ€“4355, Dec. 2011.

[24] J. Tellado, Multicarrier modulation with low PAR: Applications to DSLand wireless. Norwell, MA: Kluwer Academic, 2000.

[25] A. Dogandzic, A. Nehorai, โ€Generalized multivariate analysis of vari-ance - A unified framework for signal processing in correlated noise,โ€IEEE Signal Process. Mag., vol. 20, pp. 39โ€“54, Sep. 2003.

[26] S. Kraut, L. L. Scharf, L. T. McWhorter, โ€Adaptive subspace detectors,โ€IEEE Trans. Signal Process., vol. 49, pp. 1โ€“16, Jan. 2001.

[27] Yuanwei Jin, B. Friedlander, โ€A CFAR adaptive subspace detector forsecond-order Gaussian signals,โ€ IEEE Trans. Signal Process., vol. 53,pp. 871โ€“884, Mar. 2005.

[28] A. Paulraj, T. Kailath, โ€Eigenstructure methods for direction-of-arrivalestimation in the presence of unknown noise fields,โ€ IEEE Trans.Acoust., Speech, Signal Process., vol. ASSP-34, pp. 13โ€“20, Jan. 1986.

[29] P. Stoica, M. Cedervall, โ€Detection tests for array processing in unknowncorrelated noise fields,โ€ IEEE Trans. Signal Process., vol. 45, pp. 2351โ€“2362, Sep. 1997.

[30] B. C. Levy, Principles of Signal Detection and Parameter Estimation,Springer, 2010.

[31] S. M. Kay, Fundamentals of Statistical Signal Processing: DetectionTheory. Englewood Cliffs, NJ: Prentice-Hall, 1998.

[32] R. M. Gray, Toeplitz and Circulant Matrices: A Review. NY:Hanover/Now, 2006.

[33] J. Capon, โ€High-resolution frequency-wavenumber spectrum analysis,โ€Proc. IEEE, vol. 57, pp. 1408โ€“1418, Aug. 1969.

[34] A. J. Laub, Matrix Analysis for Scientists and Engineers, Soc. forIndustrial and Applied Math., Philadelphia, 2005.

[35] H. L. van Trees, Optimum Array Processing (Detection, Estimation andModulation Theory), Wiley, 2002.

[36] S. M. Kay, Modern Spectral Estimation: Theory and Applications,Prentice-Hall, 1987.

[37] D. E. Bernstein, Matrix Mathematics: Theory, Facts and Formulas, 2nded. Princeton University Press, 2009.

[38] R. A. Horn, C. R. Johnson, Matrix Analysis. Cambridge University Press,1986.

[39] G. H. Golub, C. F. van Loan, Matrix Computations, 2nd ed. JohnHopkins University Press, 1989.

[40] U. Grenander, G. Szego, Toeplitz Forms and Their Applications. NewYork, Chelsea Pub. Co., 1984.

[41] A. J. Gray, J. Markel, โ€A spectral flatness measure for studying theautocorrelation method of linear prediction of speech analysis,โ€ IEEETrans. Acoust., Speech, Signal Process., vol. ASSP-22, pp. 207โ€“217,Jun. 1974

[42] J. P. Dugre, L. L. Scharf, A. Beex, โ€A note on the measurement ofspectral flatness and the calculation of prediction error variances,โ€ Proc.IEEE Int. Conf. Acoust., Speech, Signal Process., 1980, vol. 5, pp. 607โ€“611.

[43] EN 300 910, โ€Digital cellular telecommunications system (Phase 2+);Radio transmission and reception (GSM 05.05 version 8.5.1 Release1999),โ€ ETSI, Nov. 2000.

[44] L. Hentila, P. Kyosti, M. Kaske, M. Narandzic, M. Alatossava, โ€Matlabimplementation of the WINNER Phase II Channel Model v1.1โ€, Online:https://www.ist-winner.org/phase 2 model.html, 2007.

[45] R. Tandra, A. Sahai, โ€SNR Walls for Signal Detectionโ€, IEEE Journal ofSelected Topics in Signal Processing, Vol.2, No.1, pp. 3-17, Feb. 2008.

[46] P. Lancaster, M. Tismenetsky, The Theory of Matrices, 2nd ed. SanDiego, CA: Academic Press, 1985.

[47] J. R.Magnus, H. Neudecker, Matrix Differential Calculus with Applica-tions to Statistics and Econometrics, revised ed. Chichester, England:Wiley, 1999.

[48] R. J. Serfling, Approximation Theorems of Mathematical Statistics.Chichester, England: Wiley, 1980.