Beamforming Techniques for mmWave Hybrid Analog-Digital ...
-
Upload
khangminh22 -
Category
Documents
-
view
0 -
download
0
Transcript of Beamforming Techniques for mmWave Hybrid Analog-Digital ...
HAL Id: tel-03595055https://tel.archives-ouvertes.fr/tel-03595055
Submitted on 3 Mar 2022
HAL is a multi-disciplinary open accessarchive for the deposit and dissemination of sci-entific research documents, whether they are pub-lished or not. The documents may come fromteaching and research institutions in France orabroad, or from public or private research centers.
L’archive ouverte pluridisciplinaire HAL, estdestinée au dépôt et à la diffusion de documentsscientifiques de niveau recherche, publiés ou non,émanant des établissements d’enseignement et derecherche français ou étrangers, des laboratoirespublics ou privés.
Beamforming Techniques for mmWave HybridAnalog-Digital Transceivers
Nabil Akdim
To cite this version:Nabil Akdim. Beamforming Techniques for mmWave Hybrid Analog-Digital Transceivers. Signal andImage processing. Université Paris-Saclay, 2021. English. NNT : 2021UPASG092. tel-03595055
Beamforming Techniques for mmWave Hybrid Analog-Digital Transceivers
Thèse de doctorat de l'université Paris-Saclay
École doctorale n°580 : sciences et technologies de l'information et de la communication (STIC)
Spécialité de doctorat : Réseaux, information et communications Unité de recherche : Université Paris-Saclay, CNRS, CentraleSupélec, Laboratoire
des signaux et systèmes, 91190, Gif-sur-Yvette, France. Référent : CentraleSupélec
Thèse présentée et soutenue à Paris-Saclay,
le 13/12/2021, par
Nabil AKDIM
Composition du Jury
Didier le Ruyet Professeur, Conservatoire National des Arts et Métiers
Président & Rapporteur
Ghaya Rekaya Professeure, Télécom Paris Rapporteure & Examinatrice
Direction de la thèse Pierre Duhamel Professeur, CentraleSupélec Directeur de thèse
Yang Sheng Professeur, CentraleSupélec Co-Directeur de thèse
Mustapha Benjillali Professeur, Institut National des Postes et Télécommunications
Invité
Carles Navarro Manchon Professeur associé, Aalborg University Invité
Thès
e de
doc
tora
t N
NT
: 202
1UPA
SG09
2
Maison du doctorat de l’Université Paris-Saclay 2ème étage aile ouest, Ecole normale supérieure Paris-Saclay 4 avenue des Sciences, 91190 Gif sur Yvette, France
Titre : Techniques de formation de beamformers pour les émetteurs-récepteurs hybrides analogiques- numériques mmWave
Mots clés : beamformers, émetteurs-récepteurs hybrides, mmWave
Résumé : La forte augmentation des applications gourmandes en bande passante ces dernières années et la pénurie mondiale des bandes cellulaire dans le spec- tre traditionnel des bandes basses ont rendu le spectre vacant dans les bandes de fréquences à ondes millimétriques (mmWave) d’une importance primordiale pour tous les acteurs clés de l’industrie cellulaire. Cela étant dit, communi- quer sans fil sur les fréquences mmWave serait néanmoins une tâche difficile, principalement en rai-son de la faible réflectivité, de l’absorption élevée et des pertes de prop- agation en espace libre impor-tantes sur des bandes aussi élevées.
L’une des solutions très populaires pour surmonter les problèmes de propagation susmentionnés con-siste à utiliser des réseaux d’antennes massifs, une technique rendue possible grace a la proportionna-lité entre la taille physique du réseau et la longueur d’onde de la porteuse (les fréquences mmWave sont caractérisées par une petite longueur d’onde, ce qui se traduit par réseaux d’antennes compacts avec un grand nombre d’éléments). Cependant, le coût élevé, la consommation d’énergie et la comple-xité du matériel de signal mixte chez mmWave ren-dent impossible l’utilisation de grands réseaux d’an-tennes avec des éléments à commande numérique.
Le cloisonnement des traitements de signal liés aux émetteurs-récepteurs mmWave a rendu possible une mise en œuvre économique et énergétique-ment effi cace de ces derniers. Ces nouvelles archi-tectures sont connues sous le nom d’émetteurs-récepteurs hybrides analogiques/numériques de for-mation de faisceaux.
Ces architectures hybrides divisent le traitement de précodage/combinaison entre les domaines analogique et numérique, ce qui réduit considéra-blement le nombre requis de chaînes RF.
Les structures des réseaux hybrides soulevent néanmoins leurs propres défis, le faible rapport signal sur bruit (SNR) résultant des pertes de pro-pagation élevées, la grande dimensionnalité de la matrice de canaux sans fil Multiple-Input-Multiple-Output (MIMO) et la présence de traitement ana-logique compliquent l’acquisition des informations d’état du canal (CSI) et le calcul des précodeurs et combineurs MIMO.
Relever les défis susmentionnés est essentiel pour activer le cellulaire basé sur mmWave. Avec cette motivation à l’esprit, cette thèse propose de nouvelles solutions algorithmiques qui les abor-dent. Nous proposons des solutions rapides et de faible complexité qui offrent des performances ef-ficaces tout en respectant les contraintes matériel-les. Les principales contributions de la présente thèse consistent à concevoir des algorithmes rapi-des et de faible complexité qui permettent de construire des précodeurs et des combineurs ro-bustes pendant la phase d’apprentissage du be-amformer et sans avoir besoin d’estimer explicite-ment le CSI puis de l’utiliser pour dériver ces be-amformers.
Maison du doctorat de l’Université Paris-Saclay 2ème étage aile ouest, Ecole normale supérieure Paris-Saclay 4 avenue des Sciences, 91190 Gif sur Yvette, France
Title : Beamforming Techniques for mmWave Hybrid Analog-Digital Transceivers.
Keywords : Beamforming, mmWave, Hybrid Analog-Digital Transceivers.
Abstract : The high increase of bandwidth greedy applications in recent years, and the global wireless band- width shortage in the traditional low band spectra has made vacant spectrum in millimeter-wave (mmWave) frequency bands of a paramount importance for all key players in the cellular indus-try. This being said, wireless communications over mmWave frequencies will nevertheless be a chal-lenging task, mainly due to the diffraction capability, high absorption and large free space propagation losses on such high bands.
One of the very popular solutions to overcome the aforementioned propagation issues is to use mas-sive an- tenna arrays, a technique that is made pos-sible because if the proportionality between the ar-ray’s physical size and the carrier wavelength (mmWave frequencies are characterized by small wavelength, which translates to compact antenna arrays with high number of elements). However, the high cost, power consumption and complexity of the mixed signal hardware at mmWave make having large an- tenna arrays with digitally controlled ele-ments infeasible.
The partitioning of the signal processing operations related to the mmWave transceivers made having a cost and energy effective implementation of these latter possi- ble. These novel architectures are known as hybrid analog/digital beamforming trans-ceivers. These hybrid architectures divide the pre-coding/combining processing between the analog and digital domains, which reduces considerably the required number of RF chains.
Hybrid array structures entail nevertheless their own challenges, the low signal-to-noise ration (SNR) resulting from high propagation losses, the large dimensionality of the Multiple-Input-Multiple-Output (MIMO) wireless channel matrix and the presence of analog processing complicate the ac-quisition of the channel state information (CSI) and the computation of the MIMO precoders and combiners.
Addressing the aforementioned challenges is key to enabling mmWave based cellular. With this mo-tivation in mind, this dissertation proposes novel algorithmic solutions that tackle them. We pro-pose fast and low complexity solutions that yield efficient performance while respecting the hardware constraints. The main contributions of the present thesis consist of devising fast and low complexity algorithms that enable building robust precoders and combiners during the beam trai-ning phase and without the need of explicitly esti-mating the CSI and then using it to derive these beamformers.
Beamforming Techniques for mmWave Hybrid Analog-Digital
Transceivers
by
Nabil Akdim
DISSERTATION
Presented to the Faculty of the Graduate School of
CentraleSupélec - Paris-Saclay University
in Partial Fulfillment of the Requirements
for the Degree of
DOCTOR OF PHILOSOPHY
CentraleSupélec - Paris-Saclay University
France
September 2021
Beamforming Techniques for mmWave Hybrid Analog-Digital
Transceivers
Nabil Akdim
CentraleSupélec - Paris-Saclay University, 2021
Abstract
The high increase of bandwidth greedy applications in recent years, and the global wireless band-
width shortage in the traditional low band spectra has made vacant spectrum in millimeter-wave
(mmWave) frequency bands of a paramount importance for all key players in the cellular industry.
This being said, wireless communications over mmWave frequencies will nevertheless be a challenging
task, mainly due to the poor diffraction capability, high absorption and large free space propagation
losses on such high bands.
One of the very popular solutions to overcome the aforementioned propagation issues is to use
massive antenna arrays, a technique that is made possible because of the proportionality between
the array’s physical size and the carrier wavelength. However, the high cost, power consumption and
complexity of the mixed signal hardware at mmWave complicates to a great extent implementing large
antenna arrays with digitally controlled elements.
Nevertheless, the partitioning of the signal processing operations related to the mmWave transceivers
has allowed cost and energy effective implementation of these latter. These novel architectures are
known as hybrid analog/digital beamforming transceivers. These hybrid architectures divide the pre-
coding/combining processing between the analog and the digital domains, which reduces considerably
ii
iii
the required number of RF chains.
Hybrid array structures entail nevertheless their own challenges, the low signal-to-noise ratio (SNR)
resulting from high propagation losses, the large dimensionality of the multiple-input-multiple-output
(MIMO) wireless channel matrix and the presence of analog processing complicate the acquisition of
the channel state information (CSI) and the computation of the MIMO precoders and combiners.
Addressing the aforementioned challenges is key to enabling mmWave based cellular. With this
motivation in mind, this dissertation proposes novel algorithmic solutions that tackle these latter.
We propose fast and low complexity solutions that yield efficient performance while respecting the
hardware constraints. The main contributions of the present thesis are deriving fast and low complexity
algorithms that enable building robust precoders and combiners during the beam training phase and
without the need of explicitly estimating the CSI and then using it to derive these beamformers.
In the first contribution, we use advanced tools from Bayesian active learning and approximate in-
ference theories, to devise a robust and fast hierarchical beam search algorithm that reduces the amount
of time or resources needed for the beam search process while guaranteeing a low beam misalignment
probability. Our second contribution proposes a novel beam training strategy based on alternating
transmissions between two hybrid mmWave transceivers. The main idea behind our proposal is to
exploit the reciprocity of the mmWave MIMO channel between the two transceivers. With appropriate
processing at each device, the alternate transmissions implicitly implement an algebraic power itera-
tion that leads to approximating the top left and right singular vectors of that MIMO channel matrix.
Mathematical analysis as well as numerical simulations illustrate the promising performance of the
proposed solutions, making them as enabling technologies for mmWave hybrid transceiver systems.
Contents
Abstract vii
List of Figures ix
List of Tables xi
1 Introduction 1
1.1 Why do Cellular Communications Need mmWave Bands ? . . . . . . . . . . . . . . . . . 2
1.2 Hybrid Beamforming as an Enabler . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
1.3 Hybrid Beamforming Challenges . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
1.4 Overview of Contributions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
1.5 Organization . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
2 MmWave Channel Characteristics 9
2.1 MmWave vs. Sub-6 Ghz propagation environements . . . . . . . . . . . . . . . . . . . . 9
2.2 MmWave Channel Models . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
2.2.1 Large Scale Fading . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11
2.2.2 Small Scale Fading . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
2.3 MmWave Transceiver Design . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20
2.3.1 MmWave Hybrid Digital-Analog Antenna Array Architectures . . . . . . . . . . 22
v
vi CONTENTS
3 mmWave Wireless Channels Variational Online Learning 31
3.1 Overview . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31
3.2 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31
3.3 System Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 35
3.4 RF Codebook . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 36
3.5 Sequential Beam Pair Search via Variational HiePM . . . . . . . . . . . . . . . . . . . . 38
3.5.1 Sequential Active Learning via the HiePM Strategy . . . . . . . . . . . . . . . . 38
3.5.2 The Variational Expectation Maximization HiePM Scheme: VEM-HiePM . . . . 40
3.5.3 The Variational Model Comparison Based HiePM Scheme : VMC-HiePM . . . . 44
3.6 Numerical Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 48
3.7 Conclusion and Future Work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 56
4 Mutli-Stream Beamforming with Hybrid Arrays 57
4.1 Overview . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 57
4.2 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 58
4.3 System Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 61
4.4 Hybrid Ping Pong Multi Beam Training : Hybrid PPMBT . . . . . . . . . . . . . . . . . 62
4.4.1 Ping-Pong Multi Beam Training with Digital Antenna Arrays: Digital PPMBT . 63
4.4.2 Analog Precoder Multi Level Codebook . . . . . . . . . . . . . . . . . . . . . . . 64
4.4.3 Ping Pong Multi Beam Training with Hybrid Antenna Arrays : Hybrid PPMBT 65
4.5 Numerical Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 69
4.6 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 74
5 Concluding Remarks 77
5.1 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 77
5.2 Future Work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 78
CONTENTS vii
Appendices 81
A Variational Hierarchical Posterior Matching for mmWave Wireless Channels Online
Learning 83
B Ping Pong Beam Training for Multi Stream MIMO Communications with Hybrid
Antenna Arrays 89
Bibliography 97
List of Figures
1.1 Estimation of Global Mobile Subscriptions [1] . . . . . . . . . . . . . . . . . . . . . . . . . . 2
1.2 Current Cellular Spectrum in the US [2] . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
1.3 Vehicle-To-Everything (V2X) Communication . . . . . . . . . . . . . . . . . . . . . . . . . . 4
1.4 Device to Device for Wearable Devices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
2.1 MmWave vs. Sub-6 Ghz Propagation Differences . . . . . . . . . . . . . . . . . . . . . . . . 10
2.2 MmWave Blockage Agents . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
2.3 Separate Modeling of LOS and NLOS Links . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
2.4 Two-state Markov Model for Blockage Events . . . . . . . . . . . . . . . . . . . . . . . . 15
2.5 Two-State Shadowing Event with 0dB Threshold Showing Unshadowed (Black Line)
and Shadowed (Red Line) Regions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
2.6 Time Domain Clusters . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
2.7 Angular Domain Clusters . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
2.8 28 GHz Time Domain Clusters . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19
2.9 28 GHz Angular Domain Clusters . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19
2.10 MIMO Transceiver Architecture at Frequencies below 6 GHz . . . . . . . . . . . . . . . . . . 21
2.11 MIMO Hybrid Digital-Analog MmWave Transceiver Architecture . . . . . . . . . . . . . . . . 23
2.12 Hybrid Precoding w/ Phase Shifters – Fully Connected . . . . . . . . . . . . . . . . . . . . . 24
2.13 Hybrid Precoding w/ Phase Shifters – Partially Connected . . . . . . . . . . . . . . . . . . . 24
ix
x LIST OF FIGURES
2.14 Hybrid Precoding with Lenses . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25
3.1 Hybrid Transceiver Structure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32
3.2 Structure of a Multi-resolution Codebook with a Resolution Parameter with N = 8 . . . . . . 37
3.3 Resulting Beam Patterns of the Beamforming Vectors in the First Three Codebook Levels of
a Hierarchical Codebook. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37
3.4 Beamforming Loss of the Different search schemes in a Channel with L = 0 Scattered
Components. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 51
3.5 Beamforming Loss of the Different Search Schemes in a Channel with L = 0 Scattered
Components. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 53
3.6 VMC-HiePM Performance in Channels with Different Power Ratios Between the Dom-
inant Path and L = 3 Scattered Components. . . . . . . . . . . . . . . . . . . . . . . . . 55
4.1 Structure of the Hybrid Transceivers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 60
4.2 Array Gains Obtained with the Analog Beamformers of the Proposed Multi-Level Code-
book, NA = 16, NRF = 4, LA = 3. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 65
4.3 Spectral Efficiency When NRFA =NRF
B = 8, NS=4, L=8, ρ=30dB. . . . . . . . . . . . . . 71
4.4 Spectral Efficiency for Different NS Values. NA=NB=128, NRFA =NRF
B =8, L=8, ρ=30dB. 72
4.5 Spectral Efficiency over Different NS Values. NA=NB=64, NRFA =NRF
B =8, L=7. . . . . 75
(a) Spectral Efficiency over SNR . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 75
(b) Spectral Efficiency over PP Iterations . . . . . . . . . . . . . . . . . . . . . . . . . 75
4.6 Spectral Efficiency When Device A is Equipped with a Hybrid Array and Device B has
a Full Digital Architecture. NA=128, NB=4, NRFA =16, NS=4, L=40. . . . . . . . . . . . 76
List of Tables
2.1 Omnidirectional Path Loss Models in the Umi Scenario . . . . . . . . . . . . . . . . . . . . . 27
2.2 LOS Probability Models in the Umi Scenario . . . . . . . . . . . . . . . . . . . . . . . . . . 28
2.3 Range for the power Consumption for the different devices in a mmWave front-end . . . . . . 29
xi
2 CHAPTER 1. INTRODUCTION
1.1 Why do Cellular Communications Need mmWave Bands ?
The next couple of years are expected to see a surge in the number of mobile subscriptions. According
to a study conducted by the international telecommunication union (ITU) [1], and as can be seen in
Fig. 1.1, the total number of global mobile subscribers is estimated to keep increasing by about 10%
year over year and will attain more than 17 billion by the year 2030.
Figure 1.1: Estimation of Global Mobile Subscriptions [1]
Such a high number of connected wireless devices, together with the emergence of new mobile
applications that require very high data rate communications, emanating from these latter, as well as
the wireless network’s user’s expectation to have a satisfactory end-user experience even in crowded
areas [3], will pose a big challenge ahead of the wireless service providers, who will have to come up
with new and innovative technologies in order to be able to cope with this explosive increase in the
mobile data traffic.
The wireless service providers will also have to overcome the global bandwidth shortage which is
caused by the congestion and fragmentation of the traditional low band spectra [4]. Fig 1.2 [2] shows
how the below 3Ghz cellular spectrum currently in use in the united states shows is badly packed, it
also shows how scarce is the total remaining free bandwidth that can be used for new applications
(only about 600 Mhz) and also how high is the cost to acquire such small bandwidth.
The difficulty to achieve the aforementioned demands faced by wireless service providers, using the
1.1. WHY DO CELLULAR COMMUNICATIONS NEED MMWAVE BANDS ? 3
Figure 1.2: Current Cellular Spectrum in the US [2]
congested and fragmented traditional low spectral bands, has pushed them to start exploring vacant
spectrum at the millimeter-wave (mmWave) frequency bands [4].
Mmwave carrier frequencies span the spectrum range from 30 GHz to 300 GHz, whereas the ma-
jority of to date deployed wireless systems operate in the below 6 Ghz spectra. Going up to such
high frequencies opens up the door to accessing multi-Ghz unused spectral bandwidths, which would
then translate to higher data rates. An example of such use is the WiGig [5] wireless systems that is
deployed in the 60 GHz unlicensed mmWave band and that makes use of a 2 Ghz large bandwidth and
combines with the orthogonal frequency division multiplexing (OFDM) wavefrom modulation scheme
to reach data rates up to 6 Gbps.
Interest in mmWave frequencies for wireless communications dates back to the 19th century. Bose
and Lebedev [6] started experimenting in such high bands already in the 1890s. But, this interest
started to gain momentum and materialise when academia, industry as well as governments, with the
hope of solving the above-mentioned issues related to bandwidth scarcity and fragmentation in the
the sub 6 Ghz spectrum, began backing the idea of deploying a flavor of the 5th generation of cellular
systems, known as 5G new radio (NR) [7], on mmWave bands [8–10]. All these efforts resulted in two
types of 5G NR cellular deployments, the first known as frequency range 1 (FR1) NR option, deployed
on sub 6 Ghz frequencies, and the second known as frequency range 2 NR (FR2) option [11], deployed
over the mmWave frequency bands. This latter FR2 option benefited, in its initial deployment, from
4 CHAPTER 1. INTRODUCTION
large bandwidths (of up to 3 Ghz) in the 28 Ghz and 39 Ghz frequency band [12], which, combined with
advanced multi antenna beamforming techniques, allowed achieving the astonishingly high throughput
of up to 4.3 Gbps on a cellular hand-held device [13].
Aside from cellular applications, mmWave can have many other potential applications. Being
already heavily used in the well established automotive radar business [14], adding the communication
dimension to it would enable new mmWave vehicle-to-everything (V2X) applications like cloud assisted
or fully automated driving, as shown in Fig. 1.31, and would open new possibilities for the assisted
and autonomous driving industry [15].
MmWave is also of interest for high speed wearable networks that connect cell phone, smart watch,
augmented reality glasses and virtual reality headsets [16]. Examples of such use cases are shown in
Fig. 1.3 and Fig. 1.42.
Figure 1.3: Vehicle-To-Everything (V2X) Communication
This being said, wireless communications over mmWave frequencies will nevertheless be a challeng-
ing task, mainly due to the poor diffraction capability, high absorption and large free space propagation
losses on such high bands [17].
1Figure is taken from Professor Robert W. Heath Jr’s mmWave communications tutorial given in the signal processingsummer school that was held in Chalmers University in Gothenburg, Sweden, during summer 2017.
2Similar to Fig. 1.3, this figure is taken from Professor Robert W. Heath Jr’s mmWave communications tutorial givenin the signal processing summer school that was held in Chalmers University in Gothenburg, Sweden, during summer2017.
1.2. HYBRID BEAMFORMING AS AN ENABLER 5
Figure 1.4: Device to Device for Wearable Devices
1.2 Hybrid Beamforming as an Enabler
Fortunately, this frequency range will allow for the use of compact and small antenna arrays with high
number of elements, as the physical size of the array is proportional to the carrier wavelength. The
large beamforming gains that such large-scale arrays enable will be used to compensate for the above
limitations. However, the high cost, power consumption and complexity of the mixed signal hardware
at mm-wave make having large antenna arrays with digitally controlled elements infeasible [18]. A
direct implication of these new hardware constraints is the renewed interest in partitioning the related
signal processing operations between analog and digital domains to reduce the number of required
mixed signal hardware components, like analog-to-digital converters (ADCs), or their resolution, thus
reducing their power consumption and die size footprints. This has led to the development of new and
novel transceiver architectures dubbed hybrid analog/digital beamforming architectures. These hybrid
architectures divide the precoding/combining processing between the analog and digital domains, which
reduces the required number of RF chains. reducing thus hardware implementation complexity and
related power consumption, which enables in turn having portable device being able to communicate
wirelessly over mmWave frequencies.
6 CHAPTER 1. INTRODUCTION
1.3 Hybrid Beamforming Challenges
The hardware constraints associated with the hybrid architectures such as the limitations on the RF
components and the coupling between analog and digital precoders, however, impose new constraints
on the precoding/combining and the wireless channel estimation design problems.
Accurate Channel State information is critical for efficient operation in wireless communication
systems. The task of obtaining such information at hybrid beamforming mmWave systems represents
a major challenge. In addition to the large training overhead associated with the large arrays and
the SNR that is typically low before beamforming design, the hardware constraints, that results from
RF/hybrid precoding, makes the channels at the baseband seen only through the RF lens [18]. This
has renewed the interest in beam training techniques. These techniques make use of multi-stage radio
frequency (RF) codebooks together with adaptive beamwidth beamforning algorithms to jointly design
the transmitter and receiver beamforming vectors with the goal of maximizing the effective receive
gain of the wireless link being used [18, 19]. Despite the reduced complexity of the aforementioned
algorithms, they do entail a large search overhead and are prone to errors in noisy channels. This
has motivated devising novel beam search techniques that are robust both to inter-beam interference
leakage caused by the RF codebook imperfections and to the additive and multiplicative noise that is
inherent to any wireless communication system.
1.4 Overview of Contributions
Addressing the aforementioned challenges is the key to enabling mmWave based cellular. With this
motivation, this dissertation proposes novel algorithmic solutions that tackle them. We propose low-
complexity and fast solutions that yield efficient performance while respecting the hardware constraints.
The primary contributions of this dissertation can be summarized as follows.
• Our first contribution proposes efficient single stream sequential noisy beam search techniques for
mmWave systems with hybrid architectures. We use a combination of bayesian active learning
1.5. ORGANIZATION 7
and of advanced inference techniques to benefit from the reduced search time of the classical
hierarchical beam-search technique, while at the same time reduce this latter’s inherent high
probability of beam misalignment, especially on noisy channels [20].
• Our second contribution proposes a novel beam training strategy based on alternating transmis-
sions between two hybrid mmWave transceivers. The main idea behind our proposal is to exploit
the reciprocity of the mmWave MIMO channel between the two transceivers. With appropriate
processing at each device, the alternate transmissions implicitly implement an algebraic power it-
eration that leads to approximating the top left and right singular vectors of that MIMO channel
matrix [21].
1.5 Organization
The rest of this thesis is organized as follows. In Chapter 2, we introduce mmWave channel character-
istics as well as mmWave hybrid transceiver design challenges. We then detail our first contribution,
dubbed "Variational Hierarchical Posterior Matching for mmWave Wireless Channels Online Learn-
ing" in Chapter 3. We show how the interplay of bayesian active learning and of advanced inference
techniques can devise fast and efficient beam search algorithms for single stream mmWave communi-
cation systems. In Chapter 4, we detail our second contribution, entitled "Ping Pong Beam Training
for Multi Stream MIMO Communications with Hybrid Antenna Arrays". We show how our proposal
approximates singular value decomposition (SVD) precoding with hybrid transceivers, enabling thus
robust multi-stream mmWave wireless communication systems. Concluding remarks and future work
are finally presented in Chapter 5.
Chapter 2
MmWave Channel Characteristics
Understanding the mmWave signal propagation characteristics and properties is fundamental to be
able to come up with accurate mmWave channel models. These latter are needed to help assess the
usability and also compare the different mmWave wireless communication systems. We will discuss in
this chapter the details of such propagation mechanisms.
2.1 MmWave vs. Sub-6 Ghz propagation environements
Spectral wave propagation at mmWave frequencies differ in many aspects from that of the low frequency
bands, mainly due to the very small wavelength compared to the size of most of the objects in the
environment. On one hand, diffraction effects, one of the main propagation mechanisms in sub 6 Ghz
bands [22], contributes much less to the overall mmWave signal propagation due to the reduced Fresnel
zone. Scattering, on the other hand, tends to be higher due to the increased effective roughness of
materials, but remains limited still and not as rich as in lower frequency bands. mmWave propagation
is also characterized by higher absorption and larger free space propagation losses [17,18].
All these differences affect heavily all mmWave channel’s properties. Multi-paths in mmWave tend
to be more clustered and exhibit far fewer paths than on lower frequency channels, leading to more
9
10 CHAPTER 2. MMWAVE CHANNEL CHARACTERISTICS
sparsity in the delay, angular and spatial domains. Doppler shift will have a bigger noisy effect on the
signal due to the large carrier frequency and large bandwidths on mmWave bands. Angular spread
will be smaller because of the high sensitivity to blockages (buildings, human body or even user’s
own fingers) and strong differences between line-of-sight and non-line-of-sight propagation conditions.
Fig.2.11 summarizes all these differences.
Figure 2.1: MmWave vs. Sub-6 Ghz Propagation Differences
2.2 MmWave Channel Models
mmWave Channel models are required for simulating the wireless mmWave signal propagation mecha-
nisms in a reproducible and cost-effective way. This is needed to accurately design and compare radio
air interfaces, system deployment and develop adequate signal processing algorithms for mmWave
transmitters and receivers.
MmWave channel model parameters can be split into two classes :
• The large scale parameter class, which encompasses characteristics like path loss, shadowing and
blockage (this latter translates to line of sight and non line of sight (LOS/NLOS) probability1Similar to Fig. 1.3, this figure is taken from Professor Robert W. Heath Jr’s mmWave communications tutorial given
in the signal processing summer school that was held in Chalmers University in Gothenburg, Sweden, during summer2017.
2.2. MMWAVE CHANNEL MODELS 11
models).
• The small scale parameter class, which encompasses characteristics like delay spread, Doppler
spread, angular spread and number of multi-path component clusters.
Let us discuss next the details of each of these two classes.
2.2.1 Large Scale Fading
In this section, we will detail the main large scale parameters of mmWave channel models. These
include path loss and shadowing parameters as well as large scale blockage parameters and related
modelling.
Path Loss and Shadowing Models
Path loss and Shadowing models for mmWave channels are inspired by Friis Law [19] and follow an
additive white noise linear log-distance parametric model [22,23] as shown in equation 2.1 below :
PL(d)[dB] = α+ 10β log10(d) + ξ, ξ ∼ N(0, σ2) (2.1)
d is the distance separating the transmitter and the receiver, α and β are parameter models that
depend on the wavelength λ being used, on the omnidirectional gains of the transmit and receive
antennas, Gt and Gr and on penetration losses of the material that the spectral waves might penetrate
in the surrounding environment. ξ is the log-normal term that accounts for variances in shadowing,
and which is also partially affected by the penetration losses.
Large scale Blockage Models
Blockage is a major impairment at mmWave. As shown in Fig.2.22, it can be caused by surrounding
objects like buildings, by surrounding people, or even by the user’s own body.2Similar to Fig. 1.3, this figure is taken from Professor Robert W. Heath Jr’s mmWave communications tutorial given
in the signal processing summer school that was held in Chalmers University in Gothenburg, Sweden, during summer2017.
12 CHAPTER 2. MMWAVE CHANNEL CHARACTERISTICS
Figure 2.2: MmWave Blockage Agents
On one hand, penetration losses that are caused by building walls can introduce attenuation up to
80 dB [24], and those that are caused by the human body can result for up to 35 dB loss [25]. On the
other hand, reflective capabilities of all these blockers allow them to be important scatterers to enable
coverage via NLOS paths for mmWave cellular systems [26]. Measurements conducted by New York
University (NYU) confirm that even in extremely dense urban environments, coverage is possible up
to 200 m from a potential cell site [4].
Blockage can be modeled in different ways. Random shape theory [27] and stochastic geometry
theory [28] are mathematical tools that can used to evaluate coverage and capacity in mmWave cellular
networks analytically. Data driven methods can also be used to quantify the effect of blockage, an
example of such methods is to model the mmwave wireless link states using a two-state model (LOS
and NLOS) or a three state model (LOS, NLOS, and signal outage), where both model’s states are
chosen to be parametric statistical functions of the distance between transmitters and receivers, and
where each state’s parameters are fit using the field sounding measurements [29], then the resulting
fitted functions are used to calculate the probability of the link being in each of these states. Fig.2.33
shows this separate modeling of LOS and NLOS links.
A widely used two state model [29–32] is described below:
3Similar to Fig. 1.3, this figure is taken from Professor Robert W. Heath Jr’s mmWave communications tutorial givenin the signal processing summer school that was held in Chalmers University in Gothenburg, Sweden, during summer2017.
2.2. MMWAVE CHANNEL MODELS 13
Figure 2.3: Separate Modeling of LOS and NLOS Links
• LOS model i.e not blocked :
PLLOS(d)[dB] = αLOS + 10βLOS log10(d) + ξLOS (2.2)
• NLOS model i.e blocked :
PLNLOS(d)[dB] = αNLOS + 10βNLOS log10(d) + ξNLOS (2.3)
• Choice of LOS or NLOS is determined by a Bernoulli random variable p(d):
PL(d)[dB] = p(d)PLLOS(d)[dB] + (1− p(d))PLNLOS(d)[dB] (2.4)
• The stochastic blocking function p(d) is modeled as:
P(p(d) = 1) = exp−λd (2.5)
Many organizations have conducted extensive mmWave channel field sounding measurements to
help collect data and fit the above statistical model parameters (λ, αLOS, βLOS, αNLOS, βNLOS and
the shadowing variances). The four major ones are :
• The 3rd Generation Partnership Project (3GPP TR 38.901 [32]), which provides channel models
from 0.5–100 GHz based on a modification of 3GPP’s extensive effort to develop models from 6
to 100 GHz in TR 38.900 [33]. 3GPP TR documents are a continual work in progress and serve
as the international industry standard for 5G cellular.
14 CHAPTER 2. MMWAVE CHANNEL CHARACTERISTICS
• 5G Channel Model (5GCM) [34], an ad-hoc group of 15 companies and universities that developed
models based on extensive measurement campaigns and helped seed 3GPP understanding for TR
38.900 [33].
• Mobile and wireless communications Enablers for the Twenty–twenty Information Society (METIS) [35],
a large research project sponsored by European Union.
• Millimeter-Wave Based Mobile Radio Access Network for 5G Integrated Communications (mm-
MAGIC) [36], another large research project sponsored by the European Union.
An example of the results of the measurement campaigns conducted by the above bodies are sum-
marized, for the urban micro-cellular (UMi) propagation scenario, in Table 2.1 for the Omnidirectional
path loss model and Table 2.2 for the LOS probability model.
Blockage introduces not only LOS/NLOS large scale fading effects on the mmWave signal prop-
agation, but also small scale rapid signal variations, mainly caused by people walking between the
transmitter and the receiver. These small scale effects can be by modeled a multi-state Markov model
where transition probability rates can be determined from the field measurements [37]. An example
of such a model is the simple two-state Markov model that is used to characterize unshadowed and
shadowed states for a wireless link in the presence of pedestrian induced variations in received signal
strength [38, 39]. Fig.2.4 shows a diagram of a two-state Markov model where Punshad and Pshad in-
dicate the transition probabilities of going from a shadowed to unshadowed state and an unshadowed
to shadowed state, respectively, and to shadowed state, respectively, and Fig.2.5 depicts the charac-
terization of a typical blockage event with two-states when applying a 0 dB threshold relative to the
zero-crossings for the beginning and end of a shadowing event.
2.2.2 Small Scale Fading
In this section, we will detail the main small scale parameters of mmWave channel models. We fist
motivate the need for using large antenna arrays for mmWave communications. We then explore the
2.2. MMWAVE CHANNEL MODELS 15
Figure 2.4: Two-state Markov Model for Blockage Events
Figure 2.5: Two-State Shadowing Event with 0dB Threshold Showing Unshadowed (Black Line) andShadowed (Red Line) Regions
clustered nature of the such high frequency channels. We discuss finally the impact of large antenna
arrays and the clustering characteristics on the mmwave channel mathematical model formulation.
Motivating large arrays for mmWave
As discussed so far, large free space propagation losses and high sensitivity to blockage make wireless
communications over mmwave channels a very challenging task. Fortunately, such high frequencies
will allow the use of compact and small antenna arrays with high number of elements, as the physical
16 CHAPTER 2. MMWAVE CHANNEL CHARACTERISTICS
size of the array is proportional to the carrier wavelength. The large beamforming gains and adaptive
directionality that such large-scale arrays enable will be used to compensate for the above limitations.
In fact, according to Friis Law, the far field receive power Pr and the transmit power Pt are related
by equation 2.6 in free space propagation :
Pr =Pt
4πR2
λ2
4πGtGr (2.6)
where R is the distance separating the transmitter and the receiver, λ is the wavelength and Gt
and Gr are the transmit and receive antenna gains.
Equation 2.6 can be dissected into three components :
• A first component that depends mainly on the transmit power and on the separation distance
between the transmitter and the receiver. This component is called the receive power spectral
density and is defined by Pt4πR2 .
• A second component that depends mainly on the wavelength used and on the receive antenna
characteristics. This component is called the effective receive aperture and is defined by λ2
4πGr.
• A third component that depends mainly on the transmit antenna characteristics. This component
equals the transmit antenna gain defined by Gt.
The receive antenna gain Gr increases with the number of antennas used at the receiver (effectively,
the more antennas we use for reception, the higher is the power will receive). So knowing that the
antenna size scales inversely to the wavelength (we refer to antenna size relative to wavelength. For
example : a 1/2 wave dipole antenna is approximately half a wavelength long), we can see that the
higher the frequency is (i.e the lower the wavelength), the higher is the number of antennas we can
accommodate in a given physical area, i.e the more effective power we can actually receive receive.
Therefore, the scaling of the antenna gains increases the effective receive aperture λ2
4πGr and more
than compensates for the increased free-space path-loss at mmWave frequencies.
2.2. MMWAVE CHANNEL MODELS 17
Compensating for path loss in this manner will require then directional transmissions with high-
dimensional antenna arrays (32 antennas and above). This explains why large arrays is a defining
characteristic of mmWave communication.
A Clustered Channel Model
Extensive field measurements [40] have shown that mmwave channels usually assume clustered spatio-
temporal models, where the channel’s multipath components are clustered both in time and angular
domains, as shown in Fig.2.6 and Fig.2.74. The time cluster–spatial lobe (TCSL) [41] approach is
then used to develop statistical spatial channel models (SSCM) for mmwave channels, as the TCSL
framework is shown to faithfully reproduce the first- and second-order time and angular statistics of
these types of wireless channels [42].
The mmWave channel’s SSCM small scale parameters are similar to those of low frequency SSCM
channel models, these are the per cluster parametric distributions of delay, power, central angles of
departure (AoD) and angles of arrival (AoA), together with the angle and delay spreads within each
cluster. Field Sounding campaigns show that these small scale characteristics exhibit spatio-temporal
sparsity due to the small angular spread and the low number of clusters caused by the limited scattering
in mmWave bands. Typically, measurements in the 28 GHz band [40] show the existence of two main
clusters in the time domain (Fig.2.8) and 5 main clusters in the angular domain (Fig.2.9)
Small Scale Fading Mathematical Model
The clustered nature of the mmwwave channel models, as well as their inherent need for large antenna
arrays makes many of the statistical fading distributions used for the traditional multiple input multiple
output (MIMO) low frequency channels inaccurate for mmWave channel modeling. For this reason,
we adopt the extended Saleh-Valenzuela [43] based clustered model representation, which allows us to
accurately capture the mathematical structure present in mmWave channels.4Similar to Fig. 1.3, both of these two figures are taken from Professor Robert W. Heath Jr’s mmWave communications
tutorial given in the signal processing summer school that was held in Chalmers University in Gothenburg, Sweden, duringsummer 2017.
18 CHAPTER 2. MMWAVE CHANNEL CHARACTERISTICS
Figure 2.6: Time Domain Clusters
Figure 2.7: Angular Domain Clusters
Consider a mmWave MIMO system with Nt transmit and Nr receive antennas. Assuming a static
2D narrow-band mmWave channels where uniform linear arrays (ULA) are used (extensions to dynamic
3D wide-band models with different array structures are straightforward [18, 40]), then our channel
model can be described by a Nr ×Nt complex matrix H, as set by equation 2.7.
H =
L∑
l=1
αlar(φr,l)aHt (φt,l) (2.7)
We have:
• L : the number of multi-path components.
• The elements on the ULAs are separated by a distance d. Typically d = λ/2, where λ is the the
mmWave wavelength of interest.
• αl is the complex fading channel gain of the l-th multi-path component. αl is typically assumed
2.2. MMWAVE CHANNEL MODELS 19
Figure 2.8: 28 GHz Time Domain Clusters
Figure 2.9: 28 GHz Angular Domain Clusters
20 CHAPTER 2. MMWAVE CHANNEL CHARACTERISTICS
to be complex Gaussian distributed.
• at(φt,l) and ar(φr,l) are the ULA array response vectors of the l-th multi-path component, at
the transmitter and receiver respectively.
• φt,l and φr,l are the incidence angles of the lth multi-path component, at the transmitter
and receiver respectively. These are modeled as at (φt,l) =[1, e−jωt,l , . . . , e−j(Nt−1)ωt,l
]Tand
ar (φr,l) =[1, e−jωr,l , . . . , e−j(Nr−1)ωr,l
]T, with ωt,l(φt,l) = 2π
λ d cos (φt,l) and ωr,l(φr,l) = 2πλ d cos (φr,l)
being the directional cosine angles of the lth multi-path component. The incidence angles φA
and φB are assumed to be sampled from the ranges [θt,1, θt,2] and [θr,1, θr,2] respectively.
2.3 MmWave Transceiver Design
As we saw above, reliable wireless communication over mmWave channels cannot be achieved without
large arrays. We will detail then in this section the challenges that come into picture when dealing
with large array transceiver design.
Signal processing for multiple antenna (MIMO) transceivers at low frequencies happens completely
in the digital baseband domain. This is made possible because in such systems, all antennas can be
digitally controlled through dedicated radio frequency (RF) chains as the number of antennas used in
low bands tend to be small (4 antennas typically), and because the mixed signal and RF components
(digital-to-analog and analog-to-digital converters, power amplifiers and low noise amplifiers) needed
for these RF chains do not consume very high power and are easy to integrate into a single system on
chip (SoC) subsystem. An example of such a transceiver system is shown in Fig. 2.105.
This is not the case for the mmWave communication systems as large antenna arrays are a defining
characteristic of these setups. This will have a big architectural impact on the mmWave transceiver
design.
5Similar to Fig. 1.3, this figure is taken from Professor Robert W. Heath Jr’s mmWave communications tutorial givenin the signal processing summer school that was held in Chalmers University in Gothenburg, Sweden, during summer2017.
2.3. MMWAVE TRANSCEIVER DESIGN 21
Figure 2.10: MIMO Transceiver Architecture at Frequencies below 6 GHz
Current hardware technology makes it very challenging to tie a separate RF chain (and all the
related baseband circuitry) for each antenna at the mmWave frequencies [44]. The array’s antenna
elements should be placed very close to each other to avoid granting lobes, this space limitation makes
it difficult to pack the RF chain needed complicated mixed signal and baseband circuitry behind each
antenna.
Controlling digitally every antenna of the array separately will drive the overall mmWave transceiver
power consumption very high : (i) mixed signal devices like PA and ADCs/DACs are power hungry
at such high frequencies [45]; (ii) benefiting from the large available bandwidth and the MIMO capa-
bilities of the massive arrays used in mmwave would require processing many parallel high throughput
data streams. This will strain the baseband digital signal processing chain and will drive the overall
transceiver’s power consumption excessively high [46].
Other aspects to take into account are the architectural challenges imposed by the mmWave analog
front end domain, where key power greedy hardware components include power amplifiers, phase
shifters, and switches.
A tremendous effort has been spent on building low power amplifiers, as these latter are an es-
sential component in the radio frequency chain, in integrated circuit (IC) design. In contrast, phase
22 CHAPTER 2. MMWAVE CHANNEL CHARACTERISTICS
shifters, originally utilized in radar systems, are the newly-introduced hardware components in hybrid
beamforming systems. Neverthless, the exact power consumption depends on the specifications and
technology used to implement these components. Table 2.3 shows the range of the power consumed by
different devices included in a mmWave front-end. Data were taken from a number of recent papers
proposing protoype devices for PAs [47–49], LNAs [50–53], phase shifters [54–57], VCOs [58–60] and
ADCs [61–64] at mmWave frequencies. Lt(Lr) is the number of RF chains at the TX(RX). A detailed
treatment of mmWave RF and analog devices and multi-gbps digital baseband circuits can be found
in [23].
All these hardware and power consumption constraints have motivated the wireless communication
research community to look into alternative mmWave specific MIMO transceiver architectures where
the required signal processing is split between the analog and digital domains, which are known as
hybrid digital-analog antenna array architectures [65], or where different design trade-offs are made
with respect to number of antennas or resolution of the RF chain’s components (DAC/ADCs, PAs and
phase shifters for example), these are known as low resolution transceivers [66], or some mix of both
of these solutions.
We will review in this section one of the main MIMO architectures for mmWave systems, namely
the hybrid digital-analog antenna array architectures. The reader can refer to [18] for an overview
about all such architectures.
2.3.1 MmWave Hybrid Digital-Analog Antenna Array Architectures
The hybrid digital-analog antenna array architectures, or hybrid beamformers for short, are composed
by large antenna arrays that are steered using analog phase shifters and only a few digitally modulated
radio-frequency (RF) chains. An illustration of such an architecture is shown in Fig.2.116.
The architecture shown in Fig.2.11 divides the mmWave MIMO transceiver between the digital and
6Similar to Fig. 1.3, this figure is taken from Professor Robert W. Heath Jr’s mmWave communications tutorial givenin the signal processing summer school that was held in Chalmers University in Gothenburg, Sweden, during summer2017.
2.3. MMWAVE TRANSCEIVER DESIGN 23
Figure 2.11: MIMO Hybrid Digital-Analog MmWave Transceiver Architecture
the analog domains. The digital component in hybrid architectures can handle each of the baseband
data streams separately, similar to the conventional fully digital MIMO transceivers, allowing thus spa-
tial multiplexing and multiuser MIMO when Ns > 1. The signal processing of the analog components
is however different. Since the transmitted signals for all baseband data streams are mixed together
through digital precoding, and the number of physical antennas is bigger than the number of streams
and RF chains, then the analog network FRF ∈ CNt×lt will be a common component shared by all
these baseband streams. This would impact greatly the algorithm design for such architectures and
would make reusing traditional low frequency channel estimation and precoding/combining techniques
very challenging.
The RF precoding/combining stage can be implemented using different analog approaches like
phase shifters [67], switches [68] or lenses [69]. Two hybrid structures are possible. In the first one
(see Fig.2.12), all the antennas can connect to each RF chain. In the second one (see Fig.2.13), the
array can be divided into subarrays, where each subarray connects to its own individual transceiver.
Having multiple subarrays reduces hardware complexity at the expense of less overall array flexibility.
A complete analysis of the energy efficiency and spectrum-efficieny of both architectures is provided
in [67].
Fig.2.12 and Fig.2.137 show the example of hybrid precoding structures with fully and partially7Similar to Fig. 1.3, both of these two figures are taken from Professor Robert W. Heath Jr’s mmWave communications
tutorial given in the signal processing summer school that was held in Chalmers University in Gothenburg, Sweden, during
24 CHAPTER 2. MMWAVE CHANNEL CHARACTERISTICS
connected phase shifters respectively. Such structures are enabled through digitally controlled phased
shifters with quantized phases, where the digital precoder/combiner can correct for lack of precision in
the analog, for example to cancel residual multi-stream interference, allowing thus hybrid precoding to
approach the performance of the unconstrained solutions [43]. The multi-subarray partially connected
structure allows for a great reduction in hardware complexity and power consumption [67].
Figure 2.12: Hybrid Precoding w/ Phase Shifters – Fully Connected
Figure 2.13: Hybrid Precoding w/ Phase Shifters – Partially Connected
To further reduce the overall implementation and power consumption complexity of the hybrid
architecture, an alternative mmWave hybrid architecture that makes use of switching networks has
been recently proposed [68]. This architecture, illustrated in Fig.2.12, exploits the sparse nature of the
mmwave channel by implementing a compressed spatial sampling of the received signal. The analog
combiner design is performed by a subset antenna selection algorithm instead of an optimization over
summer 2017.
2.3. MMWAVE TRANSCEIVER DESIGN 25
all quantized phase values. Every switch can be connected to all the antennas if the array size is small
or to a subset of antennas for larger arrays.
Hybrid architecture can also be realized using a lens antenna at the front-end, using the funda-
mental fact that lenses compute a spatial Fourier transform thereby enabling direct channel access in
beamspace [69]. This continuous aperture phased (CAP) MIMO transceiver architecture is illustrated
in Fig.2.148 and suggests a practical pathway for realizing high dimensional MIMO transceivers at
mmWave frequencies with significantly low hardware complexity compared to conventional approaches
based on digital beamforming. The antennas and RF pre- coder/combiner in Fig.2.12 are replaced by
the continuous- aperture lens antenna and mmWave beam selector in Fig.2.14. CAP-MIMO directly
samples in beamspace via an array of feed antennas arranged on the focal surface of the lens antenna.
Figure 2.14: Hybrid Precoding with Lenses
The number of ADC/DAC modules and transmit/receive chains tracks the number of data streams,
as in the phase-array-based hybrid transceiver. However, the mapping of the digitally pre-coded data
streams into corresponding beams is accomplished via the mmWave beam selector that maps the
mmWave signal for a particular data stream into a feed antenna representing the corresponding beam.
The wideband lens can be designed in a number of efficient ways, including a discrete lens array (DLA)
for lower frequencies or a dielectric lens at higher frequencies [69].
There are many implications of using a hybrid architecture for mmWave MIMO. Given channel
8Similar to Fig. 1.3, this figure is taken from Professor Robert W. Heath Jr’s mmWave communications tutorial givenin the signal processing summer school that was held in Chalmers University in Gothenburg, Sweden, during summer2017.
26 CHAPTER 2. MMWAVE CHANNEL CHARACTERISTICS
state information, new algorithms are needed to design the separate precoders/combiners since they
decompose into products of matrices with different constraints (analog and digital matrices as we saw
above). Learning the channel state is also harder, since training data is sent through analog precoders
and combiners. More challenges are found when going to broadband channels as the analog processing
is frequency flat while the digital processing can be frequency selective. There are many opportunities
for future research into designing cellular networks that support hybrid architectures.
2.3. MMWAVE TRANSCEIVER DESIGN 27
PL[dB], fc is in GHz and d3D is in meters meters Shadow fading std [dB] Applicability range and Parameters
5GCM
5GCM UMi-Street CI Model with 1 m reference distance: σSF = 3.76 6 < fc < 100 GHz
Canyon LOS PL = 32.4 + 21log10(d) + 20log10(fc)
5GCM UMi-Street CI Model with 1 m reference distance:
Canyon NLOS PL = 32.4 + 31.7log10(d) + 20log10(fc) σSF = 8.09 6 < fc < 100 GHz
ABG model:
PL = 22.4 + 35.3log10(d) + 21.3log10(fc) σSF = 7.82
5GCM UMi-Open CI Model with 1 m reference distance: σSF = 4.2 6 < fc < 100 GHz
Square LOS PL = 32.4 + 18.5log10(d) + 20log10(fc)
5GCM UMi-Open CI Model with 1 m reference distance:
Square NLOS PL = 32.4 + 28.9log10(d) + 20log10(fc) σSF = 7.1 6 < fc < 100 GHz
ABG model:
PL =3.66 + 41.4log10(d) + 24.3 log10(fc) σSF = 7.0
3GPP TR 38.901
3GPP UMi-Street PLUMi−LOS =
= 32.4 + 21 log10(d) + 20log10(fc) , 10m < d < dm
= 32.4 + 40 log10(d) + 20log10(fc)− 9.5 log10(d2m + (hBS − hUE)2) , dm < d < 5 km
σSF = 4.0 0.5 < fc < 100 GHz, 1.5 m < hUE < 22.5 m
Canyon LOS , hBS = 10 m, dm is specified in 3GPP TR 38.901
3GPP UMi-Street PL = max(PLUMi−LOS(d), PLUMi−NLOS(d)) σSF = 7.52 0.5 < fc < 100 GHz, 1.5 m < hUE < 22.5 m
Canyon NLOS PLUMi−NLOS = 22.4 + 35.3 log10(d) + 21.310(fc)− 0.3(hUE − 1.5) 10 m < d < 5000 m, hBS = 10 m
METIS
METIS UMi-Street PLUMi−LOS =
= 28.0 + 22 log10(d) + 20log10(fc) + PL0 , 10m < d < dm
= 35.8 + 40 log10(d) + 22 log10(dm) + 22log10(fc)− 18 log10(hBShUE) + PL0 , dm < d < 500 mσSF = 3.1 0.5 < fc < 60 GHz, 1.5 m < hUE < 22.5 m
Canyon LOS , hBS = 10 m, dm is specified in METIS specifications
METIS UMi-Street PL = max(PLUMi−LOS(d), PLUMi−NLOS(d)) σSF = 4.0 0.45 < fc < 6 GHz, 1.5 m < hUE < 22.5 m
Canyon NLOS PLUMi−NLOS = 23.15 + 36.7 log10(d) + 2610(fc)− 0.3(hUE) 10 m < d < 2000 m, hBS = 10 m
mmMAGIC
mmMAGIC UMi-Street PL = 32.9 + 19.2log10(d) + 20.8 log 10(fc) σSF = 2.0 65 < fc < 100 GHz
Canyon LOS
mmMAGIC UMi-Street PL = 31.0 + 45.0log10(d) + 20.0 log 10(fc) σSF = 7.82 65 < fc < 100 GHz
Canyon NLOS
Note : PL is the path loss, d is the T-R Euclidean distance
Table 2.1: Omnidirectional Path Loss Models in the Umi Scenario
28 CHAPTER 2. MMWAVE CHANNEL CHARACTERISTICS
LOS Probability models (distances in meters) Parameters
3GPP TR 38.901 PLLOS(d) = min(d1/d, 1)(1− exp(−d/d2)) + exp(−d/d2) d1 = 18 m, d2 = 36 m
5GCM d1/d2 model: d1/d2 model:
PLLOS(d) = min(d1/d, 1)(1− exp(−d/d2)) + exp(−d/d2) d1 = 20 m, d2 = 39 m
NYU squared model: NYU squared model:
PLLOS(d) = (min(d1/d, 1)(1− exp(−d/d2)) + exp(−d/d2))2 d1 = 22 m, d2 = 100 m
METIS PLLOS(d) = min(d1/d, 1)(1− exp(−d/d2)) + exp(−d/d2) d1 = 18 m, d2 = 36 m
d >= 10 m
mmMAGIC PLLOS(d) = min(d1/d, 1)(1− exp(−d/d2)) + exp(−d/d2) d1 = 18 m, d2 = 36 m
Table 2.2: LOS Probability Models in the Umi Scenario
2.3. MMWAVE TRANSCEIVER DESIGN 29
Device devices Power (mW)
(Per device)
PA Nt(Nr) 40-250
LNA Nt(Nr) 4-86
Phase shifter Nt(Nr)× Lt(Lr) 15-110
ADC Lt(Lr) 15-795
VCO Lt(Lr) 4-25
Table 2.3: Range for the power Consumption for the different devices in a mmWave front-end
Chapter 3
mmWave Wireless Channels
Variational Online Learning
3.1 Overview
We propose in this chapter1 two variational Bayesian acftive learning schemes that enable initial
access for hybrid digital-analog enabled devices operating in mmWave wireless channels. The proposed
schemes are devised with the goal to balance the beam search time and achieving higher beamforming
gain, while accounting for uncertainties on the unknown channel (gain and noise variance).
3.2 Introduction
As we discussed in earlier chapters, mmWave frequency bands (30− 300Ghz) is one of the most promis-
ing technologies that will make 5G and beyond cellular networks able to serve a large number of wireless
terminals with high data rates [4]. We saw how the free space propagation losses, poor diffraction ca-
1This chapter is based on the work published in the conference paper : N. Akdim, C. N. Manchón, M. Benjillali andP. Duhamel, "Variational Hierarchical Posterior Matching for mmWave Wireless Channels Online Learning," 2020 IEEE21st International Workshop on Signal Processing Advances in Wireless Communications (SPAWC), 2020, pp. 1-5, doi:10.1109/SPAWC48557.2020.9154340.
31
32 CHAPTER 3. MMWAVE WIRELESS CHANNELS VARIATIONAL ONLINE LEARNING
pability and high absorption of such high frequencies make building wireless transceivers operating
on such high frequencies a challenging task [17]. We introduced the concept of large antenna arrays
and we saw that the antenna array’s physical size proportionality to the carrier wavelength makes it
feasible to engineer small and compact antenna arrays with large number of elements at mmwave. We
also discussed how the high beamforming gains and the spatial steering capabilities of these arrays can
help greatly compensate the aforementioned limitations on these high bands.
When it comes to implementation, we discussed how the complexity of the mixed signal hardware at
mmWave as well as its high cost and power consumption make having element wise digitally controlled
large antenna arrays operating on such high frequencies infeasible [18]. We argued how such limitations
have motivated the wireless communication community to adopt a novel transceiver architecture termed
the hybrid digital-analog antenna array architecture [65]. This architecture helps bring down the cost
and power consumption of the mmwave transceivers by allowing to steer their large antenna arrays
using only few digitally modulated radio-frequency (RF) chains as shown in Fig. 4.1.
Baseband
Precoder
Baseband
Combiner
RF Chain
RF Chain
RF Chain
RF Chain
RF Precoder RF Combiner
NA N
B
NA
RF
NB
RF
H
TRANSCEIVER A TRANSCEIVER B
Figure 3.1: Hybrid Transceiver Structure
The hybrid digital-analog transceiver architecture brings its own challenges though. As already dis-
cussed, sensing the mmwave wireless channel using hybrid structure allows to access only a compressed
3.2. INTRODUCTION 33
version of it and requires running an exhaustive and time consuming search in the angular domain
to be able to estimate the CSI accurately. Also, using the acquired CSI to build the needed MIMO
precoders and combiners for such large MIMO channel, under the hybrid architecture constraint, is
not an easy task, it requires splitting the MIMO processing into two components, one digital and one
analog, a split that is challenging to properly perform due to the interconnection and interplay of the
analog domain design constraints with those of the digital domain [4, 18].
To overcome the long search time issue, the scientific community explored using the sparsity friendly
techniques for the CSI acquisition and precoding/combining algorithm design. These techniques were
believed, at least theoretically, to help bring down the number of channel measurements needed to
estimate CSI and build robust precoders and combiners. An example of such techniques are the
compressed sensing based approaches [18, 43, 70]. These latter have been shown, however, not only to
feature high computational complexity but also require long search time in general [70].
Other sparsity friendly techniques that are good alternative schemes to alleviate the aforementioned
issues are the hierarchical beam-search algorithms [21, 70]. The beam search mechanism in these
algorithms is designed based on the bisection concept. In particular, these algorithms start initially
by dividing the angular space into a number of partitions, which equivalently divides the AoAs/AoDs
range into a number of intervals, and design the multi-stage training precoding and combining set of
vectors, this group of vectors is known as a hierarchical beamforning codebook. The codebook’s design
is done in a way to let the combined angular spread of each stage, i.e the union of is vectors angular
spreads, cover entirely the AoAs/AoDs range of interest. Vectors of the first stage are used to sense the
angular space partitions, the received signal is then used to determine the partition(s) that are highly
likely to have non-zero element(s) which are further divided into smaller partitions in the later stages
until detecting the non-zero elements, the AoAs/AoDs, with the required resolution. If the number of
precoding vectors used in each stage equals K , where K is a design parameter, then the number of
adaptive stages needed to detect the AoAs/AoDs with a resolution of 2π/N is S = logK N . This shows
that this family of algorithms does reduce the beam search time. Nevertheless, it was demonstrated
34 CHAPTER 3. MMWAVE WIRELESS CHANNELS VARIATIONAL ONLINE LEARNING
that this reduction in the channel measurement overhead comes at the expense of entailing a high
probability of beam misalignment, especially on noisy channels [71]. To overcome the long beam search
overhead while still benefiting from the reduced search time of the hierarchical beam-search algorithms,
Chiu et al. proposed in [72] to use Bayesian active learning. They designed an algorithm which they
dubbed HierPM. This newly proposed scheme augments the hierarchical beam-search schemes with a
technique called posterior matching [73].
HierPM builds upon the connection between unit norm constrained hybrid beafmorning [70] and
noisy Bayesian active learning [74, 75] to, based on the wireless channel’s incidence angles posterior
distribution, sequentially choose the pair of precoder/combiner to use in subsequent measurements;
this choice of the precoder/combiner is performed in a way that is guaranteed to reduce both the
search time and the beam misalignment probability. HierPM as proposed in [72] presents two main
limitations: first, it is derived for systems in which only one of the communicating devices is equipped
with hybrid digital-analog arrays; this makes it not practical for cases when both devices require
beam steering as is common for mmWave communications; second, it requires knowledge of mmwave
wireless channel parameters such as the complex gain of the channel’s line-of-sight (LOS) component
as well as the noise variance. These parameters are assumed to be known or estimated in [72], but no
practical estimation algorithm is proposed to obtain these estimates. This absence of a good estimate
of the wireless channel CSI (channel gain and noise variance) makes the incidence angles posterior
distribution calculation intractable and hinders the proposed algorithm use in practical scenarios.
We will detail here the first contribution of the present thesis. We will discuss two novel sequential
noisy beam search techniques that build on HierPM principle but solve its above mentioned limitations.
Our proposed strategies extend HierPM to bi-directional beam alignemnt, in which both partici-
pating devices need to coordinate to find the correct transmission and reception directions. In addi-
tion, building upon the variational inference concept, namely the variational expectation-maximization
based inference framework [76] and the variational model comparison based inference framework [76],
our newly proposed schemes naturally account for the uncertainty about the channel’s gain and noise
3.3. SYSTEM MODEL 35
variance at the two communicating hybrid array enabled devices. The proposed estimation process
used together with both proposed strategies is gracefully embedded in the HierPM algorithm, and
enables its use in the usual situation in which the channel parameters are unknown. Numerical sim-
ulation results show that the proposed methods are able to effectively handle the uncertainty in the
channel parameters, resulting in beamforming gains close to these of an exhaustive search algorithm
while requiring an amount of pilot measurements comparable to those of hierarchical search algorithms.
We start by describing the system model and the RF codebook used. We next discuss the technical
details of each of our proposed schemes the rest. We finally show through numerical simulations how
effective these are in terms of their beamforming gains.
3.3 System Model
Our system is composed of two hybrid digital analog antenna array devices A and B, equipped with
uniform linear arrays (ULA) of NA and NB antenna elements respectively. The elements on the ULAs
are separated by a distance d = λ/2, where λ is the the mm-Wave wavelength of interest. Device
A (Device B respectively) digitally control its ULA with NRFA (NRF
B respectively) RF chains each.
The two devices communicate over a reciprocal LOS wireless MIMO channel. This is considered to be
static and narrowband, and is modeled according to the finite scatterer channel model with one single
dominant path [21,77] as:
H = α(φB)H(φA) (3.1)
where H ∈ CNB×NA is the wireless channel MIMO matrix, and α is the complex fading channel
gain, modeled as a standard complex Gaussian variable. (φA) and (φB) are the ULA array re-
sponse vectors at devices A and B with incidence angles φA and φB respectively, modeled as (ωA) =
[1, e−jωA , . . . , e−j(NA−1)ωA
]Tand (ωB) =
[1, e−jωB , . . . , e−j(NB−1)ωB
]T, with ωA(φA) = 2π
λ d cos (φA)
and ωB(φB) = 2πλ d cos (φB). The incidence angles φA and φB are modeled as uniformly distributed in
the range [θA,1, θA,2] and [θB,1, θB,2] respectively.
36 CHAPTER 3. MMWAVE WIRELESS CHANNELS VARIATIONAL ONLINE LEARNING
The two devices go through an initial access phase consisting of a pilot based beam alignment
procedure in order to establish the wireless link between them. We assume in this work that, during
this initial access phase, the CSI learning and beam search processes for the two devices are centralized,
i.e one of the devices, say B, is collecting measurements based on device A’s pilot transmission, uses
such measurements to learn the channel’s gain and noise variance and also devises the combiner it will
use for the next pilot reception occasion together with the precoder that device A should use in sending
that pilot. This decision is communicated to device A through an ideal, error-free control channel,
which can e.g. be established via a sub-6 GHz link in a non-stand-alone deployment. The extension
of our proposal to distributed CSI learning and beam search setups will not be discussed here and will
the object of a future work.
At time instant t, device A sends a pilot symbol to B, who after pilot removal observes the signal
yB,t =√PwH
B,tHfA,t +wHB,tnB,t (3.2)
where fA,t ∈ CNA and wB,t ∈ CNB denote the effective precoder and combiner used at time t by
transceivers A and B respectively. These effective precoder and combiner are obtained from hybrid
digital-analog codebooks detailed in the next section. In addition, nB,t ∈ CNB is a complex, circularly-
symmetric additive white Gaussian noise vector, obtained after training sequence removal and with
i.i.d elements, each with variance σ2B .√P is the average transmit power of the pilot signal.
3.4 RF Codebook
The adaptive beamforming strategy proposed herein utilizes the hierarchical beamforming codebook
in [70]. Such a codebook, noted CS hereafter, is designed to have S levels of beam patterns. The beams
in each level l (l = 1, . . . S) are optimized to leverages the digital-analog transceiver architecture of
the devices by properly setting digital and analog beamformers to approach the desired analog beams
shape. These desired beams should have the following ideal properties:
• They divide the angular region of interest, say [θ1, θ2] dyadically in a hierarchical manner,
3.4. RF CODEBOOK 37
• The angular coverage of any two different beams of them are disjoint,
• The union of all such beams is the whole region of interest.
We note Cl the collection of beams belonging to level l. Then, Cl will contain 2l beamforming vectors
that divide the sector [θ1, θ2] into 2l directions, each associated with a certain range of incidence angles
Rml , such that [θ1, θ2] = ∪2l
m=1Rml . We note each of such 2l vectors as either fA (Rml ) or wB (Rml ),
depending on the considered device.
Figure 3.2 shows the first three levels of an example codebook with N = 8, and figure 3.3 illustrates
the beam patterns of the beamforming vectors of each codebook level.
Figure 3.2: Structure of a Multi-resolution Codebook with a Resolution Parameter with N = 8
Figure 3.3: Resulting Beam Patterns of the Beamforming Vectors in the First Three Codebook Levels of aHierarchical Codebook.
38 CHAPTER 3. MMWAVE WIRELESS CHANNELS VARIATIONAL ONLINE LEARNING
3.5 Sequential Beam Pair Search via Variational HiePM
As described in the introduction, to allow devices A and B to run a fast and reliable beam alignment
process, an adaptive beamforming technique termed “Hierarchical Posterior Matching (HierPM)” [72]
is used hereafter. This technique uses the hierarchical beamforming codebook structure described
above to sequentially, i.e based on all available measurements at a certain point of time, choose the
next set of precoder/combiner pairs that shall be used to take a new measurement, so that both the
average best beam pair search time and the beam pair misalignment probability are optimally reduced.
In this section, we first review the details of this adaptive scheme and show that knowledge of the
channel gain α and the noise variance σ2 is necessary to make the strategy usable in practice. We then
detail our contribution. Thee newly proposed schemes overcome the aforementioned shortcomings
of the vanilla HierPM scheme, by either considering the channel’s parameters to be non random
latent unknowns and uses the variational expectation maximization scheme to estimate them, or by
by either considering the channel’s parameters to be random latent variables and resorts to using
a novel variational model comparison based inference framework [76] to account for that. We dub
our novel strategies as “Variational Expectation Maximization Based Hierarchical Posterior Matching
(VEM-HierPM)” and “Variational Model Comparison Based Hierarchical Posterior Matching (VMC-
HierPM)”.
3.5.1 Sequential Active Learning via the HiePM Strategy
We illustrate here the use of the vanilla HiePM scheme [72] for device A (an analogous strategy will
be used for device B). HiePM selects fA,t+1 based on the posterior at time t of the incidence angle
φA. We discretize the noisy beam search problem above by assuming that the beam search resolution
δA is an integer power of two and that the AoA φA is of the form:
φA ∈ φA,1, . . . , φA,δA, φA,i = θA,1 +(i− 1)
δA(θA,2 − θA,1) (3.3)
Note that Such discretization approaches the original problem of initial access as δA → 0 [72]. Note
3.5. SEQUENTIAL BEAM PAIR SEARCH VIA VARIATIONAL HIEPM 39
also that to support this level of resolution, the corresponding number of levels of the hierarchical
beamforming codebook at device A should be : SA = log2 (δA).
With the above setup, the posterior distribution of φA given all measurements up to time t (collected
in vector yB,1:t), can be written as a δA-dimensional vector πA (t) with entries
πA,i (t) := Pr (φA = φA,i|yB,1:t) , i = 1, . . . , δA. (3.4)
The posterior probability of φA being in a certain range, say Rmi , can be computed as
πA,Rmi (t) :=∑
φA,i∈Rmi
πA,i (t). (3.5)
The HiePM strategy examines the posterior probability πA,Rmi (t) for all i = 1, . . . , SA and m =
1, . . . , 2i and selects fA,t+1 ∈ CS to be the beamformer corresponding to the angular range that satisfies:
(i∗t+1,m∗t+1) = argmin
(i,m)
∣∣∣∣πA,Rmi (t)− 1
2
∣∣∣∣ (3.6)
Intuitively, HiePM chooses at time instant t+ 1 a narrower beam than the one being used at time
instant t only if the posterior of its parent beam is bigger than 12 (making this parent beam the most
suitable to choose from a Bayesian standpoint) and such a narrow beam itself has the highest posterior
among the children of its parent beam. Doing so, it is then guaranteed [72] that this scheme will
sequentially refine the width of the beamformer around the true incidence angle φA.
Next we describe how the posterior belief around φA is updated once a new measurement is taken
with the pair of beamformers chosen previously with HiePM. Based on the measurement model in (3.2),
the posterior update at time instant t+ 1 can be expressed using Bayes rule as
πA,i (t+ 1) ∝ πA,i (t) f (yB,t+1|φA = φA,i) ,
i = 1, . . . , δA
(3.7)
where f (yB,t+1|φA = φA,i) is the likelihood of φA from measurement yB,t+1. Unfortunately, the de-
pendency of likelihood term f (yB,t+1|φA = φA,i) on the latent parameters, namely the channel gain
α, the noise variance σ2B and the incidence angle φB , makes it difficult to be calculated in closed form,
thus hindering its practical use..
40 CHAPTER 3. MMWAVE WIRELESS CHANNELS VARIATIONAL ONLINE LEARNING
3.5.2 The Variational Expectation Maximization HiePM Scheme: VEM-
HiePM
Our first contribution is based on considering the channel gain α and the noise variance σ2B as classical
unknown parameters, i.e latent parameters that are not probabilistic. We explain first the Variational
Expectation Maximization approximate inference framework used in its most general form. We then
show how to apply it to our problem to account for the unknown parameters, α and σ2B , and to jointly
derive posterior updates of our incidence angles φA and φB .
Primer on the Expectation Maximization framework
We start by listing the different types of variables that the Expectation Maximization (EM) approxi-
mate inference framework builds upon:
• x is the observed data vector, which is in our case yB,1:t+1.
• z = (z1, z2, . . . ,zL) denotes the L-dim vector of latent unknown parameters that parameterize
the measurement model (3.2). In our case, these are the channel’s gain and the noise variance,
i.e z = (α, ν). ν is the inverse of σ2B .
• m ∈ 1, 2, . . . , δA×δB denotes the mth pair of angles (φA,im , φB,jm), with im ∈ 1, . . . , δA, and
jm ∈ 1, . . . , δB. Choosing a certain label m is equivalent to assuming that our measurement
model in (3.2) is parameterized by the the mth pair of angles. m will be our model’s latent
random variable.
The EM framework is used to find an estimate for the hidden unknown parameters of our model,
i.e z, that maximize the log likelihood L(z) = log(x; z). EM assumes that deriving a good maximum
log likelihood estimate of z is not easily solved directly, but that the corresponding problem in which
m is also observed is mathematically tractable and can be solved efficiently.
The EM algorithm starts with some initial guess for the maximum likelihood parameter z(0), and
3.5. SEQUENTIAL BEAM PAIR SEARCH VIA VARIATIONAL HIEPM 41
then proceeds to iteratively generate successive estimates, z(1), z(2), . . . by repeatedly applying the
following two steps, for t = 1, 2, . . . , until convergence is reached :
• E−Step : Compute a distribution q(t) over the range of m such that q(t)(m) = p(m|x; z(t−1)).
• M − Step : Set z(t) to z that maximizes Eq(t) [log(p(x,m; z)]
The recursive operation above solves the issue of having the random variable m latent, i.e unob-
served, by representing its corresponding value by a distribution of values in the E step, and then
performing a maximum likelihood estimation for the join data obtained by combining this with the
known value of the observed variable x.
The VEM-HiePM algorithm
We detail next the VEM-HiePM scheme. Suppose we made t measurements. We have from our
measurement model (3.2)
p(x,m;x) = p(yB,1:t|m; (α, ν))p(m) (3.8)
where
• ν = σ−2 is the noise precision at device B;
• p(y1:t|m; (α, ν)) =∏t+1i=1 CN(yi;
√PαwH
B,iAmfA,i, σ2) is the likelihood of our measurement model
(we assume here that the sequential noise samples are i.i.d)2, Am = (φB,jm)H(φA,im);
• p(m) = 1δAδB
is the prior belief overm, which is assumed to be uniform to make it non informative.
As already discussed, the EM algorithm starts with some initial guess for the maximum likelihood
parameters z(0)t = (α
(0)t , ν
(0)t ), and then proceeds to iteratively generate successive estimates, z(1)
t =
(α(1)t , ν
(1)t ), z(2)
t == (α(2)t , ν
(2)t ), . . . , by repeatedly applying the expectation and maximization steps,
for s = 0, 1, 2, . . . , until convergence is reached.
2In the above, CN(·;µ, λ) denotes the complex Gaussian pdf with mean µ and precision λ, Γ(·; a, b) denotes theGamma pdf with shape and rate parameters a and b.
42 CHAPTER 3. MMWAVE WIRELESS CHANNELS VARIATIONAL ONLINE LEARNING
Let q(s)t , α(s)
t and ν(s)t be the estimates of posterior distribution over m, the channel’s gain and
noise precision, respectively, at the measurement epoch t and after s > 0 iterations. We obtain after
some algebra:
αst =Eq(s−1)t
(∑td=1
(√PwH
B,tAmfA,t
)∗yB,d
)
Eq(s−1)t
(∑td=1
∣∣∣√PwH
B,tAmfA,t
∣∣∣2) (3.9)
νst =t
Eq(s−1)t
(∑td=1
∥∥∥yd −√PwH
B,tAmfA,tα(s)t
∥∥∥2) (3.10)
Now that we know how to derive estimate for α and ν if we are given an estimate for the posterior
distribution over m, let us see how to derive the posterior distribution over m given α and ν estimates.
Let q∞t and q∞t−1 be the estimates for the posterior distribution over m at measurement epochs t − 1
and t, and α(∞)t−1 and ν
(∞)t−1 be the estimates for α and ν respectively, where all these estimates are
considered after that the EM iterations converge. We have
q(∞)t (m)
(a)∼ CN(yt;√PαwH
B,tAmfA,t, ν∞t−1)q
(∞)t−1 (m)
(b)∼t∏
i=1
CN(yi;√PαwH
B,iAmfA,i, ν(∞)t−1 )p(m)
(c)∼ exp (−ν∞t−1
t∑
d=1
∥∥∥yd −√PwH
B,tAmfA,tα(∞)t
∥∥∥2
)p(m)
(3.11)
where (a) results from applying bayes rule on q(∞)t (m) and (b) and (c) are results of direct appli-
cation of our system model assumptions.
The posteriors over φA,im and φB,jm are obtained from the posterior q(∞)t (m) as
qA,t(i) =∑
m:im=iq
(∞)t (m), i = 1, . . . , δA (3.12)
qB,t(j) =∑
m:jm=jq
(∞)t (m), j = 1, . . . , δB (3.13)
The posterior probability of the incidence angles φA and φB to be in a certain range RnA,i and RpB,j
respectively, read as:
3.5. SEQUENTIAL BEAM PAIR SEARCH VIA VARIATIONAL HIEPM 43
qA,t(RnA,i) :=∑
φA,i∈RnA,i
qA,t(i), (3.14)
qB,t(RpB,j) :=∑
φB,j∈RpB,j
qB,t(i), (3.15)
The vanilla HierPM scheme is then applied separately to qA,t(i) and qB,t(j), to choose the pair of
beamformers to use for the next measurement occasion.
Algorithm 1 runs all above operations in a loop, until the measurement budget is exhausted:
device B decides which pair of beamformers devices A and B shall use to take the next measurement
by applying the HiePM scheme separately to the current posteriors qA,t and qB,t, it then takes a new
measurement yB,t+1 with those latter, and finally run variational EM inference to derive estimates of
νB = 1σ2B
and α as well as approximate of φA and φB .
Algorithm 1: VEM-HiePM1 Input : Antenna Array Size NA and NB , The search resolution δA and δB , the codebooksCSA and CSB , Search time τ
2 Output : Estimates of φA, φB ; for t = 1, 2, . . . , τ − 1 do3 #HiePM Based BF selection according to Eq.(3.6)
(fA,t+1,wB,t+1) =(fA
(RkA,t+1
A,lA,t+1
),wB
(RkB,t+1
B,lB,t+1
))
4 #Take next measurement5 yB,t+1 =
√PwH
B,t+1HfA,t+1 + wHB,t+1nB,t+1,
6 #Variational EM Posterior Update7 while (No convergence yet) do8 for m = 1 : δAδB do9 update qt(m) via (3.11)
10 end11 update αt via (3.9)12 update νt via (3.10)13 end14 #Angle and Angular Range Posterior Update15 update qA,t+1(i) via (3.21) and qB,t+1(j) via (3.22)16 update qA,t+1(RnA,i) via (3.23) and qB,t+1(RnB,j) via (3.24)
17 #Final Precoder/Combiner Vector design(lt+1,A, kt+1,A
)= (SA, arg maxk (qA,t+1(k)))
18(lt+1,B , kt+1,B
)= (SB , arg maxk (qB,t+1(k)))
19 end20 Output : φA = φA,kτ,A , φB = φB,kτ,B
44 CHAPTER 3. MMWAVE WIRELESS CHANNELS VARIATIONAL ONLINE LEARNING
3.5.3 The Variational Model Comparison Based HiePM Scheme : VMC-
HiePM
Our second contribution in this chapter is based on considering the channel gain α and the noise
variance σ2 as probabilistic latent variables. We show next, how “VMC-HiePM" is able, using the
variational model based approximate inference framework described in [76], to infer all above unknowns
and uses them efficiently to calculate the posterior update needed for HiePM, in a consistent and elegant
way. Such an inference framework lends itself naturally in the HiePM context: we make the best use
of the measurements by first estimating posteriors over the channel gain and noise variance and then
use those to robustly update the angle of incidence posterior, doing so allows VMC-HiePM to take
the channel’s gain and noise variance estimation uncertainties properly into account when deriving the
posterior of the incidence angles, thus making a robust HiePM based decision when choosing the next
precoder/combiner pair to use.
We explain first the variational model comparison based approximate inference framework used in
its most general form, then show how to apply it to our problem to derive posterior updates for our
parameters of interest.
Primer on the Variational Model Comparison based approximate inference framework
Variational Model Comparison based Posterior Update
As we did for the VEM-HiePM study, We start by listing the different types of variables that the
variational model comparison based approximate inference framework deals with:
• X is the observed data vector, in our case is yB,1:t+1.
• Z = (Z1,Z2, . . . ,ZL) denotes the L-dim vector of latent variables that parameterize the mea-
surement model (3.2). In our case, Z = (α, σ2).
• m ∈ 1, 2, . . . , δA×δB denotes the mth pair of angles (φA,im , φB,jm), with im ∈ 1, . . . , δA, and
3.5. SEQUENTIAL BEAM PAIR SEARCH VIA VARIATIONAL HIEPM 45
jm ∈ 1, . . . , δB. Choosing a certain label m is equivalent to assuming that our measurement
model in (3.2) is parameterized by the the mth pair of angles.
The framework performs joint inference on the hidden variables to find a set of distributions
q(Z|m), q(m)1:m that approximate the true posterior p(Z,m|X), by minimizing the Kullback-Leibler
(KL) divergence:
KL(q(Z|m)q(m), p(Z,m|X)). (3.16)
HiePM then uses the approximate incidence angle posterior q(m) to decide which is the best
measurement model candidate fitting the observed data vector X. Algorithm 2 [76, Chapter 10.4] lists
the steps required to perform such operations.
Algorithm 2: Variational Model Comparison based Posterior Update1 for m = 1 : δAδB do2 while (No convergence yet) do3 for j = 1, 2, . . . , L do4 q(Zj |m) ∝ Ei6=j(log(p(X|Z,m)))
5 Lm =∫Zq(Z|m) log(p(Z,m|X)
q(Z|m) )
6 q(m) ∝ p(m) exp(Lm)
Posterior Update for our measurement Model and the overall V-HiePM Algorithm
From our measurement model (3.2), we have
p(X,Z,m) = p(yB,1:t|α, ν,m)p(α)p(ν)p(m) (3.17)
where
• ν = σ−2 is the noise precision at device B
• p(X|Z,m) = p(yB,1:t|α, νB ,m) =∏t+1i=1 CN(yB,i;
√PαwH
B,iAmfA,i, σ2B) is the likelihood of
our measurement model (we assume here that the sequential noise samples are i.i.d); p(α) =
CN(α;α0, β0) is the prior belief over α, considered to be Gaussian with a known initial mean α0
46 CHAPTER 3. MMWAVE WIRELESS CHANNELS VARIATIONAL ONLINE LEARNING
and initial precision β03. p(νB) = Γ(νB ; a0, b0) is the non informative prior belief over νB , with
parameters a0 = 0 and b0 = 0. Am = (φB,jm)H(φA,im)
• p(m) = 1δAδB
is the prior belief overm, which is assumed to be uniform to make it non informative
as well4.
After some algebra, the obtained approximate posteriors for α and νB , up to the measurement
iteration t, are shown to keep the form of their respective priors, but with parameters that depend on
the measurement vector yB,1:t :
qt(α|m) has the form of complex Gaussian pdf with mean αt,m and precision βt,m reading
βt,m =at,mbt,m
t∑
d=1
∣∣∣√PwH
B,tAmfA,t
∣∣∣2
+ β0 (3.18a)
αt,m =at,m
bt,mβt,m
t∑
d=1
(√PwH
B,tAmfA,t
)∗yB,d +
α0β0
βt,m(3.18b)
qt(νB |m)5follows a Gamma pdf with parameters shape and rate parameters at,m and bt,m given by
at,m = a0 + t, (3.19a)
bt,m = b0 − 2Re(∑t
d=1
(√PwH
B,tAmfA,t
)∗yB,dα
∗t,m
)+
∑td=1
[|yB,d|2 +
(1
βt,m+ |αt,m|2
) ∣∣∣√PwH
B,tAmfA,t
∣∣∣2]
(3.19b)
Note that the choice of our prior distributions is not arbitrary, the priors chosen above correspond to
the maximum entropy distributions [78] that respect constraints that need to be put on their respective
parameters, namely α being a complex variable having a known initial mean and variance, νB being a
non negative variable and m being a discrete variable). Such a choice makes our proposal assume the
least information about our measurement model’s unknowns.3The first and second order moments of α are the only assumed known values in our model.4In the above, CN(·;µ, λ) denotes the complex Gaussian pdf with mean µ and precision λ, Γ(·; a, b) denotes the
Gamma pdf with shape and rate parameters a and b.5Note that (3.18) and (3.19) can be re-written, after performing some algebra, in a recursive format w.r.t their terms
involving summation over measurements epochs. This results in a significant reduction of the algorithm’s memory andcomputation complexity footprint.
3.5. SEQUENTIAL BEAM PAIR SEARCH VIA VARIATIONAL HIEPM 47
The posterior of the model, indexed by m, is then updated following Lines 5 and 6 in Algorithm 2,
where Lm reads
Lt,m = log(1
β2t,m
) + at,m (1− log(bt,m))
+ log(Γ(at,m))− b0at,mbt,m
−(
t∑
d=1
|yB,d|2at,mbt,m
− βt,m |αt,m|2)
(3.20)
The posteriors over φA,im and φB,jm are obtained from the posterior qt(m) as
qA,t(i) =∑
m:im=iqt(m), i = 1, . . . , δA (3.21)
qB,t(j) =∑
m:jm=jqt(m), j = 1, . . . , δB (3.22)
The posterior probability of the incidence angles φA and φB to be in a certain range RnA,i and RpB,j
resp, read as:
qA,t(RnA,i) :=∑
φA,i∈RnA,i
qA,t(i), (3.23)
qB,t(RpB,j) :=∑
φB,j∈RpB,j
qB,t(i), (3.24)
The vanilla HierPM scheme is then applied separately to qA,t(i) and qB,t(j), to choose the pair of
beamformers to use for the next measurement occasion.
Algorithm 3 runs all above operations in a loop, until the measurement budget is exhausted:
device B decides which pair of beamformers devices A and B shall use to take the next measurement
by applying the HiePM scheme separately to the current posteriors qA,t and qB,t, it then takes a
new measurement yB,t+1 with those latter, and finally run variational inference to derive approximate
posteriors of νB = 1σ2B, α as well as of φA and φB .
48 CHAPTER 3. MMWAVE WIRELESS CHANNELS VARIATIONAL ONLINE LEARNING
Algorithm 3: VMC-HiePM1 Input : Antenna Array Size NA and NB , The search resolution δA and δB , the codebooksCSA and CSB , Search time τ
2 Output : Estimates of φA, φB , α and νB3 for t = 1, 2, . . . , τ − 1 do4 #HierPM Based BF selection according to Eq.(3.6)
(fA,t+1,wB,t+1) =(fA
(RkA,t+1
A,lA,t+1
),wB
(RkB,t+1
B,lB,t+1
))
5 #Take next measurement6 yB,t+1 =
√PwH
B,t+1HfA,t+1 + wHB,t+1nB,t+1,
7 #Variational Model Comparison Posterior Update8 for m = 1 : δAδB do9 while (No convergence yet) do
10 update qt+1(α|m) via (3.18) then qt+1(νB |m) via (3.19)11 end12 update qt+1(m) via (3.20)13 end14 #Angle and Angular Range Posterior Update15 update qA,t+1(i) via (3.21) and qB,t+1(j) via (3.22)16 update qA,t+1(RnA,i) via (3.23) and qB,t+1(RnB,j) via (3.24)
17 #Final Precoder/Combiner Vector design(lt+1,A, kt+1,A
)= (SA, arg maxk (qA,t+1(k)))
18(lt+1,B , kt+1,B
)= (SB , arg maxk (qB,t+1(k)))
19 end20 Output : φA = φA,kτ,A , φB = φB,kτ,B
3.6 Numerical Results
To assess the effectiveness of the proposed algorithms, we run Monte Carlo simulations on a setup with
two hybrid digital-analog beamforming devices A and B. The channel matrix H ∈ CNB×NA reads
H = α(φB)H(φA) +
L∑
l=1
αl(φB,l)H(φA,l) (3.25)
and contains one dominant multipath component and L scattered components. All incidence angles
are independently drawn from a uniform distribution between 0 and π. The channel gains are inde-
pendently drawn from a a set of complex Gaussian distribution with mean 0 and variances fulfilling
Varα+∑lVarαl = 1, so that the average SNR ρ between the nth element of the array at A and the
mth element of the array at B equals E|Hnm|2/E|σB |2=1/σ2B
6. In all simulations below, the two
6Hnm is the channel coefficient between device B’s nth array element and device A’s mth array element, and E isthe expectation operator.
3.6. NUMERICAL RESULTS 49
devices are equipped with identical arrays made of NA = NB = 32 elements, digitally controlled with
NRFA = NRF
B = 8 RF chains. Device A uses a codebook CA with a depth of SA = log2(δA), δA = 128.
CA is built using the orthogonal matching pursuit as described in [70]. A similar codebook, CB , is used
for device B.7
We define beamforming gains, achieved after taking t measurements, for our proposed schemes as
follows :
Gx =∣∣∣wH(φB,kt,B )Hf(φA,kt,A)
∣∣∣2
(3.26)
where x can be either em for VEM-HiePM or vh for VMC-HiePM.
We benchmark these beamforming gains with different measurement budget sizes8 and under dif-
ferent channel assumptions, against that of the different state of the art schemes listed below:
• Gph of the vanilla HiePM scheme of [72]. Here, such a scheme assumes that all of the energy in
the channel is concentrated in the path corresponding to the known gain α and all other gains
αl are null, it also assumes that σ2B is known. In such case, the posterior update is done, simply
using Bayes rule as in equation (21) in [72], on the beam pair corresponding to that main path,
and then HiePM is applied to the marginals over those angles separately, similar to what our
schemes do.
• Gbs of the noisy binary search algorithm of [70], which is achieved by 4 log2(maxNA, NB) = 28
measurements.
• Gmax the best achievable beamforming gain of the used codebook, defined as:
Gmax = maxw∈CBSB ,f∈C
ASA
∣∣wHHf∣∣2 . (3.27)
We begin by assuming that the dominant component is the only component that is present in the
channel (i.e. L = 0).
7Note that the multi-RF chain setups are used solely to help build acceptable RF codebooks [70], and are not usedfor multi-stream MIMO operations.
8note the the exhaustive search needs NANB = 16384 measurements to settle.
50 CHAPTER 3. MMWAVE WIRELESS CHANNELS VARIATIONAL ONLINE LEARNING
Fig. 3.4 shows the beamforming losses of the benchmarked algorithms with respect to the optimum
pair of beamformers, defined as Lvh = Gvh/Gmax, Lem = Gem/Gmax, and Lbs = Gbs/Gmax.
3.6. NUMERICAL RESULTS 51
-10
-8-6
-4-2
02
46
81
0
SN
R i
n d
B
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.91
Beamforming Gain
Lv
h :
=
28
Le
m :
=
28
Lv
h :
=
56
Le
m :
=
56
Lv
h :
=
100
Le
m :
=
100
Lb
s :
=
28
Figure3.4:
Beamform
ingLo
ssof
theDifferentsearch
schemes
inaCha
nnel
withL
=0ScatteredCom
ponents.
52 CHAPTER 3. MMWAVE WIRELESS CHANNELS VARIATIONAL ONLINE LEARNING
It can be seen that both or our proposed schemes are clearly much better, in all SNR regimes, than
the binary search based beamforming scheme proposed in [70], we see that even with as little as 28
measurements, which is the measurement budget needed by this latter to settle, both of our algorithms
perform quite well in all SNR regimes, we see that even with such a small measurement budget, VMC-
HiePM can achieve at least 50% of the maximum beamforming gain that can be achieved with the
RF codebook being used across all SNR points. Also, We observe that with only 100 measurements
(compare this with the number of measurements needed for exhaustive search to achieve
We compare next the performance of our schemes to that of the vanilla HiePM with perfect CSI
and operating SNR knowledge scheme. Fig. 4.3 shows the beamforming losses of VMC-HiePM, VEM-
HiePM and of vanilla HiePM schemes with respect to the optimum pair of beamformers (these are
defined similar to the above : Lvh = Gvh/Gmax, Lem = Gem/Gmax and Lph = Gph/Gmax) under
the same simulation assumption of Fig. 3.4. The results show that VMC-HiePM can achieve similar or
even better performance compared to vanilla HiePM with perfect CSI and operating SNR knowledge,
we can see as well that VEM-HiePM is underperfoming when copared to the vanilla HiePM scheme.
Also, it can be observed that the vanilla HiePM scheme with perfect channel gain knowledge saturates
at high SNR: this is an effect of the algorithm assuming that the component’s incidence angle lies
on a discrete grid of values, whereas the actual angles are sampled from a continuous distribution.
VMC-HiePM is less sensitive to this model mismatch, due to the estimation of the channel gain and
inverse noise variance: in practice, these estimates partly account for the mismatch in the assumed
values of the angles and provide robustness to the overall procedure.
3.6. NUMERICAL RESULTS 53
-10
-8-6
-4-2
02
46
81
0
SN
R i
n d
B
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.91
Beamforming Gain
Lv
h :
=
28
Le
m :
=
28
Lp
h :
=
28
Lv
h :
=
56
Le
m :
=
56
Lp
h :
=
56
Lv
h :
=
100
Le
m :
=
100
Lp
h :
=
100
Figure3.5:
Beamform
ingLo
ssof
theDifferentSearch
Schemes
inaCha
nnel
withL
=0ScatteredCom
ponents.
54 CHAPTER 3. MMWAVE WIRELESS CHANNELS VARIATIONAL ONLINE LEARNING
Next, we explore the robustness of our best performing method, namely VMC-HiePM, against
channels containing more than one multipath component. For this, we consider a channel with L = 3
scattered components with gains of equal variance, and with the power ratio between the dominant
and scattered components being LOSR = Eα2/(Eα2 +∑l Eα2
l ). Fig. 4.5 shows beamforming
gains achieved by our algorithm after 100 measurements compared to the maximum gains achievable
Gmax.
As it can be observed, the maximum achievable beamforming gain decreases as the power is more
evenly distributed among the channel’s components. Although VMC-HiePM assumes the existence of
a single component, it shows remarkable resilience to the presence of other components. Even when
all components in the model have comparable power, our proposed method is able to perform within
2 dB of the optimum for sufficiently high SNR.
3.6. NUMERICAL RESULTS 55
-10
-8-6
-4-2
02
46
81
0
SN
R i
n d
B
20
21
22
23
24
25
26
27
28
29
30
Beamforming Gain in dB
Gv
h :
L=
3, L
OS
R =
0.2
Gm
ax :
L=
3, L
OS
R =
0.2
Gv
h :
L=
3, L
OS
R =
0.7
Gm
ax :
L=
3, L
OS
R =
0.7
Gv
h :
L=
0, L
OS
R =
1
Gm
ax :
L=
0, L
OS
R =
1
Figure3.6:
VMC-H
iePM
Perform
ance
inCha
nnelswithDifferentPow
erRatiosBetweentheDom
inan
tPathan
dL
=3ScatteredCom
ponents.
56 CHAPTER 3. MMWAVE WIRELESS CHANNELS VARIATIONAL ONLINE LEARNING
3.7 Conclusion and Future Work
We proposed in this work two variational Bayesian online learning schemes that enable initial access
for hybrid digital-analog enabled devices operating in mmWave wireless channels. When compared
to state of the art beam acquisition schemes, our methods shows superiority, in terms of balancing
the beam search time versus achieving higher beamforming gain, in being able to properly do so
while accounting for uncertainties on the unknown CSI (gain and noise variance) and in being very
resilient to the dominant single path assumption. Even though both of the schemes are derived based
on a discretized model of the angles of incidence of the channel’s main component, they showed great
robustness against off-grid angles as well as working with a realistic codebook implementation. Further
research will focus on adapting the proposed online learning algorithms to operating in time-varying
channels.
Chapter 4
Mutli-Stream Beamforming with
Hybrid Arrays
4.1 Overview
In this chapter1 we propose a method to derive precoders and combiners for multi-stream MIMO
transmission be- tween two devices equipped with hybrid digital-analog antenna arrays. The method
relies on a low- complexity “multi-beam split and drop with backtracking” procedure to update the
analog precoders, while digital precoders are computed with the QR-decomposition based method. We
show numerically that for sufficiently large SNRs, our proposal can approximate well the unconstrained
SVD-based precoder design and can thus enable high throughput mmWave communication systems.
1This chapter is based on the work published in the conference paper : N. Akdim, C. N. Manchón, M. Benjillali andE. de Carvalho, "Ping Pong Beam Training for Multi Stream MIMO Communications with Hybrid Antenna Arrays,"2018 IEEE Globecom Workshops (GC Wkshps), 2018, pp. 1-7, doi: 10.1109/GLOCOMW.2018.8644444.
57
58 CHAPTER 4. MUTLI-STREAM BEAMFORMING WITH HYBRID ARRAYS
4.2 Introduction
Upcoming wireless communication networks are expected to provide service to an unprecendently large
number of wireless devices with peak data rates in the order of tens of Gbps.
As we discussed in great details in earlier chapters, the low complexity and power friendly mmWave
Hybrid Antenna array transceivers are seen as a great trasnceiver design that would solve.........
the congestion and fragmentation of the traditional spectral bands below 6GHz has pushed wireless
service providers to explore vacant spectrum at the millimeter-wave (mmWave) frequency bands (30–
300 GHz) in order to fulfill that goal [4]. Having a poor diffraction capability and high absorption and
free space propagation losses, wireless communications over such high frequencies will be a challenging
task [17]. As detailed already, this frequency range will allow for the use of compact and small
antenna arrays with high number of elements, as the physical size of the array is proportional to the
carrier wavelength. The large beamforming gains that such large-scale arrays enable will be used to
compensate for the above limitations.
However, the high cost, power consumption and complexity of the mixed signal hardware at mm-
wave make having large antenna arrays with digitally controlled elements infeasible [18]. This has
motivated the wireless communication research community to look at the hybrid digital-analog antenna
array architectures [65]. In such architectures, the large antenna array is steered using analog phase
shifters and only a few digitally modulated radio-frequency (RF) chains. An illustration of such an
architecture is shown in Fig. 4.1.
Although Hybrid Antenna arrays help making packing large antenna arrays in small devices a
feasible task by reducing the implementation complexity as well as the power consumption of the
overall baseband and RF chains, they do bring their own challenges: the low SNR resulting from high
propagation losses, the large dimensionality of the MIMO channel matrix and the presence of analog
processing complicate the acquisition of the channel state information (CSI) and the computation of
the MIMO precoders and combiners [4,18]. Luckily, channel measurement campaigns [17] have shown
4.2. INTRODUCTION 59
that mm-wave channels are sparse in the angular domain, which enables the proposal of CSI acquisition
and precoding/combining algorithms that exploit such property. An example of these are compressed
sensing based approaches such as [18,43,70], which are generally computationally complex and require
a large amount of channel measurements. An alternative are exhaustive and hierarchical beam-search
techniques, which may entail significant latency and probability of miss detection [71].
In this work, we focus on a beam training strategy based on alternating transmissions between two
transceivers, which has been coined ping pong beam training (PPBT). The main idea behind PPBT
is to exploit the reciprocity of the MIMO channel. With appropriate processing at each device, the
alternate transmissions implicitly implement an algebraic power iteration that leads to approximating
the top left and right singular vectors of the MIMO channel matrix. This idea was first applied in
the digital arrays context for single stream wireless communications in [79, 80], and was extended to
multi stream setups in [81], to large antenna array and frequency selective systems in [82] and to
noisy MIMO channels in [83]. More recently, similar approaches have been proposed in the context of
mmWave communications with hybrid digital-analog antenna arrays, which we review next. In [77], the
basic ping pong beam training method for single-stream MIMO transmission was adapted to the hybrid
array architecture with the inclusion of a “beam split-and-drop” procedure for the setting of analog
precoders. The subspace estimation and decomposition method in [84] proposes a ping pong based
algorithm that iteratively estimates the channel’s right and left eigenvectors using a Krylov subspace
estimation method. This algorithm is based on exhaustive measurements with a large set of different
analog precoders, which are then linearly combined in order to cancel the effect of the analog precoders.
It therefore requires significant amount of transmissions, which imply large signalling overhead and
latency. Lastly, the power iteration based training method introduced in [85] is a technique that
extends the solution proposed in [81] to the multi stream case, where the digital precoders are set
based on an algebraic power iteration technique, while the analog precoders update is done based on
a compressed sensing technique called simultaneous orthogonal matching pursuit [86].
Compared to the above approaches, we propose in this article a strategy that sets the digital and
60 CHAPTER 4. MUTLI-STREAM BEAMFORMING WITH HYBRID ARRAYS
analog precoders of the devices in a way to approximate the top NS left and right singular vectors
of the channel matrix, with NS being the desired number of spatial streams. Our new technique,
which we dub “hybrid ping pong multi beam training“ (Hybrid PPMBT) extends the work done in [77]
to the multi stream case. It adapts the PPBT strategy to hybrid arrays by progressively choosing
the analog precoders at each device from a predefined hierarchical codebook. After one round-trip
transmission, a novel "multi beam split and drop strategy with backtracking" is applied to focus the
analog precoders towards the spatial directions that are most likely containing the channel’s top NS
multipath components. The digital precoders are updated via an orthogonal decomposition operation
on the received signal as described in [81]. In comparison to the approaches in [84] and [85], Hybrid
PPMBT is much simpler from a computational complexity aspect and has a low training overhead as it
requires significantly fewer transmissions. Simulation results show that our proposed scheme performs
very well in retrieving the wanted NS channel’s top eigenmodes for sufficiently large signal-to-noise
ratio, both in terms of accuracy and convergence speed.
Baseband
Precoder
Baseband
Combiner
RF Chain
RF Chain
RF Chain
RF Chain
RF Precoder RF Combiner
NA N
B
NA
RF
NB
RF
H
TRANSCEIVER A TRANSCEIVER B
Figure 4.1: Structure of the Hybrid Transceivers
4.3. SYSTEM MODEL 61
4.3 System Model
We consider a system in which two hybrid analog digital transceivers A and B, equipped with uniform
linear arrays (ULA) composed of NA and NB antenna elements. Such elements are separated with
a distance d = λ/2, where λ is the wavelength of interest. The two devices control digitally their
arrays with NRFA and NRF
B RF chains respectively and exchange data over a reciprocal wireless MIMO
channel using NS parallel data streams. The channel from device A to device B is considered to
be static and narrowband and is modeled according to the finite scatterer channel model with L
propagation paths [77,87], as
H =
√NANBL
L∑
l=1
αl(ΩB,l)H(ΩA,l), (4.1)
here, H ∈ CNB×NA , L is the number of multipath components (MPC), αl is the complex fading channel
gain for MPC l, ΩA,l = 2πλ d cosφA,l and ΩB,l = 2π
λ d cosφB,l are the directional cosines corresponding
to the lth MPC at arrays A and B respectively, where φA,l, φB,l are the angles of incidence of that same
path, and and are the array response vectors at device A and B respectively. The αl are modeled as
independent, standard complex gaussian variables, the φA,l and φB,l as uniformly distributed in the
range [0, 2π) radians and the array responses as (ΩA,l) = [1, e−jΩA,l , . . . , e−j(NA−1)ΩA,l ]T/√NA and
(ΩB,l) = [1, e−jΩB,l , . . . , e−j(NB−1)ΩB,l ]T/√NB .
In order to establish the wireless link with device B, device A (we assume, without loss of generality,
that device A is performing the first transmission) transmits T , an NS × NS orthogonal training
sequence i.e TTH = INS . Upon reception, device B cancels the training sequence effect by multiplying
its received digital signal by TH . The resulting signal can be expressed as:
YB = FHBHFAWA + FH
BNB , (4.2)
where FA ∈ CNA×NRFA and FB ∈ CNB×NRFB contain the states of the analog precoder and com-
biner of transceivers A and B, WA ∈ CNRFA ×NS denotes the digital precoder of transceiver A and
NB ∈ CNB×NS is a complex, circularly-symmetric additive white gaussian noise matrix, obtained
after training sequence removal and with i.i.d elements, each with variance σ2. Transmissions from
62 CHAPTER 4. MUTLI-STREAM BEAMFORMING WITH HYBRID ARRAYS
device B to device A are modeled analogously as
YA = FHAHHFBWB + FH
ANA, (4.3)
where WB ∈ CNRFB ×NS and NA ∈ CNA×NS are defined similar to the above.
4.4 Hybrid Ping Pong Multi Beam Training : Hybrid PPMBT
Given the signal model in (4.2) and (4.3), the beamforming task consists of selecting the set of analog
and digital precoders and combiners that maximize the spectral efficiency over a given channel matrix
H. For transmission from device A to B, and assuming unit transmit power equally allocated across
the NS streams, the spectral efficiency reads
R = log2 det(INS +
R−1NB
NSHeH
He
), (4.4)
where RNB= σ2WH
BFHBFBWB is the noise covariance matrix after receive combining at device B,
He = WHBFH
BHFAWA is the equivalent channel after precoding and combining at both devices. An
analogous expression applies for transmission from device B to device A.
The optimal precoders maximizing (4.4) are known to be the NS top right and left singular vectors
of H. However, the hybrid structure of the antenna array makes the computation of such precoders
challenging. On the one hand, as digital measurements of the channel are only obtained after analog
precoding and combining, estimating the full channel matrix H in order to obtain its singular value
decomposition requires a large number of measurements and hence large overhead and latency [84].
On the other hand, even if the channel matrix H can be estimated, the precoders have to be built
as the product of the analog precoding matrix FA and the digital precoding matrix WA. While the
elements of WA can take any complex value due to its digital implementation, the operation modeled
by FA is implemented via phase shifters and combiners, which restricts the values it can take. In this
work, we restrict the entries of FA to satisfy |(FA)l,i|2 ∈ 1
M(i)A
; 0, where M (i)A being the number of
activated array elements in the ith column of FA, and the option (FA)l,i = 0 accounts for the option of
2(FA)l,i is the entry of the matrix FA belonging to its lth row and ith column.
4.4. HYBRID PING PONG MULTI BEAM TRAINING : HYBRID PPMBT 63
leaving some elements of the array unused. In addition, a transmit power constraint is enforced such
that ‖FAWA‖F = 1.3 With these constraints, finding the combination of digital and analog precoders
to best approximate the channel’s singular vectors becomes a computationally intensive optimization
problem [43].
To overcome such difficulties, we propose an iterative multi beam training scheme based on alternate
transmissions between the two devices, this procedure estimates progressively and simultaneously the
top NS right and left singular vectors of H and sets the digital and analog precoders so that they
approach those singular vectors. It consists of two parts: 1. a "backtracking beam split and drop"
approach to select the analog precoders FA and FB from a predefined multi level codebook, 2. a
method to select the digital precoders WA and WB inspired by the QR decomposition algorithm
described in [81].
We will proceed by reviewing the beam training procedure for digital antenna arrays proposed
in [81], then briefly present the multi level codebook that is used for the analog precoder update, and
finally explain our multi beam training solution.
4.4.1 Ping-Pong Multi Beam Training with Digital Antenna Arrays: Dig-
ital PPMBT
We review the digital PPBT algorithm over a narrowband reciprocal channel H as described in [81].
We consider two devices A and B equipped with digitally controlled antenna arrays with NA and NB
elements respectively. At the initial (0th) iteration, the process starts with a random initialization
of the precoder at device A, W [0]A . A uses then this initial precoder to transmit a training sequence
to B. Upon reception and training sequence removal, device A gets an estimate of HW[0]A , makes a
QR-decomposition on it, and uses the Q part of that decomposition as its precoder W [0]B . It then uses
that precoder to transmit a training sequence back to device A, who will repeat the same operations.
This process is reiterated until convergence, at which, device A gets an estimate of the top NS right
3Obviously, the same constraints apply to the analog precoder of device B.
64 CHAPTER 4. MUTLI-STREAM BEAMFORMING WITH HYBRID ARRAYS
singular vectors of H and device B gets an estimate of the top NS right singular vectors of HH.
Further details on this procedure can be found in [81].
4.4.2 Analog Precoder Multi Level Codebook
We illustrate here the codebook definition for transceiver A (an analogous codebook is used for the
transceiver B). We consider a codebook CA which is composed of LA= log2(NA/NRFA )+1. levels.
Note that for the considered codebook design we constrain NA and NRFA to be both integer pow-
ers of two. For the kth level, we define a subcodebook C(k)A =ϕ(k)
A,i, i=0, 1, . . . ,M(k)A -1 consisting of
M(k)A =NRF
A 2k-1 column vectors, k=1, 2, . . . , LA. Each of the elements of the subcodebook is defined
as :
ϕ(k)A,i=
[1, e-jψ
(k)A,i , . . . , e-j(M
(k)A -1)ψ
(k)A,i ,0T
NA-M(k)A
]T/
√M
(k)A (4.5)
where ψ(k)A,i=π-π(2i+1)/M
(k)A is the directional cosine of the ith vector at the kth level (ϕ(k)
A,i steers the
array in the direction θ(k)A,i= arccosψ
(k)A,i/π, with a lobe whose width decreases with the codebook level
k), and 0N is the N -dimensional column zero vector. Further details about the codebook used here
can be found in [77].
Basically ϕ(j)A,i steers the array in the direction θ
(j)A,i = arccosψ
(j)A,i/π, with a lobe whose width
decreases with the codebook level j. Note that for a fixed level k, the directional cosines θ(k)A,i, i =
0, . . . ,M(k)A − 1 are set to uniformly sample the directional cosine range [−π, π] and as k increases this
range is sampled with larger resolution and more array elements are used for higher-level codebook
elements, resulting in more directive beamforming vectors (an illustration of this is shown in figure 4.2).
For more details about RF codebook optimization methods, we refer the reader to [88,89].
4.4. HYBRID PING PONG MULTI BEAM TRAINING : HYBRID PPMBT 65
Angle (radians)
0 0.5 1 1.5 2 2.5 3 3.5
Arr
ay G
ain
(d
B)
0
5
10
15Codebook Level 1
Angle (radians)
0 0.5 1 1.5 2 2.5 3 3.5
Arr
ay G
ain
(d
B)
0
5
10
15Codebook Level 2
Angle (radians)
0 0.5 1 1.5 2 2.5 3 3.5
Arr
ay G
ain
(d
B)
0
5
10
15Codebook Level 3
Figure 4.2: Array Gains Obtained with the Analog Beamformers of the Proposed Multi-Level Code-book, NA = 16, NRF = 4, LA = 3.
4.4.3 Ping Pong Multi Beam Training with Hybrid Antenna Arrays : Hy-
brid PPMBT
The proposed algorithm for beam training with hybrid arrays is described in pseudocode. Algorithm 4
presents the overall training scheme, while Algorithm 5 describes the subroutine used to update the
analog precoding matrices.
66 CHAPTER 4. MUTLI-STREAM BEAMFORMING WITH HYBRID ARRAYS
Initialization
First, the digital and analog precoders are initialized. FA and FB are initialized to C(1)A and C(1)
B
respectively, while WA is initialized to a random unitary matrix. The columns of WA are then
normalized to fulfill the transmit power constraint of the effective precoder FAWA. Finally, a set of
empty arrays, pB ,kB , iB, are created. Those arrays will store the needed information to perform
the analog precoder updates, as explained below.
Ping Pong Iterations
After the initialization phase, a sequence of alternate pilot transmissions starts between the two devices.
The baseband precoders are updated after each reception step by means of a QR-decomposition,
followed by a normalization step. These two operations are detailed in lines 3-7, 12-16 and 20-24 in
Algorithm. 4. Immediately after updating their baseband precoders upon reception of a transmission,
the devices transmit back with the updated digital precoders and the same analog precoder as used for
reception. Only after the transmission has been made will the transmitting device update its analog
precoders (using Algorithm 5), such that next reception is done with the updated setting. This allows
for the QR based iteration to converge, as each reception-transmission cycle is performed over a static
setting of the analog precoders.
Update of Analog Precoders
The devices invoke the routine outlined in Algorithm 5 to update their analog precoder state. This
routine bases its update on the current state of the device’s RF precoder , its codebook C and on the
update history of its baseband precoder . Using all previous updates of allows for backtracking–i.e,
correcting for wrong RF precoder updates. The routine works as follows:
a) Three sequences of values are generated: kn stores the level of the codebook of the nth column
of , pn stores the squared norm of the nth row of , i.e the aggregate energy received on it, and
4.4. HYBRID PING PONG MULTI BEAM TRAINING : HYBRID PPMBT 67
Algorithm 4: Ping Pong Multi Beam Training with Hybrid Arrays1 Input : Antenna Array Size NA and NB , the codebooks CSA and CSB , Search time τ
2 F[0]A ←
[ϕ
(1)A,0,ϕ
(1)A,1, . . . ,ϕ
(1)
A,M(1)A −1
]
3 F[0]B ←
[ϕ
(1)B,0,ϕ
(1)B,1, . . . ,ϕ
(1)
B,M(1)B −1
]
4 Initialize W[0]A to an orthogonal matrix of its size.
5 for s = 1, 2, . . . , NS − 1 do6 W
[0]A (:, s)←W
[0]A (:, s)/
√NS
∥∥∥F [0]A W
[0]A (:, s)
∥∥∥2
7 end8 pA,kA, iA ← [], [], []9 pB ,kB , iB ← [], [], []
10 A transmits, B receives:11 Y
[0]B =(F
[0]B )HHF
[0]A W
[0]A +(F
[0]B )HN
[0]B
12 [Q,R]← QR(Y[0]B )
13 for s = 1 : NS do14 W
[0]B (:, s)← Q(:, s)
15 W[0]B (:, s)←W
[0]B (:, s)/
√NS
∥∥∥F [0]B W
[0]B (:, s)
∥∥∥2
16 end17 for t = 1, 2, . . . , τ − 1 do18 B transmits, A receives:19 Y
[t]A =(F
[t−1]A )HHHF
[t−1]B W
[t−1]B +(F
[t−1]A )HN
[t]A
20 [Q,R]← qr(Y[t]A )
21 for s = 1 : NS do22 W
[t]A (:, s)← Q(:, s)
23 W[t]A (:, s)←W
[t]A (:, s)/
√NS
∥∥∥F [t−1]A W
[t]A (:, s)
∥∥∥2
24 end25 [F
[t]B , pB ,kB , iB]← Upd.An.Pr[F
[t−1]B ,W
[t−1]B , CB , pB ,kB , iB]
26 A transmits, B receives:27 Y
[t]B =(F
[t]B )HHF
[t−1]A W
[t]A +(F
[t]B )HN
[t]B
28 [Q,R]← qr(Y[t]B )
29 for s = 1 : NS do30 W
[t]B (:, s)← Q(:, s)
31 W[t]B (:, s)← W
[t]B (:,s)
√NS
∥∥∥F [t]B W
[t]B (:,s)
∥∥∥2
32 end33 [F
[t]A , pA,kA, iA]← Upd.An.Pr[F
[t−1]A ,W
[t]A , CA, pA,kA, iA]
34 end35 Output : [F
[τ ]A ],W
[τ ]A ,F
[τ ]B ],W
[τ ]B ]
in stores the index of the nth column of , out of the level of the codebook to which that column
belongs.
68 CHAPTER 4. MUTLI-STREAM BEAMFORMING WITH HYBRID ARRAYS
Algorithm 5: Analog Precoder Update : Upd.An.Pr1 Input : [F , W , C,p,k,i]2 Generate : kn, in, pn, n=1, . . . , NRF , where kn will store the codebook level to which the
′s
nth column belongs, in will store its index out of that level and pn will store the norm of thenth row of , i.e the aggregate energy received on it.
3 p← [p, p1, . . . , pNRF ],4 k← [k, k1, . . . , kNRF ],5 i← [i, i1, . . . , iNRF ]6 Sort p in a descending manner and store the result in pS , then store the arrangement of the
elements of p into pS in pI .7 bI ← []8 bL ← []
9 while n ≤ NRF do10 bI ← [bI , pI,n], pI,n is the nth element of pI , ppI,n ← 0, ppI,n is the pI,nth element of p,11 if kpI,n= log2( N
NRF)+1 or n=NRF -1 then
12 n← n+ 1, bL ← [bL, 1]13 else
14 n← n+ 2, bL ← [bL, 2]
15 for t = 1 : Length(bI) do16 m← bL,t17 if m = 2 then18 F ← [F,ϕ
(km+1)2im
,ϕ(km+1)2im+1 ]
19 else
20 F ← [F,ϕ(km)im
]
21 return F ; p,k, i
b) The sequences generated above will be used to update three vectors: kn will be appended to
k, with k being an array storing the codebook levels of the columns of used over consecutive
ping-pong iterations. pn will be appended to p, with p being an array storing the received energy
over the different spatial directions set by the analog beamformer . in will be appended to i in
a similar manner to the above.
c) Once p is updated, it will be sorted in a descending manner and the resulting sorted indices will
be stored in pI . This newly formed array will be used to find the entries of p that are most likely
to direct the analog precoders where the MPCs of H are.
d) Two new vectors are built: bI contains the K first indexes of pI i.e it identifies the beams that
are most aligned with the channel’s MPCs (K is the length of bI , which can be derived from
4.5. NUMERICAL RESULTS 69
lines 9-13). bL is a vector that is made of 1’s and 2’s. The ith entry of bL is set to 2 when
the precoder corresponding to the ith entry of bI is replaced with the two precoders belonging
to one step higher level of the codebook and that have their beams covering together its same
spatial area, otherwise it is set to 1. Deciding to append 1 or 2 to bL depends on whether we
already consumed all columns of and on whether the element of pI in question belong to the last
level of the codebook or not (see lines 17-21). Line 8 of the algorithm erases the measurement
stored over a beam that is selected to be included in the analog precoding matrix, either directly
or after splitting it into two beams of the immediately higher level. As new measurements
will be obtained over that beam in the next iteration, the old measurement is deleted to avoid
unnecessarily coming back to the previous configuration corresponding to that old measurement.
e) Finally, the RF precoder columns are updated (lines 15-22).
We dub the above procedure for updating the analog precoders “beam split and drop with backtracking”,
owing to the way it operates. At each iteration, a decision is made as to whether a given beam is
split –i.e replaced by two more directive beams– or dropped –i.e. removed from the precoding matrix.
The backtracking feature refers to the fact that, via the vector p, the measurements obtained in prior
ping-pong iterations are kept in memory, allowing for returning to lower level beams in the codebook
in the cases where noise leads to erroneous decisions in the the split-and-drop procedure.
4.5 Numerical Results
In order to assess the effectiveness of the proposed algorithm, we perform Monte Carlo simulations
for multiple configurations of the hybrid arrays at devices A and B. The channel matrix H follows
the model in (4.1). The average SNR for the sth stream link between the nth element of the array at
device B and the mth element of the array at A is defined as ρ=E|Hnm|2/E|Nns|2=1/σ2, where
Nns is the nth entry of the sth column of the noise matrix N . We assume here for simplicity that
the SNR ρ is the same for all streams. Hnm is the channel coefficient between device B’s nth array
70 CHAPTER 4. MUTLI-STREAM BEAMFORMING WITH HYBRID ARRAYS
element and device A’s mth array element, and E is the expectation operator.
We measure the performance with the average spectral efficiency per bits/s/Hz, expressed with :
log2
( ∣∣∣∣INS+R−1N
NSHeH
He
∣∣∣∣)
where He=(F[n−1]B W
[n−1]B )HHF
[n−1]A W
[n]A and RN=σ2(F
[n−1]B W
[n−1]B )HF
[n−1]B W
[n−1]B for integer it-
eration n, and He=(F[n]B W
[n]B )HHF
[n−1]A W
[n]A and RN=σ2(F
[n]B W
[n]B )HF
[n]B W
[n]B for half iteration
n+0.5.
We benchmark the performance of the hybrid PPMBT against the performance of the digital ping
pong multi stream beam training method (digital PPMBT) described in [81], for which the spectral
efficiency is calculated in a similar manner to that of the hybrid PPMBT. We compare it also with
the optimal unconstrained SVD based precoder, which maximize the spectral efficiency and which is
obtained by using the top NS left and right singular vectors of H. Note that the SVD based precoder
derivation requires full channel knowledge at both devices, an information that is hard to get when
hybrid architectures are used as explained earlier, and which our proposed scheme does not need at
the start, but rather learns while setting the precoders.
We start our evaluation in the high SNR regime (ρ=30dB), in which the training process will not
be impaired too much by noise. This allows assessing how the arrays size, the number RF chains and
the number of spatial streams really affect the algorithm’s performance.
Fig. 4.3 depicts the algorithm’s performance over a channel with 8 multipath components, when the
two devices are equipped with identical arrays made of NA=NB=32, 128 or 1024 elements and a fixed
number of RF chains NRFA =NRF
B =8 and try to establish an NS=4 streams MIMO communication.
We see that the algorithm reaches, in very few iterations, for small, medium and large sized arrays, to
about 1 to 2 bits/s/Hz of the digital PPMBT and SVD based precoding schemes performances. We also
see that the convergence speed (in terms of PP iterations) scales inversely to the codebook depth for
each of the topologies: the convergence is slower when large arrays are used because, in such cases, the
codebook has more levels and its few last levels contain a high number of beamformers with very narrow
beams. The selection of such directive beams is more prone to error than those in codebooks with lower
4.5. NUMERICAL RESULTS 71
0 2 4 6 8 10 12 14 16 18 20 22
Ping Pong Iterations
30
40
50
60
70
80
90
100S
pe
ctr
al
Eff
icie
nc
y
Optimal Precoding (SVD)
Digital PPMBT
Hybrid PPMBT
NA
=NB
=1024
NA
=NB
=128
NA
=NB
=32
Figure 4.3: Spectral Efficiency (bits/s/Hz) Attained by the Algorithm over PP Iterations. Devices Aand B are Equipped with a Hybrid Array : NRF
A =NRFB = 8, NS=4, L=8, ρ=30dB.
resolution. Although the backtracking mechanism can correct for the errors made, this comes at the
expense of a larger number of iterations needed for convergence. Note that the number of PP iterations
needed for convergence for all topologies is still much below what is needed for exhaustive search: this
latter requires as an example, for NA=NB=1024 with NRFA =NRF
B =8, NA/NRFA =NB/NRF
B =128 PP
iterations to find the best beams, compared with only 16 for our algorithm.
Fig. 4.4 shows the algorithm’s performance over the same channel as the one used in Fig. 4.3, but
with the number of MIMO streams, NS , taking different values (4, 6 and 8) and the two devices being
equipped with identical arrays made of NA=NB=128 elements and use 8 RF chains each. The purpose
of these simulations is to investigate how does the ratio of the number of MIMO streams to the number
of RF chains NS/NRF affect the algorithm’s performance. We can clearly see that the algorithm
72 CHAPTER 4. MUTLI-STREAM BEAMFORMING WITH HYBRID ARRAYS
behaves in three different ways depending on the aforementioned ratio: 1) When NS ≤ NRF /2,
the convergence is very quick and no backtracking is performed. 2) When NRF /2 < NS < NRF , the
algorithm’s convergence is slowed down and one observes some irregularity of the convergence behavior
over iterations which is due to the backtracking mechanism. 3) When NRF = NS , the algorithm fails
to provide acceptable performance.
0 5 10 15 20 25 30 35 40
Ping Pong Iterations
0
10
20
30
40
50
60
70
80
90
100
110
Sp
ectr
al E
ffic
ien
cy
Optimal Precoding (SVD)
Digital PPMBT
Hybrid PPMBT
NS=4
NS=6
NS=8
Figure 4.4: Spectral Efficiency (bits/s/Hz) Attained by the Algorithm over PP Iterations for DifferentNS Values. Devices A and B are Equipped with a Hybrid Array : NA=NB=128, NRF
A =NRFB =8, L=8,
ρ=30dB.
Next, in Fig. 4.5, we fix the size of antenna arrays and number of RF chains used at both devices to
NA=NB=64 and NRFA =NRF
B =8, and evaluate the algorithm’s performance over a channel with L = 7
multipath components at different SNR values. In Fig. 4.5a, we show the algorithm’s performance
against SNR after training convergence for different NS values. The results show that the number of
streams that the training algorithm can handle efficiently grows with SNR, as is to be expected. At
4.5. NUMERICAL RESULTS 73
low SNR regimes (−15 to 0 dB) the algorithm works better when it attempts to estimate only the
dominant singular vector of the channel; if more singular vectors are estimated, the large estimation
error degrades the overall spectral efficiency of the system. As the SNR grows, an increasing number
of singular vectors can be accurately estimated by the algorithm and, hence, increasing the number of
streams provides significant spectral efficiency gains. For all cases, we observe that the performance
of the Hybrid PPMBT is very close to that of the fully-digital counterpart, and approaches the SVD
precoding performance as the SNR grows. In Fig. 4.5b the convergence behavior of the training
algorithm with different NS settings is evaluated at their respective SNR values of interest. We observe
that the method’s spectral efficiency tends to saturate at around 10 ping pong iterations for moderate
and high SNR. Convergence for lower SNR values is, however, slower. We attribute this effect to the
fact that the large noise power induces numerous incorrect beam selection errors in the updates of the
analog precoder; although the backtracking feature of the algorithm can help correct some of them,
the price to pay is a longer training time. In any case, we remark that the algorithm reaches about
70% of the spectral efficiency obtained at convergence within the first 6 iterations, regardless of the
SNR value and number of spatial streams.
To conclude, we evaluate our proposed training procedure in a system in which only one of the
devices is equipped with a large, hybrid array (NA=256, NRFA =16), while the other has a digitally-
controlled array of moderate size (NB=4). Such topologies can be seen as massive MIMO systems
operating at microwave frequencies. In addition, to reflect the richer scattering experienced in such
frequency bands [90], we adopt the following channel model:
H=√NANBL
ABΛAHA (4.6)
where AA contains in its columns the steering vectors (ΩA,p), p=1, 2, . . . , P , AB is defined analogously,
L is again the number of multipath components, and Λ is a L×L matrix with i.i.d. standard complex
Gaussian entries. This setting will allow to test the validity of our approach in channels with richer
scattering and its robustness against the sparse assumption of the channel.
Fig. 4.6 shows the algorithm’s performance over a rich scattering channel (L=40) when NA=128,
74 CHAPTER 4. MUTLI-STREAM BEAMFORMING WITH HYBRID ARRAYS
NRFA =16, NB=NS=4. It can be seen that the proposed scheme performs remarkably well and very
close to the full digital and optimal SVD based precoder solutions at low, mid and high SNRs. These
results show that, although the algorithm was originally designed to exploit the sparse nature of
mmWave channels, it is robust to channels with richer scattering.
4.6 Conclusion
We proposed a method to derive precoders and combiners for multi-stream MIMO transmission be-
tween two devices equipped with hybrid digital-analog antenna arrays. The method relies on a low-
complexity “multi-beam split and drop with backtracking” procedure to update the analog precoders,
while digital precoders are computed with the QR-decomposition based method in [81]. For sufficiently
large SNR, the resulting precoders approximate well the unconstrained SVD-based precoders, as our
numerical assessment shows. We envision that the proposed algorithm can be especially useful in
mmWave communication systems.
Compared to the state-of-art methods, our approach offers the advantage of computational sim-
plicity while achieving high-spectral efficiency with moderate training overhead. The numerical results
show that the method achieves convergence within NRF (log2(N/NRF )+1) ping pong iterations in the
low SNR regime and log2(N/NRF )+1 iterations in the mid and high SNR regime, assuming both
transceivers are equipped with arrays made of N elements and NRF RF chains. Although the method
was developed with sparse channels in mind, the performance assessment shows that it is robust against
this assumption and also performs well in rich scattering channels.
Also, in order to further reduce the training overhead, the proposed scheme can be interleaved with
transmission of payload with increasing data-rate. This, the extension to multi-user environments and
to time varying channels will be the subject of our future work.
4.6. CONCLUSION 75
-20 -15 -10 -5 0 5 10 15
SNR(dB)
0
5
10
15
20
25
30
35
40
45
50
Sp
ectr
al E
ffic
ien
cy
Optimal Precoding (SVD)
Digital PPMBT
Hybrid PPMBT
NS=2
NS=1
NS=4
(a) Spectral Efficiency over SNR
0 5 10 15 20 25
Ping Pong Iterations
0
5
10
15
20
25
30
35
40
45
50
Sp
etc
ral E
ffic
ien
cy
Optimal Precoding (SVD)
Digital PPMBT
Hybrid PPMBT
ρ=0dB, NS=2
ρ=15dB, NS=4
ρ=-15dB, NS=1
(b) Spectral Efficiency over PP Iterations
Figure 4.5: Spectral Efficiency (bits/s/Hz) Attained by the Algorithm over PP Iterations for DifferentNS Values. Devices A and B are Equipped with a Hybrid Array. NA=NB=64, NRF
A =NRFB =8, L=7.
76 CHAPTER 4. MUTLI-STREAM BEAMFORMING WITH HYBRID ARRAYS
0 2 4 6 8 10 12 14 16
Ping Pong Iterations
0
10
20
30
40
50
60
70
80
Sp
ec
tra
l E
ffic
ien
cy
Optimal Precoding (SVD)
Digital PPMBT
Hybrid PPMBT
ρ=-15dB
ρ=30dB
ρ=0dB
Figure 4.6: Spectral Efficiency (bits/s/Hz) Attained by the Algorithm over PP Iterations. De-vice A is Equipped with a Hybrid Array and Device B has a Full Digital Architecture.NA=128, NB=4, NRF
A =16, NS=4, L=40.
Chapter 5
Concluding Remarks
5.1 Summary
The present thesis focused on developing beam search and beamforning solutions that address key
challenges in mmWave massive MIMO systems. The algorithms proposed herein tackles problems
related to the aforementioned system’s hardware constraints, channel acquisition overhead, and the
related beamforming design complexity.
We first proposed efficient Bayesian active beam search algorithms that suit best the noisy and
sparse nature of the mmWave massive MIMO line of sight channels. We showed how the proposed
techniques alleviate the need for separate channel estimation and beamforming design, while showing
high robustness against noise in low signal to noise ratio regimes. Then, we developed a novel multi
beam search algorithm that relaxes the channel’s line of sight assumption and helps establish multi-
stream mmWave wireless links. By leveraging the iterative power iteration technique, we showed that
our proposed scheme almost approximates singular value decomposition based multi stream beam-
forming.
For the line of sight mmWave massive MIMO systems, we proposed two variational Bayesian active
learning schemes that enable initial access for hybrid digital-analog enabled devices operating in these
77
78 CHAPTER 5. CONCLUDING REMARKS
highly sparse channels. The proposed schemes are devised with the goal to balance the beam search
time and to achieve high beamforming gain, while accounting for uncertainties on the unknown channel
(gain and noise variance). They build upon an active learning algorithm called hierarchical posterior
matching and extend it with tools from Bayesian inference to devise bi-directional beam alignment
algorithms that are numerically shown to effectively handle the uncertainty in the channel parameters,
thus resulting in beamforming gains close to these of exhaustive beam search algorithms, while requiring
an amount of pilot measurements comparable to that of hierarchical search algorithms.
For the multi beam alignment problem, we proposed an algorithm that derives precoders and
combiners for multi-steam MIMO wireless communications systems that are quipped with hybrid
digital-analog arrays. The proposed method uses a novel beam search mechanism called "multi-beam
split and drop with backtracking" to update the analog precoders and combiners, and uses the QR-
decomposition based update to devise the digital ones. Numerical simulations show that our algorithm
can approximate, with low pilot overhead, quite well the unconstrained SVD-based precoder/combiner
design for sufficiently large SNRs.
5.2 Future Work
There are several possible directions for future research.
In Chapter 3, we proposed two variational Bayesian online learning schemes that enable initial
access for hybrid digital-analog enabled devices operating in mmWave wireless channels, for point to
point hybrid array transceivers, communicating over static wireless channels. Both of our schemes
showed superiority, when compared to state of the art, in balancing the beam training overhead versus
attaining high beamforming gain. Further work is still required to adapt our proposals, and especially
the HiePM search mechanism, to both time varying channels as well as multi user environments.
In Chapter 4, we proposed a hybrid scheme that marries the novel multi-beam split and drop
with backtracking search mechanism with the QR-decomposition to devise the analog and digital
5.2. FUTURE WORK 79
precoders and combiners, for point to point hybrid array transceivers, communicating over static
wireless channels.
Theoretical guarantees for ’the multi-beam split and drop with backtracking’ scheme:
For fully digital transceivers, the QR-decomposition based precoding and combining is guaranteed
to converge to the wireless channel’s main singular value decomposition (SVD) components. One
important extension of this work is to consider investigating similar theoretical guarantees for our
hybrid algorithm and see under which conditions can the multi-beam split and drop with backtracking
search algorithm, when combined with QR-decomposition or any other baseband linear subspace search
algorithm, attain the performance of the unconstrained SVD based precoding and combining.
Training Overhead reduction for the proposed Ping Pong Multi Beam Training with
Hybrid Antenna Arrays scheme: Another interesting direction for future study is to investigate
how our proposed scheme can be interleaved with transmission of data payload with increasing data
rate. This would reduce the training overhead and make better use of the wireless channel resources.
Extension to multi-user environments and to time varying channels: One pre-requisite
to have our scheme considered for a concrete implementation in a cellular context is to make it usable
in multi-user environments and robust against the channel’s time variability. Future work in these two
directions is thus of great importance.
Appendix A
Variational Hierarchical Posterior
Matching for mmWave Wireless
Channels Online Learning
83
Variational Hierarchical Posterior Matching formmWave Wireless Channels Online Learning
Nabil Akdim1, Carles Navarro Manchon2, Mustapha Benjillali3 and Pierre Duhamel41 Apple, Munich, Germany
2 Department of Electronic Systems, Aalborg University, Denmark3 Communication Systems Department, INPT, Rabat, Morocco
4 Laboratoire des Signaux et Systemes (L2S), CNRS-CentraleSup, FranceEmails: [email protected], [email protected], [email protected], [email protected]
Abstract—We propose a beam alignment algorithm thatenables initial access establishment between two transceiversequipped with hybrid digital-analog antenna arrays operating inmillimeter wave wireless channels. The proposed method buildsupon an active channel learning method based on hierarchicalposterior matching that was originally proposed for single-sidedbeam alignment on single path dominant channels. We extend itto the double-sided alignment problem and propose an estimationframework based on variational Bayesian inference that accountsfor the uncertainties on the unknown channel complex gain andnoise variance. The proposed approach is numerically shownto be resilient to the single path assumption and reaches nearoptimal beamforming gains with a moderate training overhead,even at low signal-to-noise ratios.
I. INTRODUCTION
Low power consumption and implementation complexity ofhybrid digital-analog transceiver architectures have acceleratedtheir adoption as a beamforming solution that can enableefficient wireless communications over the harsh mmWave fre-quency limited-scattering and blockage-prone wireless chan-nels for 5G and beyond wireless cellular networks [1]. Also,angular sparsity of such channels [1], [2] allows for the useof adaptive sparsity-friendly techniques to ease the initialalignment and channel state information (CSI) acquisition onthem, when using such transceiver designs [3]–[6].
In this study, we focus on hierarchical posterior matching(HiePM), an initial access scheme introduced in [7] whichprovably enables fast and reliable initial access establishmentbetween two wireless transceivers over wireless mmWavechannels with a single dominant path. Chiu et al. have shownin that contribution that using posterior matching [8] togetherwith hierarchical beam search [4] can significantly reducethe initial access acquisition time while keeping the corre-sponding misalignment probability relatively low, providedthat the channel’s complex gain and operating signal-to-noiseration (SNR) are fully known to the communicating devices.These limiting constraints were relaxed in [9] by proposingto augment HiePM with extra simplifying assumptions onthe statistical properties of the channel’s CSI and then touse either a sampling scheme or a linear filtering scheme(Kalman filter) to learn it in parallel to running HiePM.Although this latter extension of the vanilla HiePM makesit robust with respect to uncertainties on the channel’s CSI,
it still presents some limitations. First, it assumes perfectknowledge of the operating SNR. Second, the assumptionsmade on the statistical distribution of the channel’s complexgain (needed to make HiePM able to run as we will seelater), are simplistic and not justified from a theoretical orpractical view. Third, the proposed methods that build onsuch statistical assumption to overcome the CSI uncertaintyissue are either very restrictive and computationally heavy(in the case of the sampling method) or show relativelymoderate to low performance (in the case of the Kalmanfilter method). Finally, the overall tweaked setup assumes thatone of the communicating transceivers has a single antenna,and the extension to the case where both communicatingdevices use the hybrid digital-analog transceiver structure isnot straightforward.
Our contribution, in this work, which we dub “VariationalHiePM (V-HiePM)” will address shortcomings of proposalsin both of the aforementioned works [7], [9]. Specifically, wewill augment HiePM with a variational approximate inferencemodel [10] that will:
• make it robust against uncertainties of both CSI andoperating SNR,
• allow for a natural and theoretically grounded parametri-sation of the statistical properties of the CSI and operatingSNR, in a way that will make HiePM run smoothly,
• make the overall setup performing very close to thevanilla HiePM scheme with perfect SNR/CSI knowledge,
• allow for both communicating devices to be equippedwith hybrid digital-analog arrays.
II. SYSTEM MODEL
Our system is composed of two hybrid digital-analog an-tenna array devices A and B, equipped with uniform lineararrays (ULAs) of NA and NB antenna elements respectively.The elements on the ULAs are separated by a distanced = /2, where is the the mmWave wavelength ofinterest. Device A (B respectively) digitally controls its ULAwith NRF
A (NRFB respectively) RF chains. The two devices
communicate over a reciprocal static and narrowband wirelessmmWave MIMO channel, that is modeled as a NB NA
complex matrix H , sampled from the finite scatterer channelmodel [5] with a single dominant path as1:
H = ↵aB(B)aHA(A) (1)
where ↵ is the complex fading channel gain. aA(A) andaB(B) are the ULA array response vectors at devices A andB with incidence angles A and B respectively, modeledas aA (!A) =
1, ej!A , . . . , ej(NA1)!A
Tand aB (!B) =
1, ej!B , . . . , ej(NB1)!BT
, with !A(A) = 2 d cos (A)
and !B(B) = 2 d cos (B). The incidence angles A and
B are assumed to be sampled from the ranges [A,1, A,2]and [B,1, B,2] respectively. 2
The two devices go through an initial access phase con-sisting of a pilot based beam alignment procedure in order toestablish the wireless link between them. We assume in thiswork that, during this initial access phase, the CSI learningand beam search processes for the two devices are centralized,i.e one of the devices, say B, is collecting measurementsbased on device A’s pilot transmission, uses them to learnthe channel’s statistics and devises the beamformer it willuse for the next pilot reception occasion together with thebeamformer that device A should use in sending that pilot,then communicates such information to device A through anideal, error-free control channel3. At time instant t, device Asends a pilot symbol to B, which observes, after pilot removal:
yB,t =p
PwHB,tHfA,t + wH
B,tnB,t (2)where fA,t 2 CNA and wB,t 2 CNB are the effectivebeamformer and combiner used at time t by transceiversA and B respectively. These are chosen from the hybriddigital-analog codebooks detailed next. nB,t 2 CNB is acomplex circularly-symmetric additive white Gaussian noisevector with i.i.d elements with an unknown variance 2
B ,obtained after training sequence removal.
pP is the average
transmit power of the pilot signal.The adaptive beamforming strategy proposed herein utilizes
the hierarchical beamforming codebook of [4]. Such a code-book, noted CS hereafter, is designed to have S levels of beampatterns. We note Cl the collection of beams belonging to levell. Then, Cl contains 2l beamforming vectors that divide thesector [1, 2] into 2l directions, each associated with a certainrange of incidence angles Rm
l , such that [1, 2] = [2l
m=1Rml .
We note each of such 2l vectors as either fA (Rml ) or
wB (Rml ), depending on the considered device.
III. SEQUENTIAL BEAM PAIR SEARCH VIA VARIATIONALHIERARCHICAL POSTERIOR MATCHING
We start this section by first reviewing the details of thevanilla HiePM scheme [7], showing that knowledge of the
1This assumption is used only for the analytical derivation of our scheme.In Section IV, we show numerically that the method is resilient to such alimitation.
2No statistical assumptions on the distribution of the gain and the anglesare made here, we will later justify the choice of distributions we will useduring the inference process.
3Such channel can e.g. be established via a sub-6 GHz link in a non-stand-alone deployment. Control feedback channel design details will not bediscussed here due to space constraints.
channel gain ↵ as well as the noise variance 2 is necessary tomake such a search strategy usable in practice. We then detailour main contribution, which consists of augmenting HiePMwith a novel variational model comparison based approximateinference framework [10] to account for the uncertainties about↵ and 2
B and thus overcome the shortcomings, as detailed inthe introduction, of the vanilla HiePM and modified HiePMschemes proposed in [7] and [9] respectively.
A. Sequential Active Learning via the HiePM Strategy
We illustrate here the use of vanilla HiePM scheme [7]for device A (an analogous strategy will be used for deviceB). HiePM selects fA,t+1 based on the posterior at time t ofthe incidence angle A. We discretize4 the noisy beam searchproblem above by assuming that the beam search resolutionA
5 is an integer power of two and that the AoA A is of theform:A 2 A,1, . . . ,A,A,A,i = A,1+
(i 1)
A(A,2 A,1) (3)
With the above setup, the posterior distribution of A givenall measurements up to time t (collected in vector yB,1:t), canbe written as a A-dimensional vector A (t) with entries
A,i (t) := Pr (A = A,i|yB,1:t) , i = 1, . . . , A. (4)
The posterior probability of A being in a certain range,say Rm
i , can be computed asA,Rm
i(t) :=
X
A,i2Rmi
A,i (t). (5)
The HiePM strategy examines the posterior probabilityA,Rm
i(t) for all i = 1, . . . , SA and m = 1, . . . , 2i and
selects fA,t+1 2 CS to be the beamformer corresponding tothe angular range that satisfies:
(it+1, mt+1) = arg min
(i,m)
A,Rmi
(t) 1
2
(6)
Doing so, it is guaranteed [7] to sequentially refine the widthof the beamformer around the true incidence angle A.
Next we describe how the posterior bielief around A isupdated once a new measurement is taken with the pair ofbeamformers chosen previously with HiePM. Based on themeasurement model in (2), the posterior update at time instantt + 1 can be expressed using Bayes rule as
A,i (t + 1) / A,i (t) f (yB,t+1|A = A,i) ,
i = 1, . . . , A(7)
where f (yB,t+1|A = A,i) is the likelihood of A frommeasurement yB,t+1. Unfortunately, the likelihood term abovecannot be calculated in closed form due to the unknownchannel gain ↵, noise variance 2
B and incidence angle B .We will show next, how “V-HiePM” is able, using the
variational model based approximate inference frameworkdescribed in [10], to infer all above unknowns and uses them
4Such discretization approaches the original problem of initial access asA ! 0 [7].
5To support this level of resolution, the corresponding number of levelsof the hierarchical beamforming codebook at device A should be : SA =log2 (A).
efficiently to calculate the posterior update needed for HiePM,in a consistent and elegant way6.
B. The V-HiePM Scheme
We explain first the variational model based approximateinference framework used in its most general form, then showhow to apply it to our problem to derive posterior updates forour parameters of interest.
1) Variational Model Comparison based Posterior Up-date: We start by listing the different types of variables thatthe variational model comparison based approximate inferenceframework deals with:
• X is the observed data vector, in our case is yB,1:t+1.• Z = (Z1, Z2, . . . , ZL) denotes the L-dim vector of latent
variables that parameterize the measurement model (2).In our case, Z = (↵,2
B).• m 2 1, 2, . . . , A B denotes the mth pair of
angles (A,im,B,jm
), with im 2 1, . . . , A, andjm 2 1, . . . , B. Choosing a certain label m is equiv-alent to assuming that our measurement model in (2) isparameterized by the the mth pair of angles.
The framework performs joint inference on the hiddenvariables to find a set of distributions q(Z|m), q(m)1:m thatapproximate the true posterior p(Z, m|X), by minimizing theKullback-Leibler (KL) divergence:
KL(q(Z|m)q(m), p(Z, m|X)). (8)HiePM then uses the approximate incidence angle posterior
q(m) to decide which is the best measurement model can-didate fitting the observed data vector X . Algorithm 1 liststhe steps required to perform such operations. We omit themathematical derivation because of space constrains and referto [10, Chapter 10.4] for such details.
Algorithm 1: Variational Model Comparison basedPosterior Update
1 for m = 1 : AB do2 while (No convergence yet) do3 for j = 1, 2, . . . , L do4 q(Zj |m) / Ei 6=j(log(p(X|Z, m)))
5 Lm =R
Zq(Z|m) log(
p(Z,m|X)q(Z|m)
)
6 q(m) / p(m) exp(Lm)
2) Posterior Update for our measurement Model andthe overall V-HiePM Algorithm: From our measurementmodel (2), we have
p(X, Z, m) = p(yB,1:t|↵, B , m)p(↵)p(B)p(m) (9)where B = 2
B is the noise precision atdevice B; p(X|Z, m) = p(yB,1:t|↵, B , m) =Qt+1
i=1 CN(yB,i;p
P↵wHB,iAmfA,i,
2B) is the likelihood
6As it will be detailed below, such an inference framework lends itselfnaturally in the HiePM context: we make the best use of the measurementsby first estimating posteriors over the channel gain and noise varianceand then use those to robustly update the angle of incidence posterior,doing so allows V-HiePM to take the channel’s gain and noise varianceestimation uncertainties properly into account when deriving the posteriorof the incidence angles, thus making a robust HiePM based decision whenchoosing the next precoder/combiner pair to use.
of our measurement model (we assume here that thesequential noise samples are i.i.d); p(↵) = CN(↵;↵0,0)is the prior belief over ↵, considered to be Gaussianwith a known initial mean ↵0 and initial precision 0
7;p(B) = (B ; a0, b0) is the non informative prior beliefover B , with parameters a0 = 0 and b0 = 0; finally,p(m) = 1
ABis the prior belief over m, which is assumed to
be uniform to make it non informative as well8. In addition,Am = aB(B,jm
)aHA(A,im
) is the assumed unfaded channelmatrix under the mth pair of incidence angles.
The obtained approximate posteriors for ↵ and B , up tothe measurement iteration t, can be shown to keep the form oftheir respective priors, but with parameters that depend on themeasurement vector yB,1:t: qt(↵|m) has the form of complexGaussian pdf with mean ↵t,m and precision t,m reading
t,m =at,m
bt,m
tX
d=1
p
PwHB,tAmfA,t
2
+ 0 (10a)
↵t,m =at,m
bt,mt,m
tX
d=1
pPwH
B,tAmfA,t
yB,d +
↵00
t,m
(10b)and qt(B |m)9follows a Gamma pdf with parameters shapeand rate parameters at,m and bt,m given by
at,m = a0 + t, (11a)
bt,m = b02RePt
d=1
pPwH
B,tAmfA,t
yB,d↵
t,m
+
Ptd=1
|yB,d|2 +
1
t,m+ |↵t,m|2
p
PwHB,tAmfA,t
2
(11b)Note that the choice of our prior distributions is not ar-
bitrary, the priors chosen above correspond to the maximumentropy distributions [11] that respect constraints that needto be put on their respective parameters, namely ↵ being acomplex variable having a known initial mean and variance,B being a non negative variable and m being a discretevariable). Such a choice makes our proposal assume the leastinformation about our measurement model’s unknowns.
The posterior of the model, indexed by m, is then updatedfollowing Lines 5 and 6 in Algorithm 1, where Lm reads
Lt,m = log(1
2t,m
) + at,m (1 log(bt,m)) + log((at,m))
b0at,m
bt,m
tX
d=1
|yB,d|2at,m
bt,m t,m |↵t,m|2
!(12)
7The first and second order moments of ↵ are the only assumed knownvalues in our model.
8In the above, CN(·; µ,) denotes the complex Gaussian pdf with meanµ and precision , (·; a, b) denotes the Gamma pdf with shape and rateparameters a and b.
9Note that (10) and (11) can be re-written, after performing some algebra,in a recursive format w.r.t their terms involving summation over measurementsepochs. This results in a significant reduction of the algorithm’s memory andcomputation complexity footprint.
The posteriors over A,imand B,jm
are obtained from theposterior qt(m) as
qA,t(i) =X
m:im=iqt(m), i = 1, . . . , A (13)
qB,t(j) =X
m:jm=jqt(m), j = 1, . . . , B (14)
The posterior probability of the incidence angles A andB to be in a certain range Rn
A,i and RpB,j resp, read as:
qA,t(RnA,i) :=
X
A,i2RnA,i
qA,t(i), (15)
qB,t(RpB,j) :=
X
B,j2RpB,j
qB,t(i), (16)
The vanilla HierPM scheme is then applied separately toqA,t(i) and qB,t(j), to choose the pair of beamformers to usefor the next measurement occasion.
The modes ↵t and Bt of the approximate posteriorsqt(↵|m
t ) and qt(B |mt ), with m
t = arg maxm(qt(m)), canbe seen as approximations of the MMSE estimates of ↵ andB respectively. These estimates are given by:
↵t = ↵t,mt, Bt = at,m
t/bt,m
t. (17)
Algorithm 2 runs all above operations in a loop, until themeasurement budget is exhausted: device B decides whichpair of beamformers devices A and B shall use to take thenext measurement by applying the HiePM scheme separatelyto the current posteriors qA,t and qB,t, it then takes a newmeasurement yB,t+1 with those latter, and finally run varia-tional inference to derive approximate posteriors of B = 1
2B
,↵ as well as of A and B .
IV. NUMERICAL RESULTS
To assess the effectiveness of V-HiePM, we run Monte Carlosimulations on a setup with two hybrid digital-analog beam-forming devices A and B. The channel matrix H 2 CNBNA
reads
H = ↵aB(B)aHA(A) +
LX
l=1
↵laB(B,l)aHA(A,l) (18)
and contains one dominant multipath component and L scat-tered components. All incidence angles are independentlydrawn from a uniform distribution between 0 and . Thechannel gains are independently drawn from a a set of complexGaussian distribution with mean 0 and variances fulfillingVar↵+
Pl Var↵l = 1, so that the average SNR between
the nth element of the array at A and the mth element ofthe array at B equals E|Hnm|2/E|B |2=1/2
B10. In all
simulations below, the two devices are equipped with identicalarrays made of NA = NB = 32 elements, digitally controlledwith NRF
A = NRFB = 8 RF chains. Device A uses a codebook
CA with a depth of SA = log2(A), A = 128. CA is built
10Hnm is the channel coefficient between device B’s nth array elementand device A’s mth array element, and E is the expectation operator.
using the orthogonal matching pursuit as described in [4]. Asimilar codebook, CB , is used for device B.11
We benchmark our algorithm’s beamforming gain after tmeasurements, defined as:
Gvh =wH(B,kt,B
)Hf(A,kt,A)2
(19)with different measurement budget sizes and under differentchannel assumptions (note the the exhaustive search needsNANB = 16384 measurements to settle), against that of thedifferent state of the art schemes listed below:
• Gph of the vanilla HiePM scheme of [7]. Here, such ascheme assumes that most of the energy in the channelis concentrated in the path corresponding to the knowngain ↵ and all other gains ↵l are null, it also assumesthat 2
B is known. In such case, the posterior update isdone, simply using Bayes rule as in equation (21) in [7],on the beam pair corresponding to that main path, andthen HiePM is applied to the marginals over those anglesseparately, similar to what V-HiePM does.
• Gbs of the noisy binary search algorithm of [4], which isachieved by 4 log2(maxNA, NB) = 28 measurements.
As a reference, we consider as well the best achievablebeamforming gain of the codebook, defined as
Gmax = maxw2CB
SB,f2CA
SA
wHHf2 . (20)
11Note that the multi-RF chain setups are used solely to help buildacceptable RF codebooks [4], and are not used for multi-stream MIMOoperations.
Algorithm 2: V-HiePM1 Input : Antenna Array Size NA and NB , The search resolution A and B ,
the codebooks CSAand CSB
, Search time 2 Output : Estimates of A, B , ↵ and B
3 for t = 1, 2, . . . , 1 do4 #HierPM Based BF selection according to Eq.(6)
(fA,t+1, wB,t+1) =
fA
RkA,t+1
A,lA,t+1
, wB
RkB,t+1
B,lB,t+1
5 #Take next measurement6 yB,t+1 =
pPwH
B,t+1HfA,t+1 + wHB,t+1nB,t+1,
7 #Variational Model Comparison Posterior Update8 for m = 1 : AB do9 while (No convergence yet) do
10 update qt+1(↵|m) via (10) then qt+1(B |m) via (11)11 end12 update qt+1(m) via (12)13 end
14 #Angle and Angular Range Posterior Update15 update qA,t+1(i) via (13) and qB,t+1(j) via (14)16 update qA,t+1(Rn
A,i) via (15) and qB,t+1(RnB,j) via (16)
17 #Final Precoder/Combiner Vector designlt+1,A, kt+1,A
= (SA, arg maxk (qA,t+1(k)))
18
lt+1,B , kt+1,B
= (SB , arg maxk (qB,t+1(k)))
19 #Channel’s Gain and Noise Precision MAP Estimates update ↵t+1 andBt+1 via (17)
20 end21 Output : A = A,k,A
, B = B,k,B, ↵ = ↵ , B = B
-10 -5 0 5 10
SNR in dB
-25
-20
-15
-10
-5
0
Be
am
form
ing
Lo
ss
in
dB
Lbs
: = 28
Lvh
: = 28
Lph
: =28
Lvh
: = 56
Lph
: =56
Lvh
: = 128
Lph
: =128
Figure 1: Beamforming loss of the different search schemes in achannel with L = 0 scattered components.
We begin by assuming that only the dominant componentis present (i.e. L = 0). Fig. 1 shows the beamforming lossesof the benchmarked algorithms with respect to the optimumpair of beamformers, defined as Lvh = Gvh/Gmax, Lph =Gph/Gmax, and Lbs = Gbs/Gmax. The results show thesuperiority of our scheme compared to the binary search of[4], and that it achieves similar or even better performancecompared to vanilla HiePM with perfect CSI and operatingSNR knowledge. It can be observed that the vanilla HiePMscheme with perfect channel gain knowledge saturates at highSNR: this is an effect of the algorithm assuming that thecomponent’s incidence angle lies on a discrete grid of values,whereas the actual angles are sampled from a continuousdistribution. Our proposed method is less sensitive to thismodel mismatch, due to the estimation of the channel gainand inverse noise variance: in practice, these estimates partlyaccount for the mismatch in the assumed values of the anglesand provide robustness to the overall procedure.
Next, we explore the robustness of the proposed methodagainst channels containing more than one multipath compo-nent. For this, we consider a channel with L = 3 scatteredcomponents with gains of equal variance, and with the powerratio between the dominant and scattered components beingLOSR = E↵2/(E↵2 +
Pl E↵2
l ). Fig. 2 shows beam-forming gains achieved by our algorithm after 100 measure-ments compared to the maximum gains achievable Gmax.
As it can be observed, the maximum achievable beamform-ing gain decreases as the power is more evenly distributedamong the channel’s components. Although V-HiePM assumesthe existence of a single component, it shows remarkableresilience to the presence of other components. Even whenall components in the model have comparable power, ourproposed method is able to perform within 2 dB of theoptimum for sufficiently high SNR.
V. CONCLUSION AND FUTURE WORK
We proposed in this work a variational Bayesian onlinelearning scheme that enables initial access for hybrid digital-analog enabled devices operating in mmWave wireless chan-nels. When compared to state of the art beam acquisitionschemes, our method shows superiority, in terms of balancing
-10 -5 0 5 10
SNR in dB
20
22
24
26
28
30
Be
am
form
ing
Ga
in i
n d
B
Gvh
: L=3, LOSR
= 0.2
Gmax
: L=3, LOSR
= 0.2
Gvh
: L=3, LOSR
= 0.7
Gmax
: L=3, LOSR
= 0.7
Gvh
: L=0, LOSR
= 1
Gmax
: L=0, LOSR
= 1
Figure 2: V-HiePM performance in channels with different powerratios between the dominant path and L = 3 scattered components.
the beam search time versus achieving higher beamforminggain, in being able to properly do so while accounting foruncertainties on the unknown CSI (gain and noise variance)and in being very resilient to the dominant single path assump-tion. Even though the scheme is derived based on a discretizedmodel of the angles of incidence of the channel’s maincomponent, it showed great robustness against off-grid anglesas well as working with a realistic codebook implementation.Further research will focus on adapting the proposed onlinelearning algorithm to operating in time-varying channels.
REFERENCES
[1] R. W. Heath, N. Gonzalez-Prelcic, S. Rangan, W. Roh, and A. M.Sayeed, “An overview of signal processing techniques for millimeterwave MIMO systems,” IEEE Journal of Selected Topics in SignalProcessing, vol. 10, no. 3, pp. 436–453, Apr. 2016.
[2] S. Hur, S. Baek, B. Kim, Y. Chang, A. F. Molisch, T. S. Rappaport,K. Haneda, and J. Park, “Proposal on millimeter-wave channel modelingfor 5G cellular systems,” IEEE Journal of Selected Topics in SignalProcessing, vol. 10, no. 3, pp. 454–469, Apr. 2016.
[3] O. E. Ayach, S. Rajagopal, S. Abu-Surra, Z. Pi, and R. W. Heath,“Spatially sparse precoding in millimeter wave MIMO systems,” IEEETransactions on Wireless Communications, vol. 13, no. 3, pp. 1499–1513, Mar. 2014.
[4] A. Alkhateeb, O. E. Ayach, G. Leus, and R. W. Heath, “Channelestimation and hybrid precoding for millimeter wave cellular systems,”IEEE Journal of Selected Topics in Signal Processing, vol. 8, no. 5, pp.831–846, Oct. 2014.
[5] N. Akdim, C. N. Manchon, M. Benjillali, and E. de Carvalho, “Pingpong beam training for multi stream MIMO communications with hybridantenna arrays,” in 2018 IEEE Globecom Workshops (GC Wkshps), Dec.2018, pp. 1–7.
[6] Y. Ding, S. Chiu, and B. D. Rao, “Bayesian channel estimation algo-rithms for massive MIMO systems with hybrid analog-digital processingand low-resolution ADCs,” IEEE Journal of Selected Topics in SignalProcessing, vol. 12, no. 3, pp. 499–513, Jun. 2018.
[7] S. Chiu, N. Ronquillo, and T. Javidi, “Active learning and CSI acquisi-tion for mmwave initial alignment,” IEEE Journal on Selected Areas inCommunications, pp. 1–1, 2019.
[8] O. Shayevitz and M. Feder, “Optimal feedback communication viaposterior matching,” IEEE Transactions on Information Theory, vol. 57,no. 3, pp. 1186–1222, Mar. 2011.
[9] N. Ronquillo, S. Chiu, and T. Javidi, “Sequential learning of CSIfor mmwave initial alignment,” CoRR, vol. abs/1912.12738, 2019.[Online]. Available: http://arxiv.org/abs/1912.12738
[10] C. M. Bishop, Pattern Recognition and Machine Learning (InformationScience and Statistics). Berlin, Heidelberg: Springer-Verlag, 2006.
[11] M. U. Thomas, “A generalized maximum entropy principle,” OperationsResearch, vol. 27, no. 6, pp. 1188–1196, 1979.
Appendix B
Ping Pong Beam Training for Multi
Stream MIMO Communications with
Hybrid Antenna Arrays
89
Ping Pong Beam Training forMulti Stream MIMO Communications
with Hybrid Antenna ArraysNabil Akdim1, Carles Navarro Manchon2, Mustapha Benjillali3, and Elisabeth de Carvalho2
1 Intel Deutschland, Munich, Germany2 Department of Electronic Systems, Aalborg University, Denmark
3 Communication Systems Department, INPT, Rabat, MoroccoEmails: [email protected], [email protected], [email protected], [email protected]
Abstract—We propose an iterative training procedure that ap-proximates multi-stream MIMO eigenmode transmission betweentwo transceivers equipped with hybrid digital analog antennaarrays. The procedure is based on a series of alternate (pingpong) transmissions between the two devices in order to exploitthe reciprocity of the wireless channel. During the ping pongiterations, the update of the devices’ digital precoders/combinersis performed based on a QR decomposition of the receivedsignal matrix. Concurrently, their analog precoders/combinersare progressively updated by a novel “multi-beam split anddrop with backtracking” mechanism that tracks the channel’smain spatial components. As shown throughout the paper, theproposed algorithm converges with only few iterations, hasminimal computational complexity, and performs very closelyto optimal singular value decomposition based precoding withsufficiently large signal-to-noise ratio.
I. INTRODUCTION
Upcoming wireless communication networks are expectedto provide service to an unprecendently large number of wire-less devices with peak data rates in the order of tens of Gbps.The congestion and fragmentation of the traditional spectralbands below 6GHz has pushed wireless service providers toexplore vacant spectrum at the millimeter-wave (mmWave)frequency bands (30–300 GHz) in order to fulfill that goal [1].Nevertheless, the poor reflectivity and high absorption and freespace propagation losses make communicating wirelessly oversuch high frequencies a challenging task [2]. Fortunately, thisfrequency range will allow for the use of compact and smallantenna arrays with high number of elements, as the physicalsize of the array is proportional to the carrier wavelength. Thelarge beamforming gains that such large-scale arrays enablewill be used to compensate for the above limitations.
However, the high cost, power consumption and complexityof the mixed signal hardware at mm-wave make having largeantenna arrays with digitally controlled elements infeasible [3].This has motivated the wireless communication research com-munity to look at the hybrid digital-analog antenna arrayarchitectures [4]. In such architectures, the large antenna arrayis steered using analog phase shifters and only a few digitallymodulated radio-frequency (RF) chains. An illustration of suchan architecture is shown in Fig. 1.
In addition to their cost and implementation advantages,hybrid array structures entail their own challenges: the lowSNR resulting from high propagation losses, the large dimen-sionality of the MIMO channel matrix and the presence ofanalog processing complicate the acquisition of the channelstate information (CSI) and the computation of the MIMOprecoders and combiners [1], [3]. Luckily, channel measure-ment campaigns [2] have shown that mm-wave channels aresparse in the angular domain, which enables the proposalof CSI acquisition and precoding/combining algorithms thatexploit such property. An example of these are compressedsensing based approaches such as [3], [5], [6], which aregenerally computationally complex and require a large amountof channel measurements. An alternative are exhaustive andhierarchical beam-search techniques, which may entail signif-icant latency and probability of miss detection [7].
In this work, we focus on a beam training strategy basedon alternating transmissions between two transceivers, whichhas been coined ping pong beam training (PPBT). The mainidea behind PPBT is to exploit the reciprocity of the MIMOchannel. With appropriate processing at each device, the alter-nate transmissions implicitly implement an algebraic poweriteration that leads to approximating the top left and rightsingular vectors of the MIMO channel matrix. This idea wasfirst applied in the digital arrays context for single streamwireless communications in [8], [9], and was extended tomulti stream setups in [10], to large antenna array andfrequency selective systems in [11] and to noisy MIMOchannels in [12]. More recently, similar approaches have beenproposed in the context of mmWave communications withhybrid digital-analog antenna arrays, which we review next.In [13], the basic ping pong beam training method for single-stream MIMO transmission was adapted to the hybrid arrayarchitecture with the inclusion of a “beam split-and-drop”procedure for the setting of analog precoders. The subspaceestimation and decomposition method in [14] proposes a pingpong based algorithm that iteratively estimates the channel’sright and left eigenvectors using a Krylov subspace estimationmethod. This algorithm is based on exhaustive measurementswith a large set of different analog precoders, which are then
Baseband
Precoder
Baseband
Combiner
RF Chain
RF Chain
RF Chain
RF Chain
RF Precoder RF Combiner
NA NB
NA
RF
NB
RF
H
WA
FATRANSCEIVER A TRANSCEIVER B
FB
WB
YB[1]
YB[ ]NB
RF
Fig. 1: Structure of the transceivers
linearly combined in order to cancel the effect of the analogprecoders. It therefore requires significant amount of trans-missions, which imply large signalling overhead and latency.Lastly, the power iteration based training method introducedin [15] is a technique that extends the solution proposedin [10] to the multi stream case, where the digital precodersare set based on an algebraic power iteration technique, whilethe analog precoders update is done based on a compressedsensing technique called simultaneous orthogonal matchingpursuit [16].
Compared to the above approaches, we propose in thisarticle a strategy that sets the digital and analog precodersof the devices in a way to approximate the top NS left andright singular vectors of the channel matrix, with NS beingthe desired number of spatial streams. Our new technique,which we dub “hybrid ping pong multi beam training“(Hybrid PPMBT) extends the work done in [13] to the multistream case. It adapts the PPBT strategy to hybrid arrays byprogressively choosing the analog precoders at each devicefrom a predefined hierarchical codebook. After one round-triptransmission, a novel ”multi beam split and drop strategy withbacktracking” is applied to focus the analog precoders towardsthe spatial directions that are most likely containing thechannel’s top NS multipath components. The digital precodersare updated via an orthogonal decomposition operation on thereceived signal as described in [10]. In comparison to theapproaches in [14] and [15], Hybrid PPMBT is much simplerfrom a computational complexity aspect and has a low trainingoverhead as it requires significantly fewer transmissions. Sim-ulation results show that our proposed scheme performs verywell in retrieving the wanted NS channel’s top eigenmodesfor sufficiently large signal-to-noise ratio, both in terms ofaccuracy and convergence speed.
II. SYSTEM MODEL
We consider a system in which two hybrid analog digitaltransceivers A and B, equipped with uniform linear arrays(ULA) composed of NA and NB antenna elements. Suchelements are separated with a distance d = /2, where isthe wavelength of interest. The two devices control digitallytheir arrays with NRF
A and NRFB RF chains respectively and
exchange data over a reciprocal wireless MIMO channel usingNS parallel data streams. The channel from device A to
device B is considered to be static and narrowband and ismodeled according to the finite scatterer channel model withL propagation paths [13], [17], as
H =
rNANB
L
LX
l=1
↵laB(B,l)aHA(A,l), (1)
here, H 2 CNBNA , L is the number of multipath com-ponents (MPC), ↵l is the complex fading channel gain forMPC l, A,l = 2
d cosA,l and B,l = 2 d cosB,l
are the directional cosines corresponding to the lth MPCat arrays A and B respectively, where A,l, B,l are theangles of incidence of that same path, and aA and aB
are the array response vectors at device A and B respec-tively. The ↵l are modeled as independent, standard com-plex gaussian variables, the A,l and B,l as uniformly dis-tributed in the range [0, 2) radians and the array responsesas aA(A,l) = [1, ejA,l , . . . , ej(NA1)A,l ]T/
pNA and
aB(B,l) = [1, ejB,l , . . . , ej(NB1)B,l ]T/p
NB .In order to establish the wireless link with device B, device
A (we assume, without loss of generality, that device A isperforming the first transmission) transmits T , an NS NS
orthogonal training sequence i.e TT H = INS. Upon reception,
device B cancels the training sequence effect by multiplyingits received digital signal by T H . The resulting signal can beexpressed as:
YB = F HBHFAWA + F H
BNB , (2)where FA 2 CNANRF
A and FB 2 CNBNRFB contain the
states of the analog precoder and combiner of transceiversA and B, WA 2 CNRF
A NS denotes the digital precoder oftransceiver A and NB 2 CNBNS is a complex, circularly-symmetric additive white gaussian noise matrix, obtained aftertraining sequence removal and with i.i.d elements, each withvariance 2. Transmissions from device B to device A aremodeled analogously as
YA = F HAHHFBWB + F H
ANA, (3)where WB 2 CNRF
B NS and NA 2 CNANS are definedsimilar to the above.
III. HYBRID PING PONG MULTI BEAM TRAINING :HYBRID PPMBT
Given the signal model in (2) and (3), the beamforming taskconsists of selecting the set of analog and digital precoders andcombiners that maximize the spectral efficiency over a givenchannel matrix H . For transmission from device A to B, andassuming unit transmit power equally allocated across the NS
streams, the spectral efficiency reads
R = log2 detINS
+R1
NB
NSHeH
He
, (4)
where RNB= 2W H
BF HBFBWB is the noise covari-
ance matrix after receive combining at device B, He =W H
BF HBHFAWA is the equivalent channel after precoding
and combining at both devices. An analogous expressionapplies for transmission from device B to device A.
The optimal precoders maximizing (4) are known to be theNS top right and left singular vectors of H . However, thehybrid structure of the antenna array makes the computation
of such precoders challenging. On the one hand, as digitalmeasurements of the channel are only obtained after analogprecoding and combining, estimating the full channel matrixH in order to obtain its singular value decomposition requiresa large number of measurements and hence large overheadand latency [14]. On the other hand, even if the channelmatrix H can be estimated, the precoders have to be builtas the product of the analog precoding matrix FA and thedigital precoding matrix WA. While the elements of WA cantake any complex value due to its digital implementation, theoperation modeled by FA is implemented via phase shiftersand combiners, which restricts the values it can take. Inthis work, we restrict the entries of FA to satisfy |(FA)l,i|12 1
M(i)A
; 0, where M(i)A being the number of activated
array elements in the ith column of FA, and the option(FA)l,i = 0 accounts for the option of leaving some elementsof the array unused. In addition, a transmit power constraint isenforced such that kFAWAkF = 1.2 With these constraints,finding the combination of digital and analog precoders tobest approximate the channel’s singular vectors becomes acomputationally intensive optimization problem [5].
To overcome such difficulties, we propose an iterative multibeam training scheme based on alternate transmissions be-tween the two devices, this procedure estimates progressivelyand simultaneously the top NS right and left singular vectorsof H and sets the digital and analog precoders so that theyapproach those singular vectors. It consists of two parts: 1) a”backtracking beam split and drop” approach to select theanalog precoders FA and FB from a predefined multi levelcodebook, 2) a method to select the digital precoders WA andWB inspired by the QR decomposition algorithm describedin [10].
We will proceed by reviewing the beam training procedurefor digital antenna arrays proposed in [10], then briefly presentthe multi level codebook that is used for the analog precoderupdate, and finally explain our multi beam training solution.
A. Ping-Pong Multi Beam Training with Digital AntennaArrays: Digital PPMBT
We review the digital PPBT algorithm over a narrowbandreciprocal channel H as described in [10]. We consider twodevices A and B equipped with digitally controlled antennaarrays with NA and NB elements respectively. At the initial(0th) iteration, the process starts with a random initializationof the precoder at device A, W
[0]A . A uses then this initial
precoder to transmit a training sequence to B. Upon receptionand training sequence removal, device A gets an estimate ofHW
[0]A , makes a QR-decomposition on it, and uses the Q
part of that decomposition as its precoder W[0]B . It then uses
that precoder to transmit a training sequence back to device A,who will repeat the same operations. This process is reiterateduntil convergence, at which, device A gets an estimate of the
1(FA)l,i is the entry of the matrix FA belonging to its lth row and ithcolumn.
2Obviously, the same constraints apply to the analog precoder of device B.
top NS right singular vectors of H and device B gets anestimate of the top NS right singular vectors of HH. Furtherdetails on this procedure can be found in [10].
Algorithm 1 Ping Pong Multi Beam Training with HybridArrays
1: Initialize:
F[0]A
'
(1)A,0,'
(1)A,1, . . . ,'
(1)
A,M(1)A
1
,
F[0]B
'
(1)B,0,'
(1)B,1, . . . ,'
(1)
B,M(1)B
1
,
Initialize W[0]A to an orthogonal matrix of its size.
for s = 1 : NS do
W[0]A (:, s) W
[0]A
(:,s)pNS
F[0]A
W[0]A
(:,s)2
end forpA, kA, iA [], [], [], pB , kB , iB [], [], [].
2: A transmits, B receives: Y[0]
B =(F[0]B )HHF
[0]A W
[0]A +(F
[0]B )HN
[0]B
3: [Q, R] qr(Y[0]
B )4: for s = 1 : NS do5: W
[0]B (:, s) Q(:, s)
6: W[0]B (:, s) W
[0]B
(:,s)pNS
F[0]B
W[0]B
(:,s)
27: end for8: t 19: loop
10: B transmits,11: A receives Y
[t]A =(F
[t1]A )HHHF
[t1]B W
[t1]B +(F
[t1]A )HN
[t]A
12: [Q, R] qr(Y[t]
A )13: for s = 1 : NS do14: W
[t]A (:, s) Q(:, s)
15: W[t]A (:, s) W
[t]A
(:,s)pNS
F[t1]A
W[t]A
(:,s)
216: end for17: [F
[t]B , pB , kB , iB]
UPD.AN.PR(F[t1]B , W
[t1]B , CB , pB , kB , iB)
18: A transmits,19: B receives: Y
[t]B =(F
[t]B )HHF
[t1]A W
[t]A +(F
[t]B )HN
[t]B
20: [Q, R] qr(Y[t]
B )21: for s = 1 : NS do22: W
[t]B (:, s) Q(:, s)
23: W[t]B (:, s) W
[t]B
(:,s)pNS
F[t]B
W[t]B
(:,s)
224: end for25: [F
[t]A , pA, kA, iA]
UPD.AN.PR(F[t1]A , W
[t]A , CA, pA, kA, iA)
26: t t + 127: end loop
B. Analog Precoder Multi Level Codebook
We illustrate here the codebook definition for transceiverA (an analogous codebook is used for the transceiverB). We consider a codebook CA which is composedof LA= log2 (NA/NRF
A )+13 levels. For the kth level,we define a subcodebook C(k)
A ='(k)A,i, i=0, 1, . . . , M
(k)A -1
consisting of M(k)A =NRF
A 2k-1 column vectors, k=1, 2, . . . , LA.Each of the elements of the subcodebook is defined as
3For the considered codebook design we constrain NA and NRFA to be
both integer powers of two.
'(k)A,i=
1, e-j (k)
A,i , . . . , e-j(M(k)A -1) (k)
A,i ,0TNA-M(k)
A
T
/
qM
(k)A ,
where (k)A,i=-(2i+1)/M
(k)A is the directional cosine of the
ith vector at the kth level ('(k)A,i steers the array in the direction
(k)A,i= arccos
(k)A,i/, with a lobe whose width decreases with
the codebook level k), and 0N is the N -dimensional columnzero vector. Further details about the codebook used here canbe found in [13].
C. Ping Pong Multi Beam Training with Hybrid AntennaArrays : Hybrid PPMBT
The proposed algorithm for beam training with hybridarrays is described in pseudocode. Algorithm 1 presents theoverall training scheme, while Algorithm 2 describes thesubroutine used to update the analog precoding matrices.
1) Initialization: First, the digital and analog precodersare initialized. FA and FB are initialized to C(1)
A and C(1)B
respectively, while WA is initialized to a random unitarymatrix. The columns of WA are then normalized to fulfill thetransmit power constraint of the effective precoder FAWA.Finally, a set of empty arrays, pB , kB , iB, are created.Those arrays will store the needed information to perform theanalog precoder updates, as explained below.
2) Ping Pong Iterations: After the initialization phase,a sequence of alternate pilot transmissions starts between thetwo devices. The baseband precoders are updated after eachreception step by means of a QR-decomposition, followedby a normalization step. These two operations are detailedin lines 3-7, 12-16 and 20-24 in Algorithm. 1. Immediatelyafter updating their baseband precoders upon reception ofa transmission, the devices transmit back with the updateddigital precoders and the same analog precoder as used forreception. Only after the transmission has been made willthe transmitting device update its analog precoders (usingAlgorithm 2), such that next reception is done with the updatedsetting. This allows for the QR based iteration to converge,as each reception-transmission cycle is performed over a staticsetting of the analog precoders.
3) Update of Analog Precoders: The devices invokethe routine outlined in Algorithm 2 to update their analogprecoder state. This routine bases its update on the currentstate of the device’s RF precoder F , its codebook C and on theupdate history of its baseband precoder W . Using all previousupdates of W allows for backtracking–i.e, correcting forwrong RF precoder updates. The routine works as follows:
a) Three sequences of values are generated: kn stores thelevel of the codebook of the nth column of F , pn storesthe squared norm of the nth row of W , i.e the aggregateenergy received on it, and in stores the index of the nthcolumn of F , out of the level of the codebook to whichthat column belongs.
b) The sequences generated above will be used to updatethree vectors: kn will be appended to k, with k beingan array storing the codebook levels of the columns ofF used over consecutive ping-pong iterations. pn will
be appended to p, with p being an array storing thereceived energy over the different spatial directions setby the analog beamformer F . in will be appended to iin a similar manner to the above.
c) Once p is updated, it will be sorted in a descendingmanner and the resulting sorted indices will be storedin pI . This newly formed array will be used to findthe entries of p that are most likely to direct the analogprecoders where the MPCs of H are.
d) Two new vectors are built: bI contains the K first indexesof pI i.e it identifies the beams that are most alignedwith the channel’s MPCs (K is the length of bI , whichcan be derived from lines 9-13). bL is a vector that ismade of 1’s and 2’s. The ith entry of bL is set to 2when the precoder corresponding to the ith entry of bI
is replaced with the two precoders belonging to one stephigher level of the codebook and that have their beamscovering together its same spatial area, otherwise it isset to 1. Deciding to append 1 or 2 to bL depends onwhether we already consumed all columns of F and onwhether the element of pI in question belong to the lastlevel of the codebook or not (see lines 17-21). Line 8 ofthe algorithm erases the measurement stored over a beamthat is selected to be included in the analog precodingmatrix, either directly or after splitting it into two beamsof the immediately higher level. As new measurementswill be obtained over that beam in the next iteration, theold measurement is deleted to avoid unnecessarily comingback to the previous configuration corresponding to that
Algorithm 2 Analog Precoder Update
1: function UPD.AN.PR(F , W , C, p,k,i)2: Generate : kn, in, pn, n=1, . . . , NRF , where kn will store
the codebook level to which the F0s nth column belongs, in
will store its index out of that level and pn will store the normof the nth row of W , i.e the aggregate energy received on it.
3: p [p, p1, . . . , pNRF ], k [k, k1, . . . , kNRF ], i [i, i1, . . . , iNRF ]
4: Sort p in a descending manner and store the result in pS ,then store the arrangement of the elements of p into pS in pI .
5: bI [], bL []6: while n NRF do7: bI [bI , pI,n], pI,n is the nth element of pI ,8: ppI,n 0, ppI,n is the pI,nth element of p,9: if kpI,n = log2(
NNRF )+1 or n=NRF -1 then
10: n n + 1, bL [bL, 1]11: else12: n n + 2, bL [bL, 2]13: end if14: end while15: for t = 1 : Length(bI) do16: m bL,t
17: if m = 2 then18: F [F ,'
(km+1)2im
,'(km+1)2im+1 ]
19: else20: F [F ,'
(km)im
]21: end if22: end for23: return F ; p, k, i24: end function
old measurement.e) Finally, the RF precoder columns are updated (lines
15-22).We dub the above procedure for updating the analog precoders“beam split and drop with backtracking”, owing to the way itoperates. At each iteration, a decision is made as to whethera given beam is split –i.e replaced by two more directivebeams– or dropped –i.e. removed from the precoding matrix.The backtracking feature refers to the fact that, via the vectorp, the measurements obtained in prior ping-pong iterations arekept in memory, allowing for returning to lower level beamsin the codebook in the cases where noise leads to erroneousdecisions in the the split-and-drop procedure.
IV. NUMERICAL RESULTS
In order to assess the effectiveness of the proposed al-gorithm, we perform Monte Carlo simulations for multipleconfigurations of the hybrid arrays at devices A and B. Thechannel matrix H follows the model in (1). The average SNRfor the sth stream link between the nth element of the array atdevice B and the mth element of the array at A is defined as=E|Hnm|2/E|Nns|2=1/2, where Nns is the nth entryof the sth column of the noise matrix N . We assume here forsimplicity that the SNR is the same for all streams. Hnm isthe channel coefficient between device B’s nth array elementand device A’s mth array element, and E is the expectationoperator.
We measure the performance with the av-erage spectral efficiency per bits/s/Hz, ex-
pressed with log2
INS+R1
N
NSHeH
He
, where
He=(F[n1]B W
[n1]B )HHF
[n1]A W
[n]A and
RN =2(F[n1]B W
[n1]B )HF
[n1]B W
[n1]B for integer it-
eration n, and He=(F[n]B W
[n]B )HHF
[n1]A W
[n]A and
RN =2(F[n]B W
[n]B )HF
[n]B W
[n]B for half iteration n+0.5.
We benchmark the performance of the hybrid PPMBTagainst the performance of the digital ping pong multi streambeam training method (digital PPMBT) described in [10], forwhich the spectral efficiency is calculated in a similar mannerto that of the hybrid PPMBT. We compare it also with theoptimal unconstrained SVD based precoder, which maximizethe spectral efficiency and which is obtained by using the topNS left and right singular vectors of H . Note that the SVDbased precoder derivation requires full channel knowledge atboth devices, an information that is hard to get when hybridarchitectures are used as explained earlier, and which ourproposed scheme does not need at the start, but rather learnswhile setting the precoders.
We start our evaluation in the high SNR regime (=30dB),in which the training process will not be impaired too much bynoise. This allows assessing how the arrays size, the numberRF chains and the number of spatial streams really affect thealgorithm’s performance.
Fig. 2 depicts the algorithm’s performance over a channelwith 8 multipath components, when the two devices are
0 2 4 6 8 10 12 14 16 18 20 22
Ping Pong Iterations
30
40
50
60
70
80
90
100
Sp
ec
tra
l E
ffic
ien
cy
Optimal Precoding (SVD)
Digital PPMBT
Hybrid PPMBT
NA
=NB
=1024
NA
=NB
=128
NA
=NB
=32
Fig. 2: Spectral Efficiency (bits/s/Hz) attained by the algorithmover PP Iterations. Devices A and B are equipped with ahybrid array. NRF
A =NRFB = 8, NS=4, L=8, =30dB.
equipped with identical arrays made of NA=NB=32, 128 or1024 elements and a fixed number of RF chains NRF
A =NRFB =8
and try to establish an NS=4 streams MIMO communication.We see that the algorithm reaches, in very few iterations, forsmall, medium and large sized arrays, to about 1 to 2 bits/s/Hzof the digital PPMBT and SVD based precoding schemesperformances. We also see that the convergence speed (interms of PP iterations) scales inversely to the codebook depthfor each of the topologies: the convergence is slower whenlarge arrays are used because, in such cases, the codebook hasmore levels and its few last levels contain a high number ofbeamformers with very narrow beams. The selection of suchdirective beams is more prone to error than those in codebookswith lower resolution. Although the backtracking mechanismcan correct for the errors made, this comes at the expense of alarger number of iterations needed for convergence. Note thatthe number of PP iterations needed for convergence for alltopologies is still much below what is needed for exhaustivesearch: this latter requires as an example, for NA=NB=1024with NRF
A =NRFB =8, NA/NRF
A =NB/NRFB =128 PP iterations
to find the best beams, compared with only 16 for ouralgorithm.
Fig. 3 shows the algorithm’s performance over the samechannel as the one used in Fig. 2, but with the number ofMIMO streams, NS , taking different values (4, 6 and 8) andthe two devices being equipped with identical arrays made ofNA=NB=128 elements and use 8 RF chains each. The purposeof these simulations is to investigate how does the ratio ofthe number of MIMO streams to the number of RF chainsNS/NRF affect the algorithm’s performance. We can clearlysee that the algorithm behaves in three different ways depend-ing on the aforementioned ratio: 1) When NS NRF /2, theconvergence is very quick and no backtracking is performed.2) When NRF /2 < NS < NRF , the algorithm’s convergenceis slowed down and one observes some irregularity of theconvergence behavior over iterations which is due to the
0 5 10 15 20 25 30 35 40
Ping Pong Iterations
0
10
20
30
40
50
60
70
80
90
100
110
Sp
ectr
al E
ffic
ien
cy
Optimal Precoding (SVD)
Digital PPMBT
Hybrid PPMBT
NS=4
NS=6
NS=8
Fig. 3: Spectral Efficiency (bits/s/Hz) attained by the al-gorithm over PP Iterations for different NS values. De-vices A and B are equipped with a hybrid array.NA=NB=128, NRF
A =NRFB =8, L=8, =30dB.
backtracking mechanism. 3) When NRF = NS , the algorithmfails to provide acceptable performance.
Next, in Fig. 4, we fix the size of antenna arrays andnumber of RF chains used at both devices to NA=NB=64and NRF
A =NRFB =8, and evaluate the algorithm’s performance
over a channel with L = 7 multipath components at differentSNR values. In Fig. 4a, we show the algorithm’s performanceagainst SNR after training convergence for different NS val-ues. The results show that the number of streams that thetraining algorithm can handle efficiently grows with SNR,as is to be expected. At low SNR regimes (15 to 0 dB)the algorithm works better when it attempts to estimate onlythe dominant singular vector of the channel; if more singularvectors are estimated, the large estimation error degrades theoverall spectral efficiency of the system. As the SNR grows,an increasing number of singular vectors can be accuratelyestimated by the algorithm and, hence, increasing the numberof streams provides significant spectral efficiency gains. Forall cases, we observe that the performance of the HybridPPMBT is very close to that of the fully-digital counterpart,and approaches the SVD precoding performance as the SNRgrows. In Fig. 4b the convergence behavior of the trainingalgorithm with different NS settings is evaluated at their re-spective SNR values of interest. We observe that the method’sspectral efficiency tends to saturate at around 10 ping pongiterations for moderate and high SNR. Convergence for lowerSNR values is, however, slower. We attribute this effect tothe fact that the large noise power induces numerous incorrectbeam selection errors in the updates of the analog precoder;although the backtracking feature of the algorithm can helpcorrect some of them, the price to pay is a longer trainingtime. In any case, we remark that the algorithm reaches about70% of the spectral efficiency obtained at convergence withinthe first 6 iterations, regardless of the SNR value and numberof spatial streams.
-20 -15 -10 -5 0 5 10 15
SNR(dB)
0
5
10
15
20
25
30
35
40
45
50
Sp
ectr
al E
ffic
ien
cy
Optimal Precoding (SVD)
Digital PPMBT
Hybrid PPMBT
NS=2
NS=1
NS=4
(a)
0 5 10 15 20 25
Ping Pong Iterations
0
5
10
15
20
25
30
35
40
45
50
Sp
etc
ral E
ffic
ien
cy
Optimal Precoding (SVD)
Digital PPMBT
Hybrid PPMBT
!=0dB, NS=2
!=15dB, NS=4
!=-15dB, NS=1
(b)Fig. 4: Spectral Efficiency (bits/s/Hz) attained by the al-gorithm over PP Iterations for different NS values. De-vices A and B are equipped with a hybrid array.NA=NB=64, NRF
A =NRFB =8, L=7. (a):Spectral Efficiency over
SNR, (b): Spectral Efficiency over PP Iterations
To conclude, we evaluate our proposed training procedure ina system in which only one of the devices is equipped with alarge, hybrid array (NA=256, NRF
A =16), while the other hasa digitally-controlled array of moderate size (NB=4). Suchtopologies can be seen as massive MIMO systems operatingat microwave frequencies. In addition, to reflect the richerscattering experienced in such frequency bands [18], we adoptthe following channel model:
H=p
NANB
LABAH
A (5)where AA contains in its columns the steering vectorsaA(A,p), p=1, 2, . . . , P , AB is defined analogously, L isagain the number of multipath components, and is a LLmatrix with i.i.d. standard complex Gaussian entries. Thissetting will allow to test the validity of our approach inchannels with richer scattering and its robustness against thesparse assumption of the channel.
0 2 4 6 8 10 12 14 16
Ping Pong Iterations
0
10
20
30
40
50
60
70
80
Sp
ectr
al E
ffic
ien
cy
Optimal Precoding (SVD)
Digital PPMBT
Hybrid PPMBT
!=-15dB
!=30dB
!=0dB
Fig. 5: Spectral Efficiency (bits/s/Hz) attained by the al-gorithm over PP Iterations. Device A is equipped with ahybrid array and device B has a full digital architecture.NA=128, NB=4, NRF
A =16, NS=4, L=40.
Fig. 5 shows the algorithm’s performance over a rich scat-tering channel (L=40) when NA=128, NRF
A =16, NB=NS=4.It can be seen that the proposed scheme performs remarkablywell and very close to the full digital and optimal SVD basedprecoder solutions at low, mid and high SNRs. These resultsshow that, although the algorithm was originally designed toexploit the sparse nature of mmWave channels, it is robust tochannels with richer scattering.
V. CONCLUSION
We proposed a method to derive precoders and combin-ers for multi-stream MIMO transmission between two de-vices equipped with hybrid digital-analog antenna arrays.The method relies on a low-complexity “multi-beam splitand drop with backtracking” procedure to update the analogprecoders, while digital precoders are computed with the QR-decomposition based method in [10]. For sufficiently largeSNR, the resulting precoders approximate well the uncon-strained SVD-based precoders, as our numerical assessmentshows. We envision that the proposed algorithm can be espe-cially useful in mmWave communication systems.
Compared to the state-of-art methods, our approach offersthe advantage of computational simplicity while achievinghigh-spectral efficiency with moderate training overhead. Thenumerical results show that the method achieves conver-gence within NRF (log2(N/NRF )+1) ping pong iterationsin the low SNR regime and log2(N/NRF )+1 iterations inthe mid and high SNR regime, assuming both transceiversare equipped with arrays made of N elements and NRF
RF chains. Although the method was developed with sparsechannels in mind, the performance assessment shows that it isrobust against this assumption and also performs well in richscattering channels.
Also, in order to further reduce the training overhead, theproposed scheme can be interleaved with transmission of
payload with increasing data-rate. This, the extension to multi-user environments and to time varying channels will be thesubject of our future work.
REFERENCES
[1] T. S. Rappaport, S. Sun, R. Mayzus, H. Zhao, Y. Azar, K. Wang, G. N.Wong, J. K. Schulz, M. Samimi, and F. Gutierrez, “Millimeter wavemobile communications for 5G cellular: It will work!” IEEE Access,vol. 1, pp. 335–349, 2013.
[2] S. Hur, S. Baek, B. Kim, Y. Chang, A. F. Molisch, T. S. Rappaport,K. Haneda, and J. Park, “Proposal on millimeter-wave channel modelingfor 5G cellular systems,” IEEE Journal of Selected Topics in SignalProcessing, vol. 10, no. 3, pp. 454–469, Apr. 2016.
[3] R. W. Heath, N. Gonzalez-Prelcic, S. Rangan, W. Roh, and A. M.Sayeed, “An overview of signal processing techniques for millimeterwave MIMO systems,” IEEE Journal of Selected Topics in SignalProcessing, vol. 10, no. 3, pp. 436–453, Apr. 2016.
[4] X. Zhang, A. F. Molisch, and S.-Y. Kung, “Variable-phase-shift-basedRF-baseband codesign for MIMO antenna selection,” IEEE Transactionson Signal Processing, vol. 53, no. 11, pp. 4091–4103, Nov. 2005.
[5] O. E. Ayach, S. Rajagopal, S. Abu-Surra, Z. Pi, and R. W. Heath,“Spatially sparse precoding in millimeter wave MIMO systems,” IEEETransactions on Wireless Communications, vol. 13, no. 3, pp. 1499–1513, Mar. 2014.
[6] A. Alkhateeb, O. E. Ayach, G. Leus, and R. W. Heath, “Channelestimation and hybrid precoding for millimeter wave cellular systems,”IEEE Journal of Selected Topics in Signal Processing, vol. 8, no. 5, pp.831–846, Oct. 2014.
[7] M. Giordani, M. Mezzavilla, and M. Zorzi, “Initial access in 5Gmmwave cellular networks,” IEEE Communications Magazine, vol. 54,no. 11, pp. 40–47, November 2016.
[8] J. B. Andersen, “Intelligent antennas in a scattering environment – anoverview,” in IEEE Global Communications Conference (GLOBECOM),Sydney, Australia, Nov. 1998, pp. 3199–3203 (vol. 6).
[9] ——, “Array gain and capacity for known random channels withmultiple element arrays at both ends,” IEEE Journal on Selected Areasin Communications, vol. 18, no. 11, pp. 2172–2178, Nov. 2000.
[10] T. Dahl, N. Christophersen, and D. Gesbert, “Blind MIMO eigenmodetransmission based on the algebraic power method,” IEEE Transactionson Signal Processing, vol. 52, no. 9, pp. 2424–2431, Sep. 2004.
[11] E. de Carvalho and J. B. Andersen, “Ping-pong beam training forreciprocal channels with delay spread,” in 49th Asilomar Conference onSignals, Systems and Computers, Pacific Grove, CA, USA, Nov. 2015,pp. 1752–1756.
[12] D. Ogbe, D. J. Love, and V. Raghavan, “Noisy beam alignment tech-niques for reciprocal MIMO channels,” IEEE Transactions on SignalProcessing, vol. 65, no. 19, pp. 5092–5107, Oct. 2017.
[13] C. N. Manchon, E. de Carvalho, and J. B. Andersen, “Ping-pongbeam training with hybrid digital-analog antenna arrays,” in IEEEInternational Conference on Communications (ICC), Paris, France, May2017, pp. 1–7.
[14] H. Ghauch, T. Kim, M. Bengtsson, and M. Skoglund, “Subspace esti-mation and decomposition for large millimeter-wave MIMO systems,”IEEE Journal of Selected Topics in Signal Processing, vol. 10, no. 3,pp. 528–542, Apr. 2016.
[15] X. Cheng, N. Lou, and S. Li, “Spatially sparse beamforming trainingfor millimeter wave MIMO systems,” IEEE Transactions on WirelessCommunications, vol. 16, no. 5, pp. 3385–3400, May 2017.
[16] M. F. Duarte and Y. C. Eldar, “Structured compressed sensing: Fromtheory to applications,” IEEE Transactions on Signal Processing, vol. 59,no. 9, pp. 4053–4085, Sept 2011.
[17] A. G. Burr, “Capacity bounds and estimates for the finite scatterersMIMO wireless channel,” IEEE Journal on Selected Areas in Commu-nications, vol. 21, no. 5, pp. 812–818, Jun. 2003.
[18] J. Andersen, “Propagation aspects of MIMO channel modeling,” inSpace-Time Wireless Systems, H. Bolcskei, C. Papadias, D. Gesbert, andA.-J. van der Veen, Eds. Cambridge University Press, 2006, ch. 1.
Bibliography
[1] ITU. (2015) ITU-M.2370-0 : IMT traffic estimates for the years 2020 to 2030. [Online]. Available:
https://www.itu.int/pub/R-REP-M.2370-2015
[2] FCC. (2015) Cellular spectrum in the US. [Online]. Available: https://www.fcc.gov/auctions
[3] ITU. (2015) M.2083 : IMT vision - "Framework and overall objectives of the future development
of IMT for 2020 and beyond". [Online]. Available: https://www.itu.int/rec/R-REC-M.2083/en
[4] T. S. Rappaport, S. Sun, R. Mayzus, H. Zhao, Y. Azar, K. Wang, G. N. Wong, J. K. Schulz,
M. Samimi, and F. Gutierrez, “Millimeter wave mobile communications for 5G cellular: It will
work!” IEEE Access, vol. 1, pp. 335–349, 2013.
[5] IEEE Standards Association. (2014) Wireless LAN Medium Access Control (MAC) and
Physical Layer (PHY) Specifications Amendment 3: Enhancements for Very High Throughput
in the 60 GHz Band. [Online]. Available: https://standards.ieee.org/standard/8802-11-2012-
Amd3 −−2014.html
[6] D. Emerson, “The work of jagadis chandra bose: 100 years of millimeter-wave research,” IEEE
Transactions on Microwave Theory and Techniques, vol. 45, no. 12, pp. 2267–2273, 1997.
[7] 3GPP. (2019) New Radio (NR) Release 15. [Online]. Available: https://www.3gpp.org/release-15
97
98 BIBLIOGRAPHY
[8] W. Roh, J.-Y. Seol, J. Park, B. Lee, J. Lee, Y. Kim, J. Cho, K. Cheun, and F. Aryanfar, “Millimeter-
wave beamforming as an enabling technology for 5g cellular communications: theoretical feasibility
and prototype results,” IEEE Communications Magazine, vol. 52, no. 2, pp. 106–113, 2014.
[9] T. Bai and R. W. Heath, “Coverage and rate analysis for millimeter-wave cellular networks,” IEEE
Transactions on Wireless Communications, vol. 14, no. 2, pp. 1100–1114, 2015.
[10] Federal Communications Commission. (2015) FCC 15-138 notice of proposed rule making. [Online].
Available: https://apps.fcc.gov/edocspublic/attachmatch/FCC − 15− 138A1.pdf
[11] M. Giordani, M. Polese, A. Roy, D. Castor, and M. Zorzi, “A tutorial on beam management for 3gpp
nr at mmwave frequencies,” IEEE Communications Surveys Tutorials, vol. 21, no. 1, pp. 173–196,
2019.
[12] 3GPP. NR Frequency, ARFCN, and operating bands. [Online]. Available:
http://nrexplained.com/band
[13] Ericsson. (2020) Ericsson researchers top 4.3Gbps downlink on 5G millimeter Wave. [Online].
Available: https://www.ericsson.com/en/news/2020/2/ericsson-achieves-record-5g-mmwave-speed
[14] H. Meinel and J. Dickmann, “Automotive radar: From its origins to future directions, microwave
journal,” MICROWAVE JOURNAL, vol. 56, no. 9, pp. 24–57.
[15] C. Sturm and W. Wiesbeck, “Waveform design and signal processing aspects for fusion of wireless
communications and radar sensing,” Proceedings of the IEEE, vol. 99, no. 7, pp. 1236–1259, 2011.
[16] A. Pyattaev, K. Johnsson, S. Andreev, and Y. Koucheryavy, “Communication challenges in high-
density deployments of wearable wireless devices,” IEEE Wireless Communications, vol. 22, no. 1, pp.
12–18, 2015.
BIBLIOGRAPHY 99
[17] S. Hur, S. Baek, B. Kim, Y. Chang, A. F. Molisch, T. S. Rappaport, K. Haneda, and J. Park,
“Proposal on millimeter-wave channel modeling for 5G cellular systems,” IEEE Journal of Selected
Topics in Signal Processing, vol. 10, no. 3, pp. 454–469, Apr. 2016.
[18] R. W. Heath, N. González-Prelcic, S. Rangan, W. Roh, and A. M. Sayeed, “An overview of signal
processing techniques for millimeter wave MIMO systems,” IEEE Journal of Selected Topics in Signal
Processing, vol. 10, no. 3, pp. 436–453, Apr. 2016.
[19] T. Rappaport, Wireless communications: Principles and practice, 2nd ed., ser. Prentice Hall commu-
nications engineering and emerging technologies series. Prentice Hall, 2002, includes bibliographical
references and index.
[20] N. Akdim, C. N. Manchón, M. Benjillali, and P. Duhamel, “Variational hierarchical posterior matching
for mmwave wireless channels online learning,” in 2020 IEEE 21st International Workshop on Signal
Processing Advances in Wireless Communications (SPAWC), 2020, pp. 1–5.
[21] N. Akdim, C. N. Manchón, M. Benjillali, and E. de Carvalho, “Ping pong beam training for multi
stream mimo communications with hybrid antenna arrays,” in 2018 IEEE Globecom Workshops (GC
Wkshps), 2018, pp. 1–7.
[22] T. S. Rappaport, G. R. MacCartney, S. Sun, H. Yan, and S. Deng, “Small-scale, local area, and
transitional millimeter wave propagation for 5g communications,” IEEE Transactions on Antennas
and Propagation, vol. 65, no. 12, pp. 6474–6490, 2017.
[23] T. Rappaport, R. Heath, R. Daniels, and J. Murdock, Millimeter wave wireless communications.
Prentice Hall, 2015, includes bibliographical references (pages 585-651) and index.
[24] H. Zhao, R. Mayzus, S. Sun, M. Samimi, J. K. Schulz, Y. Azar, K. Wang, G. N. Wong, F. Gutierrez,
and T. S. Rappaport, “28 ghz millimeter wave cellular communication measurements for reflection and
penetration loss in and around buildings in new york city,” in 2013 IEEE International Conference on
Communications (ICC), 2013, pp. 5163–5167.
100 BIBLIOGRAPHY
[25] Jonathan, Lu, Daniel, Steinbach, Patrick, Cabrol, Philip, and Pietraski, “Modeling human blockers in
millimeter wave radio links,” 2012.
[26] E. Ben-Dor, T. S. Rappaport, Y. Qiao, and S. J. Lauffenburger, “Millimeter-wave 60 ghz outdoor
and vehicle aoa propagation measurements using a broadband channel sounder,” in 2011 IEEE Global
Telecommunications Conference - GLOBECOM 2011, 2011, pp. 1–6.
[27] T. Bai, R. Vaze, and R. W. Heath, “Analysis of blockage effects on urban cellular networks,” IEEE
Transactions on Wireless Communications, vol. 13, no. 9, pp. 5070–5083, 2014.
[28] M. N. Kulkarni, S. Singh, and J. G. Andrews, “Coverage and rate trends in dense urban mmwave
cellular networks,” in 2014 IEEE Global Communications Conference, 2014, pp. 3809–3814.
[29] T. S. Rappaport, Y. Xing, G. R. MacCartney, A. F. Molisch, E. Mellios, and J. Zhang, “Overview
of millimeter wave communications for fifth-generation (5g) wireless networks—with a focus on prop-
agation models,” IEEE Transactions on Antennas and Propagation, vol. 65, no. 12, pp. 6213–6230,
2017.
[30] M. R. Akdeniz, Y. Liu, M. K. Samimi, S. Sun, S. Rangan, T. S. Rappaport, and E. Erkip, “Mil-
limeter wave channel modeling and cellular capacity evaluation,” IEEE Journal on Selected Areas in
Communications, vol. 32, no. 6, pp. 1164–1179, 2014.
[31] T. S. Rappaport, S. Sun, and M. Shafi, “Investigation and comparison of 3gpp and nyusim channel
models for 5g wireless communications,” in 2017 IEEE 86th Vehicular Technology Conference (VTC-
Fall), 2017, pp. 1–5.
[32] 3GPP. 3GPP, “Study on channel model for frequencies from 0.5 to 100 GHz,”
3rd Generation Partnership Project (3GPP), Tech. Rep. TR 38.901. [Online]. Available:
http://www.3gpp.org/DynaReport/38901.htm
BIBLIOGRAPHY 101
[33] ——. 3GPP, “Study on channel model for frequency spectrum above 6 GHz,” 3rd
Generation Partnership Project (3GPP), Tech. Rep. TR 38.900. [Online]. Available:
https://portal.3gpp.org/desktopmodules/Specifications/SpecificationDetails.aspx?specificationId=2991
[34] 5GCM : 5G Channel Model for bands up to 100 GHz. [Online]. Available:
http://www.5gworkshops.com/2015/5GChannelModelforbandsupto100GHz(2015− 12− 6).pdf
[35] METIS. (Jul.2015) METIS Channel Model METIS2020, Deliverable D1.4 v3. [Online]. Available:
https://www.metis2020.com/wp-content/uploads/deliverables/METISD1.4v1.0.pdf
[36] mmMAGIC. (May 2017) Measurement results and final mmMAGIC channel models, H2020-ICT-
671650-mmMAGIC/D2.2 v1.0. [Online]. Available: https://5g-mmmagic.eu/results/
[37] G. R. MacCartney, T. S. Rappaport, and S. Rangan, “Rapid fading due to human blockage in pedestrian
crowds at 5g millimeter-wave frequencies,” in GLOBECOM 2017 - 2017 IEEE Global Communications
Conference, 2017, pp. 1–7.
[38] I. Kashiwagi, T. Taga, and T. Imai, “Time-varying path-shadowing model for indoor populated envi-
ronments,” IEEE Transactions on Vehicular Technology, vol. 59, no. 1, pp. 16–28, 2010.
[39] S. Dehnie, “Markov chain approximation of rayeleigh fading channel,” in 2007 IEEE International
Conference on Signal Processing and Communications, 2007, pp. 1311–1314.
[40] M. K. Samimi and T. S. Rappaport, “3-d millimeter-wave statistical channel model for 5g wireless
system design,” IEEE Transactions on Microwave Theory and Techniques, vol. 64, no. 7, pp. 2207–
2225, 2016.
[41] ——, “Local multipath model parameters for generating 5g millimeter-wave 3gpp-like channel impulse
response,” in 2016 10th European Conference on Antennas and Propagation (EuCAP), 2016, pp. 1–5.
[42] ——, “Statistical channel model with multi-frequency and arbitrary antenna beamwidth for millimeter-
wave outdoor communications,” in 2015 IEEE Globecom Workshops (GC Wkshps), 2015, pp. 1–7.
102 BIBLIOGRAPHY
[43] O. E. Ayach, S. Rajagopal, S. Abu-Surra, Z. Pi, and R. W. Heath, “Spatially sparse precoding in
millimeter wave MIMO systems,” IEEE Transactions on Wireless Communications, vol. 13, no. 3, pp.
1499–1513, Mar. 2014.
[44] J. A. Zhang, X. Huang, V. Dyadyuk, and Y. J. Guo, “Massive hybrid antenna array for millimeter-wave
cellular communications,” IEEE Wireless Communications, vol. 22, no. 1, pp. 79–87, 2015.
[45] T. S. Rappaport, J. N. Murdock, and F. Gutierrez, “State of the art in 60-ghz integrated circuits and
systems for wireless communications,” Proceedings of the IEEE, vol. 99, no. 8, pp. 1390–1436, 2011.
[46] Tuan-Do-Hong and P. Russer, “Signal processing for wideband smart antenna array applications,”
IEEE Microwave Magazine, vol. 5, no. 1, pp. 57–67, 2004.
[47] B. Floyd, S. Reynolds, U. Pfeiffer, T. Zwick, T. Beukema, and B. Gaucher, “Sige bipolar transceiver
circuits operating at 60 ghz,” IEEE Journal of Solid-State Circuits, vol. 40, no. 1, pp. 156–167, 2005.
[48] T. LaRocca, J. Y.-C. Liu, and M.-C. F. Chang, “60 ghz cmos amplifiers using transformer-coupling
and artificial dielectric differential transmission lines for compact design,” IEEE Journal of Solid-State
Circuits, vol. 44, no. 5, pp. 1425–1435, 2009.
[49] T. Yao, M. Gordon, K. Yau, M. Yang, and S. Voinigescu, “60-ghz pa and lna in 90-nm rf-cmos,” in
IEEE Radio Frequency Integrated Circuits (RFIC) Symposium, 2006, 2006, pp. 4 pp.–.
[50] M. Kraemer, D. Dragomirescu, and R. Plana, “A low-power high-gain lna for the 60 ghz band in a 65
nm cmos technology,” in 2009 Asia Pacific Microwave Conference, 2009, pp. 1156–1159.
[51] A. Fonte, S. Saponara, G. Pinto, L. Fanucci, and B. Neri, “60-ghz single-chip integrated antenna and
low noise amplifier in 65-nm cmos soi technology for short-range wireless gbits/s applications,” in 2011
International Conference on Applied Electronics, 2011, pp. 1–6.
BIBLIOGRAPHY 103
[52] L. Liu, A. R. Alt, H. Benedickter, and C. Bolognesi, “Low power consumption millimeter-wave ampli-
fiers using inp hemt technology,” in 2011 IEEE MTT-S International Microwave Workshop Series on
Millimeter Wave Integration Technologies, 2011, pp. 9–12.
[53] P.-Y. Chang, S.-H. Su, S. S. H. Hsu, W.-H. Cho, and J.-D. Jin, “An ultra-low-power transformer-
feedback 60 ghz low-noise amplifier in 90 nm cmos,” IEEE Microwave and Wireless Components
Letters, vol. 22, no. 4, pp. 197–199, 2012.
[54] K.-J. Kim, K. H. Ahn, T. H. Lim, H. C. Park, and J.-W. Yu, “A 60 ghz wideband phased-array lna
with short-stub passive vector generator,” IEEE Microwave and Wireless Components Letters, vol. 20,
no. 11, pp. 628–630, 2010.
[55] Y. Yu, P. G. M. Baltus, A. de Graauw, E. van der Heijden, C. S. Vaucher, and A. H. M. van Roermund,
“A 60 ghz phase shifter integrated with lna and pa in 65 nm cmos for phased array systems,” IEEE
Journal of Solid-State Circuits, vol. 45, no. 9, pp. 1697–1709, 2010.
[56] A. Natarajan, S. K. Reynolds, M.-D. Tsai, S. T. Nicolson, J.-H. C. Zhan, D. G. Kam, D. Liu, Y.-
L. O. Huang, A. Valdes-Garcia, and B. A. Floyd, “A fully-integrated 16-element phased-array receiver
in sige bicmos for 60-ghz communications,” IEEE Journal of Solid-State Circuits, vol. 46, no. 5, pp.
1059–1075, 2011.
[57] S. Y. Kim, D.-W. Kang, K.-J. Koh, and G. M. Rebeiz, “An improved wideband all-pass i/q network
for millimeter-wave phase shifters,” IEEE Transactions on Microwave Theory and Techniques, vol. 60,
no. 11, pp. 3431–3439, 2012.
[58] J. Borremans, M. Dehan, K. Scheir, M. Kuijk, and P. Wambacq, “Vco design for 60 ghz applications
using differential shielded inductors in 0.13 m cmos,” in 2008 IEEE Radio Frequency Integrated Circuits
Symposium, 2008, pp. 135–138.
104 BIBLIOGRAPHY
[59] Z. Liu, E. Skafidas, and R. J. Evans, “A 60 ghz vco with 6ghz tuning range in 130 nm bulk cmos,”
in 2008 International Conference on Microwave and Millimeter Wave Technology, vol. 1, 2008, pp.
209–211.
[60] C.-C. Chen, C.-C. Li, B.-J. Huang, K.-Y. Lin, H.-W. Tsao, and H. Wang, “Ring-based triple-push
vcos with wide continuous tuning ranges,” IEEE Transactions on Microwave Theory and Techniques,
vol. 57, no. 9, pp. 2173–2183, 2009.
[61] P. Shettigar and S. Pavan, “A 15mw 3.6gs/s ct- adc with 36mhz bandwidth and 83db dr in 90nm
cmos,” in 2012 IEEE International Solid-State Circuits Conference, 2012, pp. 156–158.
[62] S. Lee, A. P. Chandrakasan, and H.-S. Lee, “22.4 a 1gs/s 10b 18.9mw time-interleaved sar adc with
background timing-skew calibration,” in 2014 IEEE International Solid-State Circuits Conference Di-
gest of Technical Papers (ISSCC), 2014, pp. 384–385.
[63] N. Le Dortz, J.-P. Blanc, T. Simon, S. Verhaeren, E. Rouat, P. Urard, S. Le Tual, D. Goguet,
C. Lelandais-Perrault, and P. Benabes, “22.5 a 1.62gs/s time-interleaved sar adc with digital back-
ground mismatch calibration achieving interleaving spurs below 70dbfs,” in 2014 IEEE International
Solid-State Circuits Conference Digest of Technical Papers (ISSCC), 2014, pp. 386–388.
[64] M. Miyahara, I. Mano, M. Nakayama, K. Okada, and A. Matsuzawa, “22.6 a 2.2gs/s 7b 27.4mw
time-based folding-flash adc with resistively averaged voltage-to-time amplifiers,” in 2014 IEEE Inter-
national Solid-State Circuits Conference Digest of Technical Papers (ISSCC), 2014, pp. 388–389.
[65] X. Zhang, A. F. Molisch, and S.-Y. Kung, “Variable-phase-shift-based RF-baseband codesign for MIMO
antenna selection,” IEEE Transactions on Signal Processing, vol. 53, no. 11, pp. 4091–4103, Nov. 2005.
[66] J. Singh, O. Dabeer, and U. Madhow, “On the limits of communication with low-precision analog-
to-digital conversion at the receiver,” IEEE Transactions on Communications, vol. 57, no. 12, pp.
3629–3639, 2009.
BIBLIOGRAPHY 105
[67] S. Han, C.-l. I, Z. Xu, and C. Rowell, “Large-scale antenna systems with hybrid analog and digital
beamforming for millimeter wave 5g,” IEEE Communications Magazine, vol. 53, no. 1, pp. 186–194,
2015.
[68] R. Méndez-Rial, C. Rusu, A. Alkhateeb, N. González-Prelcic, and R. W. Heath, “Channel estimation
and hybrid combining for mmwave: Phase shifters or switches?” in 2015 Information Theory and
Applications Workshop (ITA), 2015, pp. 90–97.
[69] J. Brady, N. Behdad, and A. M. Sayeed, “Beamspace mimo for millimeter-wave communications:
System architecture, modeling, analysis, and measurements,” IEEE Transactions on Antennas and
Propagation, vol. 61, no. 7, pp. 3814–3827, 2013.
[70] A. Alkhateeb, O. E. Ayach, G. Leus, and R. W. Heath, “Channel estimation and hybrid precoding for
millimeter wave cellular systems,” IEEE Journal of Selected Topics in Signal Processing, vol. 8, no. 5,
pp. 831–846, Oct. 2014.
[71] M. Giordani, M. Mezzavilla, and M. Zorzi, “Initial access in 5G mmwave cellular networks,” IEEE
Communications Magazine, vol. 54, no. 11, pp. 40–47, Nov. 2016.
[72] S. Chiu, N. Ronquillo, and T. Javidi, “Active learning and CSI acquisition for mmwave initial align-
ment,” IEEE Journal on Selected Areas in Communications, pp. 1–1, 2019.
[73] O. Shayevitz and M. Feder, “Optimal feedback communication via posterior matching,” IEEE Trans-
actions on Information Theory, vol. 57, no. 3, pp. 1186–1222, Mar. 2011.
[74] Sung-En Chiu and T. Javidi, “Sequential measurement-dependent noisy search,” in 2016 IEEE Infor-
mation Theory Workshop (ITW), Sep. 2016, pp. 221–225.
[75] Y. Ding, S. Chiu, and B. D. Rao, “Bayesian channel estimation algorithms for massive MIMO systems
with hybrid analog-digital processing and low-resolution ADCs,” IEEE Journal of Selected Topics in
Signal Processing, vol. 12, no. 3, pp. 499–513, Jun. 2018.
106 BIBLIOGRAPHY
[76] C. M. Bishop, Pattern Recognition and Machine Learning (Information Science and Statistics). Berlin,
Heidelberg: Springer-Verlag, 2006.
[77] C. N. Manchón, E. de Carvalho, and J. B. Andersen, “Ping-pong beam training with hybrid digital-
analog antenna arrays,” in IEEE International Conference on Communications (ICC), Paris, France,
May 2017, pp. 1–7.
[78] M. U. Thomas, “A generalized maximum entropy principle,” Operations Research, vol. 27, no. 6, pp.
1188–1196, 1979.
[79] J. B. Andersen, “Intelligent antennas in a scattering environment – an overview,” in IEEE Global
Communications Conference (GLOBECOM), Sydney, Australia, Nov. 1998, pp. 3199–3203 (vol. 6).
[80] ——, “Array gain and capacity for known random channels with multiple element arrays at both ends,”
IEEE Journal on Selected Areas in Communications, vol. 18, no. 11, pp. 2172–2178, Nov. 2000.
[81] T. Dahl, N. Christophersen, and D. Gesbert, “Blind MIMO eigenmode transmission based on the
algebraic power method,” IEEE Transactions on Signal Processing, vol. 52, no. 9, pp. 2424–2431, Sep.
2004.
[82] E. de Carvalho and J. B. Andersen, “Ping-pong beam training for reciprocal channels with delay
spread,” in 49th Asilomar Conference on Signals, Systems and Computers, Pacific Grove, CA, USA,
Nov. 2015, pp. 1752–1756.
[83] D. Ogbe, D. J. Love, and V. Raghavan, “Noisy beam alignment techniques for reciprocal MIMO
channels,” IEEE Transactions on Signal Processing, vol. 65, no. 19, pp. 5092–5107, Oct. 2017.
[84] H. Ghauch, T. Kim, M. Bengtsson, and M. Skoglund, “Subspace estimation and decomposition for
large millimeter-wave MIMO systems,” IEEE Journal of Selected Topics in Signal Processing, vol. 10,
no. 3, pp. 528–542, Apr. 2016.
BIBLIOGRAPHY 107
[85] X. Cheng, N. Lou, and S. Li, “Spatially sparse beamforming training for millimeter wave MIMO
systems,” IEEE Transactions on Wireless Communications, vol. 16, no. 5, pp. 3385–3400, May 2017.
[86] M. F. Duarte and Y. C. Eldar, “Structured compressed sensing: From theory to applications,” IEEE
Transactions on Signal Processing, vol. 59, no. 9, pp. 4053–4085, Sept 2011.
[87] A. G. Burr, “Capacity bounds and estimates for the finite scatterers MIMO wireless channel,” IEEE
Journal on Selected Areas in Communications, vol. 21, no. 5, pp. 812–818, Jun. 2003.
[88] S. Payami, M. Shariat, M. Ghoraishi, and M. Dianati, “Effective RF codebook design and channel
estimation for millimeter wave communication systems,” in IEEE International Conference on Com-
munication Workshop (ICCW), London, UK, Jun. 2015, pp. 1226–1231.
[89] K. Roth and J. A. Nossek, “Arbitrary beam synthesis of hybrid beamforming systems for beam train-
ing,” IEEE Wireless Communications Letters, vol. 6, no. 6, pp. 714–717, Dec. 2017.
[90] J. Andersen, “Propagation aspects of MIMO channel modeling,” in Space-Time Wireless Systems,
H. Bolcskei, C. Papadias, D. Gesbert, and A.-J. van der Veen, Eds. Cambridge University Press,
2006, ch. 1.