作者(外文):Chuang, Ya-Yi
論文名稱(外文):Codebook Design of All Index Modulation with Deep Reinforcement Learning
指導教授(外文):Wu, Jen-Ming
口試委員(外文):Chien, Feng-Tsun
Chung, Wei-Ho
Sang, Tzu-Hsien
外文關鍵詞:Deep Reinforcement learningAll Index ModulationCodebook design
本篇論文提出運用深度強化學習 (Deep Reinforcement Learning)進行碼本設計應用
於正交分頻多工之全索引調變。全索引調變(All Index Modulation)是一種改良的索引
調變(Index Modulation),不同於傳統的索引調變,它藉由移除傳統的索引調變的調變
素。我們運用深度循環Q-網路模型(Deep Recurrent Q-network)使之可以應用於碼字長度
載波(subcarrier)個數的情況下,理想中最佳碼本的漢明距離(Hamming distance)與歐式距
離(Euclidean distance)的關係,並推導出錯誤率(BER)的上界。
碼本的密度隨著子載波數量的增加而降低。 因此,在相同的頻譜效率下,錯誤率表
現能有所提高。 然而,以往的碼本設計方法因複雜度太高而難以實現。 具有深度強化學
In this thesis, we present a codebook design with deep reinforcement learning for OFDM
with All Index Modulation (OFDM-AIM). The AIM is an improved Index Modulation (IM), which removes the modulation symbols of the IM and only encodes the index symbols by mapping them to the codebook to achieve higher diversity gain.
The error performance of AIM depends on the design of the codebook. However, the error performance of the previous codebook design method is sub-optimal, and the complexity of finding the optimal codebook through exhaustive search is too high. In this work, we propose to use deep reinforcement learning (DRL) to reduce complexity and achieve better error performance.
We reformulate the problem of designing codebooks to DRL and design the reward function. We analyze the relationship between the Hamming distance and Euclidean distance of the optimal codebook to derive the SER upper bound.
According to the results we simulated, the BER performance is close to the theoretical upper bound and outperforms the classic AIM. The model we designed is less complex than other codebook design methods. The density of the codebook decreases as the number of subcarriers increases. Therefore, the error performance improves under the same spectrum efficiency. However, previous codebook design methods are difficult to implement due to the high complexity. The design of codebooks with DRL can be applied to any number of subcarriers which makes the DRL approach very attractive.
Chinese Abstract i
English Abstract ii
Contents iii
1.1 Foreword . . . . . . . . . . . . . . . . . . . . . . 1
1.2 Related Work . . . . . . . . . . . . . . . . . . . . . 2
1.3 Research Motivation and Objective . . . . . . . . . . 2
1.4 Proposed Method . . . . . . . . . . . . . . . . . . . . 3
1.5 Contribution and Achievement . . . . . . . . . . . . . . 3
1.6 Thesis Organization . . . . . . . . . . . . . . . . . . . 4
2.1 OFDM with Index Modulation . . . . . . . . . . . . . 5
2.1.1 OFDM with Index Modulation . . . . . . . . . . . . . 5
2.1.2 OFDM with Multi-Mode Index Modulation (MM-OFDM-IM) . . 7
2.1.3 OFDM with Q-Ary Multi-Mode Index Modulation (Q-MM-OFDM-IM)..8

2.2 OFDM with All Index Modulation ........ 9
2.3 OFDM with Improved All Index Modulation .....11
2.4 Reinforcement Learning ......14
2.4.1 Reinforcement Learning ......14
2.4.2 Deep Q-network (DQN) ......15
2.4.3 Deep Recurrent Network (DRQN) ......16
3 Codebook Design with Deep Reinforcement Learning for AIM...19
3.1 System Model ...19
3.2 Problem Setup and Performance Analysis ......21
3.2.1 Problem statement ......21
3.2.2 Complexity Analysis ......23
3.2.3 Performance Analysis ......25
3.2.4 Union Bound of Symbol Error Rate ......26
3.3 The Design of Codebook with DRL ......29
3.3.1 States, Actions, Rewards ......29
3.3.2 Deep Recurrent Q-Network ......33
3.3.3 Codebook Model Design ......33
4.1 Reward ......37
4.2 Complexity ......39
4.3 SER Theoretical Upper Bound of OFDM-AIM ......41
4.4 BER Performance of OFDM-AIM ......43
5 CONCLUSION ......45
Bibliography ......46
