MA Song,LI Li,LI Wei,et al.Intelligent anti-jamming decision algorithm based on proximal policy optimization[J].Journal on Communications,2024,45(08):249-257.
MA Song,LI Li,LI Wei,et al.Intelligent anti-jamming decision algorithm based on proximal policy optimization[J].Journal on Communications,2024,45(08):249-257. DOI: 10.11959/j.issn.1000-436x.2024137.
Intelligent anti-jamming decision algorithm based on proximal policy optimization
The existing intelligent anti-jamming methods based on deep reinforcement learning are applied to space-ground TT&C and communication links
in which the deep neural network used for decision-making has a complex structure
and the resources of satellites and other vehicles are limited
making it difficult to independently complete the timely training of complex neural network under the constraints of limited complexity
and the decision-making of anti-jamming cannot converge. Aiming at the above problems
an intelligent anti-jamming decision algorithm based on proximal policy optimization was proposed
which deployed the decision-making neural network and the training neural network in the vehicles and the ground station
respectively. The ground station conducted the optimal offline training based on the empirical information feedback from the vehicles
and assisted the decision-making neural network in parameter updating
thereby achieving the effective selection of anti-jamming strategies while satisfying the resource constraints of the vehicles. The simulation results demonstrate that the convergence speed of the proposed algorithm is increased by 37%
and the system capacity after convergence is increased by 25%
compared with the decision algorithms of policy gradient and deep Q-learning.
LIU T H , WANG H Q . Thought on application of space-ground integrated information network in domestic civil aviation [J ] . Telecommunication Engineering , 2018 , 58 ( 6 ): 738 - 744 .
NIEPHAUS C , KRETSCHMER M , GHINEA G . QoS provisioning in converged satellite and terrestrial networks: a survey of the state-of-the-art [J ] . IEEE Communications Surveys & Tutorials , 2016 , 18 ( 4 ): 2415 - 2441 .
ZHANG H J , CHEN A Q , LI Y B , et al . Key technologies of 6G mobile network [J ] . Journal on Communications , 2022 , 43 ( 7 ): 189 - 202 .
GUIDOTTI A , VANELLI-CORALLI A , CONTI M , et al . Architectures and key technical challenges for 5G systems incorporating satellites [J ] . IEEE Transactions on Vehicular Technology , 2019 , 68 ( 3 ): 2624 - 2639 .
ZHANG L C , XU Y B , LI F H , et al . Dynamic security-empowering architecture for space-ground integration information network [J ] . Journal on Communications , 2021 , 42 ( 9 ): 87 - 95 .
ZHU Y G , SUN Y F , YAO F Q , et al . Channel-space endogenous anti-jamming method based on multi-reconfigurable intelligent surface [J ] . Journal on Communications , 2023 , 44 ( 10 ): 13 - 22 .
BRYAN C , MARK G , JESSE S . Winning in the gray zone: using electromagnetic warfare to regain escalation dominance [R ] . 2017 .
YAO H P , WANG L Y , WANG X D , et al . The space-terrestrial integrated network: an overview [J ] . IEEE Communications Magazine , 2018 , 56 ( 9 ): 178 - 185 .
FENG Z B , XU Y H , DU Z Y , et al . Active defense technology against intelligent jammer [J ] . Journal on Communications , 2022 , 43 ( 10 ): 42 - 54 .
LI S Q , CHENG Y F , DONG B H , et al . Research on intelligent anti-jam communication techniques [J ] . Radio Communications Technology , 2012 , 38 ( 1 ): 1 - 4 .
ZHANG M J , ZHAO R , WANG P C , et al . Anti-jamming algorithm with reinforcement learning in UAV-aided Internet of things [J ] . Journal of Signal Processing , 2021 , 37 ( 1 ): 11 - 18 .
WANG R D , ZHANG Y L , WEI P , et al . Intelligent anti-jamming strategy for tactical frequency-hopping system [J ] . Journal of Signal Processing , 2023 , 39 ( 1 ): 84 - 95 .
HAN G A , XIAO L , POOR H V . Two-dimensional anti-jamming communication based on deep reinforcement learning [C ] // Proceedings of the 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) . Piscataway : IEEE Press , 2017 : 2087 - 2091 .
LIU X , XU Y H , JIA L L , et al . Anti-jamming communications using spectrum waterfall: a deep reinforcement learning approach [J ] . IEEE Communications Letters , 2018 , 22 ( 5 ): 998 - 1001 .
LI W , WANG J , LI L , et al . Intelligent anti-jamming communication with continuous action decision for ultra-dense network [C ] // Proceedings of the ICC 2019-2019 IEEE International Conference on Communications (ICC) . Piscataway : IEEE Press , 2019 : 1 - 7 .
ZHANG M Y , DOU Y J , CHEN Z Y , et al . Review of deep reinforcement learning and its applications in military field [J ] . Systems Engineering and Electronics , 2024 , 46 ( 4 ): 1297 - 1308 .
TANG S Q , PAN Z S , HU G Y , et al . Application of deep reinforcement learning in space information network—status quo and prospects [J ] . Systems Engineering and Electronics , 2023 , 45 ( 3 ): 886 - 901 .
STRASSER M , PÖPPER C , ČAPKUN S . Efficient uncoordinated FHSS anti-jamming communication [C ] // Proceedings of the Tenth ACM International Symposium on Mobile Ad Hoc Networking and Computing . New York : ACM Press , 2009 : 207 - 218 .
WILHELM M , MARTINOVIC I , SCHMITT J B , et al . Short paper: reactive jamming in wireless networks: how realistic is the threat? [C ] // Proceedings of the Fourth ACM Conference on Wireless Network Security . New York : ACM Press , 2011 : 47 - 52 .
HE X F , DAI H Y , NING P . Faster learning and adaptation in security games by exploiting information asymmetry [J ] . IEEE Transactions on Signal Processing , 2016 , 64 ( 13 ): 3429 - 3443 .
SUTTON R S , BARTO A G . Reinforcement learning: an introduction [M ] . Cambridge : MIT Press , 2018 .
HE K M , SUN J . Convolutional neural networks at constrained time cost [C ] // Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) . Piscataway : IEEE Press , 2015 : 5353 - 5360 .
ZOU R , LIU Y , ZANG Q , et al . Overview of development of foreign space-based space target surveillance system [J ] . Spacecraft Engineering , 2023 , 32 ( 5 ): 110 - 118 .