收藏切换
Attitude Control of High-speed Vehicles Based on Improved TD3 Reinforcement Learning
收藏切换
PDF
Weili WANG, Wanwei HUANG, Xiaodong LIU, Kunfeng LU, Chenhui JIA
Missiles and Space Vehicles | 2025, 48(6) : 1 - 9
Less
收藏切换
Missiles and Space Vehicles | 2025, 48(6): 1-9
Launch Vehicle and Missile
Attitude Control of High-speed Vehicles Based on Improved TD3 Reinforcement Learning
Full
Weili WANG, Wanwei HUANG, Xiaodong LIU, Kunfeng LU, Chenhui JIA
Affiliations
  • National Key Laboratory of Science and Technology on Aerospace Intelligent Control, Beijing AerospaceAutomatic Control Institute, Beijing, 100854
Published: 2025-12-25 doi: 10.7654/j.issn.2097-1974.20250601
Outline
收藏切换

To address the challenges of strong nonlinearity, high uncertainty, and rapid time-varying parameters during the reentry phase of high-speed vehicles, this study proposes an end-to-end intelligent attitude control method based on an improved Twin Delayed Deep Deterministic Policy Gradient algorithm, aligned with the demands of intelligent spacecraft development. To overcome the issues of training instability and convergence difficulties in TD3-based attitude control learning, two key innovations are introduced: a hybrid reward mechanism combining continuous tracking error penalties and sparse task-completion rewards is designed within the Markov Decision Process framework to synergistically guide agent convergence. Prior knowledge constraints derived from modern control theory are incorporated into the training process, proposing a behavior cloning-based optimization strategy for the Actor network to balance expert experience imitation and cumulative reward maximization. Simulation results show that the proposed method can accurately track the three-channel attitude commands under 14 combinations of parameter deviations.

high-speed vehicles  /  attitude control  /  deep reinforcement learning  /  behavior cloning  /  strongly adaptive control
Weili WANG, Wanwei HUANG, Xiaodong LIU, Kunfeng LU, Chenhui JIA. Attitude Control of High-speed Vehicles Based on Improved TD3 Reinforcement Learning[J]. Missiles and Space Vehicles, 2025 , 48 (6) : 1 -9 . DOI: 10.7654/j.issn.2097-1974.20250601
Year 2025 volume 48 Issue 6
PDF
335
158
Cite this Article
BibTeX
Article Info
doi: 10.7654/j.issn.2097-1974.20250601
  • Receive Date:2025-07-05
  • Online Date:2026-01-20
  • Published:2025-12-25
Article Data
Affiliations
History
  • Received:2025-07-05
  • Revised:2025-09-15
Funding
Affiliations
    National Key Laboratory of Science and Technology on Aerospace Intelligent Control, Beijing AerospaceAutomatic Control Institute, Beijing, 100854
References
Share
https://castjournals.cast.org.cn/joweb/ddyht/EN/10.7654/j.issn.2097-1974.20250601
Share to
QR

Scan QR to access full text

Cite this article
BibTeX
Citations
表12种不同金属材料的力学参数

Family
属数
Number of
genus
种数
Number of
species
占总种数比例
Percentage of
total species (%)

Genus
种数
Number of
species
占总种数比例
Percentage of total
species (%)
鹅膏菌科Amanitaceae 2 11 5.26 鹅膏菌属 Amanita 10 4.78
小菇科 Mycenaceae 2 12 5.74 丝盖伞属 Inocybe 5 2.39
多孔菌科 Polyporaceae 8 14 6.70 蜡蘑属 Laccaria 5 2.39
红菇科 Russulaceae 3 23 11.00 小皮伞属 Marasmius 6 2.87
小菇属 Mycena 11 5.26
光柄菇属 Pluteus 5 2.39
红菇属 Russula 17 8.13
栓菌属 Trametes 5 2.39
关闭全屏
  • BibTeX
  • EndNote
  • RefWorks
  • TxT