Attitude Control of High-speed Vehicles Based on Improved TD3 Reinforcement Learning

Attitude Control of High-speed Vehicles Based on Improved TD3 Reinforcement Learning

PDF

Weili WANG, Wanwei HUANG, Xiaodong LIU, Kunfeng LU, Chenhui JIA

Missiles and Space Vehicles | 2025, (6) : 1 - 9

Less

Missiles and Space Vehicles | 2025, (6): 1-9

• Launch Vehicle and Missile •

Attitude Control of High-speed Vehicles Based on Improved TD3 Reinforcement Learning

Full

Weili WANG, Wanwei HUANG, Xiaodong LIU, Kunfeng LU, Chenhui JIA

Affiliations

National Key Laboratory of Science and Technology on Aerospace Intelligent Control, Beijing AerospaceAutomatic Control Institute, Beijing, 100854

Published: 2025-12-25 doi: 10.7654/j.issn.2097-1974.20250601

Outline

Abstract

Less

To address the challenges of strong nonlinearity, high uncertainty, and rapid time-varying parameters during the reentry phase of high-speed vehicles, this study proposes an end-to-end intelligent attitude control method based on an improved Twin Delayed Deep Deterministic Policy Gradient algorithm, aligned with the demands of intelligent spacecraft development. To overcome the issues of training instability and convergence difficulties in TD3-based attitude control learning, two key innovations are introduced: a hybrid reward mechanism combining continuous tracking error penalties and sparse task-completion rewards is designed within the Markov Decision Process framework to synergistically guide agent convergence. Prior knowledge constraints derived from modern control theory are incorporated into the training process, proposing a behavior cloning-based optimization strategy for the Actor network to balance expert experience imitation and cumulative reward maximization. Simulation results show that the proposed method can accurately track the three-channel attitude commands under 14 combinations of parameter deviations.

Key words

high-speed vehicles / attitude control / deep reinforcement learning / behavior cloning / strongly adaptive control

Cite this Article

Weili WANG, Wanwei HUANG, Xiaodong LIU, Kunfeng LU, Chenhui JIA. Attitude Control of High-speed Vehicles Based on Improved TD3 Reinforcement Learning[J]. Missiles and Space Vehicles, 2025 , (6) : 1 -9 . DOI: 10.7654/j.issn.2097-1974.20250601

Appendix

Less

Year 2025 volume Issue 6

PDF

588

282

Cite this Article

BibTeX

Article Info

doi: 10.7654/j.issn.2097-1974.20250601

Receive Date：2025-07-05
Online Date：2026-01-20
Published：2025-12-25

Article Data

Affiliations

History

Received：2025-07-05
Revised：2025-09-15

Funding

Affiliations

National Key Laboratory of Science and Technology on Aerospace Intelligent Control, Beijing AerospaceAutomatic Control Institute, Beijing, 100854

References

Share

https://castjournals.cast.org.cn/joweb/ddyht/EN/10.7654/j.issn.2097-1974.20250601

Share to

Scan QR to access full text

Cite this article

BibTeX

Citations

表12种不同金属材料的力学参数

科 Family	属数 Number of genus	种数 Number of species	占总种数比例 Percentage of total species (%)	属 Genus	种数 Number of species	占总种数比例 Percentage of total species (%)
鹅膏菌科Amanitaceae	2	11	5.26	鹅膏菌属 Amanita	10	4.78
小菇科 Mycenaceae	2	12	5.74	丝盖伞属 Inocybe	5	2.39
多孔菌科 Polyporaceae	8	14	6.70	蜡蘑属 Laccaria	5	2.39
红菇科 Russulaceae	3	23	11.00	小皮伞属 Marasmius	6	2.87
				小菇属 Mycena	11	5.26
				光柄菇属 Pluteus	5	2.39
				红菇属 Russula	17	8.13
				栓菌属 Trametes	5	2.39

关闭全屏

BibTeX
EndNote
RefWorks
TxT

Articles: Latest Articles; Most Read; Collections

Updates: Events; News; Multimedia

About: About Us

Contact

No. 86 Xueyuan South Road, Haidian District, Beijing

100081

010-62199257

qkjq@cast.org.cn

Copyright © 2025 China Association for Science and Technology. All rights reserved. For all open access content, the relevant licensing terms apply.
Sponsored by the Office of the Leading Group for Cybersecurity and Informatization of CAST, and supported by Science and Technology Review Publishing House