Activities per year
Abstract
This work presents an approach to deriving a controller for spacecraft powered descent using reinforcement learning. To assist in the learning process, our approach uses optimal control demonstrations which provide open-loop control for optimal trajectories. Combining these approaches to use the optimal trajectories as demonstrations helps to overcome issues with convergence on desirable policies in the reinforcement learning problem. We demonstrate the applicability of this approach on a simulated 3-DOF Mars lander. The results show that the learned controller is capable of achieving a pinpoint soft landing from a range of initial conditions. Compared to the open-loop optimal trajectories alone, this controller generalises to more initial conditions and can cope with environmental uncertainties.
Original language | English |
---|---|
Number of pages | 15 |
Publication status | E-pub ahead of print - 25 Jun 2021 |
Event | 8th International Conference on Astrodynamics Tools and Techniques - Virtual Duration: 22 Jun 2021 → 25 Jun 2021 https://atpi.eventsair.com/QuickEventWebsitePortal/20a05-gnc-2020/website |
Conference
Conference | 8th International Conference on Astrodynamics Tools and Techniques |
---|---|
Abbreviated title | ICATT 2021 |
Period | 22/06/21 → 25/06/21 |
Internet address |
Keywords
- reinforcement learning (RL)
- intelligent control
Fingerprint
Dive into the research topics of 'Leveraging optimal control demonstrations in reinforcement learning for powered descent'. Together they form a unique fingerprint.-
8th International Conference on Astrodynamics Tools and Techniques (ICATT 2021)
Wilson, C. (Participant)
23 Jun 2021 → 25 Jun 2021Activity: Participating in or organising an event types › Participation in conference
-
Learning powered descent trajectories with optimal control demonstrations
Wilson, C. (Speaker)
25 Jun 2021Activity: Talk or presentation types › Oral presentation