Multiple sliding surface guidance for planetary landing

Tuning and optimization via reinforcement learning

Daniel R. Wibben, Brian Gaudet, Roberto Furfaro, Jules Simo

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)

Abstract

The problem of achieving pinpoint landing accuracy in future space missions to extra-terrestrial bodies such as the Moon or Mars presents many challenges, including the requirements of higher accuracy and more flexibility. These new challenges may require the development of novel and more advanced guidance algorithms. Conventional guidance schemes, which generally require a combination of off-line trajectory generation and real-time, trajectory tracking algorithms, have worked well in the past but may not satisfy the more stringent and difficult landing requirements imposed by future mission architectures to bring landers very near to specified locations. In this paper, a novel non-linear guidance algorithm for planetary landing is proposed and analyzed. Based on Higher-Order Sliding Control (HOSC) theory, the Multiple Sliding Surface Guidance (MSSG) algorithms has been specifically designed to take advantage of the ability of the system to reach the sliding surface in a finite time. The high control activity seen in typical sliding controllers is avoided in this formulation, resulting in a guidance law that is both globally stable and robust against unknown, but bounded perturbations. The proposed MSSG does not require any off-line trajectory generation and therefore it is flexible enough to target a large variety of point on the planet's surface without the need for calculation of multiple reference trajectories. However, after initial analysis, it has been seen that the performance of MSSG is very sensitive to the choice in guidance gains. MSSG generated trajectories have been compared to an optimal solution to begin an investigation of the relationship between the optimality and performance of MSSG and the selection of the guidance parameters. A full study has been performed to investigate and tune the parameters of MSSG utilizing reinforcement learning in order to truly optimize the performance of the MSSG algorithm. Results show that the MSSG algorithm can indeed generate trajectories that come very close to the optimal solution in terms of fuel usage. A full comparison of the trajectories is included, as well as a further study examining the capability of the MSSG algorithm under perturbed conditions using the optimized set of parameters.

Original languageEnglish (US)
Title of host publicationAdvances in the Astronautical Sciences
PublisherUnivelt Inc.
Pages1881-1900
Number of pages20
Volume148
StatePublished - 2013
Event23rd AAS/AIAA Space Flight Mechanics Meeting, Spaceflight Mechanics 2013 - Kauai, HI, United States
Duration: Feb 10 2013Feb 14 2013

Other

Other23rd AAS/AIAA Space Flight Mechanics Meeting, Spaceflight Mechanics 2013
CountryUnited States
CityKauai, HI
Period2/10/132/14/13

Fingerprint

planetary landing
Electronic guidance systems
Reinforcement learning
reinforcement
Landing
learning
sliding
Tuning
tuning
optimization
Trajectories
trajectory
trajectories
landing
Moon
Planets
generation time
Control theory

ASJC Scopus subject areas

  • Aerospace Engineering
  • Space and Planetary Science

Cite this

Wibben, D. R., Gaudet, B., Furfaro, R., & Simo, J. (2013). Multiple sliding surface guidance for planetary landing: Tuning and optimization via reinforcement learning. In Advances in the Astronautical Sciences (Vol. 148, pp. 1881-1900). Univelt Inc..

Multiple sliding surface guidance for planetary landing : Tuning and optimization via reinforcement learning. / Wibben, Daniel R.; Gaudet, Brian; Furfaro, Roberto; Simo, Jules.

Advances in the Astronautical Sciences. Vol. 148 Univelt Inc., 2013. p. 1881-1900.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Wibben, DR, Gaudet, B, Furfaro, R & Simo, J 2013, Multiple sliding surface guidance for planetary landing: Tuning and optimization via reinforcement learning. in Advances in the Astronautical Sciences. vol. 148, Univelt Inc., pp. 1881-1900, 23rd AAS/AIAA Space Flight Mechanics Meeting, Spaceflight Mechanics 2013, Kauai, HI, United States, 2/10/13.
Wibben DR, Gaudet B, Furfaro R, Simo J. Multiple sliding surface guidance for planetary landing: Tuning and optimization via reinforcement learning. In Advances in the Astronautical Sciences. Vol. 148. Univelt Inc. 2013. p. 1881-1900
Wibben, Daniel R. ; Gaudet, Brian ; Furfaro, Roberto ; Simo, Jules. / Multiple sliding surface guidance for planetary landing : Tuning and optimization via reinforcement learning. Advances in the Astronautical Sciences. Vol. 148 Univelt Inc., 2013. pp. 1881-1900
@inproceedings{c1568a2ecfe044459a3edbb49792fff0,
title = "Multiple sliding surface guidance for planetary landing: Tuning and optimization via reinforcement learning",
abstract = "The problem of achieving pinpoint landing accuracy in future space missions to extra-terrestrial bodies such as the Moon or Mars presents many challenges, including the requirements of higher accuracy and more flexibility. These new challenges may require the development of novel and more advanced guidance algorithms. Conventional guidance schemes, which generally require a combination of off-line trajectory generation and real-time, trajectory tracking algorithms, have worked well in the past but may not satisfy the more stringent and difficult landing requirements imposed by future mission architectures to bring landers very near to specified locations. In this paper, a novel non-linear guidance algorithm for planetary landing is proposed and analyzed. Based on Higher-Order Sliding Control (HOSC) theory, the Multiple Sliding Surface Guidance (MSSG) algorithms has been specifically designed to take advantage of the ability of the system to reach the sliding surface in a finite time. The high control activity seen in typical sliding controllers is avoided in this formulation, resulting in a guidance law that is both globally stable and robust against unknown, but bounded perturbations. The proposed MSSG does not require any off-line trajectory generation and therefore it is flexible enough to target a large variety of point on the planet's surface without the need for calculation of multiple reference trajectories. However, after initial analysis, it has been seen that the performance of MSSG is very sensitive to the choice in guidance gains. MSSG generated trajectories have been compared to an optimal solution to begin an investigation of the relationship between the optimality and performance of MSSG and the selection of the guidance parameters. A full study has been performed to investigate and tune the parameters of MSSG utilizing reinforcement learning in order to truly optimize the performance of the MSSG algorithm. Results show that the MSSG algorithm can indeed generate trajectories that come very close to the optimal solution in terms of fuel usage. A full comparison of the trajectories is included, as well as a further study examining the capability of the MSSG algorithm under perturbed conditions using the optimized set of parameters.",
author = "Wibben, {Daniel R.} and Brian Gaudet and Roberto Furfaro and Jules Simo",
year = "2013",
language = "English (US)",
volume = "148",
pages = "1881--1900",
booktitle = "Advances in the Astronautical Sciences",
publisher = "Univelt Inc.",

}

TY - GEN

T1 - Multiple sliding surface guidance for planetary landing

T2 - Tuning and optimization via reinforcement learning

AU - Wibben, Daniel R.

AU - Gaudet, Brian

AU - Furfaro, Roberto

AU - Simo, Jules

PY - 2013

Y1 - 2013

N2 - The problem of achieving pinpoint landing accuracy in future space missions to extra-terrestrial bodies such as the Moon or Mars presents many challenges, including the requirements of higher accuracy and more flexibility. These new challenges may require the development of novel and more advanced guidance algorithms. Conventional guidance schemes, which generally require a combination of off-line trajectory generation and real-time, trajectory tracking algorithms, have worked well in the past but may not satisfy the more stringent and difficult landing requirements imposed by future mission architectures to bring landers very near to specified locations. In this paper, a novel non-linear guidance algorithm for planetary landing is proposed and analyzed. Based on Higher-Order Sliding Control (HOSC) theory, the Multiple Sliding Surface Guidance (MSSG) algorithms has been specifically designed to take advantage of the ability of the system to reach the sliding surface in a finite time. The high control activity seen in typical sliding controllers is avoided in this formulation, resulting in a guidance law that is both globally stable and robust against unknown, but bounded perturbations. The proposed MSSG does not require any off-line trajectory generation and therefore it is flexible enough to target a large variety of point on the planet's surface without the need for calculation of multiple reference trajectories. However, after initial analysis, it has been seen that the performance of MSSG is very sensitive to the choice in guidance gains. MSSG generated trajectories have been compared to an optimal solution to begin an investigation of the relationship between the optimality and performance of MSSG and the selection of the guidance parameters. A full study has been performed to investigate and tune the parameters of MSSG utilizing reinforcement learning in order to truly optimize the performance of the MSSG algorithm. Results show that the MSSG algorithm can indeed generate trajectories that come very close to the optimal solution in terms of fuel usage. A full comparison of the trajectories is included, as well as a further study examining the capability of the MSSG algorithm under perturbed conditions using the optimized set of parameters.

AB - The problem of achieving pinpoint landing accuracy in future space missions to extra-terrestrial bodies such as the Moon or Mars presents many challenges, including the requirements of higher accuracy and more flexibility. These new challenges may require the development of novel and more advanced guidance algorithms. Conventional guidance schemes, which generally require a combination of off-line trajectory generation and real-time, trajectory tracking algorithms, have worked well in the past but may not satisfy the more stringent and difficult landing requirements imposed by future mission architectures to bring landers very near to specified locations. In this paper, a novel non-linear guidance algorithm for planetary landing is proposed and analyzed. Based on Higher-Order Sliding Control (HOSC) theory, the Multiple Sliding Surface Guidance (MSSG) algorithms has been specifically designed to take advantage of the ability of the system to reach the sliding surface in a finite time. The high control activity seen in typical sliding controllers is avoided in this formulation, resulting in a guidance law that is both globally stable and robust against unknown, but bounded perturbations. The proposed MSSG does not require any off-line trajectory generation and therefore it is flexible enough to target a large variety of point on the planet's surface without the need for calculation of multiple reference trajectories. However, after initial analysis, it has been seen that the performance of MSSG is very sensitive to the choice in guidance gains. MSSG generated trajectories have been compared to an optimal solution to begin an investigation of the relationship between the optimality and performance of MSSG and the selection of the guidance parameters. A full study has been performed to investigate and tune the parameters of MSSG utilizing reinforcement learning in order to truly optimize the performance of the MSSG algorithm. Results show that the MSSG algorithm can indeed generate trajectories that come very close to the optimal solution in terms of fuel usage. A full comparison of the trajectories is included, as well as a further study examining the capability of the MSSG algorithm under perturbed conditions using the optimized set of parameters.

UR - http://www.scopus.com/inward/record.url?scp=84898805913&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84898805913&partnerID=8YFLogxK

M3 - Conference contribution

VL - 148

SP - 1881

EP - 1900

BT - Advances in the Astronautical Sciences

PB - Univelt Inc.

ER -