Online Reinforcement Learning Control for the Personalization of a Robotic Knee Prosthesis

IEEE Trans Cybern. 2020 Jun;50(6):2346-2356. doi: 10.1109/TCYB.2019.2890974. Epub 2019 Jan 16.

Abstract

Robotic prostheses deliver greater function than passive prostheses, but we face the challenge of tuning a large number of control parameters in order to personalize the device for individual amputee users. This problem is not easily solved by traditional control designs or the latest robotic technology. Reinforcement learning (RL) is naturally appealing. The recent, unprecedented success of AlphaZero demonstrated RL as a feasible, large-scale problem solver. However, the prosthesis-tuning problem is associated with several unaddressed issues such as that it does not have a known and stable model, the continuous states and controls of the problem may result in a curse of dimensionality, and the human-prosthesis system is constantly subject to measurement noise, environmental change and human-body-caused variations. In this paper, we demonstrated the feasibility of direct heuristic dynamic programming, an approximate dynamic programming (ADP) approach, to automatically tune the 12 robotic knee prosthesis parameters to meet individual human users' needs. We tested the ADP-tuner on two subjects (one able-bodied subject and one amputee subject) walking at a fixed speed on a treadmill. The ADP-tuner learned to reach target gait kinematics in an average of 300 gait cycles or 10 min of walking. We observed improved ADP tuning performance when we transferred a previously learned ADP controller to a new learning session with the same subject. To the best of our knowledge, our approach to personalize robotic prostheses is the first implementation of online ADP learning control to a clinical problem involving human subjects.

MeSH terms

  • Adult
  • Algorithms
  • Amputees / rehabilitation
  • Biomechanical Phenomena / physiology
  • Exoskeleton Device*
  • Gait / physiology
  • Humans
  • Knee Prosthesis*
  • Male
  • Reinforcement, Psychology*
  • Signal Processing, Computer-Assisted
  • Young Adult