Learning Failure Prevention Skills for Safe Robot Manipulation
2023 (English)In: IEEE Robotics and Automation Letters, E-ISSN 2377-3766, Vol. 8, no 12, p. 7994-8001Article in journal (Refereed) Published
Abstract [en]
Robots are more capable of achieving manipulation tasks for everyday activities than before. However, the safety of manipulation skills that robots employ is still an open problem. Considering all possible failures during skill learning increases the complexity of the process and restrains learning an optimal policy. Nonetheless, safety-focused modularity in the acquisition of skills has not been adequately addressed in previous works. For that purpose, we reformulate skills as base and failure prevention skills, where base skills aim at completing tasks and failure prevention skills aim at reducing the risk of failures to occur. Then, we propose a modular and hierarchical method for safe robot manipulation by augmenting base skills by learning failure prevention skills with reinforcement learning and forming a skill library to address different safety risks. Furthermore, a skill selection policy that considers estimated risks is used for the robot to select the best control policy for safe manipulation. Our experiments show that the proposed method achieves the given goal while ensuring safety by preventing failures. We also show that with the proposed method, skill learning is feasible and our safe manipulation tools can be transferred to the real environment © 2023 IEEE
Place, publisher, year, edition, pages
Piscataway, NJ: IEEE, 2023. Vol. 8, no 12, p. 7994-8001
Keywords [en]
Estimation, Failure Prevention, Libraries, Mathematical models, Reinforcement Learning, Reinforcement learning, Robot Safety, Robots, Robust/Adaptive Control, Safe Robot Manipulation, Safety, Task analysis
National Category
Control Engineering
Identifiers
URN: urn:nbn:se:hh:diva-51939DOI: 10.1109/LRA.2023.3324587ISI: 001089241600003Scopus ID: 2-s2.0-85174857924OAI: oai:DiVA.org:hh-51939DiVA, id: diva2:1812638
Note
This work was supported by the Scientific and Technological Research Council of Turkey (TUBITAK) under Grant 119E-436.
2023-11-162023-11-162024-01-17Bibliographically approved