ADAPT: A Model-Free Adaptive Optimal Control for Continuum Robots

Haiyang Fang; Sishen Yuan; Hongliang Ren; Shuping He; Shing Shin Cheng

doi:10.1109/JAS.2025.125183

Volume 13 Issue 1

Jan. 2026

IEEE/CAA Journal of Automatica Sinica

JCR Impact Factor: 19.2, Top 1 (SCI Q1)

CiteScore: 28.2, Top 1% (Q1)
Google Scholar h5-index: 95， TOP 5

Turn off MathJax

Article Contents

Article Navigation > IEEE/CAA Journal of Automatica Sinica > 2026 > 13(1): 205-217

H. Fang, S. Yuan, H. Ren, S. He, and S. S. Cheng, “ADAPT: A model-free adaptive optimal control for continuum robots,” IEEE/CAA J. Autom. Sinica, vol. 13, no. 1, pp. 205–217, Jan. 2026. doi: 10.1109/JAS.2025.125183

Citation:

H. Fang, S. Yuan, H. Ren, S. He, and S. S. Cheng, “ADAPT: A model-free adaptive optimal control for continuum robots,” IEEE/CAA J. Autom. Sinica, vol. 13, no. 1, pp. 205–217, Jan. 2026. doi: 10.1109/JAS.2025.125183

Citation:

PDF( 5487 KB)

ADAPT: A Model-Free Adaptive Optimal Control for Continuum Robots

doi: 10.1109/JAS.2025.125183

Funds: This work was supported in part by the Innovation and Technology Commission of Hong Kong, China (ITS/136/20, ITS/234/21, MHP/096/22, ITS/235/22), Multi-Scale Medical Robotics Center, InnoHK, China (8312051), Research Grants Council (RGC) of Hong Kong, China (CUHK 14217822, CUHK 14207823, AoE/E-407/24-N), and The Chinese University of Hong Kong (CUHK) Direct Grant

More Information

Author Bio:
Haiyang Fang received the B.Eng. degree in automation from Anhui University, in 2018. He is currently a Ph.D. degree candidate in mechanical and automation engineering at the Department of Mechanical and Automation Engineering, The Chinese University of Hong Kong, China. His current research interests include optimal control, reinforcement learning and their applications on surgical robots

Sishen Yuan received the B.S. degree in mechanical design manufacture and automation from the Harbin Institute of Technology (HIT) at Weihai, in 2018, and the M.S. degree in mechatronic engineering from the Harbin Institute of Technology (HIT) at Shenzhen, in 2021. He is currently a Ph.D. degree candidate in electronic engineering at the Department of Electronic Engineering, The Chinese University of Hong Kong (CUHK), China. His current research interests include magnetic medical robots involving interventional flexible robotics, capsule robotics, and the design and actuation of magnetic origami robots

Hongliang Ren (Senior Member, IEEE) received the Ph.D. degree in electronic engineering (specialized in biomedical engineering) from The Chinese University of Hong Kong, China, in 2008. He has been navigating his academic journey through The Chinese University of Hong Kong, UC Berkeley, Johns Hopkins University, Children’s Hospital Boston, Harvard Medical School, Children’s National Medical Center, USA, and the National University of Singapore, Singapore. He is currently a Full Professor with the Department of Electronic Engineering, The Chinese University of Hong Kong, China. His research interests include robotics, mechatronics, artificial intelligence, actuators and sensors. Dr. Ren serves as an Associate Editor for the IEEE Transactions on Automation Science and Engineering and Medical and Biological Engineering and Computing. He has served as an Active Organizer and Contributor on the committees of numerous robotics conferences, including a variety of roles in the flagship IEEE International Conference on Robotics and Automation, IEEE/RSJ International Conference on Intelligent Robots and Systems, as well as other domain conferences such as ROBIO/BIOROB/ICIA. He is the recipient of IFMBE/IAMBE Early Career Award 2018, Interstellar Early Career Investigator Award 2018, and ICBHI Young Investigator Award 2019

Shuping He (Senior Member, IEEE) received the B.S. degree in automation and the Ph.D. degree in control theory and control engineering from Jiangnan University, respectively in 2005 and 2011. From 2010 to 2011, he was a Visiting Scholar with the Control Systems Centre, the School of Electrical and Electronic Engineering, The University of Manchester, UK. He is now a Full Professor at Anhui University. His current research interests include stochastic systems control, reinforcement learning, system modeling with applications, signal processing and artificial intelligence methods. He has authored or co-authored more than 100 papers in professional journals, conference proceedings, and technical reports in the above areas and has published three books about stochastic systems. He serves as an Associate Editor, Youth Editor or Guest Editor for Chinese Journal of Intelligent Science and Technology, IEEE/CAA Journal of Automatica Sinica, IEEE Transactions on Emerging Topics in Computing, IET Control Theory and Applications, etc. He also received Highly Cited Researcher (Clarivate) and Most Cited Chinese Researchers (Elsevier) in 2023

Shing Shin Cheng (Member, IEEE) received the B.S. degree in mechanical engineering from Johns Hopkins University, USA, in 2013, and the Ph.D. degree in robotics from the Georgia Institute of Technology, USA, in 2018. He is currently an Associate Professor with the Department of Mechanical and Automation Engineering, The Chinese University of Hong Kong, China. His research interests include flexible surgical robotics, image-guided surgical systems, and robot modeling and control
Corresponding author: Shing Shin Cheng, e-mail: sscheng@cuhk.edu.hk
Received Date: 2024-11-14
Accepted Date: 2025-01-10

Available Online: 2025-04-25

Abstract

Abstract

Realizing optimal control performance for continuum robots (CRs) poses huge challenges on traditional model-based optimal control approaches due to their high degrees of freedom, complex nonlinear dynamics and soft continuum morphologies which are difficult to explicitly model. This paper proposes a model-free adaptive optimal control algorithm (ADAPT) for CRs. In our strategy, we consider CRs as a class of nonlinear continuous-time dynamical systems in the state space, wherein the position of the end-effector is considered as the state and the input torque is mapped as the control input. Then, the optimized Hamilton-Jacobi-Bellman (HJB) equation is derived by optimal control principles, and subsequently solved by the proposed ADAPT algorithm without requiring knowledge of the original system dynamics. Under some mild assumptions, the global stability and convergence of the closed-loop control approach are guaranteed. Several simulation experiments are conducted on a magnetic CR (MCR) to demonstrate the practicality and effectiveness of the ADAPT algorithm.
- Adaptive optimal control,
- continuum robots (CRs),
- Hamilton-Jacobi-Bellman (HJB) equation,
- model-free approach

FullText(HTML)

References(55)

References

[1]	M. W. Spong, S. Hutchinson, and M. Vidyasagar, Robot Modeling and Control. New York, USA: John Wiley & Sons, 2020.
[2]	R. J. Webster III and B. A. Jones, “Design and kinematic modeling of constant curvature continuum robots: A review,” The Int. Journal of Robotics Research, vol. 29, no. 13, pp. 1661–1683, 2010. doi: 10.1177/0278364910368147
[3]	A. A. Alqumsan, S. Khoo, and M. Norton, “Robust control of continuum robots using Cosserat rod theory,” Mechanism and Machine Theory, vol. 131, pp. 48–61, 2019. doi: 10.1016/j.mechmachtheory.2018.09.011
[4]	J. Wang, J. Xue, S. Yuan, J. Tan, S. Song, and M. Q.-H. Meng, “Kine-matic modeling of magnetically-actuated robotic catheter in nonlinearly-coupled multi-field,” IEEE Robotics and Automation Letters, vol. 6, no. 4, pp. 8189–8196, 2021. doi: 10.1109/LRA.2021.3104620
[5]	F. Xu, H. Wang, Z. Liu, W. Chen, and Y. Wang, “Visual servoing pushing control of the soft robot with active pushing force regulation,” Soft Robotics, vol. 9, no. 4, pp. 690–704, 2022. doi: 10.1089/soro.2020.0178
[6]	Z. Wang, T. Wang, B. Zhao, Y. He, Y. Hu, B. Li, P. Zhang, and M. Q.-H. Meng, “Hybrid adaptive control strategy for continuum surgical robot under external load,” IEEE Robotics and Autom. Letters, vol. 6, no. 2, pp. 1407–1414, 2021. doi: 10.1109/LRA.2021.3057558
[7]	M. Khadem, J. O’Neill, Z. Mitros, L. Da Cruz, and C. Bergeles, “Autonomous steering of concentric tube robots via nonlinear model predictive control,” IEEE Trans. Robotics, vol. 36, no. 5, pp. 1595–1602, 2020. doi: 10.1109/TRO.2020.2991651
[8]	M. S. Xavier, A. J. Fleming, and Y. K. Yong, “Nonlinear estimation and control of bending soft pneumatic actuators using feedback linearization and UKF,” IEEE/ASME Trans. Mechatronics, vol. 27, no. 4, pp. 1919–1927, 2022. doi: 10.1109/TMECH.2022.3155790
[9]	P. Xiang, J. Zhang, D. Sun, K. Qiu, Q. Fang, X. Mi, Y. Wang, R. Xiong, and H. Lu, “Learning-based high-precision force estimation and compliant control for small-scale continuum robot,” IEEE Trans. Autom. Science and Engineering, vol. 21, no. 4, pp. 5389–5401, 2024. doi: 10.1109/TASE.2023.3311179
[10]	A. Ghoul, K. Kara, M. Benrabah, and M. L. Hadjili, “Optimized nonlinear sliding mode control of a continuum robot manipulator,” J. Control, Autom. and Electrical Systems, vol. 33, no. 5, pp. 1355–1363, 2022. doi: 10.1007/s40313-022-00914-1
[11]	X. Shao, P. Pustina, M. Stölzle, G. Sun, A. De Luca, L. Wu, and C. D. Santina, “Model-based control for soft robots with system uncertainties and input saturation,” IEEE Trans. Industrial Electronics, vol. 71, no. 7, pp. 7435–7444, 2024. doi: 10.1109/TIE.2023.3303636
[12]	C. Frazelle, J. Rogers, I. Karamouzas, and I. Walker, “Optimizing a continuum manipulator’s search policy through model-free reinforcement learning,” in Proc. IEEE/RSJ Int. Conf. Intelligent Robots and Systems, 2020, pp. 5564–5571.
[13]	G. Ji, J. Yan, J. Du, W. Yan, J. Chen, Y. Lu, J. Rojas, and S. S. Cheng, “Towards safe control of continuum manipulator using shielded multiagent reinforcement learning,” IEEE Robotics and Autom. Letters, vol. 6, no. 4, pp. 7461–7468, 2021. doi: 10.1109/LRA.2021.3097660
[14]	P. Schegg, E. Menager, E. Khairallah, D. Marchal, J. Dequidt, P. Preux, and C. Duriez, “Sofagym: An open platform for reinforcement learning based on soft robot simulations,” Soft Robotics, vol. 10, no. 2, pp. 410–430, 2023. doi: 10.1089/soro.2021.0123
[15]	D. Jakes, Z. Ge, and L. Wu, “Model-less active compliance for continuum robots using recurrent neural networks,” in Proc. IEEE/RSJ Int. Conf. Intelligent Robots and Systems, 2019, pp. 2167–2173.
[16]	N. Tan, P. Yu, Z. Zhong, and Y. Zhang, “Data-driven control for continuum robots based on discrete zeroing neural networks,” IEEE Trans. Industrial Informatics, vol. 19, no. 5, pp. 7088–7098, 2023.
[17]	D. Wu, X. T. Ha, Y. Zhang, M. Ourak, G. Borghesan, K. Niu, F. Trauzettel, J. Dankelman, A. Menciassi, and E. Vander Poorten, “Deep-learning-based compliant motion control of a pneumatically-driven robotic catheter,” IEEE Robotics and Autom. Letters, vol. 7, no. 4, pp. 8853–8860, 2022. doi: 10.1109/LRA.2022.3186497
[18]	Y. Yang, J. Han, Z. Liu, Z. Zhao, and K.-S. Hong, “Modeling and adaptive neural network control for a soft robotic arm with prescribed motion constraints,” IEEE/CAA J. Autom. Sinica, vol. 10, no. 2, pp. 501–511, 2023. doi: 10.1109/JAS.2023.123213
[19]	G. G. Rigatos, “Model-based and model-free control of flexible-link robots: A comparison between representative methods,” Applied Mathematical Modelling, vol. 33, no. 10, pp. 3906–3925, 2009. doi: 10.1016/j.apm.2009.01.012
[20]	M. Jin, J. Lee, and N. G. Tsagarakis, “Model-free robust adaptive control of humanoid robots with flexible joints,” IEEE Trans. Industrial Electronics, vol. 64, no. 2, pp. 1706–1715, 2016.
[21]	H. Abouaïssa and S. Chouraqui, “On the control of robot manipulator: A model-free approach,” J. Computational Science, vol. 31, pp. 6–16, 2019. doi: 10.1016/j.jocs.2018.12.011
[22]	K. He, C. Dong, and Q. Wang, “Active disturbance rejection control for uncertain nonlinear systems with sporadic measurements,” IEEE/CAA J. Autom. Sinica, vol. 9, no. 5, pp. 893–906, 2022. doi: 10.1109/JAS.2022.105566
[23]	R. Moradi, R. Berangi, and B. Minaei, “A survey of regularization strategies for deep models,” Artificial Intelligence Review, vol. 53, pp. 3947–3986, 2020. doi: 10.1007/s10462-019-09784-7
[24]	T. An, Y. Wang, G. Liu, Y. Li, and B. Dong, “Cooperative game-based approximate optimal control of modular robot manipulators for human–robot collaboration,” IEEE Trans. Cybern., vol. 53, no. 7, pp. 4691–4703, 2023. doi: 10.1109/TCYB.2023.3277558
[25]	P. M. Wensing, M. Posa, Y. Hu, A. Escande, N. Mansard, and A. Del Prete, “Optimization-based control for dynamic legged robots,” IEEE Trans. Robotics, vol. 40, pp. 43–63, 2023.
[26]	F. L. Lewis, D. Vrabie, and V. L. Syrmos, Optimal control. New York, USA: John Wiley & Sons, 2012.
[27]	Y. Yang, H. Modares, K. G. Vamvoudakis, and F. L. Lewis, “Cooperative finitely excited learning for dynamical games,” IEEE Trans. Cybern., vol. 54, no. 2, pp. 797–810, 2023.
[28]	F. Boyer, V. Lebastard, F. Candelier, F. Renda, and M. Alamir, “Statics and dynamics of continuum robots based on Cosserat rods and optimal control theories,” IEEE Trans. Robotics, vol. 39, no. 2, pp. 1544–1562, 2022.
[29]	M. Samadi Khoshkho, Z. Samadikhoshkho, and M. G. Lipsett, “Distilled neural state-dependent Riccati equation feedback controller for dynamic control of a cable-driven continuum robot,” Int. J. Advanced Robotic Systems, vol. 20, no. 3, p. 17298806231174737, 2023.
[30]	D. A. Haggerty, M. J. Banks, E. Kamenar, A. B. Cao, P. C. Curtis, I. Mezic, and E. W. Hawkes, “Control of soft robots with inertial dynamics,” Science Robotics, vol. 8, no. 81, p. eadd6864, 2023. doi: 10.1126/scirobotics.add6864
[31]	D. Bruder, X. Fu, R. B. Gillespie, C. D. Remy, and R. Vasudevan, “Data-driven control of soft robots using Koopman operator theory,” IEEE Trans. Robotics, vol. 37, no. 3, pp. 948–961, 2020.
[32]	J. Chen, Y. Dang, and J. Han, “Offset-free model predictive control of a soft manipulator using the Koopman operator,” Mechatronics, vol. 86, p. 102871, 2022. doi: 10.1016/j.mechatronics.2022.102871
[33]	D. Bruder, X. Fu, R. B. Gillespie, C. D. Remy, and R. Vasudevan, “Koopman-based control of a soft continuum manipulator under variable loading conditions,” IEEE Robotics and Autom. Letters, vol. 6, no. 4, pp. 6852–6859, 2021. doi: 10.1109/LRA.2021.3095268
[34]	R. Luus, “Application of dynamic programming to high-dimensional non-linear optimal control problems,” Int. J. Control, vol. 52, no. 1, pp. 239–250, 1990. doi: 10.1080/00207179008953533
[35]	D. P. Bertsekas, Dynamic Programming and Optimal Control, Belmont, USA: Athena Scientific, 2011.
[36]	L. Dong, Y. Tang, H. He, and C. Sun, “An event-triggered approach for load frequency control with supplementary ADP,” IEEE Trans. Power Systems, vol. 32, no. 1, pp. 581–589, 2016.
[37]	Z. Lin, J. Duan, S. E. Li, H. Ma, Y. Yin, and B. Cheng, “Continuous-time finite-horizon ADP for automated vehicle controller design with high efficiency,” in Proc. 3rd Int. Conf. Unmanned Systems, 2020, pp. 978–984.
[38]	H. Yang, Q. Hu, H. Dong, and X. Zhao, “ADP-based spacecraft attitude control under actuator misalignment and pointing constraints,” IEEE Trans. Industrial Electronics, vol. 69, no. 9, pp. 9342–9352, 2021.
[39]	S. Li, L. Ding, H. Gao, Y.-J. Liu, L. Huang, and Z. Deng, “ADP-based online tracking control of partially uncertain time-delayed nonlinear system and application to wheeled mobile robots,” IEEE Trans. Cybern., vol. 50, no. 7, pp. 3182–3194, 2019.
[40]	T. Sun and X.-M. Sun, “An adaptive dynamic programming scheme for nonlinear optimal control with unknown dynamics and its application to turbofan engines,” IEEE Trans. Industrial Informatics, vol. 17, no. 1, pp. 367–376, 2020.
[41]	Z. Li, L. Wu, Y. Xu, S. Moazeni, and Z. Tang, “Multi-stage real-time operation of a multi-energy microgrid with electrical and thermal energy storage assets: A data-driven MPC-ADP approach,” IEEE Trans. Smart Grid, vol. 13, no. 1, pp. 213–226, 2021.
[42]	Q. Wei, Z. Liao, R. Song, P. Zhang, Z. Wang, and J. Xiao, “Self-learning optimal control for ice-storage air conditioning systems via data-based adaptive dynamic programming,” IEEE Trans. Industrial Electronics, vol. 68, no. 4, pp. 3599–3608, 2020.
[43]	D. Wang, M. Zhao, M. Ha, and J. Ren, “Neural optimal tracking control of constrained nonaffine systems with a wastewater treatment application,” Neural Networks, vol. 143, pp. 121–132, 2021. doi: 10.1016/j.neunet.2021.05.027
[44]	Y. Yang, B. Kiumarsi, H. Modares, and C. Xu, “Model-free λ-policy iteration for discrete-time linear quadratic regulation,” IEEE Trans. Neural Networks and Learning Systems, vol. 34, no. 2, pp. 635–649, 2021.
[45]	W. Xue, B. Lian, J. Fan, P. Kolaric, T. Chai, and F. L. Lewis, “Inverse reinforcement Q-learning through expert imitation for discrete-time systems,” IEEE Trans. Neural Networks and Learning Systems, vol. 34, no. 5, pp. 2386–2399, 2023. doi: 10.1109/TNNLS.2021.3106635
[46]	Z. Xia, M. Hu, W. Dai, H. Yan, and X. Ma, “Q-learning-based multi-rate optimal control for process industries,” IEEE Trans. Circuits and Systems Ⅱ: Express Briefs, vol. 70, no. 6, pp. 2006–2010, 2022.
[47]	C. Zheng, Y. An, Z. Wang, H. Wu, X. Qin, B. Eynard, and Y. Zhang, “Hybrid offline programming method for robotic welding systems,” Robotics and Computer-Integrated Manufacturing, vol. 73, p. 102238, 2022. doi: 10.1016/j.rcim.2021.102238
[48]	J. Fitzgerald, R. M. A. Azad, and C. Ryan, “A bootstrapping approach to reduce over-fitting in genetic programming,” in Proc. 15th annual Conf. Companion on Genetic and Evolutionary Computation, 2013, pp. 1113–1120.
[49]	Y. Yang, Y. Pan, C.-Z. Xu, and D. C. Wunsch, “Hamiltonian-driven adaptive dynamic programming with efficient experience replay,” IEEE Trans. Neural Networks and Learning Systems, vol. 35, no. 3, pp. 3278–3290, 2024. doi: 10.1109/TNNLS.2022.3213566
[50]	V. N. Afanasiev, V. Kolmanovskii, and V. R. Nosov, Mathematical Theory of Control Systems Design. Dordrecht, Netherlands: Springer Science & Business Media, 2013, vol. 341.
[51]	T. Bian, Y. Jiang, and Z.-P. Jiang, “Adaptive dynamic programming and optimal control of nonlinear nonaffine systems,” Automatica, vol. 50, no. 10, pp. 2624–2632, 2014. doi: 10.1016/j.automatica.2014.08.023
[52]	R. Dreyfus, Q. Boehler, and B. J. Nelson, “A simulation framework for magnetic continuum robots,” IEEE Robotics and Autom. Letters, vol. 7, no. 3, pp. 8370–8376, 2022. doi: 10.1109/LRA.2022.3187249
[53]	A. J. Petruska, J. Edelmann, and B. J. Nelson, “Model-based calibration for magnetic manipulation,” IEEE Trans. Magnetics, vol. 53, no. 7, pp. 1–6, 2017.
[54]	H. Fang, Y. Tu, H. Wang, S. He, F. Liu, Z. Ding, and S. S. Cheng, “Fuzzy-based adaptive optimization of unknown discrete-time nonlinear Markov jump systems with off-policy reinforcement learning,” IEEE Trans. Fuzzy Systems, vol. 30, no. 12, pp. 5276–5290, 2022. doi: 10.1109/TFUZZ.2022.3171844
[55]	O. Qasem, W. Gao, and K. G. Vamvoudakis, “Adaptive optimal control of continuous-time nonlinear affine systems via hybrid iteration,” Automatica, vol. 157, p. 111261, 2023. doi: 10.1016/j.automatica.2023.111261

Supplements(0)

Cited By

Proportional views

Proportional views

通讯作者: 陈斌, bchen63@163.com

1.
沈阳化工大学材料科学与工程学院沈阳 110142

Figures(6) / Tables(5)

Get Citation

PDF

XML

Article Metrics

Article views (885) PDF downloads(103)

ADAPT: A Model-Free Adaptive Optimal Control for Continuum Robots

doi: 10.1109/JAS.2025.125183

Abstract

References

Proportional views

Catalog

通讯作者: 陈斌, bchen63@163.com

Article Metrics

Export File

Citation

Format

Content