A PD-Type State-Dependent Riccati Equation With Iterative Learning Augmentation for Mechanical Systems

Saeed Rafee Nekoo; José Ángel Acosta; Guillermo Heredia; Anibal Ollero

doi:10.1109/JAS.2022.105533

Volume 9 Issue 8

Aug. 2022

IEEE/CAA Journal of Automatica Sinica

JCR Impact Factor: 15.3, Top 1 (SCI Q1)

CiteScore: 23.5, Top 2% (Q1)
Google Scholar h5-index: 77， TOP 5

Turn off MathJax

Article Contents

Article Navigation > IEEE/CAA Journal of Automatica Sinica > 2022 > 9(8): 1499-1511

S. R. Nekoo, J. Á. Acosta, G. Heredia, and A. Ollero, “A PD-type state-dependent Riccati equation with iterative learning augmentation for mechanical systems,” IEEE/CAA J. Autom. Sinica, vol. 9, no. 8, pp. 1499–1511, Aug. 2022. doi: 10.1109/JAS.2022.105533

Citation:

S. R. Nekoo, J. Á. Acosta, G. Heredia, and A. Ollero, “A PD-type state-dependent Riccati equation with iterative learning augmentation for mechanical systems,” IEEE/CAA J. Autom. Sinica, vol. 9, no. 8, pp. 1499–1511, Aug. 2022. doi: 10.1109/JAS.2022.105533

Citation:

PDF( 2031 KB)

A PD-Type State-Dependent Riccati Equation With Iterative Learning Augmentation for Mechanical Systems

doi: 10.1109/JAS.2022.105533

GRVC Robotics Lab., Depto de Ingeniería de Sistemas y Automática, Escuela Técnica Superior de Ingeniería, Universidad de Sevilla, Seville 41092, Spain

Funds: This work was supported by the European Commission H2020 Programme under HYFLIERS project contract 779411, AERIAL-CORE project contract number 871479 and the ARTIC (RTI2018-102224-B-I00) project, funded by the Spanish Agencia Estatal de Investigación

More Information

Author Bio:
Saeed Rafee Nekoo is a Senior Postdoc Researcher at the GRVC Robotics Lab., Depto de Ingeniería de Sistemas y Automática, Escuela Técnica Superior de Ingeniería, Universidad de Sevilla, Spain. He currently serves as a Researcher in aerial robotics and control engineering and is engaged with HYFLIERS (HYbrid FLying-rollIng with-snakE-aRm robot for contact inSpection) H2020 European Research Council project, focusing on design and development of variable-pitch-rotor drones for inspection: design, prototyping, control implementation, and experimentation. His research interests include robotics, nonlinear and optimal control, control engineering, manufacturing, cooperative robotics, flexible joint manipulators, observer, and estimator design, path planning, wheeled mobile robots, control of autonomous underwater vehicles, free-floating space manipulator design, and control, digital implementation of continuous-time nonlinear controllers, and design, manufacturing and control of mechatronics systems, aerial robotics, multirotor UAV and variable-pitch-rotor quadcopter control

José Ángel Acosta received both the servo-electrical and mechanical engineering degrees at the University of Huelva, Spain, and the electrical engineering degree at the University of Seville, Spain, respectively. He was Marie Curie Control Training Site Fellow as Member of the Centre National de la Recherche Scientifique (CNRS, France) in the Laboratoire des Signaux et Systèmes, France, in 2003 and 2005. He received the Ph.D. degree in 2004 at the Automatic Control and Systems Engineering Department at the University of Seville and the Ph.D. European Award in 2005. He was nominated for the George S. Axelby Outstanding Paper Award in the IEEE Transactions on Automatic Control journal in 2006. In 1999 he joined that department as Research Assistant, where he is currently Professor and Research Member of the Automatic Control and Robotics Institute, and is with the GRVC Robotics Lab., Depto de Ingeniería de Sistemas y Automática, Escuela Técnica Superior de Ingeniería, Universidad de Sevilla, Spain. He has also been a Visitor at the Laboratoire des Signaux & Systèmes (CNRS, France) repeatedly since 2005 and Academic Visitor researching in the Electrical & Electronic Engineering Department as a Member of the Control & Power Group at Imperial College London, UK, in 2008-09-10 and 2011. His research interests are in the field of nonlinear control of dynamical systems with emphasis on electromechanical and robotic systems

Guillermo Heredia (Member, IEEE) is Full Professor at University of Seville (Spain). He was a Visiting Researcher at the Field Robotics Centre, Carnegie Mellon University (USA), and worked for an international automobile manu-facturer (General Motors). He participated as Senior Researcher in 65 R&D projects (EU, NASA and national projects), leading or co-leading 12 of them, including the FP7 ECSAFEMOBIL and the H2020 AEROBI and HYFLIERS projects. He is author or co-author of more than 100 publications on aerial robotics, aerial manipulation, autonomous vehicles and fault detection and reconfiguration

Anibal Ollero (Fellow, IEEE) is Full Professor and Head of GRVC at University Seville, and Scientific Advisor of the Center for Aerospace Technologies (CATEC) also in Seville. He has been Full Professor at the Universities of Santiago and Malaga (Spain) and Researcher at the Robotics Institute of Carnegie Mellon University (USA) and LAAS-CNRS (France). He authored more than 750 publications, including 9 books and 200 papers in journals and has been editor of 15 books. He has delivered plenaries and keynotes in more than 100 events including IEEE ICRA 2016 and IEEE IROS 2018. He has been supervisor or co-supervisor of 45 Ph.D. Thesis that have received many awards. He led more than 160 research projects, participating in more than 25 projects of the European Research Programmes being coordinator of 7 and associated or deputy coordinator of 3, all of them dealing with unmanned aerial systems and aerial robots. From November 2018 he is running the GRIFFIN ERC-Advanced Grant with the objective of developing a new generation of aerial robots that will be able to glide, flapping the wings, perch and manipulate by maintaining the equilibrium, and from December 2019 he is the coordinator of the H2020-AERiAL-CORE project with the participation of 15 universities, research centers and companies dealing with aerial robotic manipulators and applications to inspection and maintenance. He has transferred technologies to more than 20 companies and has been awarded with 25 international research and innovation awards, including the recent Rei Jaume I in New Technologies (Spain), the Challenge 3 of the MBZIRC 2020 International Robotics Competition, the Overall Information and Communication Technologies Innovation Radar Prize 2017 of the European Commission, and has been also elected between the three European innovators of the year being candidate to the European personalities of the year 2017. He is IEEE Fellow “for contributions to the development and deployment of aerial robots”. Currently Co-Chair of the “IEEE Technical Committee on Aerial Robotics and Unmanned Aerial Vehicles”, Coordinator of the “Aerial Robotics Topic Group” of euRobotics and has been Member of the “Board of Directors” of euRobotics until March 2019. He has been also Founder and President of the Spanish Society for the Research and Development in Robotics (SEIDROB) until November 2017
Corresponding author: Saeed Rafee Nekoo, e-mail: saerafee@yahoo.com
Received Date: 2021-10-27
Revised Date: 2021-12-21
Accepted Date: 2022-01-13

Available Online: 2022-03-19

Abstract

Abstract

This work proposes a novel proportional-derivative (PD)-type state-dependent Riccati equation (SDRE) approach with iterative learning control (ILC) augmentation. On the one hand, the PD-type control gains could adopt many useful available criteria and tools of conventional PD controllers. On the other hand, the SDRE adds nonlinear and optimality characteristics to the controller, i.e., increasing the stability margins. These advantages with the ILC correction part deliver a precise control law with the capability of error reduction by learning. The SDRE provides a symmetric-positive-definite distributed nonlinear suboptimal gain K(x) for the control input law u = –R^–1(x)B^T(x)K(x)x. The sub-blocks of the overall gain R^–1(x)B^T(x)K(x), are not necessarily symmetric positive definite. A new design is proposed to transform the optimal gain into two symmetric-positive-definite gains like PD-type controllers as u = –K_SP(x)e–K_SD(x)ė. The new form allows us to analytically prove the stability of the proposed learning-based controller for mechanical systems; and presents guaranteed uniform boundedness in finite-time between learning loops. The symmetric PD-type controller is also developed for the state-dependent differential Riccati equation (SDDRE) to manipulate the final time. The SDDRE expresses a differential equation with a final boundary condition, which imposes a constraint on time that could be used for finite-time control. So, the availability of PD-type finite-time control is an asset for enhancing the conventional classical linear controllers with this tool. The learning rules benefit from the gradient descent method for both regulation and tracking cases. One of the advantages of this approach is a guaranteed-stability even from the first loop of learning. A mechanical manipulator, as an illustrative example, was simulated for both regulation and tracking problems. Successful experimental validation was done to show the capability of the system in practice by the implementation of the proposed method on a variable-pitch rotor benchmark.
- Closed-loop,
- iterative learning control (ILC),
- PD-type,
- SDRE,
- SDDRE,
- symmetric

FullText(HTML)

References(50)

References

[1]	T. Cimen, “Survey of state-dependent Riccati equation in nonlinear optimal feedback control synthesis,” Journal of Guidance,Control,and Dynamics, vol. 35, pp. 1025–1047, 2012. doi: 10.2514/1.55821
[2]	T. Cimen and S. P. Banks, “Global optimal feedback control for general nonlinear systems with nonquadratic performance criteria,” Systems &Control Letters, vol. 53, pp. 327–346, 2004.
[3]	L. Felicetti and G. B. Palmerini, “A comparison among classical and SDRE techniques in formation flying orbital control,” in Proc. IEEE Aerospace Conf., Big Sky, Montana, 2013, pp. 1–12.
[4]	A. Heydari and S. N. Balakrishnan, “Fixed-final-time optimal tracking control of input-affine nonlinear systems,” Neurocomputing, vol. 129, pp. 528–539, 2014. doi: 10.1016/j.neucom.2013.09.006
[5]	A. Hamdache, S. Saadi, and I. Elmouki, “Nominal and neighboring-optimal control approaches to the adoptive immunotherapy for cancer,” Int. Journal of Dynamics and Control, vol. 4, pp. 346–361, 2016. doi: 10.1007/s40435-015-0205-y
[6]	S.-R. Oh, Z. Bien, and I. H. Suh, “An iterative learning control method with application to robot manipulators,” IEEE Journal on Robotics and Automation, vol. 4, pp. 508–514, 1988. doi: 10.1109/56.20435
[7]	H.-S. Ahn, Y.-Q. Chen, and K. L. Moore, “Iterative learning control: Brief survey and categorization,” IEEE Trans. Systems,Man,and Cybernetics,Part C (Applications and Reviews), vol. 37, pp. 1099–1121, 2007.
[8]	D. A. Bristow, M. Tharayil, and A. G. Alleyne, “A survey of iterative learning control,” IEEE Control Systems Magazine, vol. 26, pp. 96–114, 2006. doi: 10.1109/MCS.2006.1636313
[9]	Q. Zhu, F. Song, J.-X. Xu, and Y. Liu, “An internal model based iterative learning control for wafer scanner systems,” IEEE/ASME Trans. Mechatronics, vol. 24, pp. 2073–2084, 2019. doi: 10.1109/TMECH.2019.2929565
[10]	D. Shen, “Iterative learning control with incomplete information: A survey,” IEEE/CAA J. Autom. Sinica, vol. 5, pp. 885–901, 2018. doi: 10.1109/JAS.2018.7511123
[11]	J. Zhang, B. Cui, X. Dai, and Z. Jiang, “Iterative learning control for distributed parameter systems based on non-collocated sensors and actuators,” IEEE/CAA J. Autom. Sinica, vol. 7, pp. 865–871, 2019.
[12]	J. Wei, Y. Zhang, and H. Bao, “An exploration on adaptive iterative learning control for a class of commensurate high-order uncertain nonlinear fractional order systems,” IEEE/CAA J. Autom. Sinica, vol. 5, pp. 618–627, 2017.
[13]	W. He, T. Meng, X. He, and C. Sun, “Iterative learning control for a flapping wing micro aerial vehicle under distributed disturbances,” IEEE Trans. Cybernetics, vol. 49, pp. 1524–1535, 2018.
[14]	Y. Jian, D. Huang, J. Liu, and D. Min, “High-precision tracking of piezoelectric actuator using iterative learning control and direct inverse compensation of hysteresis,” IEEE Trans. Industrial Electronics, vol. 66, pp. 368–377, 2018.
[15]	D. Shen and J.-X. Xu, “Adaptive learning control for nonlinear systems with randomly varying iteration lengths,” IEEE Trans. Neural Networks and Learning Systems, vol. 30, pp. 1119–1132, 2018.
[16]	L. Roveda, G. Pallucca, N. Pedrocchi, F. Braghin, and L. M. Tosatti, “Iterative learning procedure with reinforcement for high-accuracy force tracking in robotized tasks,” IEEE Trans. Industrial Informatics, vol. 14, pp. 1753–1763, 2018. doi: 10.1109/TII.2017.2748236
[17]	S.-K. Oh, B. J. Park, and J. M. Lee, “Point-to-point iterative learning model predictive control,” Automatica, vol. 89, pp. 135–143, 2018. doi: 10.1016/j.automatica.2017.11.010
[18]	Y. Pan and H. Yu, “Composite learning robot control with guaranteed parameter convergence,” Automatica, vol. 89, pp. 398–406, 2018. doi: 10.1016/j.automatica.2017.11.032
[19]	A. Schöllig and R. D’Andrea, “Optimization-based iterative learning control for trajectory tracking,” in Proc. European Control Conf., Budapest, Hungary, 2009, pp. 1505–1510.
[20]	M. Ohnishi, L. Wang, G. Notomista, and M. Egerstedt, “Barrier-certified adaptive reinforcement learning with applications to brushbot navigation,” IEEE Trans. Robotics, vol. 35, pp. 1186–1205, 2019. doi: 10.1109/TRO.2019.2920206
[21]	A. Loquercio, E. Kaufmann, R. Ranftl, A. Dosovitskiy, V. Koltun, and D. Scaramuzza, “Deep drone racing: From simulation to reality with domain randomization,” IEEE Trans. Robotics, vol. 36, pp. 1–14, 2020. doi: 10.1109/TRO.2019.2942989
[22]	H. Pan, X. Niu, R.-C. Li, Y. Dou, and H. Jiang, “Annealed gradient descent for deep learning,” Neurocomputing, vol. 380, pp. 201–211, 2020. doi: 10.1016/j.neucom.2019.11.021
[23]	T.-Y. Kuc, K. Nam, and J. S. Lee, “An iterative learning control of robot manipulators,” IEEE Trans. Robotics and Automation, vol. 7, pp. 835–842, 1991. doi: 10.1109/70.105392
[24]	M. Sun and D. Wang, “Closed-loop iterative learning control for non-linear systems with initial shifts,” Int. Journal of Adaptive Control and Signal Processing, vol. 16, pp. 515–538, 2002. doi: 10.1002/acs.707
[25]	C.-W. Chen, S. Rai, and T.-C. Tsao, “Iterative learning of dynamic inverse filters for feedforward tracking control,” IEEE/ASME Trans. Mechatronics, vol. 25, pp. 349–359, 2019.
[26]	A. P. Schoellig, F. L. Mueller, and R. D’Andrea, “Optimization-based iterative learning for precise quadrocopter trajectory tracking,” Autonomous Robots, vol. 33, pp. 103–127, 2012. doi: 10.1007/s10514-012-9283-2
[27]	D. Meng and J. Zhang, “Robust optimization-based iterative learning control for nonlinear systems with nonrepetitive uncertainties,” IEEE/CAA J. Autom. Sinica, vol. 8, pp. 1001–1014, 2021. doi: 10.1109/JAS.2021.1003973
[28]	D. Meng and J. Zhang, “Design and analysis of data-driven learning control: An optimization-based approach,” IEEE Trans. Neural Networks and Learning Systems, 2021. DOI: 10.1109/TNNLS.2021.3070920
[29]	D. Meng and J. Zhang, “Robust tracking of nonrepetitive learning control systems with iteration-dependent references,” IEEE Trans. Systems,Man,and Cybernetics: Systems, 2018. DOI: 10.1109/TSMC.2018.2883383
[30]	R. Kelly and R. Carelli, “A class of nonlinear PD-type controllers for robot manipulators,” Journal of Robotic Systems, vol. 13, pp. 793–802, 1996. doi: 10.1002/(SICI)1097-4563(199612)13:12<793::AID-ROB2>3.0.CO;2-Q
[31]	J. Alvarez-Ramirez, R. Kelly, and I. Cervantes, “Semiglobal stability of saturated linear PID control for robot manipulators,” Automatica, vol. 39, pp. 989–995, 2003. doi: 10.1016/S0005-1098(03)00035-9
[32]	S. R. Nekoo, J. Á. Acosta, G. Heredia, and A. Ollero, “A benchmark mechatronics platform to assess the inspection around pipes with variable pitch quadrotor for industrial sites,” Mechatronics, vol. 79, p. 102641, 2021.
[33]	M. H. Korayem and S. R. Nekoo, “Finite-time state-dependent Riccati equation for time-varying nonaffine systems: Rigid and flexible joint manipulator control,” ISA Transactions, vol. 54, pp. 125–144, 2015. doi: 10.1016/j.isatra.2014.06.006
[34]	T. Cimen, “State-dependent Riccati equation (SDRE) control: A survey,” IFAC Proceedings Volumes, vol. 41, pp. 3761–3775, Jul. 2008. doi: 10.3182/20080706-5-KR-1001.00635
[35]	S. R. Nekoo, “Tutorial and review on the state-dependent Riccati equation,” Journal of Applied Nonlinear Dynamics, vol. 8, pp. 109–166, 2019. doi: 10.5890/JAND.2019.06.001
[36]	Y. Batmani, M. Davoodi, and N. Meskin, “On design of suboptimal tracking controller for a class of nonlinear systems,” in Proc. American Control Conf., Boston, MA, USA, 2016, pp. 1094–1098.
[37]	A. Ghaffari, M. Nazari, and F. Arab, “Suboptimal mixed vaccine and chemotherapy in finite duration cancer treatment: State-dependent Riccati equation control,” Journal of the Brazilian Society of Mechanical Sciences and Engineering, vol. 37, pp. 45–56, 2015. doi: 10.1007/s40430-014-0172-9
[38]	A. Wernli and G. Cook, “Suboptimal control for the nonlinear quadratic regulator problem,” Automatica, vol. 11, pp. 75–84, 1975. doi: 10.1016/0005-1098(75)90010-2
[39]	S. R. Nekoo, J. Á. Acosta, and A. Ollero, “Gravity compensation and optimal control of actuated multibody system dynamics,” IET Control Theory &Applications, vol. 16, pp. 79–93, 2021.
[40]	M. H. Korayem, A. Nikoobin, and V. Azimirad, “Maximum load carrying capacity of mobile manipulators: Optimal control approach,” Robotica, vol. 27, pp. 147–159, 2009. doi: 10.1017/S0263574708004578
[41]	D. E. Kirk, Optimal Control Theory: An Introduction: Courier Corporation, 2012.
[42]	T. Çimen and S. P. Banks, “Nonlinear optimal tracking control with application to super-tankers for autopilot design,” Automatica, vol. 40, pp. 1845–1863, 2004. doi: 10.1016/j.automatica.2004.05.015
[43]	A. Prach, O. Tekinalp, and D. Bernstein, “Nonlinear aircraft flight control using the forward propagating Riccati equation,” in Proc. AIAA Guidance, Navigation, and Control Conf., San Diego, California, USA, 2016, pp. 1383–1396.
[44]	A. Khamis, H. M. Nguyen, and D. S. Naidu, “Nonlinear, optimal control of wind energy conversion systems using differential SDRE,” in Resilience Week, Philadelphia, PA, 2015, pp. 1–6.
[45]	B. Geranmehr, E. Khanmirza, and S. Kazemi, “Trajectory control of aggressive maneuver by agile autonomous helicopter,” Proc. the Institution of Mechanical Engineers, Part G: Journal of Aerospace Engineering, p. 0954410018755807, 2018.
[46]	S. R. Nekoo and B. Geranmehr, “Nonlinear observer-based optimal control using the state-dependent Riccati equation for a class of non-affine control systems,” Journal of Control Engineering and Applied Informatics, vol. 16, pp. 5–13, 2014.
[47]	S. R. Nekoo, “Nonlinear closed loop optimal control: A modified state-dependent Riccati equation,” ISA Transactions, vol. 52, pp. 285–290, 2013. doi: 10.1016/j.isatra.2012.10.005
[48]	S. R. Nekoo, “Digital implementation of a continuous-time nonlinear optimal controller: An experimental study with real-time computations,” ISA Transactions, vol. 101, pp. 346–357, 2020. doi: 10.1016/j.isatra.2020.01.020
[49]	M. Xin, S. N. Balakrishnan, and Z. Huang, “Robust state dependent Riccati equation based robot manipulator control,” in Proc. IEEE Int. Conf. Control Applications, Mexico City, Mexico, 2001, pp. 369–374.
[50]	[Online]. Available: https://www.oulu.fi/hyfliers/.

Supplements(0)

Cited By

Proportional views

Proportional views

通讯作者: 陈斌, bchen63@163.com

1.
沈阳化工大学材料科学与工程学院沈阳 110142

Figures(11) / Tables(3)

Get Citation

PDF

XML

Article Metrics

Article views (988) PDF downloads(136)

Highlights

A nonlinear finite-time PD-like controller is presented based on SDDRE augmented with ILC
A convex objective function is introduced for regulation training rule of gradient descent
Uniform boundedness in finite time is guaranteed, suitable for unstable mechanical systems
VP propeller pendulum is controlled experimentally with SDRE + ILC approach

A PD-Type State-Dependent Riccati Equation With Iterative Learning Augmentation for Mechanical Systems

doi: 10.1109/JAS.2022.105533

Abstract

References

Proportional views

Catalog

通讯作者: 陈斌, bchen63@163.com

Article Metrics

Highlights

Export File

Citation

Format

Content