Please use this identifier to cite or link to this item: https://hdl.handle.net/2440/93639
Citations
Scopus Web of Science® Altmetric
?
?
Full metadata record
DC FieldValueLanguage
dc.contributor.authorTang, D.-
dc.contributor.authorChen, L.-
dc.contributor.authorTian, Z.-
dc.date.issued2015-
dc.identifier.citationProceedings of the 2015 IEEE China Summit & International Conference on Signal and Information processing, 2015, pp.792-796-
dc.identifier.isbn9781479919475-
dc.identifier.urihttp://hdl.handle.net/2440/93639-
dc.descriptionIEEE Catalog Number: CFP15SIP-USB-
dc.description.abstractA new policy-iteration algorithm based on neural networks (NNs) is proposed in this paper to synthesize optimal control laws online for continuous-time nonlinear systems. Latest advances in this field have enabled synchronous policy iteration but require an additional tuning loop or a logic switch mechanism to maintain system stability. A new algorithm is thus derived in this paper to address this limitation. The optimal control law is found by solving the Hamilton-Jacobi- Bellman (HJB) equation for the associated value function via synchronous policy iteration in a critic-actor configuration. As a major contribution, a new form of NN approximation for the value function is proposed, offering the closed-loop system asymptotic stability without additional tuning scheme or logic switch mechanism. As a second contribution, an extended Kalman filter is introduced to estimate the critic NN parameters for fast convergence. The efficacy of the new algorithm is verified by simulations.-
dc.description.statementofresponsibilityDifan Tang, Lei Chen, and Zhao Feng Tian-
dc.language.isoen-
dc.publisherInstitute of Electrical and Electronics Engineers (IEEE)-
dc.rights© 2015 IEEE-
dc.source.urihttp://dx.doi.org/10.1109/chinasip.2015.7230513-
dc.subjectmachine learning; neural network; policy iteration; optimal control; nonlinear system-
dc.titleNeural-network based online policy iteration for continuous-time infinite-horizon optimal control of nonlinear systems-
dc.typeConference paper-
dc.contributor.conference3rd IEEE China Summit & International Conference on Signal and Information processing (ChinaSIP 2015) (12 Jul 2015 - 15 Jul 2015 : Chengdu, China)-
dc.identifier.doi10.1109/ChinaSIP.2015.7230513-
pubs.publication-statusPublished-
dc.identifier.orcidTang, D. [0000-0002-7143-0441]-
dc.identifier.orcidChen, L. [0000-0002-2269-2912]-
dc.identifier.orcidTian, Z. [0000-0001-9847-6004]-
Appears in Collections:Aurora harvest 7
Mechanical Engineering publications

Files in This Item:
File Description SizeFormat 
hdl_93639.pdfAccepted version457.16 kBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.