Temat: optimal value function - Katalog OPAC zbiorów

Skocz do pozycji: 1.

Tytuł:: Convergence of finite-dimensional approximations for mixed-integer optimization with differential equations
Autorzy:: Hante, Falk M.
Schmidt, Martin
Powiązania:: https://bibliotekanauki.pl/articles/1839150.pdf
Data publikacji:: 2019
Wydawca:: Polska Akademia Nauk. Instytut Badań Systemowych PAN
Tematy:: optimization
differential equations
optimal value function
Lipschitz continuity
parametric optimization
mixed integer nonlinear programming
Opis:: We consider a direct approach to solving the mixedinteger nonlinear optimization problems with constraints depending on initial and terminal conditions of an ordinary differential equation. In order to obtain a finite-dimensional problem, the dynamics are approximated using discretization methods. In the framework of general one-step methods, we provide sufficient conditions for the convergence of this approach in the sense of the corresponding optimal values. The results are obtained by considering the discretized problem as a parametric mixed-integer nonlinear optimization problem in finite dimensions, where the step size for discretization of the dynamics is the parameter. In this setting, we prove the continuity of the optimal value function under a stability assumption for the integer feasible set and second-order conditions from nonlinear optimization. We address the necessity of the conditions on the example of pipe sizing problems for gas networks.
Źródło:: Control and Cybernetics; 2019, 48, 2; 209-226
0324-8569
Pojawia się w:: Control and Cybernetics
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 2.

Tytuł:: Online learning algorithm for zero-sum games with integral reinforcement learning
Autorzy:: Vamvoudakis, K. G.
Vrabie, D.
Lewis, F. L.
Powiązania:: https://bibliotekanauki.pl/articles/91780.pdf
Data publikacji:: 2011
Wydawca:: Społeczna Akademia Nauk w Łodzi. Polskie Towarzystwo Sieci Neuronowych
Tematy:: learning
online algorithm
zero-sum game
game
infinite horizon
Hamilton-Jacobi-Isaacs equation
approximation network
optimal value function
adaptive control tuning algorithm
Nash solution
Opis:: In this paper we introduce an online algorithm that uses integral reinforcement knowledge for learning the continuous-time zero sum game solution for nonlinear systems with infinite horizon costs and partial knowledge of the system dynamics. This algorithm is a data based approach to the solution of the Hamilton-Jacobi-Isaacs equation and it does not require explicit knowledge on the system’s drift dynamics. A novel adaptive control algorithm is given that is based on policy iteration and implemented using an actor/ disturbance/critic structure having three adaptive approximator structures. All three approximation networks are adapted simultaneously. A persistence of excitation condition is required to guarantee convergence of the critic to the actual optimal value function. Novel adaptive control tuning algorithms are given for critic, disturbance and actor networks. The convergence to the Nash solution of the game is proven, and stability of the system is also guaranteed. Simulation examples support the theoretical result.
Źródło:: Journal of Artificial Intelligence and Soft Computing Research; 2011, 1, 4; 315-332
2083-2567
2449-6499
Pojawia się w:: Journal of Artificial Intelligence and Soft Computing Research
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 3.

Tytuł:: Time-parametric control : uniform convergence of the optimal value functions of discretized problems
Autorzy:: Gugat, M.
Powiązania:: https://bibliotekanauki.pl/articles/206773.pdf
Data publikacji:: 1999
Wydawca:: Polska Akademia Nauk. Instytut Badań Systemowych PAN
Tematy:: ciąg
dyskretyzacja
funkcja celu
optymalizacja parametryczna
profil momentu
systemy sterowania minimalno-czasowe
warunek Hoeldera
warunek Lipschitza
zbieżność jednakowa
continuity
discretization
Hoelder condition
Lipschitz condition
moment problems
optimal value function
parametric optimization
time-minimal control
uniform convergence
Opis:: The problem of time-optimal control of linear hyperbolic systems is equivalent to the computation of the root of the optimal value function of a time-parametric program, whose feasible set is described by a countable system of moment equations. To compute this root, discretized problems with a finite number of equality constraints can be used. In this paper, we show that on a certain time-interval, the optimal value functions of the discretized problems converge uniformly to the optimal value function of the original problem. We also give sufficient conditions fot Lipschitz and Hoelder continuity of the optimal value function of the original problem.
Źródło:: Control and Cybernetics; 1999, 28, 1; 7-33
0324-8569
Pojawia się w:: Control and Cybernetics
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Informacja

Wyszukujesz frazę "optimal value function" wg kryterium: Temat

Źródło danych

Dostawca treści

Kolekcja

Rok wydania

Wydawca

Temat

Autor

Typ dokumentu

Język