Optimal stationary policies inrisk-sensitive dynamic programs with finite state spaceand nonnegative rewards

Szczegóły
Opis

Tytuł:: Optimal stationary policies inrisk-sensitive dynamic programs with finite state spaceand nonnegative rewards
Autorzy:: Cavazos-Cadena, Rolando
Montes-de-Oca, Raúl
Powiązania:: https://bibliotekanauki.pl/articles/1208177.pdf
Data publikacji:: 2000
Wydawca:: Polska Akademia Nauk. Instytut Matematyczny PAN
Tematy:: unichain property
Markov decision processes
risk-sensitive optimality equation
risk-sensitive expected total- reward criterion
Źródło:: Applicationes Mathematicae; 2000, 27, 2; 167-185
1233-7234
Język:: angielski
Prawa:: Wszystkie prawa zastrzeżone. Swoboda użytkownika ograniczona do ustawowego zakresu dozwolonego użytku
Dostawca treści:: Biblioteka Nauki
: Artykuł

Przejdź do źródła

This work concerns controlled Markov chains with finite state space and nonnegative rewards; it is assumed that the controller has a constant risk-sensitivity, and that the performance ofa control policy is measured by a risk-sensitive expected total-reward criterion. The existence of optimal stationary policies isstudied within this context, and the main resultestablishes the optimalityof a stationary policy achieving the supremum in the correspondingoptimality equation, whenever the associated Markov chain hasa unique positive recurrent class. Two explicit examples are providedto show that, if such an additional condition fails, an optimal stationarypolicy cannot be generally guaranteed. The results of this note, which consider both the risk-seeking and the risk-averse cases, answer an extended version of a question recently posed in Puterman (1994).

Informacja

Powiązane pozycje