A stochastic approximation for parameters Markov decision processes

HU Guang-hua

Overview of Chinese core journals
Chinese Science Citation Database(CSCD)
Chinese Scientific and Technological Paper and Citation Database (CSTPCD)
China National Knowledge Infrastructure(CNKI)
Chinese Science Abstracts Database(CSAD)
JST China

SCOPUS

HU Guang-hua. A stochastic approximation for parameters Markov decision processesJ. Journal of Yunnan University: Natural Sciences Edition, 2003, 25(5): 377-380.

Citation:

HU Guang-hua. A stochastic approximation for parameters Markov decision processesJ. Journal of Yunnan University: Natural Sciences Edition, 2003, 25(5): 377-380.

Citation:

HU Guang-hua. A stochastic approximation for parameters Markov decision processesJ. Journal of Yunnan University: Natural Sciences Edition, 2003, 25(5): 377-380.

A stochastic approximation for parameters Markov decision processes

HU Guang-hua

Abstract

Abstract

A stochastic gradient algorithm for average reward Markov decision processes (MDP) that depends on a parameter vector is proposed.A new gradient of the object function is given and a stochastic approximation algorithm that bases on a single sample path is presented.Finally,a convergence of the gradient (with probability 1) is provided.

FullText(HTML)

References (0)

Cited By

Turn off MathJax

Article Contents

A stochastic approximation for parameters Markov decision processes

Abstract

Catalog

Export File

Citation

Format

Content