Computational comparison of value iteration algorithms for discounted Markov decision processes
Thomas, L. C.
MetadataShow full item record
This note describes the results of a computational comparison of value iteration algorithms suggested for solving finite state discounted Markov decision processes. Such a process visits a set of states S = (1,2,...M). In Section two we describe the schemes examined and the various bounds that can be used for stopping them. Section three concentrates on one scheme that did well in the comparison - ordinary value iteration - and looks at various methods for eliminating non-optimal actions both permanently and temporarily
NPS Report NumberNPS55-82-024
Showing items related by title, author, creator and subject.
Russell, Gary P. (Monterey, CaliforniaNaval Postgraduate School, 1999);The "Weimar Russia" analogy is based on a comparison between the failures of the Weimar Republic in Germany (1918-33) and the current problems of post-Soviet Russia. The premise of the analogy is that initial advances ...
A comparison of the Joint Maritime Command Information System (JMCIS) capabilities with the U.S. Marine Corps (U.S.M.C.) Advanced Tactical Air Command Center (ATACC) data link requirements Sweeney, Todd Franklin (Monterey, California. Naval Postgraduate School, 1994-09);This thesis is a comparison of the capabilities currently available in the Joint Maritime Command Information System (JMCIS) to the data link requirements of the United States Marine Corps (USMC) ...
Brodehl, Richard Brian (Monterey, California. U.S. Naval Postgraduate School, 1967-09);This paper is concerned with the comparison of four different non-divergent wind fields obtained from a single geopotential height field over a dense data area. After developing the divergence equation of the non-divergent ...