{"id":14603,"date":"2022-05-10T22:29:10","date_gmt":"2022-05-11T02:29:10","guid":{"rendered":"https:\/\/carleton.ca\/scs\/?page_id=14603"},"modified":"2026-06-02T14:59:22","modified_gmt":"2026-06-02T18:59:22","slug":"tr-24-learning-automata-possessing-ergodicity-of-the-mean-the-two-action-case","status":"publish","type":"page","link":"https:\/\/carleton.ca\/scs\/research\/scs-technical-reports\/technical-reports-1983\/tr-24-learning-automata-possessing-ergodicity-of-the-mean-the-two-action-case\/","title":{"rendered":"TR-24: Learning Automata Possessing Ergodicity of the Mean : The Two Action Case"},"content":{"rendered":"\n<section class=\"w-screen px-6 cu-section cu-section--white ml-offset-center md:px-8 lg:px-14\">\n    <div class=\"space-y-6 cu-max-w-child-5xl  md:space-y-10 cu-prose-first-last\">\n\n            <div class=\"cu-textmedia flex flex-col lg:flex-row mx-auto gap-6 md:gap-10 my-6 md:my-12 first:mt-0 max-w-5xl\">\n        <div class=\"justify-start cu-textmedia-content cu-prose-first-last\" style=\"flex: 0 0 100%;\">\n            <header class=\"font-light prose-xl cu-pageheader md:prose-2xl cu-component-updated cu-prose-first-last\">\n                                    <h1 class=\"cu-prose-first-last font-semibold !mt-2 mb-4 md:mb-6 relative after:absolute after:h-px after:bottom-0 after:bg-cu-red after:left-px text-3xl md:text-4xl lg:text-5xl lg:leading-[3.5rem] pb-5 after:w-10 text-cu-black-700 not-prose\">\n                        TR-24: Learning Automata Possessing Ergodicity of the Mean : The Two Action Case\n                    <\/h1>\n                \n                                \n                            <\/header>\n\n                    <\/div>\n\n            <\/div>\n\n    <\/div>\n<\/section>\n\n<p>Carleton University<br>\n<a href=\"https:\/\/carleton.ca\/scs\/research\/scs-technical-reports\/technical-reports-1983\/\">Technical Report<\/a> <strong>TR-24<\/strong><br>\nMay 1983<\/p>\n\n\n\n<h2 id=\"learning-automata-possessing-ergodicity-of-the-mean-the-two-action-case\" class=\"wp-block-heading tr_t1\">Learning Automata Possessing Ergodicity of the Mean : The Two Action Case<\/h2>\n\n\n\n<p>M.A.L. Thathachar &amp; B.J. Oommen<\/p>\n\n\n\n<h3 id=\"abstract\" class=\"wp-block-heading\">Abstract<\/h3>\n\n\n\n<p>Learning automata which update their action probabilities on<br>\nthe basis of the responses they get from an environment are considered in this paper. The automata update the probabilities whether the environment responds with a reward or a penalty. An automaton is said to possess Ergodicity of the Mean (EM) if the mean action probability is the total state probability of an ergodic Markov chain. The only known algorithm which is Ergodic in the Mean (EM) is the Linear Reward-Penalty (LRp) scheme. For the 2-action case necessary and sufficient conditions have been derived for nonlinear updating schemes to be Ergodic in the Mean (EM). The method of controlling the rate of convergence of this scheme has been presented. In particular a generalized linear algorithm has been proposed which is superior to the Linear Reward-Penalty ( LRp) scheme. The expression for the variance of the limiting action probabilities of this scheme has been derived. The technique of designing the optimal linear automaton in this family has also been considered. Methods to decrease the variance for the general nonlinear scheme have been discussed. It has been shown that the set of absolutely expedient schemes and the set of schemes which possess ergodicity of the mean are mutually disjoint.<\/p>\n\n\n\n<h3 id=\"download\" class=\"wp-block-heading\">Download<\/h3>\n\n\n\n<p><a href=\"https:\/\/carleton.ca\/scs\/wp-content\/uploads\/sites\/260\/TR-24.pdf\">TR-24.pdf<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Carleton University Technical Report TR-24 May 1983 Learning Automata Possessing Ergodicity of the Mean : The Two Action Case M.A.L. Thathachar &amp; B.J. Oommen Abstract Learning automata which update their action probabilities on the basis of the responses they get from an environment are considered in this paper. The automata update the probabilities whether the [&hellip;]<\/p>\n","protected":false},"author":2,"featured_media":0,"parent":11785,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"_acf_changed":false,"_cu_dining_location_slug":"","footnotes":"","_links_to":"","_links_to_target":""},"cu_page_type":[88],"class_list":["post-14603","page","type-page","status-publish","hentry","cu_page_type-technical-report"],"acf":{"cu_post_thumbnail":false},"_links":{"self":[{"href":"https:\/\/carleton.ca\/scs\/wp-json\/wp\/v2\/pages\/14603","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/carleton.ca\/scs\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/carleton.ca\/scs\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/carleton.ca\/scs\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/carleton.ca\/scs\/wp-json\/wp\/v2\/comments?post=14603"}],"version-history":[{"count":1,"href":"https:\/\/carleton.ca\/scs\/wp-json\/wp\/v2\/pages\/14603\/revisions"}],"predecessor-version":[{"id":14604,"href":"https:\/\/carleton.ca\/scs\/wp-json\/wp\/v2\/pages\/14603\/revisions\/14604"}],"up":[{"embeddable":true,"href":"https:\/\/carleton.ca\/scs\/wp-json\/wp\/v2\/pages\/11785"}],"wp:attachment":[{"href":"https:\/\/carleton.ca\/scs\/wp-json\/wp\/v2\/media?parent=14603"}],"wp:term":[{"taxonomy":"cu_page_type","embeddable":true,"href":"https:\/\/carleton.ca\/scs\/wp-json\/wp\/v2\/cu_page_type?post=14603"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}