{"id":1982,"date":"2019-08-07T11:02:28","date_gmt":"2019-08-07T15:02:28","guid":{"rendered":"https:\/\/carleton.ca\/sce\/?p=1982"},"modified":"2026-01-23T11:20:10","modified_gmt":"2026-01-23T16:20:10","slug":"seminar-a-reinforcement-learning-algorithm-for-coordination-in-stochastic-games","status":"publish","type":"post","link":"https:\/\/carleton.ca\/sce\/2019\/seminar-a-reinforcement-learning-algorithm-for-coordination-in-stochastic-games\/","title":{"rendered":"Seminar: A REINFORCEMENT LEARNING ALGORITHM FOR COORDINATION IN STOCHASTIC GAMES"},"content":{"rendered":"\n<section class=\"w-screen px-6 cu-section cu-section--white ml-offset-center md:px-8 lg:px-14\">\n    <div class=\"space-y-6 cu-max-w-child-5xl  md:space-y-10 cu-prose-first-last\">\n\n            <div class=\"cu-textmedia flex flex-col lg:flex-row mx-auto gap-6 md:gap-10 my-6 md:my-12 first:mt-0 max-w-5xl\">\n        <div class=\"justify-start cu-textmedia-content cu-prose-first-last\" style=\"flex: 0 0 100%;\">\n            <header class=\"font-light prose-xl cu-pageheader md:prose-2xl cu-component-updated cu-prose-first-last\">\n                                    <h1 class=\"cu-prose-first-last font-semibold !mt-2 mb-4 md:mb-6 relative after:absolute after:h-px after:bottom-0 after:bg-cu-red after:left-px text-3xl md:text-4xl lg:text-5xl lg:leading-[3.5rem] pb-5 after:w-10 text-cu-black-700 not-prose\">\n                        Seminar: A REINFORCEMENT LEARNING ALGORITHM FOR COORDINATION IN STOCHASTIC GAMES\n                    <\/h1>\n                \n                                \n                            <\/header>\n\n                    <\/div>\n\n            <\/div>\n\n    <\/div>\n<\/section>\n\n<p><strong>CARLETON WIRELESS SEMINAR SERIES<\/strong><\/p>\n\n\n\n<p><strong>Time:<\/strong> Tuesday, 13 August 2019, 2:00-3:00 pm<br>\n<strong>Place:<\/strong> Carleton University, Systems and Computer Engineering<br>\nThe Maker Lab, 4463 Mackenzie Building, <a href=\"https:\/\/carleton.ca\/campus\/map\">map.<\/a><\/p>\n\n\n\n<p><strong>Title:<\/strong> A REINFORCEMENT LEARNING ALGORITHM FOR COORDINATION IN STOCHASTIC GAMES<\/p>\n\n\n\n<p><strong>Speaker:<\/strong> Bora Yongacoglu<br>\nPhD Candidate, Department of Mathematics and Statistics, Queen&#8217;s University<br>\n(Supervisor: Professor Serdar Yuksel)<\/p>\n\n\n\n<p><strong>* ABSTRACT:<\/strong> Stochastic games provide a useful model for the decentralized control of a stochastic system. We study a class of games called common interest games, for which there exist globally optimal joint policies that minimize the long-run cost incurred by all agents. Despite the incentive to coordinate behaviour, achieving optimality in such a setting can be difficult because of communication constraints and lack of system knowledge. Existing methods either rely heavily on communication or otherwise fail to guarantee convergence to optimal policies. In this talk, we present a reinforcement learning algorithm for playing a common interest game that requires no control-sharing and comes with provable convergence guarantees.<\/p>\n\n\n\n<p><strong>* BIO:<\/strong> Bora Yongacoglu received his B.A. from McGill University, with majors in mathematics and economics. He received his M.Sc. in Applied Mathematics from Queen&#8217;s University, where he is currently a Ph.D. student. His research interests include learning in games, stochastic control, and decentralized control.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>CARLETON WIRELESS SEMINAR SERIES Time: Tuesday, 13 August 2019, 2:00-3:00 pm Place: Carleton University, Systems and Computer Engineering The Maker Lab, 4463 Mackenzie Building, map. Title: A REINFORCEMENT LEARNING ALGORITHM FOR COORDINATION IN STOCHASTIC GAMES Speaker: Bora Yongacoglu PhD Candidate, Department of Mathematics and Statistics, Queen&#8217;s University (Supervisor: Professor Serdar Yuksel) * ABSTRACT: Stochastic games [&hellip;]<\/p>\n","protected":false},"author":2,"featured_media":1984,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"footnotes":"","_links_to":"","_links_to_target":""},"categories":[1],"tags":[],"class_list":["post-1982","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-news"],"acf":{"cu_post_thumbnail":""},"_links":{"self":[{"href":"https:\/\/carleton.ca\/sce\/wp-json\/wp\/v2\/posts\/1982","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/carleton.ca\/sce\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/carleton.ca\/sce\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/carleton.ca\/sce\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/carleton.ca\/sce\/wp-json\/wp\/v2\/comments?post=1982"}],"version-history":[{"count":2,"href":"https:\/\/carleton.ca\/sce\/wp-json\/wp\/v2\/posts\/1982\/revisions"}],"predecessor-version":[{"id":1985,"href":"https:\/\/carleton.ca\/sce\/wp-json\/wp\/v2\/posts\/1982\/revisions\/1985"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/carleton.ca\/sce\/wp-json\/wp\/v2\/media\/1984"}],"wp:attachment":[{"href":"https:\/\/carleton.ca\/sce\/wp-json\/wp\/v2\/media?parent=1982"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/carleton.ca\/sce\/wp-json\/wp\/v2\/categories?post=1982"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/carleton.ca\/sce\/wp-json\/wp\/v2\/tags?post=1982"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}