{"id":4611,"date":"2019-12-20T11:15:51","date_gmt":"2019-12-20T16:15:51","guid":{"rendered":"https:\/\/carleton.ca\/cuids\/?post_type=cu_event&#038;p=4611"},"modified":"2026-05-13T16:10:08","modified_gmt":"2026-05-13T20:10:08","slug":"seminar-reinforcement-learning-for-high-dimensional-problems","status":"publish","type":"cu_event","link":"https:\/\/carleton.ca\/cuids\/event\/seminar-reinforcement-learning-for-high-dimensional-problems\/","title":{"rendered":"Seminar &#8211; Reinforcement Learning for High-Dimensional Problems: From PDE Control to Model Learning"},"content":{"rendered":"<header class=\"mb-6 cu-pageheader cu-component-updated md:mb-12\">\n    <h1 class=\"cu-prose-first-last font-semibold !mt-2 mb-4 md:mb-6 text-3xl md:text-4xl lg:text-5xl lg:leading-[3.5rem] relative after:absolute after:h-px after:bottom-0 pb-5 after:w-10 after:bg-cu-red after:left-px\">\n        \n    <\/h1>\n    \n        <\/header>\n\n    \n    \n    \n    \n    <div class=\"cu-buttongroup cu-component-updated flex flex-wrap md:flex-1 gap-3 md:gap-5 justify-start\">\n                                                                        <\/div>\n    \n<figure class=\"wp-block-table\"><table class=\"has-fixed-layout\"><tbody><tr><td><\/td><\/tr><\/tbody><\/table><\/figure>\n\n\n\n<figure class=\"wp-block-image aligncenter\"><img loading=\"lazy\" decoding=\"async\" width=\"240\" height=\"72\" src=\"https:\/\/carleton.ca\/cuids\/wp-content\/uploads\/sites\/245\/Institute-for-Data-Science-color-small-240x72.png\" alt=\"\" class=\"wp-image-4616\" srcset=\"https:\/\/carleton.ca\/cuids\/wp-content\/uploads\/sites\/245\/Institute-for-Data-Science-color-small-240x72.png 240w, https:\/\/carleton.ca\/cuids\/wp-content\/uploads\/sites\/245\/Institute-for-Data-Science-color-small-160x48.png 160w, https:\/\/carleton.ca\/cuids\/wp-content\/uploads\/sites\/245\/Institute-for-Data-Science-color-small-400x119.png 400w, https:\/\/carleton.ca\/cuids\/wp-content\/uploads\/sites\/245\/Institute-for-Data-Science-color-small-360x107.png 360w, https:\/\/carleton.ca\/cuids\/wp-content\/uploads\/sites\/245\/Institute-for-Data-Science-color-small.png 587w\" sizes=\"auto, (max-width: 240px) 100vw, 240px\" \/><\/figure>\n\n\n\n<p><\/p>\n\n\n\n<p class=\"has-text-align-center\"><em>Data Science Distinguished Speaker Seminar Series&nbsp;&nbsp;<\/em><\/p>\n\n\n\n<p class=\"has-text-align-center\"><strong>Refreshments will be provided&nbsp;<\/strong><\/p>\n\n\n\n<p>&nbsp;<\/p>\n\n\n\n<p><strong>Abstract<\/strong><\/p>\n\n\n\n<p>Many real-world decision-making problems can be formulated as a reinforcement learning (RL) problem with high-dimensional state and action spaces. Due&nbsp;to the curse of dimensionality, however, solving these problems can be extremely challenging unless one exploits their intrinsic regularities. The regularities of&nbsp;an RL problem might be in the value function, policy, or the model of the environment.<\/p>\n\n\n\n<p>In the first part of the talk, I introduce the problem of data-driven control of Partial Differential Equations (PDE), which has many industrial applications, as an&nbsp;example of a high-dimensional RL problem. I formulate it as an RL problem and empirically show that methods that benefit from regularities of the value&nbsp;function can solve such a problem.<\/p>\n\n\n\n<p>In the second part of the talk, I focus on model-based RL (MBRL). I argue that conventional model learning approaches that are based on learning a good&nbsp;predictive model of the environment might be an overkill for MBRL. I discuss a new type of regularity of RL problems that arises due to the interaction of the&nbsp;dynamics of the environment and the value function, and introduce the Value-Aware Model Learning framework, which benefits from this regularity.<\/p>\n\n\n\n<p>&nbsp;<\/p>\n\n\n\n<p><strong>About the Speaker<\/strong><\/p>\n\n\n\n<p><a href=\"https:\/\/vectorinstitute.ai\/team\/amir-massoud-farahmand\/\">Amir-massoud Farahmand<\/a> is a faculty member, research scientist, and Canada CIFAR AI Chair at the Vector Institute in Toronto, Canada. He is also an&nbsp;assistant professor (status) at the Department of Computer Science, University of Toronto, with a cross-appointment at the Department of Mechanical and&nbsp;Industrial Engineering. His research interests are in reinforcement learning and machine learning with a focus on developing theoretically-sound algorithms&nbsp;for challenging industrial problems. He received his PhD from the University of Alberta in 2011, followed by postdoctoral fellowships at McGill University&nbsp;(2011\u20132014) and Carnegie Mellon University (CMU) (2014). Prior to joining the Vector Institute in 2018, he worked as a principal research scientist at&nbsp;Mitsubishi Electric Research Laboratories (MERL) in Cambridge, USA for three years.<\/p>\n\n\n\n<p>Amir-massoud received an NSERC Postdoctoral Fellowship (2012\u20132014) and the University of Alberta\u2019s Department of Computing Science Ph.D.&nbsp;Outstanding Thesis Award for the period of 2011\u20132012. His work has been published in top machine learning and AI (JMLR, MLJ, NeurIPS, ICML, ICLR,&nbsp;AISTATS, AAAI, IJCAI), control engineering (IEEE TAC, CDC, ACC), and robotics (IROS and ICRA) venues.<\/p>\n\n\n\n<h3 id=\"registration-for-this-event-is-now-closed\" class=\"wp-block-heading\">Registration for this event is now closed<\/h3>\n","protected":false},"author":2,"featured_media":0,"template":"","meta":{"_acf_changed":false,"footnotes":"","_links_to":"","_links_to_target":""},"cu_event_type":[24,16],"cu_event_audience":[],"class_list":["post-4611","cu_event","type-cu_event","status-publish","hentry","cu_event_type-external-news-events","cu_event_type-seminarseries"],"acf":{"cu_event_start_date":"2020-01-30T11:00:00","cu_event_end_date":"2020-01-30T12:00:00","cu_event_location_type":"in-person","cu_event_meeting_address_type":"on-campus","cu_building":"HP","cu_event_meeting_room":"5345","cu_event_meeting_address_full":null,"cu_event_virtual_type":"tbd","cu_event_virtual_meeting_link":"","cu_post_thumbnail":false,"cu_event_cost":"","cu_event_registration":"","cu_event_secondary_button":"","cu_event_contact_name":"","cu_event_email":"cuids@carleton.ca ","cu_event_phone":""},"_links":{"self":[{"href":"https:\/\/carleton.ca\/cuids\/wp-json\/wp\/v2\/cu_event\/4611","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/carleton.ca\/cuids\/wp-json\/wp\/v2\/cu_event"}],"about":[{"href":"https:\/\/carleton.ca\/cuids\/wp-json\/wp\/v2\/types\/cu_event"}],"author":[{"embeddable":true,"href":"https:\/\/carleton.ca\/cuids\/wp-json\/wp\/v2\/users\/2"}],"version-history":[{"count":4,"href":"https:\/\/carleton.ca\/cuids\/wp-json\/wp\/v2\/cu_event\/4611\/revisions"}],"predecessor-version":[{"id":4658,"href":"https:\/\/carleton.ca\/cuids\/wp-json\/wp\/v2\/cu_event\/4611\/revisions\/4658"}],"wp:attachment":[{"href":"https:\/\/carleton.ca\/cuids\/wp-json\/wp\/v2\/media?parent=4611"}],"wp:term":[{"taxonomy":"cu_event_type","embeddable":true,"href":"https:\/\/carleton.ca\/cuids\/wp-json\/wp\/v2\/cu_event_type?post=4611"},{"taxonomy":"cu_event_audience","embeddable":true,"href":"https:\/\/carleton.ca\/cuids\/wp-json\/wp\/v2\/cu_event_audience?post=4611"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}