{"id":6828,"date":"2023-11-01T10:19:41","date_gmt":"2023-11-01T14:19:41","guid":{"rendered":"https:\/\/carleton.ca\/cuids\/?post_type=cu-events&#038;p=6828"},"modified":"2025-03-13T12:36:56","modified_gmt":"2025-03-13T16:36:56","slug":"victims-of-circumstance-how-environment-manipulation-shapes-reinforcement-learning-behaviours","status":"publish","type":"cu-events","link":"https:\/\/carleton.ca\/cuids\/cu-events\/victims-of-circumstance-how-environment-manipulation-shapes-reinforcement-learning-behaviours\/","title":{"rendered":"[RECORDED] Distinguished Speaker Series &#8211; Victims of Circumstance: How Environment Manipulation Shapes Reinforcement-Learning Behaviours"},"content":{"rendered":"<h2>CUIDS Distinguished Speaker Series<\/h2>\n<h3>Victims of Circumstance: How Environment Manipulation Shapes Reinforcement-Learning Behaviours<\/h3>\n<p>Zinovi Rabinovich, Assistant Professor, School of Computer Science &#8211; Carleton University<\/p>\n<p>Machine learning algorithms have been subjected to a range of attacks, both to thwart and to subvert their learning. It is particularly easy to do with Reinforcement Learning algorithms that heavily depend on their perceptions being reliable, their attempted actions correctly executed, and the rewards they reap indicative of the progress towards their goal. Control any one of those aspects, and you can make an RL agent fail or, worse, learn a bad behaviour. But what if perceptions come with error correcting codes, actions are verifiable, and the reward is strictly intrinsic to the agent? Are our RL agents safe from manipulation, then? Turns out no. It is possible, by the process of environment poisoning (i.e., changing how the environment behaves in response to agent actions), to manipulate an RL agent into learning a target (bad) behaviour. In this talk, I will show how it can be done, discuss how flexible the approach is, and what the future expects of it.<\/p>\n<p>Zinovi Rabinovich is an Assistant Professor in the School of Computer Science at Carleton University. He obtained his Ph.D. in Computer Science from the Hebrew University in Jerusalem, spent some years as an Algorithms Engineer at Mobileye Vision Technologies Ltd, and moved back into academia. His research focuses on how to leverage information asymmetry (in availability or in access) to manipulate decision processes. He&#8217;s looked into action advice provision, strategic information disclosure, election manipulation, and, more recently, poisoning Reinforcement Learning.<\/p>\n<p><a href=\"https:\/\/can01.safelinks.protection.outlook.com\/?url=https%3A%2F%2Fscholar.google.com%2Fcitations%3Fuser%3DJwJRnmAAAAAJ&amp;data=05%7C01%7CAliRofan%40cunet.carleton.ca%7C4ced2c36ff3e4a6ad0cd08dbdadaa187%7C6ad91895de06485ebc51fce126cc8530%7C0%7C0%7C638344404945737912%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&amp;sdata=MroeQvJASng65UlQoPPMKqn7AB6dvn%2Fn3O2MyxESH3s%3D&amp;reserved=0\">https:\/\/scholar.google.com\/citations?user=JwJRnmAAAAAJ<\/a><\/p>\n<p><a href=\"https:\/\/can01.safelinks.protection.outlook.com\/?url=https%3A%2F%2Fdblp.org%2Fpid%2F93%2F4009.html&amp;data=05%7C01%7CAliRofan%40cunet.carleton.ca%7C4ced2c36ff3e4a6ad0cd08dbdadaa187%7C6ad91895de06485ebc51fce126cc8530%7C0%7C0%7C638344404945737912%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&amp;sdata=mhSfSoC2kS%2F%2B6ez%2B%2B%2F7UcKkzYWqawtyvcolVebhSxos%3D&amp;reserved=0\">https:\/\/dblp.org\/pid\/93\/4009.html<\/a><\/p>\n<p><a href=\"https:\/\/can01.safelinks.protection.outlook.com\/?url=http%3A%2F%2Fwww.zinovi.net%2F&amp;data=05%7C01%7CAliRofan%40cunet.carleton.ca%7C4ced2c36ff3e4a6ad0cd08dbdadaa187%7C6ad91895de06485ebc51fce126cc8530%7C0%7C0%7C638344404945737912%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&amp;sdata=FY166JtqZehs66Ds1Gm%2FIoEb4SrQdk%2B%2F62JzZsaXk0Q%3D&amp;reserved=0\">www.zinovi.net<\/a><\/p>\n<p><\/p>\n<p><strong>Seminar Moderator:<\/strong><\/p>\n<p><a href=\"https:\/\/carleton.ca\/scs\/people\/alan-tsang\/\">Koon-Ho Alan Tsang<\/a> &#8211; Assistant Professor at the School of Computer Science at Carleton University<\/p>\n<p><\/p>\n<p>Light refreshments will be provided.<\/p>\n<p>Please RSVP below to help us prepare for the event.<\/p>\n\n<div class=\"\">\n\t<p>\n\t\t<a class=\"button__red button__red--solid\" href=\"https:\/\/carleton.ca\/cuids\/rsvp-cuids-distinguished-speaker-series\/\"  rel=\"noopener noreferrer\">RSVP<\/a>\n\t<\/p>\n<\/div>\n\n<p><iframe loading=\"lazy\" id=\"kmsembed-1_cwlnkd2f\" class=\"kmsembed\" title=\"CUIDS Distinguished Speaker Series - Victims of circumstance: How environment manipulation shapes reinforcement-learning behaviours\" src=\"https:\/\/mediaspace.carleton.ca\/embed\/secure\/iframe\/entryId\/1_cwlnkd2f\/uiConfId\/36153741\/st\/0\" width=\"640\" height=\"360\" frameborder=\"0\" sandbox=\"allow-downloads allow-forms allow-same-origin allow-scripts allow-top-navigation allow-pointer-lock allow-popups allow-modals allow-orientation-lock allow-popups-to-escape-sandbox allow-presentation allow-top-navigation-by-user-activation\" allowfullscreen=\"allowfullscreen\"><\/iframe><\/p>\n<p><\/p>\n<p><\/p>\n<p><\/p>\n","protected":false},"template":"","meta":{"_relevanssi_hide_post":"","_relevanssi_hide_content":"","_relevanssi_pin_for_all":"","_relevanssi_pin_keywords":"","_relevanssi_unpin_keywords":"","_relevanssi_related_keywords":"","_relevanssi_related_include_ids":"","_relevanssi_related_exclude_ids":"","_relevanssi_related_no_append":"","_relevanssi_related_not_related":"","_relevanssi_related_posts":"","_relevanssi_noindex_reason":"","_mi_skip_tracking":false,"_exactmetrics_sitenote_active":false,"_exactmetrics_sitenote_note":"","_exactmetrics_sitenote_category":0,"_links_to":"","_links_to_target":""},"daevent-type":[24,17,16],"event-audience":[37,38,39,40,41,42,43,44,45],"event-featured":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v21.2 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>[RECORDED] Distinguished Speaker Series - Victims of Circumstance: How Environment Manipulation Shapes Reinforcement-Learning Behaviours - Events - Institute for Data Science<\/title>\n<meta name=\"description\" content=\"CUIDS Distinguished Speaker Series Victims of Circumstance: How Environment Manipulation Shapes Reinforcement-Learning Behaviours Zinovi Rabinovich,\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/carleton.ca\/cuids\/cu-events\/victims-of-circumstance-how-environment-manipulation-shapes-reinforcement-learning-behaviours\/\" \/>\n<meta name=\"twitter:label1\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data1\" content=\"2 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/carleton.ca\/cuids\/cu-events\/victims-of-circumstance-how-environment-manipulation-shapes-reinforcement-learning-behaviours\/\",\"url\":\"https:\/\/carleton.ca\/cuids\/cu-events\/victims-of-circumstance-how-environment-manipulation-shapes-reinforcement-learning-behaviours\/\",\"name\":\"[RECORDED] Distinguished Speaker Series - Victims of Circumstance: How Environment Manipulation Shapes Reinforcement-Learning Behaviours - Events - Institute for Data Science\",\"isPartOf\":{\"@id\":\"https:\/\/carleton.ca\/cuids\/#website\"},\"datePublished\":\"2023-11-01T14:19:41+00:00\",\"dateModified\":\"2025-03-13T16:36:56+00:00\",\"description\":\"CUIDS Distinguished Speaker Series Victims of Circumstance: How Environment Manipulation Shapes Reinforcement-Learning Behaviours Zinovi Rabinovich,\",\"breadcrumb\":{\"@id\":\"https:\/\/carleton.ca\/cuids\/cu-events\/victims-of-circumstance-how-environment-manipulation-shapes-reinforcement-learning-behaviours\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/carleton.ca\/cuids\/cu-events\/victims-of-circumstance-how-environment-manipulation-shapes-reinforcement-learning-behaviours\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/carleton.ca\/cuids\/cu-events\/victims-of-circumstance-how-environment-manipulation-shapes-reinforcement-learning-behaviours\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/carleton.ca\/cuids\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Events\",\"item\":\"https:\/\/carleton.ca\/cuids\/cu-events\/\"},{\"@type\":\"ListItem\",\"position\":3,\"name\":\"[RECORDED] Distinguished Speaker Series &#8211; Victims of Circumstance: How Environment Manipulation Shapes Reinforcement-Learning Behaviours\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/carleton.ca\/cuids\/#website\",\"url\":\"https:\/\/carleton.ca\/cuids\/\",\"name\":\"Institute for Data Science\",\"description\":\"Carleton University\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/carleton.ca\/cuids\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"en-US\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"[RECORDED] Distinguished Speaker Series - Victims of Circumstance: How Environment Manipulation Shapes Reinforcement-Learning Behaviours - Events - Institute for Data Science","description":"CUIDS Distinguished Speaker Series Victims of Circumstance: How Environment Manipulation Shapes Reinforcement-Learning Behaviours Zinovi Rabinovich,","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/carleton.ca\/cuids\/cu-events\/victims-of-circumstance-how-environment-manipulation-shapes-reinforcement-learning-behaviours\/","twitter_misc":{"Est. reading time":"2 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/carleton.ca\/cuids\/cu-events\/victims-of-circumstance-how-environment-manipulation-shapes-reinforcement-learning-behaviours\/","url":"https:\/\/carleton.ca\/cuids\/cu-events\/victims-of-circumstance-how-environment-manipulation-shapes-reinforcement-learning-behaviours\/","name":"[RECORDED] Distinguished Speaker Series - Victims of Circumstance: How Environment Manipulation Shapes Reinforcement-Learning Behaviours - Events - Institute for Data Science","isPartOf":{"@id":"https:\/\/carleton.ca\/cuids\/#website"},"datePublished":"2023-11-01T14:19:41+00:00","dateModified":"2025-03-13T16:36:56+00:00","description":"CUIDS Distinguished Speaker Series Victims of Circumstance: How Environment Manipulation Shapes Reinforcement-Learning Behaviours Zinovi Rabinovich,","breadcrumb":{"@id":"https:\/\/carleton.ca\/cuids\/cu-events\/victims-of-circumstance-how-environment-manipulation-shapes-reinforcement-learning-behaviours\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/carleton.ca\/cuids\/cu-events\/victims-of-circumstance-how-environment-manipulation-shapes-reinforcement-learning-behaviours\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/carleton.ca\/cuids\/cu-events\/victims-of-circumstance-how-environment-manipulation-shapes-reinforcement-learning-behaviours\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/carleton.ca\/cuids\/"},{"@type":"ListItem","position":2,"name":"Events","item":"https:\/\/carleton.ca\/cuids\/cu-events\/"},{"@type":"ListItem","position":3,"name":"[RECORDED] Distinguished Speaker Series &#8211; Victims of Circumstance: How Environment Manipulation Shapes Reinforcement-Learning Behaviours"}]},{"@type":"WebSite","@id":"https:\/\/carleton.ca\/cuids\/#website","url":"https:\/\/carleton.ca\/cuids\/","name":"Institute for Data Science","description":"Carleton University","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/carleton.ca\/cuids\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"en-US"}]}},"acf":{"Location: Building":"herzberg","Event Location":"Virtual (Zoom)","show_cost":"yes","audience":[{"term_id":37,"name":"Alumni","slug":"alumni","term_group":0,"term_taxonomy_id":37,"taxonomy":"event-audience","description":"","parent":0,"count":8,"filter":"raw"},{"term_id":38,"name":"Anyone","slug":"anyone","term_group":0,"term_taxonomy_id":38,"taxonomy":"event-audience","description":"","parent":0,"count":22,"filter":"raw"},{"term_id":39,"name":"Carleton Community","slug":"carleton-community","term_group":0,"term_taxonomy_id":39,"taxonomy":"event-audience","description":"","parent":0,"count":17,"filter":"raw"},{"term_id":40,"name":"Current Students","slug":"current-students","term_group":0,"term_taxonomy_id":40,"taxonomy":"event-audience","description":"","parent":0,"count":18,"filter":"raw"},{"term_id":41,"name":"Faculty","slug":"faculty","term_group":0,"term_taxonomy_id":41,"taxonomy":"event-audience","description":"","parent":0,"count":12,"filter":"raw"},{"term_id":42,"name":"Media","slug":"media","term_group":0,"term_taxonomy_id":42,"taxonomy":"event-audience","description":"","parent":0,"count":5,"filter":"raw"},{"term_id":43,"name":"Prospective Students","slug":"prospective-students","term_group":0,"term_taxonomy_id":43,"taxonomy":"event-audience","description":"","parent":0,"count":6,"filter":"raw"},{"term_id":44,"name":"Staff","slug":"staff","term_group":0,"term_taxonomy_id":44,"taxonomy":"event-audience","description":"","parent":0,"count":6,"filter":"raw"},{"term_id":45,"name":"Staff and Faculty","slug":"staff-faculty","term_group":0,"term_taxonomy_id":45,"taxonomy":"event-audience","description":"","parent":0,"count":11,"filter":"raw"}],"Multi Day Event":"","End Time":"13:30","Start Time":"12:30","Date":"2023.11.08","Contact Name":"Ali Rofan","Contact Email":"cuids@carleton.ca","Contact Phone":"","More Info Link":"","Cost":"","Location: Room":"5345"},"_links":{"self":[{"href":"https:\/\/carleton.ca\/cuids\/wp-json\/wp\/v2\/cu-events\/6828"}],"collection":[{"href":"https:\/\/carleton.ca\/cuids\/wp-json\/wp\/v2\/cu-events"}],"about":[{"href":"https:\/\/carleton.ca\/cuids\/wp-json\/wp\/v2\/types\/cu-events"}],"version-history":[{"count":4,"href":"https:\/\/carleton.ca\/cuids\/wp-json\/wp\/v2\/cu-events\/6828\/revisions"}],"predecessor-version":[{"id":7309,"href":"https:\/\/carleton.ca\/cuids\/wp-json\/wp\/v2\/cu-events\/6828\/revisions\/7309"}],"wp:attachment":[{"href":"https:\/\/carleton.ca\/cuids\/wp-json\/wp\/v2\/media?parent=6828"}],"wp:term":[{"taxonomy":"daevent-type","embeddable":true,"href":"https:\/\/carleton.ca\/cuids\/wp-json\/wp\/v2\/daevent-type?post=6828"},{"taxonomy":"event-audience","embeddable":true,"href":"https:\/\/carleton.ca\/cuids\/wp-json\/wp\/v2\/event-audience?post=6828"},{"taxonomy":"event-featured","embeddable":true,"href":"https:\/\/carleton.ca\/cuids\/wp-json\/wp\/v2\/event-featured?post=6828"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}