{"id":4256,"date":"2018-05-31T13:26:14","date_gmt":"2018-05-31T20:26:14","guid":{"rendered":"http:\/\/www.dresan.com\/blog\/?p=4256"},"modified":"2023-03-02T16:20:42","modified_gmt":"2023-03-02T23:20:42","slug":"prm-rl-won-a-best-paper-award-at-icra","status":"publish","type":"post","link":"https:\/\/dresan.com\/blog\/2018\/05\/31\/prm-rl-won-a-best-paper-award-at-icra\/","title":{"rendered":"PRM-RL Won a Best Paper Award at ICRA!"},"content":{"rendered":"<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone wp-image-4257\" src=\"http:\/\/www.dresan.com\/blog\/wp-content\/uploads\/2018\/05\/download_20180525_212050.jpg\" alt=\"\" width=\"600\" height=\"338\" srcset=\"https:\/\/dresan.com\/blog\/wp-content\/uploads\/2018\/05\/download_20180525_212050.jpg 2048w, https:\/\/dresan.com\/blog\/wp-content\/uploads\/2018\/05\/download_20180525_212050-300x169.jpg 300w, https:\/\/dresan.com\/blog\/wp-content\/uploads\/2018\/05\/download_20180525_212050-768x432.jpg 768w, https:\/\/dresan.com\/blog\/wp-content\/uploads\/2018\/05\/download_20180525_212050-600x338.jpg 600w\" sizes=\"auto, (max-width: 600px) 100vw, 600px\" \/><\/p>\n<p>So, this happened! Our team&#8217;s paper on &#8220;PRM-RL&#8221; &#8211; a way to teach robots to navigate their worlds which combines human-designed algorithms that use roadmaps with deep-learned algorithms to control the robot itself &#8211; won a best paper award at the ICRA robotics conference!<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone wp-image-4158\" src=\"http:\/\/www.dresan.com\/blog\/wp-content\/uploads\/2018\/01\/Screenshot-2018-01-12-21.49.45.png\" alt=\"\" width=\"600\" height=\"233\" srcset=\"https:\/\/dresan.com\/blog\/wp-content\/uploads\/2018\/01\/Screenshot-2018-01-12-21.49.45.png 1554w, https:\/\/dresan.com\/blog\/wp-content\/uploads\/2018\/01\/Screenshot-2018-01-12-21.49.45-300x117.png 300w, https:\/\/dresan.com\/blog\/wp-content\/uploads\/2018\/01\/Screenshot-2018-01-12-21.49.45-768x299.png 768w, https:\/\/dresan.com\/blog\/wp-content\/uploads\/2018\/01\/Screenshot-2018-01-12-21.49.45-600x233.png 600w\" sizes=\"auto, (max-width: 600px) 100vw, 600px\" \/><\/p>\n<p>I talked a little bit about how PRM-RL works in the post &#8220;<a href=\"http:\/\/www.dresan.com\/blog\/2018\/01\/12\/learning-to-drive-by-learning-where-you-can-drive\/\">Learning to Drive &#8230; by Learning Where You Can Drive<\/a>&#8220;, so I won&#8217;t go over the whole spiel here &#8211; but the basic idea is that we&#8217;ve gotten good at teaching robots to control themselves using a technique called deep reinforcement learning (the RL in PRM-RL) that trains them in simulation, but it&#8217;s hard to extend this approach to long-range navigation problems in the real world; we overcome this barrier by using a more traditional robotic approach, probabilistic roadmaps (the PRM in PRM-RL), which build maps of where the robot can drive using point to point connections; we combine these maps with the robot simulator and, boom, we have a map of where the robot thinks it can successfully drive.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone wp-image-4258\" src=\"http:\/\/www.dresan.com\/blog\/wp-content\/uploads\/2018\/05\/download_20180525_212105.jpg\" alt=\"\" width=\"600\" height=\"450\" srcset=\"https:\/\/dresan.com\/blog\/wp-content\/uploads\/2018\/05\/download_20180525_212105.jpg 2048w, https:\/\/dresan.com\/blog\/wp-content\/uploads\/2018\/05\/download_20180525_212105-300x225.jpg 300w, https:\/\/dresan.com\/blog\/wp-content\/uploads\/2018\/05\/download_20180525_212105-768x576.jpg 768w, https:\/\/dresan.com\/blog\/wp-content\/uploads\/2018\/05\/download_20180525_212105-600x450.jpg 600w\" sizes=\"auto, (max-width: 600px) 100vw, 600px\" \/><\/p>\n<p>We were cited not just for this technique, but for testing it extensively in simulation and on two different kinds of robots. I want to thank everyone on the team &#8211; especially Sandra Faust for her background in PRMs and for taking point on the idea (and doing all the quadrotor work with Lydia Tapia), for Oscar Ramirez and Marek Fiser for their work on our reinforcement learning framework and simulator, for Kenneth Oslund for his heroic last-minute push to collect the indoor robot navigation data, and to our manager James for his guidance, contributions to the paper and support of our navigation work.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone wp-image-4259\" src=\"http:\/\/www.dresan.com\/blog\/wp-content\/uploads\/2018\/05\/IMG_20180524_134532.jpg\" alt=\"\" width=\"600\" height=\"450\" srcset=\"https:\/\/dresan.com\/blog\/wp-content\/uploads\/2018\/05\/IMG_20180524_134532.jpg 4048w, https:\/\/dresan.com\/blog\/wp-content\/uploads\/2018\/05\/IMG_20180524_134532-300x225.jpg 300w, https:\/\/dresan.com\/blog\/wp-content\/uploads\/2018\/05\/IMG_20180524_134532-768x576.jpg 768w, https:\/\/dresan.com\/blog\/wp-content\/uploads\/2018\/05\/IMG_20180524_134532-600x450.jpg 600w\" sizes=\"auto, (max-width: 600px) 100vw, 600px\" \/><\/p>\n<p>Woohoo! Thanks again everyone!<\/p>\n<p>-the Centaur<\/p>\n","protected":false},"excerpt":{"rendered":"<p>So, this happened! Our team&#8217;s paper on &#8220;PRM-RL&#8221; &#8211; a way to teach robots to navigate their worlds which combines human-designed algorithms that use roadmaps with deep-learned algorithms to control&#8230;<\/p>\n","protected":false},"author":2,"featured_media":4158,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[209,255,217],"tags":[165,19,8,245,5],"class_list":["post-4256","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-computing","category-featured","category-intelligence","tag-engineering-the-robot-apocalypse","tag-hard-science","tag-intelligence","tag-startuppery","tag-we-call-it-living","ratio-2-1","entry"],"_links":{"self":[{"href":"https:\/\/dresan.com\/blog\/wp-json\/wp\/v2\/posts\/4256","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/dresan.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/dresan.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/dresan.com\/blog\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/dresan.com\/blog\/wp-json\/wp\/v2\/comments?post=4256"}],"version-history":[{"count":2,"href":"https:\/\/dresan.com\/blog\/wp-json\/wp\/v2\/posts\/4256\/revisions"}],"predecessor-version":[{"id":4261,"href":"https:\/\/dresan.com\/blog\/wp-json\/wp\/v2\/posts\/4256\/revisions\/4261"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/dresan.com\/blog\/wp-json\/wp\/v2\/media\/4158"}],"wp:attachment":[{"href":"https:\/\/dresan.com\/blog\/wp-json\/wp\/v2\/media?parent=4256"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/dresan.com\/blog\/wp-json\/wp\/v2\/categories?post=4256"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/dresan.com\/blog\/wp-json\/wp\/v2\/tags?post=4256"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}