{"id":10253,"date":"2022-02-14T23:35:07","date_gmt":"2022-02-14T23:35:07","guid":{"rendered":"http:\/\/TheNextWeb=1380402"},"modified":"2022-02-14T23:35:07","modified_gmt":"2022-02-14T23:35:07","slug":"deepmind-reward-may-not-be-enough-for-agi-but-its-worth-a-try","status":"publish","type":"post","link":"https:\/\/www.londonchiropracter.com\/?p=10253","title":{"rendered":"DeepMind: Reward may NOT be enough for AGI \u2014 but it\u2019s worth a try"},"content":{"rendered":"\n<div><img decoding=\"async\" src=\"https:\/\/img-cdn.tnwcdn.com\/image\/neural?filter_last=1&amp;fit=1280%2C640&amp;url=https%3A%2F%2Fcdn0.tnwcdn.com%2Fwp-content%2Fblogs.dir%2F1%2Ffiles%2F2022%2F02%2FUntitled-design-5.jpg&amp;signature=46afb4770ad61e8052bcef3411496990\" class=\"ff-og-image-inserted\"><\/div>\n<p>DeepMind has been connected to artificial general intelligence since birth.<\/p>\n<p>The lab was launched with <a href=\"https:\/\/deepmind.com\/research\/publications\/2021\/Reward-is-Enough\" target=\"_blank\" rel=\"nofollow noopener noreferrer\">a mission to<\/a> develop AGI, was cofounded by a researcher <a href=\"https:\/\/twitter.com\/ShaneLegg\/status\/1404405011241738247\" target=\"_blank\" rel=\"nofollow noopener noreferrer\">who coined the term<\/a>, and has made <a href=\"https:\/\/deepmind.com\/blog\/article\/real-world-challenges-for-agi\" target=\"_blank\" rel=\"nofollow noopener noreferrer\">some compelling advances<\/a> in the field.<\/p>\n<p>It also recently produced a <a href=\"https:\/\/thenextweb.com\/news\/deepmind-reinforcement-learning-enough-general-ai-syndication\" target=\"_blank\" rel=\"noopener noreferrer\">provocative paper<\/a>&nbsp;on the subject: \u201c<a href=\"https:\/\/deepmind.com\/research\/publications\/2021\/Reward-is-Enough\" target=\"_blank\" rel=\"nofollow noopener noreferrer\">Reward is Enough<\/a>\u201d<\/p>\n<p>The study <span class=\"s1\">hypothesizes that AGI could be achieved through a single approach: reinforcement learning.<\/span><\/p>\n<p><span class=\"s1\">This technique provides feedback in the form of a \u201creward\u201d \u2014 a positive number that tells an algorithm that the action it just performed will benefit its goal. <\/span><\/p>\n<p><span class=\"s1\">The approach has shown promise in programs such as MuZero, which mastered multiple games mastered multiple games without being told their rules. DeepMind <a href=\"https:\/\/editorial.thenextweb.com\/?p=1380402&amp;preview=true\">called the system <\/a>a \u201csignificant step forward in the pursuit of general-purpose algorithms.\u201d&nbsp;<\/span><\/p>\n<p>\u201cReward is Enough\u201d suggests that reinforcement learning alone could lead to AGI.<\/p>\n<p><a href=\"https:\/\/bdtechtalks.com\/2021\/07\/07\/ai-reward-is-not-enough-herbert-roitblat\/\" target=\"_blank\" rel=\"nofollow noopener noreferrer\">This theory has been challenged<\/a> by many computer scientists \u2014 including some at DeepMind. But <span class=\"s1\">Doina Precup, one of the paper\u2019s co-authors, told TNW that the study merely sought to probe the possibilities.<\/span><\/p>\n<p>\u201cU<span class=\"s1\">ltimately, we want to test this as a hypothesis and to think of it in the context of other methods as well,\u201d said Precup, who heads up DeepMind\u2019s Montreal office.<\/span><\/p>\n<p>Indeed, reinforcement learning is just one approach that the <a href=\"https:\/\/thenextweb.com\/topic\/google\" target=\"_blank\" rel=\"noopener noreferrer\">Google<\/a> subsidiary is exploring. In a new episode <span>of the <\/span><a href=\"https:\/\/deepmind.com\/learning-resources\/deepmind-the-podcast\" target=\"_blank\" rel=\"nofollow noopener noreferrer\" data-saferedirecturl=\"https:\/\/www.google.com\/url?q=https:\/\/deepmind.com\/learning-resources\/deepmind-the-podcast&amp;source=gmail&amp;ust=1644958346354000&amp;usg=AOvVaw1U9FBtO1cKTvSi7LRZEvUr\"><span>DeepMind podcast<\/span><\/a>, the lab\u2019s researchers discuss the promise of various pathways to AGI.<\/p>\n<p>Among the reward-is-enough skeptics is Raia Hadsell, the company\u2019s director of robotics, who notes the difficulty of designing an all-powerful reward that leads to AGI. DeepMind cofounder Shane Legg, meanwhile, suspects that reinforcement learning may have to combine with learning algorithms.<\/p>\n<p>Precup also has doubts that reward alone is enough, but she believes it could be a crucial ingredient in AGI.<\/p>\n<p class=\"p1\"><span class=\"s1\">\u201cBecause it\u2019s learning from interaction in an incremental way, it feels very much like what biological intelligence systems do,\u201d she said. <\/span><\/p>\n<p class=\"p1\"><span class=\"s1\">\u201cIs it at the end of the day going to be the only technology that contributes to AGI? Well, that\u2019s not clear at all \u2014 there\u2019s a lot of other really interesting things that are going on.\u201d<\/span><span class=\"s1\"><\/span><\/p>\n<p>Precup is nonetheless optimistic that we\u2019re already on a path to AGI. Ultimately, she\u2019s more concerned about the safety of the destination than the route that takes us there.<\/p>\n<p><em>\u201cThe road to AGI,\u201d the fifth episode in season two of \u201cDeepMind: The Podcast,\u201d is <a href=\"https:\/\/deepmind.com\/learning-resources\/deepmind-the-podcast\" target=\"_blank\" rel=\"nofollow noopener noreferrer\">available here<\/a>&nbsp;from February 15.<\/em><\/p>\n<p> <a href=\"https:\/\/thenextweb.com\/news\/deepmind-reinforcement-learning-only-one-possible-pathway-to-agi\">Source<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>DeepMind has been connected to artificial general intelligence since birth. The lab was launched with a mission to develop AGI, was cofounded by a researcher who coined the term, and has made&#8230;<\/p>\n","protected":false},"author":1,"featured_media":10254,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":[],"categories":[1],"tags":[],"_links":{"self":[{"href":"https:\/\/www.londonchiropracter.com\/index.php?rest_route=\/wp\/v2\/posts\/10253"}],"collection":[{"href":"https:\/\/www.londonchiropracter.com\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.londonchiropracter.com\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.londonchiropracter.com\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.londonchiropracter.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=10253"}],"version-history":[{"count":0,"href":"https:\/\/www.londonchiropracter.com\/index.php?rest_route=\/wp\/v2\/posts\/10253\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.londonchiropracter.com\/index.php?rest_route=\/wp\/v2\/media\/10254"}],"wp:attachment":[{"href":"https:\/\/www.londonchiropracter.com\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=10253"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.londonchiropracter.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=10253"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.londonchiropracter.com\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=10253"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}