{"id":7486,"date":"2021-08-31T15:17:43","date_gmt":"2021-08-31T15:17:43","guid":{"rendered":"http:\/\/TheNextWeb=1365554"},"modified":"2021-08-31T15:17:43","modified_gmt":"2021-08-31T15:17:43","slug":"gpt-3-mimics-human-love-for-offensive-reddit-comments-study-finds","status":"publish","type":"post","link":"https:\/\/www.londonchiropracter.com\/?p=7486","title":{"rendered":"GPT-3 mimics human love for \u2018offensive\u2019 Reddit comments, study finds"},"content":{"rendered":"\n<p><em>Did you know&nbsp;<a class=\"c-link\" href=\"https:\/\/thenextweb.com\/conference\/neural\" target=\"_blank\" rel=\"noopener noreferrer\" data-sk=\"tooltip_parent\">Neural is taking the stage this fall<\/a>? Together with an amazing line-up of experts, we will explore the future of AI during TNW Conference 2021.&nbsp;<a class=\"c-link\" href=\"https:\/\/thenextweb.com\/conference\/neural#tickets\" target=\"_blank\" rel=\"noopener noreferrer\" data-sk=\"tooltip_parent\">Secure your online ticket now<\/a>!<\/em><\/p>\n<p><span>Chatbots are getting better at mimicking human speech \u2014 for better and for worse. <\/span><\/p>\n<p><span>A new study of Reddit conversations found chatbots replicate our fondness for toxic language. <\/span>The analysis revealed that two prominent dialogue models<span>&nbsp;are almost twice as likely to agree with \u201coffensive\u201d comments than with \u201csafe\u201d ones.<\/span><\/p>\n<h2>Offensive contexts<\/h2>\n<p>The researchers, from the Georgia Institute of Technology and the University of Washington, explored contextually offensive language by developing<span>&nbsp;\u201cToxiChat,\u201d a dataset of 2,000 Reddit threads. <\/span><\/p>\n<p><span>To study the behavior of neural <a href=\"https:\/\/thenextweb.com\/topic\/chatbot\">chatbots<\/a>, they extended the threads with responses generated by <a href=\"https:\/\/thenextweb.com\/topic\/openai\">OpenAI\u2019s<\/a> GPT-3 and Microsoft\u2019s DialoGPT.<\/span><\/p>\n<p>They then paid workers on Amazon Mechanical Turk to annotate the responses as \u201csafe\u201d or \u201coffensive.\u201d Comments were deemed offensive if they were intentionally or unintentionally toxic, rude, or disrespectful towards an individual, like a Reddit user, or a group, such as feminists.<\/p>\n<p>The stance of the responses toward previous comments in the thread was also annotated, as \u201cAgree,\u201d \u201cDisagree,\u201d or \u201cNeutral.\u201d<\/p>\n<p>\u201cWe assume that a user or a chatbot can become offensive by aligning themselves with an offensive statement made by another user,\u201d the researchers wrote in their pre-print <a href=\"https:\/\/arxiv.org\/pdf\/2108.11830.pdf\" target=\"_blank\" rel=\"nofollow noopener noreferrer\">study paper<\/a>.<\/p>\n<h2>Bad bots<\/h2>\n<p>The dataset contained further evidence of <a href=\"https:\/\/www.inverse.com\/article\/4566-why-is-being-offensive-so-much-fun\" target=\"_blank\" rel=\"nofollow noopener noreferrer\">our love for the offensive<\/a>.&nbsp; The analysis revealed that 42% of user responses agreed with toxic comments, whereas only 13% agreed with safe ones.<\/p>\n<p>They also found that the chatbots mimicked this undesirable behavior. Per the study paper:<\/p>\n<blockquote readability=\"7\">\n<p>We hypothesize that the higher proportion of agreement observed in response to offensive comments may be explained by the hesitancy of Reddit users to engage with offensive comments unless they agree. This may bias the set of respondents towards those who align with the offensive statement.<\/p>\n<\/blockquote>\n<p>This human behavior was mimicked by the dialogue models: both DialoGPT and GPT-3 were almost twice as likely to agree with an offensive comment than a safe one.<\/p>\n<figure class=\"post-image post-mediaBleed aligncenter\"><img decoding=\"async\" loading=\"lazy\" class=\"size-full wp-image-1365561 js-lazy\" src=\"https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2021\/08\/Screenshot-2021-08-31-at-11.40.54.png\" alt=\"Chatbots mimic human tendency towards agreeing with offensive comments.\" width=\"624\" height=\"428\" sizes=\"(max-width: 624px) 100vw, 624px\" data-srcset=\"https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2021\/08\/Screenshot-2021-08-31-at-11.40.54.png 624w, https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2021\/08\/Screenshot-2021-08-31-at-11.40.54-280x192.png 280w, https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2021\/08\/Screenshot-2021-08-31-at-11.40.54-197x135.png 197w, https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2021\/08\/Screenshot-2021-08-31-at-11.40.54-394x270.png 394w\"><figcaption>Credit: Baheti et al.<\/figcaption><figcaption><a href=\"https:\/\/thenextweb.com\/news\/gpt-3-and-humans-twice-as-likely-agree-with-offensive-reddit-comments-chatbots#\" data-url=\"https:\/\/twitter.com\/intent\/tweet?url=https%3A%2F%2Feditorial.thenextweb.com%2Fneural%2F2021%2F08%2F31%2Fgpt-3-and-humans-twice-as-likely-agree-with-offensive-reddit-comments-chatbots%2F&amp;via=thenextweb&amp;related=thenextweb&amp;text=Check out this picture on: Reddit users were more likely to reply to offensive comments.\" data-title=\"Share Reddit users were more likely to reply to offensive comments. on Twitter\" data-width=\"685\" data-height=\"500\" class=\"post-image-share popitup\" title=\"Share Reddit users were more likely to reply to offensive comments. on Twitter\"><i class=\"icon icon--inline icon--twitter--dark\"><\/i><\/a>Reddit users were more likely to reply to offensive comments.<\/figcaption><noscript><img decoding=\"async\" loading=\"lazy\" class=\"size-full wp-image-1365561\" src=\"https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2021\/08\/Screenshot-2021-08-31-at-11.40.54.png\" alt=\"Chatbots mimic human tendency towards agreeing with offensive comments.\" width=\"624\" height=\"428\" srcset=\"https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2021\/08\/Screenshot-2021-08-31-at-11.40.54.png 624w, https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2021\/08\/Screenshot-2021-08-31-at-11.40.54-280x192.png 280w, https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2021\/08\/Screenshot-2021-08-31-at-11.40.54-197x135.png 197w, https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2021\/08\/Screenshot-2021-08-31-at-11.40.54-394x270.png 394w\"><\/noscript><\/figure>\n<p>The responses generated by humans had some significant differences.<\/p>\n<p>Notably, the chatbots tended to respond with more personal attacks directed towards individuals, while Reddit users were more likely to target specific demographic groups.<\/p>\n<figure class=\"post-image post-mediaBleed aligncenter\"><img decoding=\"async\" loading=\"lazy\" class=\"size-full wp-image-1365565 js-lazy\" src=\"https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2021\/08\/Screenshot-2021-08-31-at-11.59.25.png\" alt=\"The distribution of target group frequencies.\" width=\"1342\" height=\"340\" sizes=\"(max-width: 1342px) 100vw, 1342px\" data-srcset=\"https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2021\/08\/Screenshot-2021-08-31-at-11.59.25.png 1342w, https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2021\/08\/Screenshot-2021-08-31-at-11.59.25-280x71.png 280w, https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2021\/08\/Screenshot-2021-08-31-at-11.59.25-270x68.png 270w, https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2021\/08\/Screenshot-2021-08-31-at-11.59.25-540x137.png 540w, https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2021\/08\/Screenshot-2021-08-31-at-11.59.25-796x202.png 796w\"><figcaption>Credit: Baheti et al.<\/figcaption><figcaption><a href=\"https:\/\/thenextweb.com\/news\/gpt-3-and-humans-twice-as-likely-agree-with-offensive-reddit-comments-chatbots#\" data-url=\"https:\/\/twitter.com\/intent\/tweet?url=https%3A%2F%2Feditorial.thenextweb.com%2Fneural%2F2021%2F08%2F31%2Fgpt-3-and-humans-twice-as-likely-agree-with-offensive-reddit-comments-chatbots%2F&amp;via=thenextweb&amp;related=thenextweb&amp;text=Check out this picture on: The top 10 target groups for Reddit user responses, DGPT responses, and GPT-3 responses. Target groups are organized in decreasing frequency in each decagon, starting clockwise from the top-right corner.\" data-title=\"Share The top 10 target groups for Reddit user responses, DGPT responses, and GPT-3 responses. Target groups are organized in decreasing frequency in each decagon, starting clockwise from the top-right corner. on Twitter\" data-width=\"685\" data-height=\"500\" class=\"post-image-share popitup\" title=\"Share The top 10 target groups for Reddit user responses, DGPT responses, and GPT-3 responses. Target groups are organized in decreasing frequency in each decagon, starting clockwise from the top-right corner. on Twitter\"><i class=\"icon icon--inline icon--twitter--dark\"><\/i><\/a>The top 10 target groups for Reddit user responses, DGPT responses, and GPT-3 responses. Target groups are organized in decreasing frequency in each decagon, starting clockwise from the top-right corner.<\/figcaption><noscript><img decoding=\"async\" loading=\"lazy\" class=\"size-full wp-image-1365565\" src=\"https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2021\/08\/Screenshot-2021-08-31-at-11.59.25.png\" alt=\"The distribution of target group frequencies.\" width=\"1342\" height=\"340\" srcset=\"https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2021\/08\/Screenshot-2021-08-31-at-11.59.25.png 1342w, https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2021\/08\/Screenshot-2021-08-31-at-11.59.25-280x71.png 280w, https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2021\/08\/Screenshot-2021-08-31-at-11.59.25-270x68.png 270w, https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2021\/08\/Screenshot-2021-08-31-at-11.59.25-540x137.png 540w, https:\/\/cdn0.tnwcdn.com\/wp-content\/blogs.dir\/1\/files\/2021\/08\/Screenshot-2021-08-31-at-11.59.25-796x202.png 796w\"><\/noscript><\/figure>\n<h2>Changing behavior<\/h2>\n<p>Defining \u201ctoxic\u201d behavior is a complicated and subjective task.<\/p>\n<p>One issue is that context often determines whether language is offensive. ToxiChat, for instance, contains replies that seem innocuous in isolation, but appear offensive when read alongside the preceding message.<\/p>\n<p>The role of context can make it difficult to mitigate toxic language in text generators.<\/p>\n<p>A solution used by GPT-3 and Facebook\u2019s Blender chatbot is to stop producing outputs when offensive inputs are detected. However, this can often generate false-positive predictions.<\/p>\n<p>The researchers experimented with an alternative method: preventing models from agreeing with offensive statements.<\/p>\n<p>They found that fine-tuning dialogue models on safe and neutral responses partially mitigated this behavior.<\/p>\n<p>But they\u2019re more excited by another approach: developing models that diffuse fraught situations by \u201cgracefully [responding] with non-toxic counter-speech.\u201d<\/p>\n<p>Good luck with that.<\/p>\n<p><em>Greetings Humanoids! Did you know we have a newsletter all about AI? You can subscribe to it&nbsp;<a href=\"https:\/\/share.hsforms.com\/1kUPK28s-Ro67GKE8E65Zwg47gef\" target=\"_blank\" rel=\"nofollow noopener noreferrer\">right here<\/a>.<\/em><\/p>\n<p> <a href=\"https:\/\/thenextweb.com\/news\/gpt-3-and-humans-twice-as-likely-agree-with-offensive-reddit-comments-chatbots\">Source<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Did you know&nbsp;Neural is taking the stage this fall? Together with an amazing line-up of experts, we will explore the future of AI during TNW Conference 2021.&nbsp;Secure your online ticket now! Chatbots&#8230;<\/p>\n","protected":false},"author":1,"featured_media":7487,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":[],"categories":[1],"tags":[],"_links":{"self":[{"href":"https:\/\/www.londonchiropracter.com\/index.php?rest_route=\/wp\/v2\/posts\/7486"}],"collection":[{"href":"https:\/\/www.londonchiropracter.com\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.londonchiropracter.com\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.londonchiropracter.com\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.londonchiropracter.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=7486"}],"version-history":[{"count":0,"href":"https:\/\/www.londonchiropracter.com\/index.php?rest_route=\/wp\/v2\/posts\/7486\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.londonchiropracter.com\/index.php?rest_route=\/wp\/v2\/media\/7487"}],"wp:attachment":[{"href":"https:\/\/www.londonchiropracter.com\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=7486"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.londonchiropracter.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=7486"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.londonchiropracter.com\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=7486"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}