{"id":13138,"date":"2018-08-17T10:45:06","date_gmt":"2018-08-17T18:45:06","guid":{"rendered":"http:\/\/www.palada.net\/index.php\/2018\/08\/17\/news-6905\/"},"modified":"2018-08-17T10:45:06","modified_gmt":"2018-08-17T18:45:06","slug":"news-6905","status":"publish","type":"post","link":"http:\/\/www.palada.net\/index.php\/2018\/08\/17\/news-6905\/","title":{"rendered":"A Bot Panic Hits Amazon Mechanical Turk"},"content":{"rendered":"<p><img decoding=\"async\" src=\"https:\/\/media.wired.com\/photos\/5b6df22751297c21002b4536\/master\/pass\/HackerBot.jpg\"\/><\/p>\n<p><strong>Credit to Author: Emily Dreyfuss| Date: Fri, 17 Aug 2018 15:38:35 +0000<\/strong><\/p>\n<p><span class=\"lede\">For the past <\/span>week, psychologists all over America have been freaking out.<\/p>\n<p>The cause of their agita was an observation by a psychology graduate student from the University of Minnesota named Max Hui Bai. Like many researchers, Bai uses Amazon\u2019s <a href=\"https:\/\/www.wired.com\/tag\/mechanical-turk\/\">Mechanical Turk<\/a> platform, where individuals sign up to complete simple tasks, such as taking surveys for academics or marketers, and earn a low fee. On Tuesday, August 7, he posed a simple question in a <a href=\"https:\/\/www.facebook.com\/groups\/psychmap\/permalink\/656859794690946\/\" target=\"_blank\">Facebook group<\/a> for psychology researchers: &quot;Have anyone used Mturk in the last few weeks and notice any quality drop?&quot;<\/p>\n<p class=\"paywall\">As he would later elaborate in a <a href=\"https:\/\/www.maxhuibai.com\/blog\/evidence-that-responses-from-repeating-gps-are-random\" target=\"_blank\">blog post<\/a>, Bai had found that the surveys he conducted with MTurk were full of nonsense answers to open-ended questions and respondents with duplicate GPS locations. He said he had to throw out nearly half of the data in his most recent survey, a sharp increase from what he was used to seeing. His Facebook post garnered 181 comments, with other researchers describing similar signs of low-quality data in their own recent work. A number of them wondered if the culprit was <a href=\"https:\/\/www.wired.com\/tag\/bots\/\">bots<\/a>\u2014automated programs mimicking human behavior, not the actual human labor MTurk is supposed to supply.<\/p>\n<p class=\"paywall\">The discussion soon spread over <a href=\"https:\/\/twitter.com\/kurtjgray\/status\/1027543640489512965\" target=\"_blank\">Twitter<\/a> and email, until it appeared the whole field was worried about MTurk. By Friday, <a href=\"https:\/\/www.newscientist.com\/article\/2176436-bots-on-amazons-mechanical-turk-are-ruining-psychology-studies\/\" target=\"_blank\"><em>New Scientist<\/em><\/a> ran an article with the headline \u201cBots on Amazon\u2019s MTurk Are Ruining Psychology Studies.\u201d One psychology professor mused on Facebook, \u201cI wonder if this is the end of MTurk research?\u201d<\/p>\n<p class=\"paywall\">If that were the case, it would be a pretty big deal. Thousands of published social science studies use MTurk survey data every year, according to Panos Ipeirotis, a data scientist at New York University\u2019s Stern School of Business.<\/p>\n<p>&quot;When most of us think of bots, we think of large networks of criminals, but a bot is just a tool for automation.&quot;<\/p>\n<p name=\"inset-left\" class=\"inset-left-component__el\">Reid Tatoris, Distil<\/p>\n<p class=\"paywall\">But here\u2019s the thing: It\u2019s hard to know for sure if what Bai reported was the result of bots run amok. There are plenty of explanations for junk responses on MTurk. Bai recognizes this. \u201cIt might be bots, it might be human-augmented bots, or it might be humans who are tired of taking the survey and are just randomly clicking the buttons,\u201d he says. It could also be the result of poor survey design, as Joe Miele, who operates an MTurk data consultancy, <a href=\"http:\/\/turkrequesters.blogspot.com\/2018\/08\/the-bot-problem-on-mturk.html\" target=\"_blank\">pointed out<\/a> in response to the uproar.<\/p>\n<p class=\"paywall\">And not all bot-like behavior on MTurk is considered bad. The platform&#x27;s <a href=\"https:\/\/www.mturk.com\/worker\/acceptable-use-policy\" target=\"_blank\">Acceptable Use Policy<\/a> says that Amazon is \u201cgenerally OK with you using scripts and automated tools\u201d to more efficiently preview and pick tasks. It\u2019s not uncommon for MTurk workers, or Turkers, to use scripts to help them find high-paying tasks they\u2019re suitable for and to accept them quickly. What you cannot do is complete those tasks using automated tools, because then you aren\u2019t using your human intelligence to do the job, and that\u2019s the whole point of MTurk. That hasn\u2019t stopped some people from reportedly using tools to automate filling out forms, but it\u2019s not clear yet whether their use is on the rise, or even that common. Amazon will only say this behavior is against its rules.<\/p>\n<p class=\"paywall\">\u201cThere are bots on MTurk and have been for years,\u201d says digital labor researcher Rochelle LaPlante, a former moderator of Reddit\u2019s <a href=\"https:\/\/www.reddit.com\/r\/mturk\/\" target=\"_blank\">r\/mturk subreddit<\/a>. \u201cI don\u2019t know if this new flare of discussion is actually an increase in bots, or just an increase in researchers talking about it and actively searching their data for it.\u201d<\/p>\n<p class=\"paywall\">When it launched in 2005, Mechanical Turk was a game-changer. It offered a wider pool for researchers than undergraduates on campus, who before online crowdsourcing had been the main participants in many of these studies, and for a relatively low cost. MTurk ushered in a &quot;<a href=\"https:\/\/daily.jstor.org\/amazons-mechanical-turk-has-reinvented-research\/\" target=\"_blank\">golden age<\/a>&quot; for social science research. Today, data gathered on the platform is used in thousands of studies a year.<\/p>\n<p class=\"paywall\">But all along there have also been reservations about the site and that data\u2019s reliability. People have worried they could get scammed\u2014requesters about getting back bad work, and workers about taking on tasks that never pay out. Marketers and researchers worried that the Turker population wasn\u2019t representative enough for their surveys. And <a href=\"https:\/\/www.wired.com\/story\/amazons-turker-crowd-has-had-enough\/\">the incredibly low pay<\/a>\u2014just a few cents per task\u2014has disconcerted labor activists and ethicists, who wonder if it\u2019s kosher for scientists to rely on laborers who are making so little. It also provides an incentive for workers to complete tasks as quickly as possible. Researchers do have control over how much they compensate Turkers for work and can opt to pay more if they want. At the same time, scientists are concerned about keeping costs down, and institutional review boards have often expressed concerns (<a href=\"https:\/\/www.thehastingscenter.org\/irb_article\/paying-research-participants-outsized-influence-undue-influence\/\" target=\"_blank\">wrongly, some say<\/a>) that high pay for human research subjects could be coercive.<\/p>\n<p class=\"paywall\">As more researchers used MTurk, they\u2019ve also discovered ways to mitigate many of these concerns. Ipeirotis has <a href=\"http:\/\/journal.sjdm.org\/10\/10630a\/jdm10630a.html\" target=\"_blank\">found that<\/a> the Turker population is just as representative as university undergraduates for survey populations, and the data could be just as reliable as long as people took proper precautions in designing their studies. As for bot work, experts say that researchers can avoid problems by setting up their surveys with stringent parameters and designing tasks that are difficult to automate.<\/p>\n<p class=\"paywall\">\u201cMost people are, by now, smart enough to deal with noise that comes from workers that do not pay attention, or from bots,\u201d Ipeirotis says. Notably, Bai and many of the researchers who reported an uptick in bad data this summer were using Captcha and attention checks, as experts advise, though Miele and some researchers in the Facebook group suggest perhaps their participant qualifications could have been stronger.<\/p>\n<p class=\"paywall\">Kathryn Johnson, a psychology professor at Arizona State University, spent the past week going back over her data to see if what Bai reported on his blog was true for her research. \u201cI usually have one MTurker study going a month,\u201d she says.<\/p>\n<p class=\"paywall\">She found the same two troubling things in her most recent MTurk studies: repeated GPS locations and nonsense answers to open-ended questions. But location data, whether an IP address or GPS, is not by itself a reliable indicator of fraudulent behavior, four different MTurk and bot experts told WIRED. So if that\u2019s the only suspicious thing that researchers are seeing in their results, they should not worry.<\/p>\n<p class=\"paywall\">The nonsense answers are more meaningful. A bunch of researchers told Bai that they had repeated instances of survey respondents replying \u201cnice\u201d or \u201cgood\u201d to open-ended questions for which those words made no sense.<\/p>\n<p class=\"paywall\">\u201cThere are browser extensions that fill forms with random answers, so I\u2019m certain some of [what they are seeing] is this,\u201d LaPlante says, but she is quick to note it could also be people not answering the surveys carefully. Repeated bad answers could be Turkers copying and pasting quickly so they can complete more surveys and earn more money. It could also mean people with poor English skills are taking the surveys, notes Miele.<\/p>\n<p class=\"paywall\">Though Amazon explicitly disallows bots that complete jobs on Mechanical Turk, the company is not very forthcoming about how big a problem they are on the platform. Perhaps it\u2019s because they haven\u2019t needed to be. Unlike Twitter, which in light of its <a href=\"https:\/\/www.wired.com\/story\/how-americans-wound-up-on-twitters-list-of-russian-bots\/\">well-known bot infestations<\/a> has had to be vocal about <a href=\"https:\/\/www.wired.com\/story\/twitter-continues-cleanup-cracks-down-on-malicious-apps\/\">kicking them off<\/a>, the possibility of bots on MTurk struck many people as news this week.<\/p>\n<p class=\"paywall\">Amazon also makes setting up multiple or fake accounts a bit more difficult, by requiring workers to provide valid tax information. But that doesn\u2019t prevent a verified person from supplementing their own MTurk labor with an automated system. It would be much easier to set a script on your account to complete a bunch of jobs while you sleep or go to your other job, for instance.<\/p>\n<p class=\"paywall\">\u201cWhen most of us think of bots, we think of large networks of criminals, but a bot is just a tool for automation. It could be used by one individual to say, instead of me making $5 a day in Amazon Turk, I\u2019m going to use it to make $20 a day. It\u2019s not necessarily nefarious or evil, but there is a gray area,\u201d says Reid Tatoris, vice president of product outreach and marketing at Distil, which detects and protects clients from automated attacks and bots. \u201cBut it\u2019s definitely not in compliance,\u201d he added.<\/p>\n<p>&quot;There are a dozen people I know of personally who run bots, and they get away with it.&quot;<\/p>\n<p name=\"inset-left\" class=\"inset-left-component__el\">Kristy Milland<\/p>\n<p class=\"paywall\">In response to this week\u2019s bot worries, a representative for Amazon told WIRED that the company suspends or terminates anyone found completing MTurk tasks by automated means. \u201cWe have both automated and manual mechanisms to detect fraud and misuse of the service by bots, and we are always improving these mechanisms as we discover new forms of abuse,\u201d the representative said.<\/p>\n<p class=\"paywall\">Amazon wouldn\u2019t say whether there has been an uptick in automated behavior on MTurk recently, nor would the company discuss specific examples of bots or accounts.<\/p>\n<p class=\"paywall\">\u201cThis has been going on since the beginning of Mechanical Turk, since forever,\u201d says Kristy Milland, who has conducted research on the platform and <a href=\"https:\/\/www.wired.com\/story\/the-crazy-hacks-one-woman-used-to-make-money-on-mechanical-turk\/\">worked as a Turker herself<\/a> for 12 years. She describes herself as an MTurk labor activist, working to encourage fair pay on the platform. \u201cThere are a dozen people I know of personally who run bots, and they get away with it,\u201d she says, adding that it would take her 30 seconds to write a simple script to fill in survey information automatically on MTurk.<\/p>\n<p class=\"paywall\">Forums for Turkers are full of conversations about scripts, some of which would violate Amazon\u2019s terms. You can also find YouTube videos showing Turkers how to write a script to auto-fill answers. In Milland\u2019s opinion, this behavior is in some ways inevitable thanks to the platform\u2019s policies.<\/p>\n<p class=\"paywall\">\u201cMechanical Turk workers have been treated really, really badly for 12 years, and so in some ways I see this as a point of resistance,&quot; she says. &quot;If we were paid fairly on the platform, nobody would be risking their account this way.\u201d<\/p>\n<p class=\"paywall\">Anyone who finds a possible bot account, or an account otherwise violating MTurk\u2019s terms, can alert Amazon via a <a href=\"https:\/\/requester.mturk.com\/contactus\" target=\"_blank\">contact form<\/a> on the site. Milland says she has sent Amazon multiple MTurk IDs that are running bots, but the accounts are still active.<\/p>\n<p class=\"paywall\">\u201cThey don\u2019t want to admit [bots are on the platform],\u201d Milland says. \u201cThere are enough people who don\u2019t know that such a thing is possible that they don\u2019t want to even let a whisper of the fact that it\u2019s a possibility out. So they won\u2019t talk about it. I send it on. I do not get a reply at all.\u201d<\/p>\n<p class=\"paywall\">Amazon\u2019s silence on the topic is striking, considering the level of concern among researchers. Members of the Facebook group where Bai originally posted about MTurk say they\u2019ve reached out to Amazon this week, but do not report hearing back. Last week, Bai created a questionnaire for researchers\u2014not on MTurk\u2014and is now leading a crowdsourced effort among social scientists to figure out how much of the bad data he has seen is new, how large the problem is, and how to stop it. He\u2019s still analyzing that survey, but he plans to send their research to Amazon in the hopes that the data will force the company to respond.<\/p>\n<p class=\"paywall\">If we think of Bai\u2019s quest like a scientific experiment, we just passed the hypothesis phase (\u201cThere may be more bad data on MTurk, and it may be due to bots\u201d) and have entered data-collection mode. The results aren\u2019t in yet.<\/p>\n<p class=\"related-cne-video-component__dek\">When Steve Jobs launched the iPhone in 2007, he said it was 5 years ahead of the competition and he was right. But after a decade, it&#39;s starting to feel like Apple needs something big again. And now, on cue, here comes something big.<\/p>\n<p><a href=\"https:\/\/www.wired.com\/story\/amazon-mechanical-turk-bot-panic\" target=\"bwo\" >https:\/\/www.wired.com\/category\/security\/feed\/<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p><img decoding=\"async\" src=\"https:\/\/media.wired.com\/photos\/5b6df22751297c21002b4536\/master\/pass\/HackerBot.jpg\"\/><\/p>\n<p><strong>Credit to Author: Emily Dreyfuss| Date: Fri, 17 Aug 2018 15:38:35 +0000<\/strong><\/p>\n<p>Concerned social scientists turned their analytical skills onto one of their most widely used research tools this week: Amazon&#8217;s Mechanical Turk.<\/p>\n","protected":false},"author":4,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"colormag_page_container_layout":"default_layout","colormag_page_sidebar_layout":"default_layout","footnotes":""},"categories":[10378,10607],"tags":[714],"class_list":["post-13138","post","type-post","status-publish","format-standard","hentry","category-security","category-wired","tag-security"],"_links":{"self":[{"href":"http:\/\/www.palada.net\/index.php\/wp-json\/wp\/v2\/posts\/13138","targetHints":{"allow":["GET"]}}],"collection":[{"href":"http:\/\/www.palada.net\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"http:\/\/www.palada.net\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"http:\/\/www.palada.net\/index.php\/wp-json\/wp\/v2\/users\/4"}],"replies":[{"embeddable":true,"href":"http:\/\/www.palada.net\/index.php\/wp-json\/wp\/v2\/comments?post=13138"}],"version-history":[{"count":0,"href":"http:\/\/www.palada.net\/index.php\/wp-json\/wp\/v2\/posts\/13138\/revisions"}],"wp:attachment":[{"href":"http:\/\/www.palada.net\/index.php\/wp-json\/wp\/v2\/media?parent=13138"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"http:\/\/www.palada.net\/index.php\/wp-json\/wp\/v2\/categories?post=13138"},{"taxonomy":"post_tag","embeddable":true,"href":"http:\/\/www.palada.net\/index.php\/wp-json\/wp\/v2\/tags?post=13138"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}