<?xml version='1.0' encoding='UTF-8'?><?xml-stylesheet href="http://www.blogger.com/styles/atom.css" type="text/css"?><feed xmlns='http://www.w3.org/2005/Atom' xmlns:openSearch='http://a9.com/-/spec/opensearchrss/1.0/' xmlns:georss='http://www.georss.org/georss' xmlns:gd='http://schemas.google.com/g/2005' xmlns:thr='http://purl.org/syndication/thread/1.0'><id>tag:blogger.com,1999:blog-6538286886012474786</id><updated>2012-02-15T23:55:10.638-08:00</updated><category term='clustering'/><category term='yahoo'/><category term='meetup'/><category term='data mining'/><category term='funny'/><category term='Rudy Giuliani'/><category term='reputation'/><category term='comic'/><category term='privacy'/><category term='youtube'/><category term='structural models.'/><category term='mind maps'/><category term='demo'/><category term='large datasets'/><category term='bayesian'/><category term='FROC'/><category term='propublica'/><category term='presentation'/><category term='ranked xml querying'/><category term='academia'/><category term='psychology'/><category term='string matching'/><category term='adwords'/><category term='evaluation'/><category term='information extraction'/><category term='charity'/><category term='amazon'/><category term='spam'/><category term='market for lemons'/><category term='wikis'/><category term='industry analysis'/><category term='wisdom of the crowds'/><category term='presidential elections 2008'/><category term='open access'/><category term='lda'/><category term='Mitt Romney'/><category term='probability'/><category term='embed'/><category term='fraud'/><category term='teaching'/><category term='humor'/><category term='powerpoint'/><category term='frequentist'/><category term='power law'/><category term='deduplication'/><category term='computer science'/><category term='keyword bidding'/><category term='mechanical turk'/><category term='reviews'/><category term='advice'/><category term='research'/><category term='reduced models'/><category term='customer service'/><category term='tutorial'/><category term='outliers'/><category term='acm'/><category term='honda'/><category term='data cleaning'/><category term='dagstuhl'/><category term='hcomp'/><category term='prediction markets'/><category term='incentives'/><category term='human computation'/><category term='wikipedia'/><category term='economics'/><category term='drm'/><category term='csdm'/><category term='surveys'/><category term='online advertising'/><category term='peer reviewing'/><category term='intellectual property'/><category term='search'/><category term='efficient markets'/><category term='microsoft'/><category term='Hillary Clinton'/><category term='online labor'/><category term='independence'/><category term='readability'/><category term='statistics'/><category term='machine learning'/><category term='ROC'/><category term='crowdsourcing'/><category term='publishers'/><category term='cognitive dissonance'/><category term='gmail'/><category term='dirichlet'/><category term='merger'/><category term='google'/><category term='typesetting'/><title type='text'>Top Site Map</title><subtitle type='html'></subtitle><link rel='http://schemas.google.com/g/2005#feed' type='application/atom+xml' href='http://topsitemap.blogspot.com/feeds/posts/default'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default?max-results=100'/><link rel='alternate' type='text/html' href='http://topsitemap.blogspot.com/'/><link rel='hub' href='http://pubsubhubbub.appspot.com/'/><link rel='next' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default?start-index=101&amp;max-results=100'/><author><name>Internet at Every Where</name><uri>http://www.blogger.com/profile/13430497579682715412</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><generator version='7.00' uri='http://www.blogger.com'>Blogger</generator><openSearch:totalResults>175</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>100</openSearch:itemsPerPage><entry><id>tag:blogger.com,1999:blog-6538286886012474786.post-7563714552779440360</id><published>2011-04-15T17:04:00.000-07:00</published><updated>2011-04-17T19:00:19.129-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='mechanical turk'/><category scheme='http://www.blogger.com/atom/ns#' term='research'/><category scheme='http://www.blogger.com/atom/ns#' term='human computation'/><category scheme='http://www.blogger.com/atom/ns#' term='hcomp'/><title type='text'>Deadline for HCOMP 2011 extended: Submission due on April 29th</title><content type='html'>Due to a significant number of requests, and a number of conflicts with other conferences and workshops, we decided to extend the submission deadline for &lt;a href="http://www.humancomputation.com/"&gt;HCOMP 2011&lt;/a&gt;. The &lt;b&gt;new deadline is April 29th&lt;/b&gt;.&lt;br /&gt;&lt;br /&gt;If you want to know more, you can see the &lt;a href="http://behind-the-enemy-lines.blogspot.com/2011/02/3rd-human-computation-workshop-hcomp.html"&gt;call for papers and workshop announcement&lt;/a&gt;.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6538286886012474786-7563714552779440360?l=topsitemap.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/7563714552779440360'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/7563714552779440360'/><link rel='alternate' type='text/html' href='http://topsitemap.blogspot.com/2011/04/deadline-for-hcomp-2011-extended.html' title='Deadline for HCOMP 2011 extended: Submission due on April 29th'/><author><name>Internet at Every Where</name><uri>http://www.blogger.com/profile/13430497579682715412</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author></entry><entry><id>tag:blogger.com,1999:blog-6538286886012474786.post-2545035869290041164</id><published>2011-04-15T08:29:00.000-07:00</published><updated>2011-04-17T19:00:19.136-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='mechanical turk'/><category scheme='http://www.blogger.com/atom/ns#' term='crowdsourcing'/><category scheme='http://www.blogger.com/atom/ns#' term='wisdom of the crowds'/><title type='text'>Video from NYC Crowdsourcing Meetup</title><content type='html'>On April 13th, we hosted at NYU Stern the &lt;a href="http://behind-the-enemy-lines.blogspot.com/2011/04/nyc-crowdsourcing-meetup-april-13th.html"&gt;NYC Crowdsourcing Meetup&lt;/a&gt;.&amp;nbsp;For those who missed it, you can now download an&amp;nbsp;&lt;a href="http://echo360.stern.nyu.edu:8080/ess/echo/presentation/3cca744c-8dc7-4c0d-9564-d8868fe3f177/media.mp3"&gt;audio-only podcast version&lt;/a&gt;&amp;nbsp;or watch the&amp;nbsp;&lt;a href="http://echo360.stern.nyu.edu:8080/ess/echo/presentation/3cca744c-8dc7-4c0d-9564-d8868fe3f177"&gt;video from the event together with the slide presentations&lt;/a&gt;:&lt;br /&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://echo360.stern.nyu.edu:8080/ess/echo/presentation/3cca744c-8dc7-4c0d-9564-d8868fe3f177"&gt;&lt;img border="0" height="313" src="http://4.bp.blogspot.com/-jFmqv6pObxM/TahjhbKIWBI/AAAAAAAAlu0/Y9GNVKlqIZM/s320/crowdsourcing-meetup-video-snapshot.PNG" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;The speakers at the event:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;b&gt;John Horton&lt;/b&gt;, Staff Economist of oDesk. John talked on issues of matching employers with contractors in an online marketplace. Specifically he described mechanisms for forcing contractors to give an accurate description of their skills, avoiding issues of over-tagging a profile with irrelevant keywords or over-claiming qualifications.&lt;/li&gt;&lt;li&gt;&lt;b&gt;Amanda Michel&lt;/b&gt;, Director of Distributed Reporting at ProPublica. Amanda talked about the crowdsourcing efforts of ProPublica, and how they use the crowd to enable better journalistic investigation of topics they are researching. At some point during the presentation, Amanda quoted from &lt;a href="http://www.propublica.org/article/our-stimulus-spot-check-summer-wave-of-projects-nears-crest-817"&gt;one of their studie&lt;/a&gt;s "&lt;i&gt;ProPublica pulled a random sample of 520 of the roughly 6,000 approved projects to examine stimulus progress around the country. That sample is large enough to estimate national patterns with a margin of error of plus or minus 4.5 percentage points&lt;/i&gt;." Honestly, a tear came down my eye when I compared that with the corresponding practices of Greek newsrooms that typically operate with samples of n=1 or n=0.&lt;/li&gt;&lt;li&gt;&lt;b&gt;Todd Carter&lt;/b&gt;, CEO and Co-Founder of Tagasauris. Todd described Tagasauris, a system for annotating and tagging media files. Todd described the annotation effort for &lt;a href="http://www.magnumphotos.com/"&gt;Magnum Photos&lt;/a&gt;, (sample photos in their collection include the &lt;a href="http://www.magnumphotos.com/C.aspx?VP3=ViewBox_VPage&amp;amp;VBID=2K1HZOXHAFXW3&amp;amp;IT=ZoomImage01_VForm&amp;amp;IID=2S5RYDYF53IF&amp;amp;PN=8&amp;amp;CT=Search"&gt;Afghan refugee girl&lt;/a&gt;, &lt;a href="http://www.magnumphotos.com/C.aspx?VP3=ViewBox_VPage&amp;amp;VBID=2K1HZOXHAQ5TH&amp;amp;CT=Search&amp;amp;DT=Image"&gt;Merilyn Monroe on top of the vent&lt;/a&gt;, and many other iconic photos). A highlight was the discovery of a "lost" set of images from the shooting of the movie "American Graffiti". These images, shot by Dennis Stock, were in the Magnum archive but were not possible to find as they were lacking any tags and description. After the annotation effort from Tagasauris, the lost set of photos &lt;a href="http://www.magnumphotos.com/C.aspx?VP3=ViewBox_VPage&amp;amp;VBID=2K1HZOXHARHQF&amp;amp;CT=Search&amp;amp;DT=Image"&gt;were re-discovered&lt;/a&gt;.&lt;/li&gt;&lt;li&gt;&lt;b&gt;Panos Ipeirotis&lt;/b&gt;, representing AdSafe Media. I talked about our efforts in AdSafe, on using crowdsourcing in order to create machine learning systems for classifying web pages.&lt;/li&gt;&lt;/ul&gt;&lt;div&gt;It was a lively and&amp;nbsp;successful&amp;nbsp;event. If there is enough interest and participants, I think this is an event that can be repeated periodically.&lt;/div&gt;&lt;a href="http://www.blogger.com/"&gt;&lt;/a&gt;&lt;span id="goog_1773302289"&gt;&lt;/span&gt;&lt;span id="goog_1773302290"&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6538286886012474786-2545035869290041164?l=topsitemap.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/2545035869290041164'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/2545035869290041164'/><link rel='alternate' type='text/html' href='http://topsitemap.blogspot.com/2011/04/video-from-nyc-crowdsourcing-meetup.html' title='Video from NYC Crowdsourcing Meetup'/><author><name>Internet at Every Where</name><uri>http://www.blogger.com/profile/13430497579682715412</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://4.bp.blogspot.com/-jFmqv6pObxM/TahjhbKIWBI/AAAAAAAAlu0/Y9GNVKlqIZM/s72-c/crowdsourcing-meetup-video-snapshot.PNG' height='72' width='72'/></entry><entry><id>tag:blogger.com,1999:blog-6538286886012474786.post-1322861131465573283</id><published>2011-04-10T08:16:00.000-07:00</published><updated>2011-04-17T19:00:19.140-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='mechanical turk'/><category scheme='http://www.blogger.com/atom/ns#' term='crowdsourcing'/><category scheme='http://www.blogger.com/atom/ns#' term='meetup'/><category scheme='http://www.blogger.com/atom/ns#' term='wisdom of the crowds'/><category scheme='http://www.blogger.com/atom/ns#' term='online labor'/><title type='text'>NYC Crowdsourcing Meetup: April 13th, 6.30pm</title><content type='html'>Join us for its first ever &lt;a href="http://www.meetup.com/Distributed-Work/events/17149053/"&gt;New York City Crowdsourcing meetup&lt;/a&gt; hosted by NYU and sponsored by &lt;a href="http://crowdflower.com/"&gt;CrowdFlower&lt;/a&gt;:&lt;br /&gt;&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;b&gt;When&lt;/b&gt;: Wednesday, April 13,&amp;nbsp;6:30-9pm&lt;/li&gt;&lt;li&gt;&lt;b&gt;Where&lt;/b&gt;: NYU Stern School of Business,&amp;nbsp;Room M3-110,&amp;nbsp;&lt;a href="http://goo.gl/maps/rVxm"&gt;44 West 4th Street, New York, NY 10012&lt;/a&gt;&amp;nbsp;&lt;/li&gt;&lt;/ul&gt;&lt;br /&gt;Pizza, beer, and thought provoking conversation about the future of work.&amp;nbsp;Come listen, ask, and debate how crowdsourcing is changing everything from philanthropy and urban planing to creative design and enterprise solutions.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Confirmed Speakers:&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;&lt;ul&gt;&lt;li&gt;Lukas Biewald, CEO and Co-Founder of CrowdFlower&lt;/li&gt;&lt;li&gt;Todd Carter, CEO and Co-Founder of Tagasauris&lt;/li&gt;&lt;li&gt;John Horton, Chief Economist of oDesk&lt;/li&gt;&lt;li&gt;Panos Ipeirotis, Associate Professor at Stern School of Business, NYU&lt;/li&gt;&lt;li&gt;Amanda Michel, Director of Distributed Reporting at ProPublica&lt;/li&gt;&lt;li&gt;Bartek Ringwelski, CEO and Co-Founder of SkillSlate&lt;/li&gt;&lt;li&gt;Trebor Scholz, Associate Professor in Media &amp;amp; Culture at The New School University&lt;/li&gt;&lt;/ul&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6538286886012474786-1322861131465573283?l=topsitemap.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/1322861131465573283'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/1322861131465573283'/><link rel='alternate' type='text/html' href='http://topsitemap.blogspot.com/2011/04/nyc-crowdsourcing-meetup-april-13th.html' title='NYC Crowdsourcing Meetup: April 13th, 6.30pm'/><author><name>Internet at Every Where</name><uri>http://www.blogger.com/profile/13430497579682715412</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author></entry><entry><id>tag:blogger.com,1999:blog-6538286886012474786.post-5726933194961468238</id><published>2011-04-05T13:08:00.000-07:00</published><updated>2011-04-17T19:00:19.146-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='mechanical turk'/><category scheme='http://www.blogger.com/atom/ns#' term='crowdsourcing'/><category scheme='http://www.blogger.com/atom/ns#' term='surveys'/><category scheme='http://www.blogger.com/atom/ns#' term='wisdom of the crowds'/><category scheme='http://www.blogger.com/atom/ns#' term='research'/><category scheme='http://www.blogger.com/atom/ns#' term='tutorial'/><category scheme='http://www.blogger.com/atom/ns#' term='computer science'/><category scheme='http://www.blogger.com/atom/ns#' term='human computation'/><title type='text'>Tutorial on Crowdsourcing and Human Computation</title><content type='html'>Last week, together with Praveen Paritosh from Google, we presented a 6-hour tutorial at the WWW 2011 conference, on crowdsourcing and human computation. The title of the tutorial was "&lt;a href="http://www.www2011india.com/tutorialstr26.html"&gt;Managing Crowdsourced Human Computation&lt;/a&gt;".&lt;br /&gt;&lt;br /&gt;My slides from the tutorial are &lt;a href="http://www.slideshare.net/ipeirotis/managing-crowdsourced-human-computation"&gt;available now on Slideshare&lt;/a&gt;:&lt;br /&gt;&lt;br /&gt;&lt;center&gt;&lt;br /&gt;&lt;div style="text-align: center;"&gt;&lt;div id="__ss_7526103" style="width: 425px;"&gt;&lt;iframe frameborder="0" height="355" marginheight="0" marginwidth="0" scrolling="no" src="http://www.slideshare.net/slideshow/embed_code/7526103" width="425"&gt;&lt;/iframe&gt;&lt;/div&gt;&lt;/div&gt;&lt;/center&gt;&lt;br /&gt;&lt;br /&gt;Once Praveen gets clearance from Google, we will post his slides as well. &lt;br /&gt;&lt;br /&gt;Judging from all the crap that I get to review lately, I was getting pessimistic about the quality of research on crowdsourcing. However, while preparing the tutorial, I realized the massive amount of high-quality research that is being published. We had 6 hours for the tutorial, and we did not have enough time to cover many really interesting papers. I had to refer people to other, more "specialized" tutorials (e.g., on linguistic annotation, on search relevance, etc), which I mention at the end of the slides.&lt;br /&gt;&lt;br /&gt;Special thanks go to my PhD student, &lt;a href="http://pages.stern.nyu.edu/~jwang5/"&gt;Jing Wang&lt;/a&gt;, for her slides on market design, &lt;a href="http://www.ischool.utexas.edu/~ml/"&gt;Matt Lease&lt;/a&gt; for his excellent &lt;a href="http://ir.ischool.utexas.edu/crowd/"&gt;list of pointers for crowdsourcing resources&lt;/a&gt;, &lt;a href="http://wwwcsif.cs.ucdavis.edu/~alonsoom/"&gt;Omar Alonso&lt;/a&gt; for his tutorial slides on crowdsourcing for search relevance, &lt;a href="http://alexquinn.org/"&gt;Alex Quinn&lt;/a&gt; and &lt;a href="http://www.cs.umd.edu/~bederson/"&gt;Ben Bederson&lt;/a&gt; for their survey on human computation, and Winter Mason for sharing his slides from his CSDM keynote. And all the other researchers for making crowdsourcing and human computation an exciting field for research!&lt;br /&gt;&lt;br /&gt;Last but not least: Luis von Ahn with Edith Law will be presenting another tutorial on human computation during AAAI, in San Francisco on August 8th. We will be organizing the &lt;a href="http://humancomputation.com/"&gt;HCOMP 2011 workshop&lt;/a&gt; in conjunction with AAAI as well! The submission deadline is April 22nd! Do not forget to submit!&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6538286886012474786-5726933194961468238?l=topsitemap.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/5726933194961468238'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/5726933194961468238'/><link rel='alternate' type='text/html' href='http://topsitemap.blogspot.com/2011/04/tutorial-on-crowdsourcing-and-human.html' title='Tutorial on Crowdsourcing and Human Computation'/><author><name>Internet at Every Where</name><uri>http://www.blogger.com/profile/13430497579682715412</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author></entry><entry><id>tag:blogger.com,1999:blog-6538286886012474786.post-7293830741322824113</id><published>2011-04-05T05:41:00.000-07:00</published><updated>2011-04-17T19:00:19.150-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='mechanical turk'/><category scheme='http://www.blogger.com/atom/ns#' term='research'/><title type='text'>An ingenious application of crowdsourcing: Fix reviews' grammar, improve sales</title><content type='html'>&lt;div dir="ltr" style="text-align: left;" trbidi="on"&gt;I have been doing &lt;a href="http://pages.stern.nyu.edu/~panos/publications/kdd2007.pdf"&gt;research&lt;/a&gt; &lt;a href="http://pages.stern.nyu.edu/~panos/publications/icis2010.pdf"&gt;on&lt;/a&gt; the &lt;a href="http://pages.stern.nyu.edu/~panos/publications/jrpm2009.pdf"&gt;economic&lt;/a&gt; &lt;a href="http://papers.ssrn.com/sol3/papers.cfm?abstract_id=1024903"&gt;impact&lt;/a&gt; &lt;a href="http://pages.stern.nyu.edu/~panos/publications/tkde2010-usefulness.pdf"&gt;of&lt;/a&gt; &lt;a href="http://pages.stern.nyu.edu/~panos/publications/icec2007.pdf"&gt;product&lt;/a&gt; &lt;a href="http://pages.stern.nyu.edu/~panos/publications/acl2007.pdf"&gt;reviews&lt;/a&gt; &lt;a href="http://pages.stern.nyu.edu/~panos/publications/www2011.pdf"&gt;for a&lt;/a&gt; &lt;a href="http://patft.uspto.gov/netacgi/nph-Parser?Sect1=PTO2&amp;amp;Sect2=HITOFF&amp;amp;p=1&amp;amp;u=/netahtml/PTO/search-bool.html&amp;amp;r=1&amp;amp;f=G&amp;amp;l=50&amp;amp;co1=AND&amp;amp;d=PTXT&amp;amp;s1=7848979.PN.&amp;amp;OS=PN/7848979&amp;amp;RS=PN/7848979"&gt;while&lt;/a&gt;. One thing that we have noticed is that the quality of the reviews can have an impact on product sales,&lt;i&gt; independently of the polarity of the review&lt;/i&gt;.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;&lt;i&gt;High-quality reviews improve product sales&lt;/i&gt;&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;A well-written review tends to inspire confidence about the product, even if the review is negative. Typically such reviews are perceived as objective and thorough. So, if we have a high-quality, but negative, review this may serve as a guarantee that the negative aspects of the product are not that bad after all. For example, a negative review such as "&lt;i&gt;horrible battery life... in my tests battery lasts barely longer than 24 hours...&lt;/i&gt;" may be perceived as positive  by other customers that consider a 24-hour batter life to be more than sufficient.&lt;br /&gt;&lt;br /&gt;In our recent (&lt;a href="http://www.www2011india.com/"&gt;award-winning&lt;/a&gt;) &lt;a href="http://pages.stern.nyu.edu/~panos/publications/www2011.pdf"&gt;WWW2011 paper "Towards a Theory Model for Product Search"&lt;/a&gt;, we noticed that demand for a hotel increases if the online reviews on TripAdvisor and Travelocity are well-written, without spelling errors; this holds no matter if the review is positive or negative. In our &lt;a href="http://pages.stern.nyu.edu/~panos/publications/tkde2010-usefulness.pdf"&gt;TKDE paper "Estimating the Helpfulness and Economic Impact of Product Reviews: Mining Text and Reviewer Characteristics"&lt;/a&gt;, we observed similar trends for products sold and reviewed on Amazon.com.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;&lt;i&gt;And what can we do knowing this?&lt;/i&gt;&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;Being in a business school, these findings were considered informative but not deeply interesting. Do not forget, the focus of researchers in business schools is &lt;a href="http://behind-the-enemy-lines.blogspot.com/2009/09/different-attitudes-of-computer.html"&gt;centered on causality and on policy-making&lt;/a&gt;. Yes, we know that it is important for the reviews to be well-written and informative if we want the product to sell well. But if we cannot do anything about this, it is not deeply interesting. It is almost like knowing that during the cold months the demand for summer resorts drops!&lt;br /&gt;&lt;br /&gt;But here comes the twist...&lt;br /&gt;&lt;br /&gt;&lt;b&gt;&lt;i&gt;The crowdsourcing solution&lt;/i&gt;&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;Last week, over drinks during the WWW conference, I learned about a fascinating application of crowdsourcing that attacked exactly this issue.&lt;br /&gt;&lt;br /&gt;An online retailer noticed that indeed products with high-quality reviews are selling well. So, they decided to take action. They used Amazon Mechanical Turk to improve the quality of its reviews. Using the &lt;a href="http://groups.csail.mit.edu/uid/other-pubs/soylent.pdf"&gt;Find-Fix-Verify pattern&lt;/a&gt;, they used Mechanical Turk to examine a few &lt;b&gt;&lt;i&gt;millions &lt;/i&gt;&lt;/b&gt;of product reviews. (Here are the archived versions of the HITs: &lt;a href="http://mturk-tracker.com/hit/79f44798e8c296e29bddbd3a3aa8f60a/"&gt;Find&lt;/a&gt;, &lt;a href="http://mturk-tracker.com/hit/160L5FUZB7DDN03NIAB2D4J8HRZ022/"&gt;Fix&lt;/a&gt;, &lt;a href="http://mturk-tracker.com/hit/1QPDXYWLI7K28IZ6MAP7WEO0N6V03B/"&gt;Verify&lt;/a&gt;... and if you have not figured out the firm name by now, it is Zappos :-) ) For the reviews with mistakes, &lt;b&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="color: #660000;"&gt;they fixed the spelling and grammar errors! Thus they effectively improved the quality of the reviews on their website. And, correspondingly, they improved the demand for their products!&lt;/span&gt;&lt;/i&gt;&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;While I do not know the exact revenue improvement, I was told that it was substantial. Given that the e-tailer  spent at least 10 cents per review, and that they examined approximately 5 million reviews, this is an expense of a few hundred thousand dollars. (My archive on MTurk-Tracker kind of confirms these numbers.) So, the expected revenue improvement should have been at least a few million dollars!&lt;br /&gt;&lt;br /&gt;Ethical? I would say yes. Notice that they are not fixing the polarity or the content of the reviews. They just change the language to be correct and error-free. I can see the counter-argument that the writing style allows us to judge if the review is serious or not. So, artificially improving the writing style may be considered as interference with the perceived objectivity of the user-generated reviews.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="color: #660000;"&gt;But is it ingenious? Yes!&lt;/span&gt;&lt;/b&gt; It is one of these solutions that is sitting in front of you but you just cannot see it. And this is what makes it ingenious.&lt;br /&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6538286886012474786-7293830741322824113?l=topsitemap.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/7293830741322824113'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/7293830741322824113'/><link rel='alternate' type='text/html' href='http://topsitemap.blogspot.com/2011/04/ingenious-application-of-crowdsourcing.html' title='An ingenious application of crowdsourcing: Fix reviews&amp;#39; grammar, improve sales'/><author><name>Internet at Every Where</name><uri>http://www.blogger.com/profile/13430497579682715412</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author></entry><entry><id>tag:blogger.com,1999:blog-6538286886012474786.post-4324718604484868155</id><published>2011-03-22T12:25:00.000-07:00</published><updated>2011-04-17T19:00:19.154-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='mechanical turk'/><category scheme='http://www.blogger.com/atom/ns#' term='crowdsourcing'/><category scheme='http://www.blogger.com/atom/ns#' term='industry analysis'/><category scheme='http://www.blogger.com/atom/ns#' term='efficient markets'/><title type='text'>Crowdsourcing goes professional: The rise of the verticals</title><content type='html'>&lt;div dir="ltr" style="text-align: left;" trbidi="on"&gt;Over the last few months, I see a trend. Instead of letting end-users interact directly with the crowd (e.g., on Mechanical Turk), we see a rise of the number of solutions that target a very specific vertical.&lt;br /&gt;&lt;ul style="text-align: left;"&gt;&lt;li&gt;&lt;a href="http://castingwords.com/"&gt;CastingWords&lt;/a&gt;: Audio transcription&lt;/li&gt;&lt;li&gt;&lt;a href="http://www.speakertext.com/"&gt;SpeakerText&lt;/a&gt;: Video transcription&lt;/li&gt;&lt;li&gt;&lt;a href="http://www.serv.io/translation"&gt;Serv.io Translate&lt;/a&gt;: Translation (by &lt;a href="http://www.cloudcrowd.com/"&gt;CloudCrowd&lt;/a&gt;)&lt;/li&gt;&lt;li&gt;&lt;a href="http://www.serv.io/edit"&gt;Serv.io Edit&lt;/a&gt;: Proofreading (by &lt;a href="http://www.cloudcrowd.com/"&gt;CloudCrowd&lt;/a&gt;)&lt;/li&gt;&lt;li&gt;&lt;a href="http://crowdflower.com/"&gt;CrowdFlower&lt;/a&gt;&amp;nbsp;&lt;a href="http://crowdflower.com/solutions/blv/index"&gt;Business Listing Verification&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://crowdflower.com/"&gt;CrowdFlower&lt;/a&gt;&amp;nbsp;&lt;a href="http://crowdflower.com/solutions/search_rel/index"&gt;Search Relevance&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://crowdflower.com/"&gt;CrowdFlower&lt;/a&gt;&amp;nbsp;&lt;a href="http://crowdflower.com/solutions/prod_cat/index"&gt;Product Categorization&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://www.betterocr.com/"&gt;BetterOCR&lt;/a&gt;: Improving Optical Character Recognition&lt;/li&gt;&lt;li&gt;&lt;a href="http://www.tagasauris.com/"&gt;Tagasauris&lt;/a&gt;: Photo tagging&lt;/li&gt;&lt;li&gt;&lt;a href="http://www.mediapiston.com/"&gt;MediaPiston&lt;/a&gt;:&amp;nbsp;Targeted content generation&lt;/li&gt;&lt;/ul&gt;&lt;div&gt;Add services like &lt;a href="http://www.trada.com/"&gt;Trada&lt;/a&gt;&amp;nbsp;for crowd-optimizing paid advertising campaigns, &lt;a href="http://www.utest.com/"&gt;uTest&lt;/a&gt;&amp;nbsp;for crowd-testing software applications, etc. and you will see that for most crowd applications there is now a professionally developed crowd-app.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Why do we see these efforts? This is the time that most people realize that crowdsourcing is not that simple. Using Mechanical Turk directly &lt;a href="http://engineeringblog.yelp.com/2011/02/towards-building-a-high-quality-workforce-with-mechanical-turk.html"&gt;is a very costly enterprise&lt;/a&gt; and &lt;b&gt;c&lt;i&gt;annot be done effectively by amateurs&lt;/i&gt;&lt;/b&gt;: The interface needs to be professionally designed, quality control needs to be done intelligently, and the crowd needs to be managed in the same way that any employee is managed.&amp;nbsp;Most companies do not have time or the resources to invest in such solutions. So, we see the rise of such verticals that address the most common tasks that were accomplished on Mechanical Turk.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="font-size: x-small;"&gt;(Interestingly enough, if I remember correctly, the rise of vertical solutions was also a phase during web search. In the period in which AltaVista started being spammed and full of irrelevant results, we saw the rise of topic-specific search engines that were trying to eliminate the problems of polysemy by letting you search only for web pages within a given topic.)&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;For me, this is the signal that &lt;b&gt;crowdsourcing will stop being the fad of the day&lt;/b&gt;. Amateurish solutions will be shunned, and most people will find it cheaper to just use the services of the verticals above. Saying "oh, I paid just $[&lt;i&gt;add offensively low dollar amount&lt;/i&gt;] to do [&lt;i&gt;add trivial task&lt;/i&gt;] on Mechanical Turk" will stop being a novelty and people will just point to a company that does the same thing&amp;nbsp;professionally&amp;nbsp;and in a large scale.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;This also means that &lt;b&gt;the crowdsourcing space will become increasingly "boring."&lt;/b&gt; All the low-hanging fruits will be gone. Only people that are willing to invest time and effort in the long term will get into the space.&amp;nbsp;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;And it will be the time that we will get to separate the wheat from the chaff.&lt;/div&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6538286886012474786-4324718604484868155?l=topsitemap.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/4324718604484868155'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/4324718604484868155'/><link rel='alternate' type='text/html' href='http://topsitemap.blogspot.com/2011/03/crowdsourcing-goes-professional-rise-of.html' title='Crowdsourcing goes professional: The rise of the verticals'/><author><name>Internet at Every Where</name><uri>http://www.blogger.com/profile/13430497579682715412</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author></entry><entry><id>tag:blogger.com,1999:blog-6538286886012474786.post-535569957737925372</id><published>2011-03-16T10:50:00.000-07:00</published><updated>2011-04-17T19:00:19.158-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='research'/><category scheme='http://www.blogger.com/atom/ns#' term='online advertising'/><category scheme='http://www.blogger.com/atom/ns#' term='spam'/><category scheme='http://www.blogger.com/atom/ns#' term='fraud'/><title type='text'>Uncovering an advertising fraud scheme. Or "the Internet is for porn"</title><content type='html'>&lt;div dir="ltr" style="text-align: left;" trbidi="on"&gt;You have heard about fraud and online advertising. You may have seen the &lt;a href="http://online.wsj.com/video/digits-porn-sites-scam-advertisers/5FBF57B5-3063-4A45-B55D-47B98C006862.html"&gt;Wall Street Journal video &amp;nbsp;"Porn Sites Scam Advertisers"&lt;/a&gt;, or even read the story at today's Wall Street Journal about "&lt;a href="http://online.wsj.com/article/SB10001424052748704893604576200383793893712.html"&gt;Off Screen, Porn Sites Trick Advertisers&lt;/a&gt;" (&lt;i&gt;Hint&lt;/i&gt;: to avoid the WSJ paywall, search the title of the article through Google News and click from there, to read the full article).&lt;br /&gt;&lt;br /&gt;Since I am intimately familiar with the story covered by WSJ (i.e., I was part of the team at &lt;a href="http://www.adsafemedia.com/"&gt;AdSafe&lt;/a&gt; that uncovered it), I thought it would be also good to cover the technical aspects in more detail, uncovering the way in which this advertising fraud scheme operated.&lt;br /&gt;&lt;br /&gt;It is long but (I think) interesting. &lt;b&gt;It is a story of a one-man-making-a-million-dollar-per-month fraud scheme&lt;/b&gt;. It shows how a moderately sophisticated advertising fraud scheme can generate very significant monetary benefits for the fraudster: Profits of millions of dollars per year.&lt;br /&gt;&lt;br /&gt;If you want to skip the technical sleuthing details, you can skip directly to the &lt;a href="http://behind-the-enemy-lines.blogspot.com/2011/03/uncovering-advertising-fraud-scheme.html#overall"&gt;overall picture&lt;/a&gt; and the discussion.&lt;br /&gt;&lt;br /&gt;&lt;hr align="center" style="background-color: #691f01; border-bottom-width: 1px; border-color: initial; border-left-width: 1px; border-right-width: 1px; border-style: initial; border-top-width: 1px; color: #691f01; display: block; height: 2px;" width="50%" /&gt;&lt;div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="color: #990000;"&gt;Disclaimer&lt;/span&gt;: In the story below, I will only mention by name the sites performing the fraudulent activities. All the brand names that you see are just for illustration purposes. They are &lt;span class="Apple-style-span" style="color: #cc0000;"&gt;not &lt;/span&gt;the ones affected by this case of fraud. Also remember that this is a personal blog. The views and opinions that I express here are my own and do not necessarily represent the views of &lt;a href="http://www.adsafemedia.com/"&gt;AdSafe&lt;/a&gt;&amp;nbsp;or the views of &lt;a href="http://www.stern.nyu.edu/"&gt;New York University&lt;/a&gt;.&lt;/b&gt;&lt;/div&gt;&lt;br /&gt;&lt;hr align="center" style="background-color: #691f01; border-bottom-width: 1px; border-color: initial; border-left-width: 1px; border-right-width: 1px; border-style: initial; border-top-width: 1px; color: #691f01; display: block; height: 2px;" width="50%" /&gt;&lt;div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;"&gt;&lt;i&gt;&lt;b&gt;The erroneous classifier&lt;/b&gt;&lt;/i&gt;&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;"&gt;&lt;br /&gt;&lt;/div&gt;It all started while working at &lt;a href="http://www.adsafemedia.com/"&gt;AdSafe&lt;/a&gt;.  For those not familiar with AdSafe: The role of AdSafe is to provide brand protection services to online advertisers. In plain English, AdSafe analyzes website content and can block ads from appearing on individual web pages with content inappropriate for a brand. Porn, hate speech, gambling, celebrity gossip, torrents, are among the many categories that we detect.&lt;br /&gt;&lt;br /&gt;On a nice Monday, the data science team gets the notification: The web page classifier was detecting a large number of porn web pages within legitimate, clean, big-brand-name websites. Think of websites such as BabyCenter, MSN MoneyCentral, HGTV, and so on. These sites would never have anything racy in their pages. However, we could see them being classified as having hard-core porn!&lt;br /&gt;&lt;br /&gt;Why do we detect porn in clean sites? None of the pages within the sites contained anything offensive. No porn, no offensive material. Nothing. The website was clean as it gets. What was going on?&lt;br /&gt;&lt;br /&gt;&lt;hr align="center" style="background-color: #691f01; border-bottom-width: 1px; border-color: initial; border-left-width: 1px; border-right-width: 1px; border-style: initial; border-top-width: 1px; color: #691f01; display: block; height: 2px;" width="50%" /&gt;&lt;div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;"&gt;&lt;i&gt;&lt;b&gt;The invisible iframe hosting &lt;/b&gt;&lt;/i&gt;&lt;/div&gt;&lt;br /&gt;The lifesaver was a technique developed at AdSafe: The key to the solution was the ability to read the address of the top frame that was hosting the ad&lt;span class="Apple-style-span" style="font-size: x-small;"&gt;(*)&lt;/span&gt;. We were detecting porn because the ads that were supposed to appear within a "clean publisher" site &lt;b&gt;&lt;i&gt;were appearing within the frame of a porn website. &lt;/i&gt;&lt;/b&gt;Think of HGTV as an illustrative, &lt;i&gt;but not the real&lt;/i&gt;, example of such a "clean publisher."&lt;br /&gt;&lt;br /&gt;&lt;i&gt;(*) For the technically curious: reading the address of the top frame is a challenging problem. For security reasons, browsers do not allow cross-domain scripting. So, it is not possible to just call the "top" object and read its properties. We have a proprietary solution for this.&lt;/i&gt;&lt;br /&gt;&lt;br /&gt;By using this technique, we got our explanation: The HGTV website was appearing within an iframe of a porn website. In our case, the porn website was &lt;a href="http://www.hqtubevideos.com/"&gt;www.hqtubevideos.com&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;WTF? This made no sense. Why would the porn website display HGTV (and the associated ads) within an iframe? Why would the porn site generate this "invisible" traffic towards HGTV? Just so that HGTV would get paid for the CPM ads? Or was the porn site trying to decrease the clickthrough rate of HGTV and ruin the performance the CPC campaigns? Did the porn website love HGTV so much and it was trying to increase its traffic? No way. Did HGTV employ a porn website to increase its traffic? No way, either.&lt;br /&gt;&lt;br /&gt;Made no sense whatsoever.&lt;br /&gt;&lt;br /&gt;&lt;hr align="center" style="background-color: #691f01; border: 1px; color: #691f01; display: block; height: 2px;" width="50%" /&gt;&lt;br /&gt;&lt;b&gt;&lt;i&gt;Checking the structure of the porn website&lt;/i&gt;&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;So, we decided to investigate. Let's see what is going on. First, we go and see the HTML source of &lt;a href="http://www.hqtubevideos.com/play.html"&gt;www.hqtubevideos.com/play.html&lt;/a&gt; that was the top frame what we were detecting. Here is the source:&lt;br /&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="https://lh5.googleusercontent.com/-GXHdZyVj3DI/TXEktLQfshI/AAAAAAAAhsM/Xa5o-DOTgYE/s1600/hqtube-play_html.PNG" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="317" src="https://lh5.googleusercontent.com/-GXHdZyVj3DI/TXEktLQfshI/AAAAAAAAhsM/Xa5o-DOTgYE/s400/hqtube-play_html.PNG" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;/div&gt;&lt;br /&gt;The highlighted part shows an interesting redirection. We go to the &lt;a href="http://www.hqtubevideos.com/index.php"&gt;www.hqtubevideos.com/index.php&lt;/a&gt;, but with the parameter &lt;b&gt;?x=1&lt;/b&gt; at the end.&lt;br /&gt;&lt;br /&gt;Loading the page &lt;a href="http://www.hqtubevideos.com/index.php"&gt;www.hqtubevideos.com/index.php&lt;/a&gt; without this parameter loads a "vanilla" porn website. A few semi-suspicious attempts to add the website in the bookmarks in the beginning. Then porn pictures and links to affiliate sites. Plenty of porn but nothing to set an alarm. So far, so good.&lt;br /&gt;&lt;br /&gt;They key, though, is this parameter &lt;b&gt;?x=1&lt;/b&gt;. Loading the &lt;a href="http://www.hqtubevideos.com/index.php?x=1"&gt;www.hqtubevideos.com/index.php?x=1&lt;/a&gt; &lt;b&gt;we see a key part added in the website, at the very bottom. &lt;/b&gt;Here is the corresponding source.&lt;br /&gt;&lt;br /&gt;&lt;div style="text-align: center;"&gt;&lt;span class="Apple-style-span" style="font-family: monospace;"&gt;&lt;span class="Apple-style-span" style="white-space: pre-wrap;"&gt;&lt;span class="Apple-style-span" style="font-family: 'Times New Roman'; white-space: normal;"&gt;&lt;a href="http://3.bp.blogspot.com/-G28bMoOBVd0/TWXaItPW4iI/AAAAAAAAhnQ/7kXWSGX0iwQ/s1600/hqtubevideos-index_html-iframecode.PNG" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="50" src="http://3.bp.blogspot.com/-G28bMoOBVd0/TWXaItPW4iI/AAAAAAAAhnQ/7kXWSGX0iwQ/s400/hqtubevideos-index_html-iframecode.PNG" width="400" /&gt;&lt;/a&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;br /&gt;Aha! A 0x0 iframe, loading the following URLs:&lt;br /&gt;&lt;ul style="text-align: left;"&gt;&lt;li&gt;&lt;a href="http://www.hqtubevideos.com/counter.php"&gt;www.hqtubevideos.com/counter.php&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://www.hqtubevideos.com/counter2.php"&gt;www.hqtubevideos.com/counter2.php&lt;/a&gt;&lt;/li&gt;&lt;/ul&gt;&lt;div&gt;The first URL seems to be loading some randomized hashid. Ignore.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The second URL, &lt;a href="http://www.hqtubevideos.com/counter2.php"&gt;www.hqtubevideos.com/counter2.php&lt;/a&gt;, is a little bit more interesting and puzzling:&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://4.bp.blogspot.com/-yBSORQvlIro/TWXcSly1Z5I/AAAAAAAAhnY/xJHTO-3H7Bk/s1600/hqtubevideos-counter2.php.PNG" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="101" src="http://4.bp.blogspot.com/-yBSORQvlIro/TWXcSly1Z5I/AAAAAAAAhnY/xJHTO-3H7Bk/s400/hqtubevideos-counter2.php.PNG" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;What is going on? Why would a porn site link to these domains? &lt;b&gt;What is the connection? &lt;/b&gt;&lt;br /&gt;&lt;br /&gt;&lt;hr align="center" style="background-color: #691f01; border-bottom-width: 1px; border-color: initial; border-left-width: 1px; border-right-width: 1px; border-style: initial; border-top-width: 1px; color: #691f01; display: block; height: 2px;" width="50%" /&gt;&lt;div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;"&gt;&lt;b&gt;&lt;i&gt;&lt;br /&gt;&lt;/i&gt;&lt;/b&gt;&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;"&gt;&lt;b&gt;&lt;i&gt;The parked domains&lt;/i&gt;&lt;/b&gt;&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;"&gt;&lt;br /&gt;&lt;/div&gt;We started by doing a &lt;i&gt;whois &lt;/i&gt;to figure out the ownership of this domains. Unfortunately, the registration information for the hqtubevideos is private and protected. However, the registration info for all the other domains is available. Not surprisingly, we see a common ownership for all these seven domains:&lt;br /&gt;&lt;br /&gt;&lt;blockquote&gt;Registrant:&lt;br /&gt;Thomas Schneider&lt;br /&gt;519 S. York Road&lt;br /&gt;Dillsburg, Pennsylvania 17019&lt;br /&gt;United States&lt;br /&gt;Registered through: GoDaddy.com, Inc. (http://www.godaddy.com)&lt;br /&gt;Domain Name: RELAXHEALTH.COM&lt;br /&gt;Created on: 11-Mar-09&lt;br /&gt;Expires on: 11-Mar-12&lt;br /&gt;Last Updated on: 10-Jan-11&lt;br /&gt;Administrative Contact:&lt;br /&gt;Schneider, Thomas  garret.and@gmail.com&lt;br /&gt;519 S. York Road&lt;br /&gt;Dillsburg, Pennsylvania 17019&lt;br /&gt;United States&lt;br /&gt;7174327575      Fax --&lt;br /&gt;Technical Contact:&lt;br /&gt;Schneider, Thomas  garret.and@gmail.com&lt;br /&gt;519 S. York Road&lt;br /&gt;Dillsburg, Pennsylvania 17019&lt;br /&gt;United States&lt;br /&gt;7174327575      Fax --&lt;br /&gt;Domain servers in listed order:&lt;br /&gt;NS1.ROLENEWS.COM&lt;br /&gt;NS2.ROLENEWS.COM&lt;/blockquote&gt;&lt;br /&gt;Now we start seeing something being uncovered. Would this guy, Thomas Schneider, be behind this? Too easy to be true. We went and did a reverse whois to find other domains that contained the email &lt;b&gt;garret.and@gmail.com&lt;/b&gt;. And here we are: The email is associated with the registration of 89 other domains, which are registered under a variety of last names, but all listing &lt;b&gt;garret.and@gmail.com &lt;/b&gt;as the contact email:&lt;br /&gt;&lt;blockquote&gt;&lt;div style="text-align: center;"&gt;aboutclimax.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;aboutclinical.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;aboutcouples.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;abouterectile.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;abouterection.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;achieveday.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;achievedrugs.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;afterdeaths.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;afterdrugs.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;associatedmagazine.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;atlantea.org&lt;/div&gt;&lt;div style="text-align: center;"&gt;baldnesshealth.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;basehealth.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;becomeerect.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;begineducate.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;behaviordesire.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;beingdizzy.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;bestcialis.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;bestclimax.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;bigcouples.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;bodychemical.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;bodyclimax.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;bodyday.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;bundlehealth.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;calnam.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;cancerdamage.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;carecouples.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;carloschongdds.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;ceaifa.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;cialisc.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;cigarettesfinder.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;clubofheads.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;coacaz.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;college-grants1.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;college-scholarships1.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;collinshall.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;conditionnews.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;couponvi.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;criminaldefenseattorneys2.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;criminaldefenselawfirms1.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;detailedhealth.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;drinkershealth.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;drinkingmagazine.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;eurovision-2008.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;experiencemedical.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;fantasiesmagazine.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;fearhealth.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;gendergibe.org&lt;/div&gt;&lt;div style="text-align: center;"&gt;government-grants1.org&lt;/div&gt;&lt;div style="text-align: center;"&gt;groupovienna.net&lt;/div&gt;&lt;div style="text-align: center;"&gt;hardballdollars.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;hawgsandpaws.org&lt;/div&gt;&lt;div style="text-align: center;"&gt;impotencemagazine.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;letscurepeyronies.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;levitrav.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;medicationmagazine.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;moorehabitat.org&lt;/div&gt;&lt;div style="text-align: center;"&gt;nighttimemagazine.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;ownmeds.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;panimarock.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;playmeds.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;powerfulselling.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;printcoupons1.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;propeciav.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;relationshipmeds.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;relaxhealth.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;rml-inc.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;rxvis.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;savewhalompark.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;sex-tvs.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;shopwizz.biz&lt;/div&gt;&lt;div style="text-align: center;"&gt;signbysign.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;steve-magic.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;styleandmore.net&lt;/div&gt;&lt;div style="text-align: center;"&gt;syncsql.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;takemedical.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;taylor-training.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;testosteronehealth.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;thedongman.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;traumamedical.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;twohealth.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;viagracomp.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;viagraeds.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;viagramagazine.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;viagravi.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;washealth.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;waymagazine.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;weightmedical.com&lt;/div&gt;&lt;div style="text-align: center;"&gt;worldcuplive1.com&lt;/div&gt;&lt;/blockquote&gt;&lt;div&gt;&lt;div class="separator" style="clear: both; text-align: left;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: left;"&gt;Let's see what we have so far: The owner of a porn domain loads in a set of 0x0 iframes, a set of other websites, all operated by the same owner. But still, no clear motivation. Also, the connection with the publishers that we checked remains elusive.&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: left;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: left;"&gt;&lt;/div&gt;&lt;hr align="center" style="background-color: #691f01; border-bottom-width: 1px; border-color: initial; border-left-width: 1px; border-right-width: 1px; border-style: initial; border-top-width: 1px; color: #691f01; display: block; height: 2px;" width="50%" /&gt;&lt;div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;"&gt;&lt;div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;"&gt;&lt;b&gt;&lt;i&gt;&lt;br /&gt;&lt;/i&gt;&lt;/b&gt;&lt;/div&gt;&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;"&gt;&lt;div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;"&gt;&lt;b&gt;&lt;i&gt;Re-directions within the parked domains&lt;/i&gt;&lt;/b&gt;&lt;/div&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: left;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: left;"&gt;Now, let's see what is going on within these URL calls, such as &lt;a href="http://www.takemedical.com/go_with_post.php"&gt;www.takemedical.com/go_with_post.php&lt;/a&gt;. Here is the HTML source of one of those URLs:&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: left;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="https://lh6.googleusercontent.com/-OXJ0B8qFnNw/TXEstfh_dDI/AAAAAAAAhsU/6rSed97u-70/s1600/parked-domain-post-redirect.PNG" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="127" src="https://lh6.googleusercontent.com/-OXJ0B8qFnNw/TXEstfh_dDI/AAAAAAAAhsU/6rSed97u-70/s400/parked-domain-post-redirect.PNG" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: left;"&gt;&lt;br /&gt;&lt;/div&gt;Interesting. Another redirection. The site automatically submits a search form, searching for the term "&lt;i&gt;hihijiji&lt;/i&gt;". Loading the page in the browser with the GET method (as opposed to the POST method indicated in the form), takes us to the normal page of a parked domain.&lt;br /&gt;&lt;br /&gt;But let's submit with the POST method:&lt;br /&gt;&lt;blockquote&gt;&lt;code&gt;curl www.takemedical.com/search.php -d token=hihijiji&lt;/code&gt;&lt;/blockquote&gt;&lt;/div&gt;&lt;div&gt;Ha! The result is different:&lt;/div&gt;&lt;div&gt;&lt;blockquote&gt;&lt;br /&gt;&lt;code&gt; &amp;lt;iframe frameborder="No" height="1" src="index2.php" width="1"&amp;gt; &amp;lt;/iframe&amp;gt; &lt;/code&gt;&lt;/blockquote&gt;&lt;/div&gt;&lt;div&gt;We uncovered a hidden URL. This is the point where everything will start falling into place.&lt;br /&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;br /&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;/span&gt;&lt;/b&gt;&lt;br /&gt;&lt;hr align="center" style="background-color: #691f01; border-bottom-width: 1px; border-color: initial; border-left-width: 1px; border-right-width: 1px; border-style: initial; border-top-width: 1px; color: #691f01; display: block; height: 2px;" width="50%" /&gt;&lt;div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;"&gt;&lt;div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;"&gt;&lt;div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;"&gt;&lt;b&gt;&lt;b&gt;&lt;i&gt;&lt;br /&gt;&lt;/i&gt;&lt;/b&gt;&lt;/b&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;"&gt;&lt;div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;"&gt;&lt;div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;"&gt;&lt;b&gt;&lt;b&gt;&lt;i&gt;Re-directions and generating click fraud with normal click patterns&lt;/i&gt;&lt;/b&gt;&lt;/b&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;br /&gt;Within this hidden&amp;nbsp;URL&amp;nbsp;is where all the interesting things are happening!&lt;b&gt; Let's load &lt;a href="http://www.takemedical.com/index2.php"&gt;www.takemedical.com/index2.php&lt;/a&gt; and see the network activity.&lt;/b&gt; (In Chrome, go to Tools, Developer Tools, and then to the Network tab.). Here is the screenshot:&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="https://lh5.googleusercontent.com/-R9lgD8LQfdw/TXEu7e2Sb5I/AAAAAAAAhsY/cT0tZn1-bDk/s1600/parked-domain-redirects.PNG" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="145" src="https://lh5.googleusercontent.com/-R9lgD8LQfdw/TXEu7e2Sb5I/AAAAAAAAhsY/cT0tZn1-bDk/s400/parked-domain-redirects.PNG" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Indeed, here is where all the action is happening: These innocent sounding parked domain load all sorts of ad sites, and then "clicks" on the ads. By click, we do not mean any actual click. Instead the site loads the URL in the ad, that is typically a redirection to the ad server, which then redirects to the advertised URL. After this "click" within the iframe we finally have the publisher website (the "HGTV" that we mentioned before)! &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Interestingly enough, the click fraud was very well-done: It was not loading all the time the same website. Sometimes it was mevio, other times it was tremor, other times bodyarchitect.tv, and so on. And once we have been redirected enough times from the same IP address, the final redirect was going to find.fm, to execute a straight-forward search. Clever! Engage in fraud, but be careful not to trigger any alarms.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Also, notice that the traffic patterns for the clicks are not bot-generated&lt;/b&gt;. These are actual users. With real and different web browsers. Different IP addresses. Different times of the day, following the usual traffic patterns per region. Good job: these click-fraud patterns are the least likely to be caught as &lt;b&gt;they have patterns very similar to normal traffic&lt;/b&gt;.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;For those interested in the details, here is the set of screenshots with the redirects:&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="https://lh5.googleusercontent.com/-qVyp1ChmmRA/TXExPKhXnOI/AAAAAAAAhsc/qvg_K3bUaWI/s1600/parked-domain-1.PNG" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="135" src="https://lh5.googleusercontent.com/-qVyp1ChmmRA/TXExPKhXnOI/AAAAAAAAhsc/qvg_K3bUaWI/s400/parked-domain-1.PNG" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="https://lh3.googleusercontent.com/-3KulINgZhwA/TXExPdCASOI/AAAAAAAAhsg/iybgmapk3qg/s1600/parked-domain-2.PNG" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="142" src="https://lh3.googleusercontent.com/-3KulINgZhwA/TXExPdCASOI/AAAAAAAAhsg/iybgmapk3qg/s400/parked-domain-2.PNG" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="https://lh3.googleusercontent.com/-q3Cpp0uIM6g/TXExPsl3ycI/AAAAAAAAhsk/W1wY7lz8rdo/s1600/parked-domain-3.PNG" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="162" src="https://lh3.googleusercontent.com/-q3Cpp0uIM6g/TXExPsl3ycI/AAAAAAAAhsk/W1wY7lz8rdo/s400/parked-domain-3.PNG" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="https://lh4.googleusercontent.com/-zWpoE6IlyfI/TXExP36fWhI/AAAAAAAAhso/gdQL4qAbuew/s1600/parked-domain-4.PNG" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="123" src="https://lh4.googleusercontent.com/-zWpoE6IlyfI/TXExP36fWhI/AAAAAAAAhso/gdQL4qAbuew/s400/parked-domain-4.PNG" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="https://lh5.googleusercontent.com/-c0FdFkU0xX4/TXExQF5jHfI/AAAAAAAAhss/BkNvK0wy-QY/s1600/parked-domain-5.PNG" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="142" src="https://lh5.googleusercontent.com/-c0FdFkU0xX4/TXExQF5jHfI/AAAAAAAAhss/BkNvK0wy-QY/s400/parked-domain-5.PNG" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="https://lh4.googleusercontent.com/-oLleFeARs3o/TXExQS95chI/AAAAAAAAhsw/TnyPVDzQf0I/s1600/parked-domain-6.PNG" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="143" src="https://lh4.googleusercontent.com/-oLleFeARs3o/TXExQS95chI/AAAAAAAAhsw/TnyPVDzQf0I/s400/parked-domain-6.PNG" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="https://lh4.googleusercontent.com/-_-KqSY7ue5w/TXExQTVttwI/AAAAAAAAhs0/j039lVKHojE/s1600/parked-domain-7.PNG" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="146" src="https://lh4.googleusercontent.com/-_-KqSY7ue5w/TXExQTVttwI/AAAAAAAAhs0/j039lVKHojE/s400/parked-domain-7.PNG" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="https://lh4.googleusercontent.com/-Dxp-OfM0oXw/TXExQrQsoLI/AAAAAAAAhs4/5mH5KM0jBSY/s1600/parked-domain-8.PNG" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="122" src="https://lh4.googleusercontent.com/-Dxp-OfM0oXw/TXExQrQsoLI/AAAAAAAAhs4/5mH5KM0jBSY/s400/parked-domain-8.PNG" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="color: #660000;"&gt;Update (3/17/2011)&lt;/span&gt;&lt;/b&gt;: Initially, I did not want to post screenshots with the actual ad networks that were being defrauded, as it was not my intention to involve them in the story. However, since they were mentioned in the Wall Street Journal article already, and they have taken measures against this, I am posting them now. Here are the screenshots with ad loads from the networks. Warning: NSFW.&lt;br /&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="https://lh3.googleusercontent.com/-SUHgf5Tl7J8/TYIOyds7PBI/AAAAAAAAhzU/lpi6qSTWUdY/s1600/screenshots-with-adsloading.PNG" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="208" src="https://lh3.googleusercontent.com/-SUHgf5Tl7J8/TYIOyds7PBI/AAAAAAAAhzU/lpi6qSTWUdY/s400/screenshots-with-adsloading.PNG" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="https://lh4.googleusercontent.com/-YRnaz2nFI7I/TYIOytSnz9I/AAAAAAAAhzY/o8mAQFt4NLM/s1600/screenshots-with-adsloading-2.PNG" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="195" src="https://lh4.googleusercontent.com/-YRnaz2nFI7I/TYIOytSnz9I/AAAAAAAAhzY/o8mAQFt4NLM/s400/screenshots-with-adsloading-2.PNG" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="https://lh5.googleusercontent.com/-2F5fWyJcJxE/TYIOy1EfVQI/AAAAAAAAhzc/7wQBe5u2-Ao/s1600/screenshots-with-adsloading-3.PNG" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="192" src="https://lh5.googleusercontent.com/-2F5fWyJcJxE/TYIOy1EfVQI/AAAAAAAAhzc/7wQBe5u2-Ao/s400/screenshots-with-adsloading-3.PNG" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="https://lh4.googleusercontent.com/-0oK0TYKUp-s/TYIOzVkwEyI/AAAAAAAAhzg/qhApru-wz3M/s1600/screenshots-with-adsloading-4.PNG" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="183" src="https://lh4.googleusercontent.com/-0oK0TYKUp-s/TYIOzVkwEyI/AAAAAAAAhzg/qhApru-wz3M/s400/screenshots-with-adsloading-4.PNG" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="https://lh6.googleusercontent.com/-HTRY6sjthcM/TYIOzpONdaI/AAAAAAAAhzk/GlqC4kIH13Q/s1600/screenshots-with-adsloading-5.PNG" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="187" src="https://lh6.googleusercontent.com/-HTRY6sjthcM/TYIOzpONdaI/AAAAAAAAhzk/GlqC4kIH13Q/s400/screenshots-with-adsloading-5.PNG" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="https://lh6.googleusercontent.com/-k3tPj1W9Dmw/TYIO0NK3FeI/AAAAAAAAhzo/Y4KC52lC_4o/s1600/screenshots-with-adsloading-6.PNG" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="190" src="https://lh6.googleusercontent.com/-k3tPj1W9Dmw/TYIO0NK3FeI/AAAAAAAAhzo/Y4KC52lC_4o/s400/screenshots-with-adsloading-6.PNG" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="https://lh4.googleusercontent.com/-BF-RRQHDSTM/TYIO0bLYgLI/AAAAAAAAhzs/lHOTJo8FeGs/s1600/screenshots-with-adsloading-7.PNG" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="180" src="https://lh4.googleusercontent.com/-BF-RRQHDSTM/TYIO0bLYgLI/AAAAAAAAhzs/lHOTJo8FeGs/s400/screenshots-with-adsloading-7.PNG" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; text-align: left;"&gt;&lt;/div&gt;&lt;hr align="center" style="background-color: #691f01; border-bottom-width: 1px; border-color: initial; border-left-width: 1px; border-right-width: 1px; border-style: initial; border-top-width: 1px; color: #691f01; display: block; height: 2px;" width="50%" /&gt;&lt;div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;"&gt;&lt;div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;"&gt;&lt;b&gt;&lt;i&gt;&lt;br /&gt;&lt;/i&gt;&lt;/b&gt;&lt;/div&gt;&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;"&gt;&lt;div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;"&gt;&lt;b&gt;&lt;i&gt;The role of parked domains: Laundering traffic&lt;/i&gt;&lt;/b&gt;&lt;/div&gt;&lt;/div&gt;&lt;br /&gt;At this point, we now know how this person makes money. Clearly, there is click-fraud: the scammer is employing click-fraud services to click on the pay-per-click ads "displayed" in his parked domains. If some of the ads are also pay-per-impression, he may also get paid for these invisible impressions that happen within the 0x0 iframe.&lt;br /&gt;&lt;br /&gt;Why the parked domains though? Why not doing the same directly within the porn site? The answer is simple: &lt;b&gt;Traffic laundering.&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;What do I mean by "traffic laundering"? First, the ad networks are unlikely to place many ads within a porn site. On the other hand, they have ad-placement services for parked domains. Second, the publishers that get the traffic from the parked domains see in the referral URLs some legitimately-sounding domain names, not a porn site. Even if they go and check the site, they will only see an empty site full of ads. Nothing too suspicious. Hats off to the scammer. Clever scheme.&lt;br /&gt;&lt;br /&gt;You think we are done? No. There is one more piece in the puzzle. How does the scammer attract visitors to the porn site? &lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;br /&gt;&lt;hr align="center" style="background-color: #691f01; border-bottom-width: 1px; border-color: initial; border-left-width: 1px; border-right-width: 1px; border-style: initial; border-top-width: 1px; color: #691f01; display: block; height: 2px;" width="50%" /&gt;&lt;div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;"&gt;&lt;div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;"&gt;&lt;b&gt;&lt;i&gt;&lt;br /&gt;&lt;/i&gt;&lt;/b&gt;&lt;/div&gt;&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;"&gt;&lt;div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;"&gt;&lt;b&gt;&lt;i&gt;Generating traffic through an adult traffic exchange&lt;/i&gt;&lt;/b&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The other interesting part: The porn website does not really contain porn!  There are a few images but most of the links are to other porn website that actually host the video. In other words, the scammer does not even pay the cost of hosting porn!&lt;br /&gt;&lt;br /&gt;However, according to QuantCast and Compete, the website has a pretty significant number of unique visitors per month. Here is the traffic over the last year:&lt;/div&gt;&lt;br /&gt;&lt;div style="text-align: center;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="text-align: center;"&gt;&lt;iframe frameborder="0" height="205" marginheight="0px" marginwidth="0px" scrolling="no" src="http://www.quantcast.com/profile/embed?img=http%3A//www.quantcast.com/profile/trafficGraph%3Fwunit%3Dwd%253Acom.hqtubevideos%26drg%3D%26dty%3Dpp%26gl%3D1yr%26reachType%3Dperiod%26dtr%3Ddm%26width%3D522%26country%3DUK%26ggt%3Dlarge%26showDeleteButtons%3Dtrue&amp;amp;w=322&amp;amp;h=205&amp;amp;showDeleteButtons=false&amp;amp;wunit=Charts.Traffic.FrequencyGraph." width="322"&gt;&lt;/iframe&gt;&lt;/div&gt;&lt;br /&gt;&lt;div style="text-align: center;"&gt;&lt;a href="http://siteanalytics.compete.com/hqtubevideos.com/?metric=uv"&gt;&lt;img src="http://grapher.compete.com/hqtubevideos.com_uv_310.png" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;div style="text-align: left;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="text-align: left;"&gt;This porn website gets 500K to 1M unique visitors per month! That is &lt;b&gt;a lot&lt;/b&gt; of traffic for a website without any real content! So, how does the guy get all the traffic?&lt;br /&gt;&lt;br /&gt;The answer surprised me. Apparently, there is an exchange (yes, my dear readers, an exchange!) for buying and selling adult traffic! Its name: &lt;a href="http://www.trafficholder.com/"&gt;TrafficHolder.com&lt;/a&gt;&lt;/div&gt;&lt;div style="text-align: left;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="text-align: left;"&gt;Do you want to buy traffic for people interested in midget sex? The price is $2.94 per thousand visitors. Interested in latex? The cost is $2.54 per thousand visitors. Interested in HD video? $3.54 per thousand visitors. (The running price catalog and the available traffic volume is available at &lt;a href="http://www.trafficholder.com/cgi-bin/traffic/manager/buying100.cgi"&gt;http://www.trafficholder.com/cgi-bin/traffic/manager/buying100.cgi&lt;/a&gt;)&lt;/div&gt;&lt;div style="text-align: left;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="text-align: left;"&gt;How do the porn sites sell traffic to each other? Through pop-ups, pop-unders, by causing the first click to the website to redirect to the buyer's site. The term for this traffic is "&lt;a href="http://www.trafficholder.com/faq.html"&gt;skimmed traffic&lt;/a&gt;"&lt;/div&gt;&lt;div style="text-align: left;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="text-align: left;"&gt;Following the trail, we figured out the source of the traffic for hqtubevideos.com: It was coming from the (very popular, apparently) website &lt;a href="http://www.pornoxo.com/"&gt;www.pornoxo.com&lt;/a&gt;. The reports from &lt;a href="http://www.quantcast.com/pornoxo.com"&gt;QuantCast &lt;/a&gt;and &lt;a href="http://siteanalytics.compete.com/pornoxo.com/"&gt;Compete &lt;/a&gt;confirm that PornoXo gets approximately 1 million unique visitors per month. If you visit the PornoXo.com website, you will see that the first click will create a pop-under that loads the page hqtubevideos.com/play.html. This is the page responsible for all the fraud that I described above.&lt;br /&gt;&lt;br /&gt;Based on the exchange prices and the visitorship at PornoXo, this traffic has a cost of $3K/month for hqtubevideos.com, which is significant. So, we need to figure out how the scammer recovers this cost.&lt;br /&gt;&lt;br /&gt;&lt;hr align="center" style="background-color: #691f01; border-bottom-width: 1px; border-color: initial; border-left-width: 1px; border-right-width: 1px; border-style: initial; border-top-width: 1px; color: #691f01; display: block; height: 2px;" width="50%" /&gt;&lt;div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;"&gt;&lt;div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;"&gt;&lt;b&gt;&lt;i&gt;&lt;br /&gt;&lt;/i&gt;&lt;/b&gt;&lt;/div&gt;&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;"&gt;&lt;div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;"&gt;&lt;b&gt;&lt;i&gt;How much money are we talking about?&lt;/i&gt;&lt;/b&gt;&lt;/div&gt;&lt;/div&gt;&lt;br /&gt;So, the key question now: How much money the hqtubevideos.com generates through the scheme? To get a feeling of how much fraud is going on, please do the following:&lt;br /&gt;&lt;ul style="text-align: left;"&gt;&lt;li&gt;Open Chrome&lt;/li&gt;&lt;li&gt;Open the options, and then Tools, then Developer Tools. This will load the monitoring tool.&lt;/li&gt;&lt;li&gt;Switch to the "Network" tab&lt;/li&gt;&lt;li&gt;Visit &lt;a href="http://www.hqtubevideos.com/play.html"&gt;http://www.hqtubevideos.com/play.html&lt;/a&gt; and see what is being loaded in the background (my own counting was approximately 1 ad loading per 10 seconds)&lt;/li&gt;&lt;/ul&gt;Let's do some back-of-the-envelope, very conservative, approximations:&lt;br /&gt;&lt;ul style="text-align: left;"&gt;&lt;li&gt;The site gets 500K-1M visitors per month&lt;/li&gt;&lt;li&gt;The cost of this traffic is approximately $1.5K to $3K per month&lt;/li&gt;&lt;li&gt;Each unique visit loads 7 sites, which then generate clicks. Let's assume that there is no reload of the invisible sites, to keep the estimates low.&lt;/li&gt;&lt;ul&gt;&lt;li&gt;Assuming 500K visitors and that just one click out of the seven sites goes through, this is 500K clicks per month (low estimate)&lt;/li&gt;&lt;li&gt;Assuming 1M visitors and that all clicks, in all 7 sites, go through, this is 7M clicks per month (high estimate)&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;The a low-end estimate for &lt;a href="http://www.clickz.com/clickz/stats/1706952/average-search-cpc-data-category-february-2010"&gt;CPC click costs is 30 cents&lt;/a&gt;, out of which we can assume that the scammer gets, say, 10 cents.&lt;/li&gt;&lt;li&gt;&lt;b&gt;This generates a total income of &lt;span class="Apple-style-span" style="color: #990000;"&gt;$50K to $700K per month&lt;/span&gt;&lt;/b&gt;&lt;/li&gt;&lt;li&gt;&lt;span class="Apple-style-span" style="color: #990000;"&gt;&lt;b&gt;The scheme is running for 8 months now, generating total revenue of $400K to $5M so far.&lt;/b&gt;&amp;nbsp;&lt;i&gt;(And you thought that investment bankers were getting paid a lot...)&lt;/i&gt;&lt;/span&gt;&lt;/li&gt;&lt;/ul&gt;Notice that these approximations assume that the site only generates the direct clicks discussed above. You will notice that there is no end in the loading of ads, if you leave the website open for a while. Given that the site visitors come from PornoXo, there is a good chance they will keep watching the porn video at PornoXo, leaving hqtubevideos to load the ads in the background.&lt;br /&gt;&lt;br /&gt;But even with the modest estimates listed above, we are talking about a business that generates tens of thousands of dollars, with really minimum requirements. This is a scheme that a single person can set up in a week...&lt;br /&gt;&lt;hr align="center" style="background-color: #691f01; border-bottom-width: 1px; border-color: initial; border-left-width: 1px; border-right-width: 1px; border-style: initial; border-top-width: 1px; color: #691f01; display: block; height: 2px;" width="50%" /&gt;&lt;div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;"&gt;&lt;div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;"&gt;&lt;b&gt;&lt;i&gt;&lt;br /&gt;&lt;/i&gt;&lt;/b&gt;&lt;/div&gt;&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;"&gt;&lt;div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;"&gt;&lt;a href="http://www.blogger.com/post-edit.g?blogID=7118563403027467631&amp;amp;postID=3825542681386408252" name="overall"&gt;&lt;/a&gt;&lt;br /&gt;&lt;b&gt;&lt;i&gt;Overall picture&lt;/i&gt;&lt;/b&gt;&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;"&gt;&lt;b&gt;&lt;i&gt;&lt;br /&gt;&lt;/i&gt;&lt;/b&gt;&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;"&gt;Trying to put all pieces together, I created the following graphical summary to see what is going on:&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;"&gt;&lt;b&gt;&lt;i&gt;&lt;br /&gt;&lt;/i&gt;&lt;/b&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="https://lh6.googleusercontent.com/-OdTNSl6wHEM/TXF2a6jxXNI/AAAAAAAAhtE/b3kxxNFozIo/s1600/overall-diagram.PNG" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="302" src="https://lh6.googleusercontent.com/-OdTNSl6wHEM/TXF2a6jxXNI/AAAAAAAAhtE/b3kxxNFozIo/s400/overall-diagram.PNG" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;"&gt;&lt;b&gt;&lt;i&gt;&lt;br /&gt;&lt;/i&gt;&lt;/b&gt;&lt;/div&gt;&lt;/div&gt;&lt;br /&gt;Let's follow the flow of the users:&lt;/div&gt;&lt;div style="text-align: left;"&gt;&lt;ol style="text-align: left;"&gt;&lt;li&gt;Scammer buys user traffic from PornoXo.com and sends it to HQTubeVideos. &lt;/li&gt;&lt;li&gt;HQTubeVideos loads, in invisible iframes, some parked domains with innocent-sounding names (relaxhealth.com, etc)&lt;/li&gt;&lt;li&gt;In the parked domains, ad networks serve display and PPC ads.&lt;/li&gt;&lt;li&gt;The click-fraud sites click on the ads that appear within the parked domains.&lt;/li&gt;&lt;li&gt;The legitimate publishers gets invisible/fraudulent traffic through the (fraudulently) clicked ads from parked domains.&lt;/li&gt;&lt;li&gt;Brand advertisers place their ad on the websites of the legitimate publishers, which in reality appear within the (invisible) iframe of HQTubeVideos. &lt;/li&gt;&lt;li&gt;AdSafe detects the attempted placement within the porn website, and prevents the ads of the brand publisher from appearing in the legitimate website, which is hosted within the invisible frame of the porn site.&lt;/li&gt;&lt;/ol&gt;&lt;div&gt;Notice how nicely orchestrated is the whole scheme: The parked domains "launder" the porn traffic. The ad networks place the ads in some legitimately-sounding parked domains, not in a porn site. The publishers get traffic from innocent domains such as RelaxHealth, not from porn sites. The porn site loads a variety of publishers, distributing the fraud across many publishers and many advertisers.&lt;br /&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;div&gt;&lt;div&gt;&lt;div&gt;&lt;div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;"&gt;&lt;/div&gt;&lt;hr align="center" style="background-color: #691f01; border-bottom-width: 1px; border-color: initial; border-left-width: 1px; border-right-width: 1px; border-style: initial; border-top-width: 1px; color: #691f01; display: block; height: 2px;" width="50%" /&gt;&lt;div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;"&gt;&lt;div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;"&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;&lt;i&gt;&lt;br /&gt;&lt;/i&gt;&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;&lt;i&gt;Who has the incentives to fight this? &lt;/i&gt;&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;And now let's see who has the incentives to fight this. It is fraud, right? But I think it is well-executed type of fraud. &lt;b&gt;It targets and defrauds the player that has the least incentives to fight the scam.&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;"&gt;Who is affected? Let's follow the money:&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;"&gt;&lt;/div&gt;&lt;ol style="text-align: left;"&gt;&lt;li&gt;The big brand advertisers (Continental, Coca Cola, Verizon, Vonage,...) pay the publishers and the ad networks for running their campaigns.&lt;/li&gt;&lt;li&gt;The publishers pay the ad network and the scammer for the fraudulent clicks.&lt;/li&gt;&lt;li&gt;The scammer pays PornoXo and TrafficHolder for the traffic.&lt;/li&gt;&lt;/ol&gt;&lt;/div&gt;&lt;div&gt;The ad networks see clicks on their ads, they get paid, so not much to worry about. They would worry if their advertisers were not happy. But here we have a piece of&amp;nbsp;genius:&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;The scammer &lt;i&gt;did not target sites that would measure conversions or cost-per-acquisition&lt;/i&gt;. Instead, the scammer was targeting mainly sites that sell pay-per-impression ads and video ads. If the publishers display CPM ads paid by impression, any traffic is good, all impressions count. It is not an accident that the &lt;b&gt;scammer targets publishers with video content, and plenty of pay-per-impression video ads&lt;/b&gt;. The publishers have no reason to worry if they get traffic and the cost-per-visit is low.&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Effectively, the only one hurt in this chain are the big brand advertisers, who feed the rest of the advertising chain. &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Do the big brands care about this type of fraud? Yes and no, but not really deeply. Yes, they pay for some "invisible impressions". But this is a marketing campaign. In any case, not all marketing attempts are successful. Do all readers of Economist look at the printed ads? Hardly. Do all web users pay attention to the banner ads? I do not think so. Invisible ads are just one of the things that make advertising a little bit more expensive and harder. Consider it part of the cost of doing business. In any case, compared to the overall marketing budget of these behemoths, the cost of such fraud is peanuts. &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The big brands do not want their brand to be hurt. If the ads do not appear in places inappropriate for the brand, things are fine. Fighting the fraud publicly? This will just associate the brand with fraud. No marketing department wants that.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Note also that the fraudster does not target a single publisher, does not target a single advertiser. The damage is amortized so nicely that nobody feels that it is a big deal. A mastery of the long tail...&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Well, but what if fraud is big? What if big bucks are wasted? Maybe some newspapers would like to investigate. Let's break the big story. What would be the effect? Publicizing that a significant source of their income (online advertising) is a dangerous thing, full of fraud? Who would like to shoot himself in the foot?&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;div&gt;&lt;hr align="center" style="background-color: #691f01; border-bottom-width: 1px; border-color: initial; border-left-width: 1px; border-right-width: 1px; border-style: initial; border-top-width: 1px; color: #691f01; display: block; height: 2px;" width="50%" /&gt;&lt;div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;"&gt;&lt;div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;"&gt;&lt;/div&gt;&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;"&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;&lt;i&gt;&lt;br /&gt;&lt;/i&gt;&lt;/b&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;&lt;i&gt;Fraud as (harmless?) parasite &lt;/i&gt;&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Really. Genius. Defraud a lot of rich guys a little bit, and ensure that nobody has the incentives to really fight and chase the fraud.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The guy essentially realized that this type of fraud is really behaving like a parasite within a much bigger ecosystem. And it is a parasite that is so costly to remove that it makes sense to leave it there. As long as the parasite does not annoy the host too much, things will be fine. &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Only if fraud becomes really big there will be the real incentive to fight advertising fraud. Until then, you know how to make $500K/month...&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;/div&gt;&lt;div style="text-align: left;"&gt;&lt;a href="http://www.blogger.com/"&gt;&lt;/a&gt;&lt;span id="goog_208365650"&gt;&lt;/span&gt;&lt;span id="goog_208365651"&gt;&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6538286886012474786-535569957737925372?l=topsitemap.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/535569957737925372'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/535569957737925372'/><link rel='alternate' type='text/html' href='http://topsitemap.blogspot.com/2011/03/uncovering-advertising-fraud-scheme-or.html' title='Uncovering an advertising fraud scheme. Or &amp;quot;the Internet is for porn&amp;quot;'/><author><name>Internet at Every Where</name><uri>http://www.blogger.com/profile/13430497579682715412</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='https://lh5.googleusercontent.com/-GXHdZyVj3DI/TXEktLQfshI/AAAAAAAAhsM/Xa5o-DOTgYE/s72-c/hqtube-play_html.PNG' height='72' width='72'/></entry><entry><id>tag:blogger.com,1999:blog-6538286886012474786.post-1023012225959458061</id><published>2011-03-14T16:41:00.000-07:00</published><updated>2011-04-17T19:00:19.167-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='mechanical turk'/><category scheme='http://www.blogger.com/atom/ns#' term='research'/><title type='text'>Do Mechanical Turk workers lie about their location?</title><content type='html'>&lt;div dir="ltr" style="text-align: left;" trbidi="on"&gt;A few weeks back, Dahn Tamir graciously allowed me to take a peek at the data that he has been gathering about this workers on Mechanical Turk. He has assigned tasks over time to more than 50,000 workers on Mechanical Turk, so I consider his data to be one of the most representative samples of workers.&lt;br /&gt;&lt;br /&gt;One of the nice tasks that he has been running is a simple HIT in which he asks workers to report their location. At the same time, in this task, Dahn was recording the IP of the worker. Why the task was nice? Because there is absolutely no incentive for the workers to be truthful. The submission will be accepted and paid no matter what. In a sense,&lt;b&gt;&lt;i&gt; it is a test that check if workers will be truthful in cases where it is not possible to check their accuracy&lt;/i&gt;&lt;/b&gt;.&lt;br /&gt;&lt;br /&gt;So, we used this test to check how sincere are the workers: We can simply geocode the IP address and find out the actual location of the worker. (With some degree of error, but good enough for approximation purposes.) For the workers that reported to be based in the US (approximately 22,000 workers), the HIT was asking for the zip code of the worker, making it easy to assign an approximate long/lat location.&lt;br /&gt;&lt;br /&gt;To measure how accurately the worker report their location, we measured the distance between the location of the IP and the location of the zip code. The plot below shows the distribution of the differences:&lt;br /&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="https://lh6.googleusercontent.com/-shz_czOZqoI/TX6jVzh8ejI/AAAAAAAAhxo/nXAYPLjHsxE/s1600/distance-distribution.PNG" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="252" src="https://lh6.googleusercontent.com/-shz_czOZqoI/TX6jVzh8ejI/AAAAAAAAhxo/nXAYPLjHsxE/s400/distance-distribution.PNG" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="https://lh3.googleusercontent.com/-hX-EC7Xv8kI/TX6l4xltY2I/AAAAAAAAhxs/Qj-mS4n-9Ak/s1600/distance-cdf.PNG" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="230" src="https://lh3.googleusercontent.com/-hX-EC7Xv8kI/TX6l4xltY2I/AAAAAAAAhxs/Qj-mS4n-9Ak/s400/distance-cdf.PNG" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;&lt;br /&gt;As you can see, most of the workers were pretty truthful about their location. The difference in distance was less than 10 miles for more than 60% of the workers: this difference can be easily explained by the limited accuracy of the geocoding API's and by the approximation of using zipcode locations.&lt;br /&gt;&lt;br /&gt;Of course, the flip side of the coin is that a significant fraction of the workers were essentially lying about their location: For 10% of the workers (i.e., ~2250 of them) the IP address was more than 100 miles away from the reported zip code. For 2% of the workers (i.e., ~500 workers) the distance was more than 1000 miles away.&lt;br /&gt;&lt;br /&gt;The biggest lier? A worker from Chennai, India who reported a zip code corresponding to Tampa in Florida. The IP was a cool 9500 miles away from the reported location!&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6538286886012474786-1023012225959458061?l=topsitemap.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/1023012225959458061'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/1023012225959458061'/><link rel='alternate' type='text/html' href='http://topsitemap.blogspot.com/2011/03/do-mechanical-turk-workers-lie-about.html' title='Do Mechanical Turk workers lie about their location?'/><author><name>Internet at Every Where</name><uri>http://www.blogger.com/profile/13430497579682715412</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='https://lh6.googleusercontent.com/-shz_czOZqoI/TX6jVzh8ejI/AAAAAAAAhxo/nXAYPLjHsxE/s72-c/distance-distribution.PNG' height='72' width='72'/></entry><entry><id>tag:blogger.com,1999:blog-6538286886012474786.post-4165177043398782297</id><published>2011-03-10T22:01:00.000-08:00</published><updated>2011-04-17T19:00:19.174-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='publishers'/><category scheme='http://www.blogger.com/atom/ns#' term='academia'/><category scheme='http://www.blogger.com/atom/ns#' term='research'/><category scheme='http://www.blogger.com/atom/ns#' term='acm'/><title type='text'>The Road to Serfdom, ACM Edition</title><content type='html'>&lt;div dir="ltr" style="text-align: left;" trbidi="on"&gt;&lt;b&gt;&amp;lt;rant&amp;gt;&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;A couple of days back, I got the following email from &lt;a href="http://www.acm.org/"&gt;ACM&lt;/a&gt;:&lt;br /&gt;&lt;br /&gt;&lt;blockquote&gt;Dear Moderator/Chairs,&lt;br /&gt;&lt;br /&gt;This is being sent to everyone with the chairs cc'd as the last and final requeset for the eform below to be completed or your panel overview abstract will be removed from the WWW 2011 Companion Publication and will NOT appear in the ACM DL.&lt;br /&gt;&lt;br /&gt;Your prompt and immediate attention to the form below is needed.&lt;br /&gt;&lt;br /&gt;permission release form URL: ....&lt;br /&gt;&lt;br /&gt;ACM Copyrights &amp;amp; Permissions&lt;/blockquote&gt;&lt;br /&gt;Given that this was the "&lt;i&gt;last and final requeset&lt;/i&gt;"[sic], I assumed that somehow I missed the previous requests. So, I checked my email to find out how late I was. Nope. Nothing in the archive, nothing in the trash, nothing in the spam, no entry in the delivery log. This was the &lt;b&gt;&lt;i&gt;first&lt;/i&gt;&lt;/b&gt; notification sent by ACM. They have just forgotten about this. But since they were running late, why not just threaten the authors? It is so much easier to pass the blame to others and be the first one to be aggressive. &lt;br /&gt;&lt;br /&gt;What happened ACM, did you start get advice on customer service from your pals at Sheridan Printing, who tend to send requests &lt;a href="http://behind-the-enemy-lines.blogspot.com/2007/06/are-publishers-making-themselves.html"&gt;like this&lt;/a&gt;?&lt;br /&gt;&lt;br /&gt;But I should not have been so surprised. This email just reflects the overall attitude of ACM. I have experienced this many times in the past. Anyway, I decided to sign the e-form, without firing back.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;&lt;i&gt;Donating copyright to ACM&lt;/i&gt;&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;Signing the form was a mechanic action before. However, after reading Matt Blaze's post &lt;a href="http://www.crypto.com/blog/copywrongs/"&gt;on copyright and academic publishing&lt;/a&gt;, I decided to read the form a little bit more carefully, to see exactly what I was signing.&lt;br /&gt;&lt;br /&gt;As usual, we start with a transfer of copyright to ACM. The authors agree to transfer all their copyright rights to ACM, blah blah...&lt;br /&gt;&lt;br /&gt;Wait a minute! Why does ACM needs to &lt;b&gt;&lt;i&gt;own &lt;/i&gt;&lt;/b&gt;the copyright? No good reason. To publish and distribute the article, ACM just needs a &lt;i&gt;&lt;b&gt;non-exclusive license to print and distribute&lt;/b&gt;&lt;/i&gt;. There is no need to &lt;b&gt;&lt;i&gt;own &lt;/i&gt;&lt;/b&gt;the copyright.&lt;br /&gt;&lt;br /&gt;If we follow ACM's logic, any artist that wants to see their work exhibited in any museum, they need to give up the ownership of their work and give full ownership of their creations to the museum. For free. Without expecting any royalties back in return. Ever. Furthermore, the museum instead of promoting the work, they would lock it in a "patron members access only". For all others, the museum would demand a separate entrance ticket to show &lt;b&gt;&lt;i&gt;each &lt;/i&gt;&lt;/b&gt;of the collection pieces. &amp;nbsp;(Say, for a friendly price of $5 to see each painting?)&amp;nbsp;.&lt;br /&gt;&lt;br /&gt;Anyway, let's not belabor the point with copyright. We know that ACM's policy sucks. We know that ACM is a bureaucracy serving just itself and not its members or the profession. Let's move on.&lt;br /&gt;&lt;br /&gt;Let's move to the point that really got me fired up.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;&lt;i&gt;Protecting ACM from liability&lt;/i&gt;&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;What got me really pissed was the &lt;a href="http://www.acm.org/publications/CopyReleaseProc-1.26.10.pdf"&gt;last part of the agreement&lt;/a&gt;:&lt;br /&gt;&lt;br /&gt;&lt;blockquote&gt;&lt;b&gt;Liability Waiver&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;&lt;i&gt;* Your grant of permission is conditional upon you agreeing to the terms set out below.&lt;/i&gt;&lt;br /&gt;&lt;br /&gt;I hereby &lt;b&gt;&lt;i&gt;release and discharge ACM and other publication sponsors and organizers from any and all liability arising out of my inclusion in the publication, or in connection with the performance of any of the activities described in this document as permitted herein&lt;/i&gt;&lt;/b&gt;. This includes, but is not limited to, my right of privacy or publicity, copyright, patent rights, trade secret rights, moral rights or trademark rights.&lt;br /&gt;&lt;br /&gt;All permissions and releases granted by me herein shall be effective in perpetuity unless otherwise stipulated, and extend and apply to the ACM and its assigns, contractors, sublicensed distributors, successors and agents.&lt;br /&gt;&lt;br /&gt;&lt;/blockquote&gt;&lt;b&gt;&lt;i&gt;So, not only we should donate "voluntarily" ownership&amp;nbsp;&lt;/i&gt;&lt;/b&gt;&lt;b&gt;&lt;i&gt;of our copyright&lt;/i&gt;&lt;/b&gt;&lt;b&gt;&lt;i&gt;&amp;nbsp;to ACM . We also need to protect ACM from any  liability.&lt;/i&gt;&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;In other words, ACM wants to get all the upside from owning the copyright, &lt;b&gt;&lt;i&gt;without ever distributing royalties&lt;/i&gt;&lt;/b&gt; to the contributing authors. (Not that it would be worth much. It is a matter of principle and a signal of respect to the authors, not an issue of monetary importance.) At the same, ACM also wants the authors to provide guarantee that if there is any problem with the copyright, the author will be the one liable for the damages.&lt;br /&gt;&lt;br /&gt;All the upside for ACM, no revenue to the authors. All the downside to the authors, no obligations for ACM.&lt;br /&gt;&lt;br /&gt;Thank you ACM for caring so much about your members. You will not be missed when you disappear.&lt;br /&gt;&lt;br /&gt;Yours truly,&lt;br /&gt;A &lt;a href="http://plone.acm.org/membership/life"&gt;lifetime member&lt;/a&gt; of ACM.&lt;br /&gt;&lt;br /&gt;PS:&amp;nbsp;In retrospect, the title of the post is offensive: From Wikipedia's definition of &lt;a href="http://en.wikipedia.org/wiki/Serfdom"&gt;serfdom&lt;/a&gt;: "&lt;i&gt;Serfdom included the forced labor of serfs bound to a hereditary plot of land owned by a lord in return for protection&lt;/i&gt;". In other words, the&amp;nbsp;slave owners took the product of slaves' work, but in return they provided the protection and military support, to defend the slaves that were working the land. ACM also wants the slaves to "protect the land" as well. I owe an apology to the slave owners for the comparison.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;&amp;lt;/rant&amp;gt;&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6538286886012474786-4165177043398782297?l=topsitemap.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/4165177043398782297'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/4165177043398782297'/><link rel='alternate' type='text/html' href='http://topsitemap.blogspot.com/2011/03/road-to-serfdom-acm-edition.html' title='The Road to Serfdom, ACM Edition'/><author><name>Internet at Every Where</name><uri>http://www.blogger.com/profile/13430497579682715412</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author></entry><entry><id>tag:blogger.com,1999:blog-6538286886012474786.post-3994375461270838153</id><published>2011-03-03T11:41:00.000-08:00</published><updated>2011-04-17T19:00:19.181-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='mechanical turk'/><category scheme='http://www.blogger.com/atom/ns#' term='research'/><title type='text'>The promise and fear of an assembly line for knowledge work</title><content type='html'>Last week, together with &lt;a href="http://www.propublica.org/site/author/amanda_michel"&gt;Amanda Michel from ProPublica&lt;/a&gt;, we were presenting at the &lt;a href="http://www.ire.org/training/conference/CAR11/"&gt;CAR 2011&lt;/a&gt; conference (CAR stands for Computer-Assisted Reporting), on how to best use Mechanical Turk for a variety of tasks pertaining to data-driven journalism.&lt;br /&gt;&lt;br /&gt;We discussed issues of &lt;a href="http://code.google.com/p/get-another-label/"&gt;quality assurance&lt;/a&gt;, how &lt;a href="http://groups.csail.mit.edu/uid/turkit/"&gt;TurkIt&lt;/a&gt;-like workflow-based tasks can generate nice outcomes, and briefly touched upon the &lt;a href="http://mybossisarobot.com/"&gt;CrowdForge work from Niki Kittur and the team at CMU&lt;/a&gt;, showing that crowdsourcing can potentially generate intellectual outcomes comparable to those of trained humans.&lt;br /&gt;&lt;br /&gt;The discussion after the session was a mix of excitement and fear. We have observed in the past how "assembly line" work for industrial production lead to massive productivity improvements and was the basis for much of the progress in the 19th and 20th century. But that was for mechanical work. Yes, it replaced centuries old crafts of the blacksmiths, carpenters, potters, but that was just part of progress.&lt;br /&gt;&lt;br /&gt;What happens if we see now the assembly line extended into tasks that were traditionally considered creative and intellectual in nature? What would be the effect of an &lt;b&gt;assembly line for knowledge work&lt;/b&gt;?&lt;br /&gt;&lt;br /&gt;A few months back, &lt;a href="http://behind-the-enemy-lines.blogspot.com/2010/12/excerpts-from-communist-manifesto.html"&gt;I quoted&lt;/a&gt; Marx and Engels who, back in 1848, wrote in their Communist manifesto:&lt;br /&gt;&lt;blockquote&gt;the &lt;b&gt;work of the proletarians has lost all individual character, and, consequently, all charm for the workman&lt;/b&gt;. ... [The workman] becomes an appendage of the machine, and it is only the most simple, most monotonous, and most easily acquired knack, that is required of him&lt;/blockquote&gt;&lt;span class="Apple-style-span" style="font-size: x-small;"&gt;(Btw, TIME magazine &lt;a href="http://www.time.com/time/magazine/article/0,9171,2043450,00.html"&gt;liked that connection enough to put it into their own article&lt;/a&gt; about Mechanical Turk.)&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;But how likely it is to see this style of work to be extended further in the intellectual field? Are these Mechanical Turk experiments something generalizable, or just cute proof-of-concept experiments?&lt;br /&gt;&lt;br /&gt;I was reminded of this question today, when I realized that many intellectual tasks are already commoditized:&lt;br /&gt;&lt;br /&gt;The article "&lt;a href="http://www.citypages.com/2011-02-23/news/inside-the-multimillion-dollar-essay-scoring-business/"&gt;Inside the multimillion-dollar essay-scoring business: Behind the scenes of standardized testing&lt;/a&gt;" gives a dreadful view of now essays are being scored for the standardized tests.&lt;br /&gt;&lt;br /&gt;Based on the description of the article, the (human-based) scoring process "&lt;i&gt;goes too fast; relies on cheap, inexperienced labor; and does not accurately assess student learning&lt;/i&gt;." Needless to say, the workers were not exactly enthusiastic about their work. Match that with the computer-assisted scoring of essays, and you have an MTurk-like environment for much more intellectually-demanding tasks...&lt;br /&gt;&lt;br /&gt;After reading this essay-scoring mill story, I started feeling a little bit uneasy. The MTurk-style work seems too far away to be in my future, so the discussion is always, ahem, academic. But the essay scoring brought the concept a little bit too close for comfort.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6538286886012474786-3994375461270838153?l=topsitemap.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/3994375461270838153'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/3994375461270838153'/><link rel='alternate' type='text/html' href='http://topsitemap.blogspot.com/2011/03/promise-and-fear-of-assembly-line-for.html' title='The promise and fear of an assembly line for knowledge work'/><author><name>Internet at Every Where</name><uri>http://www.blogger.com/profile/13430497579682715412</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author></entry><entry><id>tag:blogger.com,1999:blog-6538286886012474786.post-2553309879931112990</id><published>2011-02-17T12:06:00.000-08:00</published><updated>2011-04-17T19:00:19.185-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='research'/><title type='text'>What was the main factor for Watson's success? Hardware, software, or data?</title><content type='html'>&lt;div dir="ltr" style="text-align: left;" trbidi="on"&gt;I can think of three thinks that may have allowed Watson to win Jeopardy:&lt;br /&gt;&lt;ul style="text-align: left;"&gt;&lt;li&gt;&lt;b&gt;Hardware&lt;/b&gt;: From a &lt;a href="http://www.scottaaronson.com/blog/?p=550#comment-20324"&gt;comment&lt;/a&gt;&amp;nbsp;at&amp;nbsp;&lt;a href="http://www.scottaaronson.com/blog/?p=550"&gt;Shtetl-Optimized&lt;/a&gt;, "&lt;i&gt;The hardware Watson was running on is said to be capable of 80 teraflops. According to the TOP500 list for November 2000, the fastest supercomputer (ASCI White) was capable of 4.9 teraflops.&lt;/i&gt;" So, computers became 40x faster over the last 10 years. Is this the winning factor?&lt;/li&gt;&lt;li&gt;&lt;b&gt;Software&lt;/b&gt;: A couple of months back &lt;a href="http://agtb.wordpress.com/2010/12/23/progress-in-algorithms-beats-moore%E2%80%99s-law/"&gt;Noam Nissan reported&lt;/a&gt;: "&lt;i&gt;while improvements in hardware accounted for an approximate 1,000 fold increase in calculation speed over a 15-year time-span, improvements in algorithms accounted for an over 43,000 fold increase&lt;/i&gt;." So, maybe it is just the better NLP and machine learning algorithms that played the crucial role in this success.&lt;/li&gt;&lt;li&gt;&lt;b&gt;Data&lt;/b&gt;: 10 years back we did not have Wikipedia, and its derivatives, such as &lt;a href="http://en.wiktionary.org/wiki/Wiktionary:Main_Page"&gt;Wiktionary&lt;/a&gt;, &lt;a href="http://en.wikiquote.org/wiki/Main_Page"&gt;WikiQuote&lt;/a&gt;, &lt;a href="http://species.wikimedia.org/wiki/Main_Page"&gt;Wikispecies&lt;/a&gt;, &lt;a href="http://wiki.freebase.com/wiki/DBPedia"&gt;DBPedia&lt;/a&gt;, etc. Such resources add a tremendous value for finding connections between concepts.&amp;nbsp;&lt;/li&gt;&lt;/ul&gt;&lt;div&gt;My gut feeling says that the crucial factor is the development of the data resources that allowed Watson to answer such trivia questions. Without discounting the importance of hardware and software development, without having such tremendously organized and rich data sources, it would not be possible for Watson to answer any of these questions. The &lt;a href="http://en.wikipedia.org/wiki/IBM_WebFountain"&gt;IBM WebFountain&lt;/a&gt; was around for a while, but trying to structure the unstructured web data, and get meaning out of such data, is much harder than taking and analyzing the nicely organized data in DBPedia.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;To paraphrase a loosely-related quote:&amp;nbsp;&lt;a href="http://anand.typepad.com/datawocky/2008/03/more-data-usual.html"&gt;&lt;i&gt;Better data usually beats better algorithms&lt;/i&gt;&lt;/a&gt;.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6538286886012474786-2553309879931112990?l=topsitemap.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/2553309879931112990'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/2553309879931112990'/><link rel='alternate' type='text/html' href='http://topsitemap.blogspot.com/2011/02/what-was-main-factor-for-watson-success.html' title='What was the main factor for Watson&amp;#39;s success? Hardware, software, or data?'/><author><name>Internet at Every Where</name><uri>http://www.blogger.com/profile/13430497579682715412</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author></entry><entry><id>tag:blogger.com,1999:blog-6538286886012474786.post-7157900805260502523</id><published>2011-02-16T11:14:00.000-08:00</published><updated>2011-04-17T19:00:19.189-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='mechanical turk'/><category scheme='http://www.blogger.com/atom/ns#' term='research'/><title type='text'>Browsers of Mechanical Turk workers</title><content type='html'>&lt;div dir="ltr" style="text-align: left;" trbidi="on"&gt;Yesterday,&amp;nbsp;Michael Bernstein asked on Twitter:&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://twitter.com/#!/msbernst/status/37253829149986816"&gt;&lt;br /&gt;&lt;img border="0" height="120" src="http://1.bp.blogspot.com/-DS41tL9OSQo/TVsL5JWTqaI/AAAAAAAAhjQ/ud2OVeNOq58/s320/bernstein-turk-browers-twitter.PNG" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;I recalled that my favorite go-to source for Turk statistics, Dahn Tamir, used Mechanical Turk a couple of years back to&amp;nbsp;&lt;a href="http://www.evilsoft.org/2009/08/25/libertarians-are-just-crazy-for-firefox-and-other-observations"&gt;examine the connection between browser use and political orientation&lt;/a&gt;.&lt;br /&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;I asked Dahn if he had collected more extensive data. He is running tasks using a very large number of workers, so his sample would have been representative.&lt;br /&gt;&lt;br /&gt;I was not disappointed:&amp;nbsp;Dahn had data from approximately 19,000 workers, based on 75,000 worker requests from the last 6 months, recording the "user-agent" part of the HTTP request. For each workerid, we counted how many different user-agents we have seen. The maximum was 19, with an average value of 1.3.&lt;br /&gt;&lt;br /&gt;Then, measured the workerid's per user-agent string. If a worker had registered multiple user-agent strings, we split the credit across browsers. For example, if the same workerid had one session with IE8 and one with IE9, then we gave 0.5 credit to IE8 and 0.5 credit to IE9.&lt;br /&gt;&lt;br /&gt;I processed further the data using the&amp;nbsp;&lt;a href="http://www.useragentstring.com/pages/api.php"&gt;UserAgentString API&lt;/a&gt;&amp;nbsp;and I generated this &lt;a href="https://spreadsheets.google.com/pub?hl=en&amp;amp;hl=en&amp;amp;key=0AjX1e06EhsXSdDRUQkdJZ3psOHozQk40RGtEZ1FLd2c&amp;amp;output=html"&gt;Google spreadsheet&lt;/a&gt;.&lt;/div&gt;&lt;br /&gt;&lt;div&gt;After processing the data, here are the high level results:&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Operating System Usage&lt;/b&gt;&lt;br /&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://3.bp.blogspot.com/-y-sc4lwNX1E/TVwdiX56xII/AAAAAAAAhjw/a1RUewe0_3I/s1600/OperatingSystems.PNG" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="217" src="http://3.bp.blogspot.com/-y-sc4lwNX1E/TVwdiX56xII/AAAAAAAAhjw/a1RUewe0_3I/s400/OperatingSystems.PNG" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;Yep, most (~85%-90%) of MTurk workers use Windows. I found very surprising the prevalence of Windows XP!&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Browser Usage&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://2.bp.blogspot.com/-1FyGto6rvWY/TVwdgGNbdHI/AAAAAAAAhjg/5DRbiumRNEk/s1600/Browsers.PNG" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="202" src="http://2.bp.blogspot.com/-1FyGto6rvWY/TVwdgGNbdHI/AAAAAAAAhjg/5DRbiumRNEk/s400/Browsers.PNG" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;I found very interesting the relatively low percentage of IE users.&lt;br /&gt;&lt;br /&gt;For reference, here are the most common versions for each of the most-used browsers by Mechanical Turk workers:&lt;br /&gt;&lt;br /&gt;&lt;div style="text-align: center;"&gt;&lt;a href="http://1.bp.blogspot.com/-VBm2SXfrJgc/TVwdhS51f0I/AAAAAAAAhjs/oMHa83e7vaE/s1600/IE-versions.PNG" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="206" src="http://1.bp.blogspot.com/-VBm2SXfrJgc/TVwdhS51f0I/AAAAAAAAhjs/oMHa83e7vaE/s400/IE-versions.PNG" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://4.bp.blogspot.com/-_T1Bc4XFU5I/TVwdgaqBh5I/AAAAAAAAhjk/xfjB0niInsw/s1600/chrome-versions.PNG" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="220" src="http://4.bp.blogspot.com/-_T1Bc4XFU5I/TVwdgaqBh5I/AAAAAAAAhjk/xfjB0niInsw/s400/chrome-versions.PNG" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://1.bp.blogspot.com/-l-uZx-YSHtQ/TVwdg0ak6yI/AAAAAAAAhjo/Q54gZ_Ess-A/s1600/firefox-versions.PNG" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="193" src="http://1.bp.blogspot.com/-l-uZx-YSHtQ/TVwdg0ak6yI/AAAAAAAAhjo/Q54gZ_Ess-A/s400/firefox-versions.PNG" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://4.bp.blogspot.com/-Jgyh7qEKQY8/TVwdi0MEIAI/AAAAAAAAhj0/JWfpX1s6DUc/s1600/safari-versions.PNG" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="231" src="http://4.bp.blogspot.com/-Jgyh7qEKQY8/TVwdi0MEIAI/AAAAAAAAhj0/JWfpX1s6DUc/s400/safari-versions.PNG" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: left;"&gt;One of the good news is that &lt;b&gt;most MTurk workers tend to use new versions of the&amp;nbsp;browsers&amp;nbsp;with good support for the latest web technologies (css, javascript, etc)&lt;/b&gt;. Interestingly, though, a very significant fraction uses Windows XP!&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: left;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: left;"&gt;In the future, we may repeat the measurements by also keeping statistics about Javascript versions, plugins, flash support etc.&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: left;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: left;"&gt;Until then, enjoy and code safely, knowing that you do not have to support IE6 and IE7 in your MTurk HITs.&lt;/div&gt;&lt;br /&gt;&lt;/div&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6538286886012474786-7157900805260502523?l=topsitemap.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/7157900805260502523'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/7157900805260502523'/><link rel='alternate' type='text/html' href='http://topsitemap.blogspot.com/2011/02/browsers-of-mechanical-turk-workers.html' title='Browsers of Mechanical Turk workers'/><author><name>Internet at Every Where</name><uri>http://www.blogger.com/profile/13430497579682715412</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://1.bp.blogspot.com/-DS41tL9OSQo/TVsL5JWTqaI/AAAAAAAAhjQ/ud2OVeNOq58/s72-c/bernstein-turk-browers-twitter.PNG' height='72' width='72'/></entry><entry><id>tag:blogger.com,1999:blog-6538286886012474786.post-1793021572920131831</id><published>2011-02-13T21:20:00.000-08:00</published><updated>2011-04-17T19:00:19.193-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='mechanical turk'/><category scheme='http://www.blogger.com/atom/ns#' term='research'/><category scheme='http://www.blogger.com/atom/ns#' term='hcomp'/><title type='text'>3rd Human Computation Workshop (HCOMP 2011), San Francisco, August 7 or 8</title><content type='html'>&lt;div dir="ltr" style="text-align: left;" trbidi="on"&gt;I am just posting this here, to build some awareness about HCOMP 2011, the 3rd Human Computation Workshop, which will be organized together with AAAI in San Francisco, on August 7 or 8. You can also find more detailed&amp;nbsp;information about the workshop at&amp;nbsp;&lt;a href="http://www.humancomputation.com/"&gt;http://www.humancomputation.com&lt;/a&gt;. The submission deadline is &lt;s&gt;April 22&lt;/s&gt;&amp;nbsp;April 29th.&lt;br /&gt;&lt;br /&gt;Human Computation is the study of systems where humans perform a major&amp;nbsp;part of the computation or are an integral part of the overall&amp;nbsp;computational system. Over the past few years, we have observed &lt;a href="http://behind-the-enemy-lines.blogspot.com/2010/10/explosion-of-crowdsourcing-workshops.html"&gt;a&amp;nbsp;proliferation of related workshops&lt;/a&gt;, &lt;a href="http://courses.ischool.utexas.edu/Lease_Matt/2011/Spring/INF385T/"&gt;new&lt;/a&gt; &lt;a href="http://www.cs.umass.edu/~dganesan/courses/fall10/"&gt;courses&lt;/a&gt;, and tutorials,&amp;nbsp;&lt;a href="http://ir.ischool.utexas.edu/crowd/"&gt;scattered across many conferences&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;In this 3rd Human Computation Workshop (HCOMP 2011), we hope to draw&amp;nbsp;together participants across disciplines -- machine learning, HCI,&amp;nbsp;mechanism and market design, information retrieval, decision-theoretic&lt;br /&gt;planning, optimization, computer vision -- for a stimulating full-day&amp;nbsp;workshop at AAAI in the beautiful San Francisco this summer.  There&amp;nbsp;will be presentation of new works, lively discussions, poster and demo&amp;nbsp;sessions, and invited talks by Eric Horvitz, Jennifer Wortman and&amp;nbsp;more.  There will also be a 4-hour tutorial called "Human Computation:&amp;nbsp;Core Research Questions and State of the Art" at AAAI on August 7, which will give newcomers and current researchers a bird�s eye view of the research landscape of human computation.&lt;br /&gt;&lt;br /&gt;&lt;hr align="center" style="background-color: #691f01; border: 1px; color: #691f01; display: block; height: 2px;" width="50%" /&gt;&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Call for Papers&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;3rd Human Computation Workshop (HCOMP 2011)&lt;br /&gt;co-located with AAAI 2011&lt;br /&gt;August 7 or 8, San Francisco, CA&lt;br /&gt;&lt;a href="http://www.humancomputation.com/"&gt;http://www.humancomputation.com&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;Human computation is a relatively new research area that studies how&amp;nbsp;to build intelligent systems that involves human computers, with each&amp;nbsp;of them performing computation (e.g., image classification,&amp;nbsp;translation, and protein folding) that leverage human intelligence,&amp;nbsp;but challenges even the most sophisticated AI algorithms that exist&amp;nbsp;today.   With the immense growth of the Web, human computation systems&amp;nbsp;can now leverage the abilities of an unprecedented number of Internet&amp;nbsp;users to perform complex computation. Various genres of human&amp;nbsp;computation applications are available today, including games with a&amp;nbsp;purpose (e.g., the ESP Game) that generates useful data through&amp;nbsp;gameplay, crowdsourcing marketplaces (e.g., Amazon Mechanical Turk)&amp;nbsp;that coordinate workers to perform tasks for monetary rewards, and&amp;nbsp;identity verification systems (e.g. reCAPTCHA) that generate useful&amp;nbsp;data through users performing computation for access to online&amp;nbsp;content.&lt;br /&gt;&lt;br /&gt;Despite the variety of human computation applications, there exist&amp;nbsp;many common core research issues. How can we design mechanisms for&amp;nbsp;querying human computers in such a way that incentivizes or encourages&amp;nbsp;truthful responses? What are the techniques for aggregating noisy&amp;nbsp;outputs from multiple human computers? How do we effectively assign&amp;nbsp;tasks to human computers to match their particular expertise and&amp;nbsp;interests? What are some programming paradigms for designing&amp;nbsp;algorithms that effectively leverage the humans in the loop? How do we&amp;nbsp;build human computation systems that involve the joint efforts of both&amp;nbsp;machines and humans, trading off each of their particular strengths&amp;nbsp;and weaknesses? Significant advances on such questions will likely&amp;nbsp;need to draw many disciplines, including machine learning, mechanism&amp;nbsp;and market design, information retrieval, decision-theoretic planning,&amp;nbsp;optimization, human computer interaction, etc.&lt;br /&gt;&lt;br /&gt;The workshop recognizes the growing opportunity for AI to function as&amp;nbsp;an enabling technology in human computation systems. At the same time, AI can leverage technical advances and data collected from human&lt;br /&gt;computation systems for its own advancement.  The goal of HCOMP 2011&amp;nbsp;is to bring together academic and industry researchers from diverse&amp;nbsp;subfields in a stimulating discussion of existing human computation&amp;nbsp;applications and future directions of this relatively new subject&amp;nbsp;area. The workshop also aims to broaden the scope of human computation&amp;nbsp;to more than the issue of data collection to a broader definition of&amp;nbsp;human computation, to study systems where humans perform a major part&amp;nbsp;of the computation or are an integral part of the overall&amp;nbsp;computational system.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Topics&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;Topics of interest include, but are not limited to:&lt;br /&gt;&lt;br /&gt;&lt;ul style="text-align: left;"&gt;&lt;li&gt;Programming languages, tools and platforms to support human computation&lt;/li&gt;&lt;li&gt;Domain-specific challenges in human computation&lt;/li&gt;&lt;li&gt;Methods for estimating the cost, reliability, and skill of labelers&lt;/li&gt;&lt;li&gt;Methods for designing and controlling workflows for human computation tasks&lt;/li&gt;&lt;li&gt;Empirical and formal models of incentives in human computation systems&lt;/li&gt;&lt;li&gt;Benefits of one-time versus repeated labeling&lt;/li&gt;&lt;li&gt;Design of manipulation-resistance mechanisms in human computation&lt;/li&gt;&lt;li&gt;Concerns regarding the protection of labeler identities&lt;/li&gt;&lt;li&gt;Active learning from imperfect human labelers&lt;/li&gt;&lt;li&gt;Techniques for inferring expertise and routing tasks&lt;/li&gt;&lt;li&gt;Theoretical limitations of human computation&lt;/li&gt;&lt;/ul&gt;&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Format&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;The workshop will consist of several invited talks from prominent&amp;nbsp;researchers in different areas related to human computation, selected&amp;nbsp;presentations of technical and position papers, as well as poster and&amp;nbsp;demo sessions, organized by theme.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Submission&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;Technical papers and position papers may be up to 6 pages in length,&amp;nbsp;and should follow&lt;a href="http://www.aaai.org/Publications/Author/author.php"&gt; AAAI formatting guidelines&lt;/a&gt;. For demos and poster&amp;nbsp;presentations, authors should submit a short paper or extended&amp;nbsp;abstract (up to 2 pages). We welcome early work, and particularly&amp;nbsp;encourage submission of visionary position papers that are more&amp;nbsp;forward looking.  Papers must be submitted electronically via &lt;a href="https://cmt.research.microsoft.com/hcomp2011/"&gt;CMT&lt;/a&gt;.&amp;nbsp;The submission deadline is April 22, 2010.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Workshop Website&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;For more details, please consult &lt;a href="http://www.humancomputation.com/"&gt;our workshop website&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Organizers&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;Luis von Ahn (co-chair)&lt;br /&gt;Panagiotis Ipeirotis (co-chair)&lt;br /&gt;Edith Law&lt;br /&gt;Haoqi Zhang&lt;br /&gt;Jing Wang&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Program Committee&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;Foster Provost&lt;br /&gt;Winter Mason&lt;br /&gt;Eric Horvitz&lt;br /&gt;Ed Chi&lt;br /&gt;Serge Belongie&lt;br /&gt;Paul Bennett&lt;br /&gt;Jennifer Wortman&lt;br /&gt;Yiling Chen&lt;br /&gt;Kristen Grauman&lt;br /&gt;Raman Chandrasekar&lt;br /&gt;Rob Miller&lt;br /&gt;Deepak Ganesan&lt;br /&gt;Chris Callison-Burch&lt;br /&gt;Vitor R. Carvalho&lt;br /&gt;David Parkes&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6538286886012474786-1793021572920131831?l=topsitemap.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/1793021572920131831'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/1793021572920131831'/><link rel='alternate' type='text/html' href='http://topsitemap.blogspot.com/2011/02/3rd-human-computation-workshop-hcomp.html' title='3rd Human Computation Workshop (HCOMP 2011), San Francisco, August 7 or 8'/><author><name>Internet at Every Where</name><uri>http://www.blogger.com/profile/13430497579682715412</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author></entry><entry><id>tag:blogger.com,1999:blog-6538286886012474786.post-5983653813876726821</id><published>2011-02-09T18:18:00.000-08:00</published><updated>2011-04-17T19:00:19.197-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='mechanical turk'/><category scheme='http://www.blogger.com/atom/ns#' term='csdm'/><category scheme='http://www.blogger.com/atom/ns#' term='crowdsourcing'/><category scheme='http://www.blogger.com/atom/ns#' term='wisdom of the crowds'/><category scheme='http://www.blogger.com/atom/ns#' term='research'/><title type='text'>Notes from "Crowdsourcing in Search and Data Mining" (CSDM) workshop</title><content type='html'>&lt;div dir="ltr" style="text-align: left;" trbidi="on"&gt;I was attending the &lt;a href="http://ir.ischool.utexas.edu/csdm2011/proceedings.html"&gt;Crowdsourcing in Search and Data Mining&lt;/a&gt; workshop at the WSDM 2011 conference in Hong Kong. I kept some (informal) notes about the workshop:&lt;br /&gt;&lt;br /&gt;&lt;b&gt;&lt;i&gt;Invited talk - Winter Mason&lt;/i&gt;, presenting his work on "Individual vs. Group Success in Social Networks".&amp;nbsp;&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;Winter started by presenting a summary of his HCOMP 2009 paper on how payment on Mechanical Turk affect quality and speed of completion (hint: affects speed, does not affect quality). Then he moved to talk on how agents can collaborate to solve complex problems. How structure of the communication network affect the results? How the position in the network affects the performance of an individual?&amp;nbsp;The participants were playing a game, where they were attempting to discover oil in a map. The users could see where their neighbors searched for oil in the field. So, they were kind of guiding each other in discovering oil. As part of the experiment, the underlying, invisible graph connecting the players was changed, to see the results. Typically the players were copying each other more, as the clustering coefficient of the graph increased (i.e, less exploration). However, this did not have a statistically significant effect in "finding the peak of oil production" and all graph structured performed similarly in terms of overall success, although there was some non-significant decrease in performance. This work points the direction to a lot of interesting future research: &lt;a href="http://behind-the-enemy-lines.blogspot.com/2010/11/wisdom-of-crowds-when-do-we-need.html"&gt;When should we let people talk to each other vs let them work independently&lt;/a&gt;? What is the structure of the solution space that indicates that people should collaborate vs explore independently (e.g., if there is a "hard to find" peak of oil production, and many areas with moderate oil production that are easier to find, you want people to explore independently; if there is a single peak, collaboration helps)&lt;br /&gt;&lt;br /&gt;&lt;b&gt;&lt;i&gt;Guido Zuccon&lt;/i&gt;, Teerapong Leelanupab, Stewart Whiting, Joemon Jose and Leif Azzopardi. Crowdsourcing Interactions - A proposal for capturing user interactions through crowdsourcing.&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;The main question is how to capture behavior of users of search engines, when you do not have a search engine available (ala Google, Microsoft, Yahoo, etc). The alternative is to have something done on the lab, but the population is homogeneous and expensive if you want to create a big data set.&lt;br /&gt;&lt;br /&gt;The author described how they build a task-oriented system on top of MTurk, intermediating on top of Bing, and examined how users used the search engine to identify answers to the questions posed (taken by IIR track of TREC).&lt;br /&gt;&lt;br /&gt;&lt;b&gt;&lt;i&gt;Richard McCreadie&lt;/i&gt;, Craig Macdonald and Iadh Ounis. Crowdsourcing Blog Track Top News Judgments at TREC.&lt;/b&gt;&lt;br /&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;br /&gt;The authors describe their&amp;nbsp;experiences&amp;nbsp;crowdsourcing relevance evaluations. "Find interesting stories on day d, for category c". Basic setup: display the results to the user, and measure speed, cost, and level of agreement (quality of&amp;nbsp;assessments). They observe everything in the server, and they conclude that MTurk is good, cheap, and fast. To ensure quality they need at least three&amp;nbsp;assessors&amp;nbsp;per document. Workers seemed to just skip through the documents (~15 seconds per document) but the results seemed pretty consistent with the overall TREC results.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;&lt;i&gt;Carsten Eickhoff&lt;/i&gt; and Arjen de Vries. How Crowdsourcable is Your Task?&lt;/b&gt;&lt;br /&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;br /&gt;The authors list their experiences of how they fell in love with crowdsourcing, but then they realized (surprise!) that there are cheaters out to get you and submit junk. Their major conclusions: the inherent reputation metrics on Turk are pretty much useless, and gold testing tends to be good in specific cases (well-defined, unambiguous answers). They suggest to make the HITs more "creative" and less "mechanical" to be less susceptible to spam. Novelty helps, as refreshes the mind of the worker.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Christopher Harris. You�re Hired! An Examination of Crowdsourcing Incentive Models in Human Resource Tasks.&lt;/b&gt;&lt;br /&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;br /&gt;How can we structure a resume screening HIT, in order to achieve both good precision and recall? HR screening is mainly a recall task (you do not want to lose good candidates). The initial worker screening included a English skill test, plus an "attention to detail" task, which examines that workers pay attention. The author presents an experiment with different treatment conditions in terms of incentives (no incentive, bonus always, bonus only on performance framed as negative). The basic result: incentives for performance increase completion time, and positive incentives increase performance.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Jing Wang, Siamak Faridani and Panagiotis Ipeirotis. Estimating Completion Time for Crowdsourced Tasks Using Survival Analysis Models.&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;An analysis of the MTurk market, to identify how long it takes a task to be completed. Effect of price: 10x price, gives a 40% speedup. Effect of grouping HITS: 1000 HITs grouped get done 7x faster than 1000 sequential HITs. &lt;a href="http://www.slideshare.net/ipeirotis/estimating-completion-time-for-crowdsourced-tasks-using-survival-analysis-models"&gt;The slides are available online&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Raynor Vliegendhart, Martha Larson, Christoph Kofler, Carsten Eickhoff and Johan Pouwelse. Investigating Factors Influencing Crowdsourcing Tasks with High Imaginative Load.&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;&lt;i&gt;[jet lag was hitting pretty hard at that point, and I had to take off and get some rest...]&lt;/i&gt;&lt;br /&gt;&lt;br /&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;i&gt;Invited talk -&amp;nbsp;&lt;/i&gt;&lt;/b&gt;&lt;/span&gt;&lt;i&gt;Thore Graepel&lt;/i&gt;: The Smarter Crowd: Active Learning, Knowledge Corroboration, and Collective IQs&lt;/b&gt;&lt;br /&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;br /&gt;Some ideas on how to use graphical models to model users and their expertise to match them with appropriate items to work on. Basic idea: user modeling helps in understanding the workers, and can improve active learning.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Omar Alonso. Perspectives on Infrastructure for Crowdsourcing.&lt;/b&gt;&lt;br /&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;br /&gt;Description of techniques that&amp;nbsp;facilitate&amp;nbsp;the development of advanced crowdsourcing systems (e.g., Mapreduce, reputation systems, workflows etc). Can we devise a general computation system for the HPU (human processing unit), building on existing paradigms for computational systems? What are the fundamental building blocks that we need?&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Abhimanu Kumar and Matthew Lease. Modeling Annotator Accuracies for Supervised Learning.&lt;/b&gt;&lt;br /&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;br /&gt;This paper examines where to allocate the labeling effort when building a machine learning task. Basic idea: If we know the quality of the workers, most of the solutions work pretty similarly.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;b&gt;&lt;i&gt;Invited talk -&amp;nbsp;&lt;/i&gt;&lt;/b&gt;&lt;/span&gt;&lt;i&gt;Panos Ipeirotis&lt;/i&gt;:&amp;nbsp;Crowdsourcing using Mechanical Turk: Quality Management and Scalability&lt;/b&gt;&lt;br /&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;br /&gt;I described our experiences in building systems for managing quality when dealing with imperfect human annotators (from our &lt;a href="http://pages.stern.nyu.edu/~panos/publications/hcomp2010.pdf"&gt;HCOMP 2010 paper&lt;/a&gt;), and on how to efficiently allocate resources in labeling when using the data to build machine learning models (&lt;a href="http://pages.stern.nyu.edu/~panos/publications/kdd2008.pdf"&gt;KDD 2008&lt;/a&gt; and &lt;a href="http://archive.nyu.edu/handle/2451/29799"&gt;working paper&lt;/a&gt;). I used plenty of examples from our &lt;a href="http://www.adsafemedia.com/"&gt;AdSafe&lt;/a&gt;&amp;nbsp;experience, and gave a brief glimpse into our latest explorations in using the use of psychology and biology to influence worker behavior.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Conclusions and Best Paper&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;The day ended with some overall discussion about the problems that we face with crowdsourcing. The discussion focused significantly on what we can do to separate inherent uncertainty in the signal from uninformative noise, and on how to be able to get the "informed minority" to get the truth out, without being drowned by the "tyranny of majority" (a good example is the question "Is Obama a Grammy winner?", where most people will intuitively say "no" but the correct answer is "yes"; in redundancy based approaches it is likely that the noise with bury the signal). Also people expressed concern that everyone is building his own little system from scratch, reinventing the wheel, instead of having a more coordinated effort to share experiences, and infrastructure. Also the paper "How Crowdsourcable is Your Task?" got the most-innovative paper award, and the discussion continued over beers and other distilled beverages....&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6538286886012474786-5983653813876726821?l=topsitemap.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/5983653813876726821'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/5983653813876726821'/><link rel='alternate' type='text/html' href='http://topsitemap.blogspot.com/2011/02/notes-from-in-search-and-data-mining.html' title='Notes from &amp;quot;Crowdsourcing in Search and Data Mining&amp;quot; (CSDM) workshop'/><author><name>Internet at Every Where</name><uri>http://www.blogger.com/profile/13430497579682715412</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author></entry><entry><id>tag:blogger.com,1999:blog-6538286886012474786.post-5329564528431187961</id><published>2011-02-06T15:56:00.000-08:00</published><updated>2011-04-17T19:00:19.202-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='mechanical turk'/><category scheme='http://www.blogger.com/atom/ns#' term='academia'/><category scheme='http://www.blogger.com/atom/ns#' term='research'/><title type='text'>The unreasonable effectiveness of simplicity</title><content type='html'>&lt;div dir="ltr" style="text-align: left;" trbidi="on"&gt;There are a few techniques, which are extremely easy to understand and implement. At the same time, they appear to be extremely basic and should be very easy to beat with more advanced techniques. However, this is often not the case. Consider the following examples:&lt;br /&gt;&lt;br /&gt;&lt;hr align="center" style="background-color: #691f01; border: 1px; color: #691f01; display: block; height: 2px;" width="50%" /&gt;&lt;br /&gt;&lt;b&gt;Majority voting and aggregating discrete votes.&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;Let's say that we are trying to label objects using discrete labels (e.g., is this blog comment "spam" or "not spam"). For each object, we get multiple people to look at it, and label it, the current practice today on Mechanical Turk.&lt;br /&gt;&lt;br /&gt;The simplest aggregation technique: Use the majority vote as the correct answer.&lt;br /&gt;&lt;br /&gt;This seems a ridiculously easy baseline to beat. We can model quality of the workers. We can control for the varying difficulty of the examples that need to be rated. We can control for the different types of expertise of the workers, and match them with the examples that are best for them. &lt;a href="http://pages.stern.nyu.edu/~panos/publications/hcomp2010.pdf"&gt;Plenty&lt;/a&gt; &lt;a href="http://mplab.ucsd.edu/~jake/OptimalLabeling.pdf"&gt;of&lt;/a&gt; &lt;a href="http://books.nips.cc/papers/files/nips23/NIPS2010_0577.pdf"&gt;papers&lt;/a&gt; &lt;a href="http://www.computer.org/portal/web/csdl/doi/10.1109/ICDM.2010.114"&gt;were&lt;/a&gt; &lt;a href="http://www.cs.cmu.edu/~pinard/Papers/280_Donmez.pdf"&gt;published&lt;/a&gt; &lt;a href="http://jmlr.csail.mit.edu/papers/v11/raykar10a.html"&gt;around&lt;/a&gt; &lt;a href="http://www.cs.mcgill.ca/~colt2009/papers/037.pdf"&gt;this&lt;/a&gt; &lt;a href="http://www.cs.cmu.edu/~schneide/rsp767-donmez.pdf"&gt;topic&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;What is the improvement? Modest at best, and non-existent most of the time. The only (real) improvement, in most cases, &lt;b&gt;means kicking out the spammers and take the majority vote across the good workers&lt;/b&gt;. Do we need any advanced technique for that? No. A few gold cases here and there (ala Crowdflower), or a simple comparison of how often one workers agrees with the majority, is typically enough.&lt;br /&gt;&lt;br /&gt;Why is that? Because majority vote is a simple model. No parameters to estimate.&amp;nbsp;For anything more advanced, we need a lot of data for the model to generate robust parameter estimates. The errors introduced by incorrect parameter estimates typically alleviates the advantages of the more complex modeling.&lt;br /&gt;&lt;br /&gt;&lt;hr align="center" style="background-color: #691f01; border: 1px; color: #691f01; display: block; height: 2px;" width="50%" /&gt;&lt;br /&gt;&lt;b&gt;Averages and aggregating continuous probability estimates&lt;/b&gt;&lt;br /&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;br /&gt;Now consider the case of combining forecasts from multiple sources. For example, we want to predict the weather, and we have multiple sources each with its own forecast. Or we have many stock market analysts, covering the same stock and making predictions for future performance.&lt;br /&gt;&lt;br /&gt;Consider the simplest way to aggregate: average across all estimates $p_i$.&lt;br /&gt;&lt;br /&gt;&lt;div style="text-align: center;"&gt;$\hat p = \frac{1}{N} \cdot \sum_i^N p_i$&lt;/div&gt;&lt;br /&gt;Very straightforward. As in the case of aggregating discrete labels, it is trivial to improve, in theory.&lt;br /&gt;&lt;br /&gt;This topic has a long history in the literature, and there are even meta-studies that examine the effectiveness of the various approaches. See for example the survey-style studies:&lt;br /&gt;&lt;ul style="text-align: left;"&gt;&lt;li&gt;&lt;a href="http://duke.edu/~dandan/Papers/camel.pdf"&gt;The Effects &amp;nbsp;of &amp;nbsp;Averaging &amp;nbsp;Subjective &amp;nbsp;Probability &amp;nbsp;Estimates&amp;nbsp;Between &amp;nbsp;and Within Judges&lt;/a&gt;, by Ariely et al. and&amp;nbsp;&lt;/li&gt;&lt;li&gt;&lt;a href="http://www.cob.ohio-state.edu/~butler_267/DAPapers/WP970009.pdf"&gt;Combining probability distributions from experts in risk analysis&lt;/a&gt;&amp;nbsp;by Clemen and Winkler.&lt;/li&gt;&lt;/ul&gt;Both studies reach similar conclusions: You can definitely improve simple averages, but most of the time the improvement is marginal, and you lose robustness. From Clemen and Winkler: "&lt;i&gt;...simple combination rules (e.g., simple average) tend to perform quite well. More complex rules, sometimes outperform the simple rules, but they can be somewhat sensitive, leading to poor performance in some instances&lt;/i&gt;."&lt;br /&gt;&lt;br /&gt;&lt;hr align="center" style="background-color: #691f01; border: 1px; color: #691f01; display: block; height: 2px;" width="50%" /&gt;&lt;br /&gt;Sometimes a simple baseline algorithm is so good for practical purposes, that any improvements have only academic interest. I know that we have papers to write, careers to advance, and grants to get, but sometimes it is good to stop and think: "&lt;i&gt;What did I gain, compared to a simpler alternative? Does it make sense to introduce so much additional complexity for the sake of some minor improvement?&lt;/i&gt;"&lt;br /&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6538286886012474786-5329564528431187961?l=topsitemap.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/5329564528431187961'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/5329564528431187961'/><link rel='alternate' type='text/html' href='http://topsitemap.blogspot.com/2011/02/unreasonable-effectiveness-of.html' title='The unreasonable effectiveness of simplicity'/><author><name>Internet at Every Where</name><uri>http://www.blogger.com/profile/13430497579682715412</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author></entry><entry><id>tag:blogger.com,1999:blog-6538286886012474786.post-6914318461641288564</id><published>2010-12-23T22:14:00.000-08:00</published><updated>2011-04-17T19:00:19.206-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='mechanical turk'/><title type='text'>Amazon Reacts: Spammers Kicked Out of MTurk!</title><content type='html'>I got a &lt;a href="http://behind-the-enemy-lines.blogspot.com/2010/12/mechanical-turk-now-with-4092-spam.html#comment-117873368"&gt;notification in the comments&lt;/a&gt;&amp;nbsp;of the &lt;a href="http://behind-the-enemy-lines.blogspot.com/2010/12/mechanical-turk-now-with-4092-spam.html"&gt;blog post about spam on MTurk&lt;/a&gt; that Amazon seems to have taken seriously the spam problem reported in my previous blog post.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;blockquote&gt;Broken Turk wrote:&lt;br /&gt;&lt;br /&gt;Have you seen the site? &amp;nbsp;It looks as if MTurk liked your research?&lt;br /&gt;&lt;br /&gt;&lt;a href="https://www.mturk.com/mturk/searchbar?selectedSearchType=hitgroups&amp;amp;searchWords=&amp;amp;minReward=1.00&amp;amp;x=10&amp;amp;y=10&amp;amp;=%2Fsearchbar"&gt;(Link to top paying HITs, no spam!)&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;a href="http://turkers.proboards.com/index.cgi?board=kgeneral&amp;amp;action=display&amp;amp;thread=7820"&gt;(Turker Nation post)&lt;/a&gt;&lt;br /&gt;&lt;/blockquote&gt;&lt;br /&gt;Indeed, I checked the available HITs and all the spam HITs seem to have magically disappeared! It seems that all the negative publicity convinced the guys at MTurk that spam IS a problem.&lt;br /&gt;&lt;br /&gt;I will consider that the goal of the &lt;a href="http://behind-the-enemy-lines.blogspot.com/2010/12/mechanical-turk-now-with-4092-spam.html"&gt;blog post&lt;/a&gt; is now achieved. Amazon listened!&lt;br /&gt;&lt;br /&gt;Good job Amazon!&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6538286886012474786-6914318461641288564?l=topsitemap.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/6914318461641288564'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/6914318461641288564'/><link rel='alternate' type='text/html' href='http://topsitemap.blogspot.com/2010/12/amazon-reacts-spammers-kicked-out-of.html' title='Amazon Reacts: Spammers Kicked Out of MTurk!'/><author><name>Internet at Every Where</name><uri>http://www.blogger.com/profile/13430497579682715412</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author></entry><entry><id>tag:blogger.com,1999:blog-6538286886012474786.post-7511242498763507959</id><published>2010-12-16T15:49:00.000-08:00</published><updated>2011-04-17T19:00:19.212-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='mechanical turk'/><category scheme='http://www.blogger.com/atom/ns#' term='spam'/><title type='text'>Mechanical Turk: Now with 40.92% spam.</title><content type='html'>At this point, &lt;a href="https://www.mturk.com/mturk/findhits?match=false"&gt;Amazon Mechanical Turk&lt;/a&gt; has reached the mainstream. Pretty much everyone knows about the concept. Post small tasks online, pay people cents, and get thousands of micro-tasks completed.&lt;br /&gt;&lt;br /&gt;Unfortunately, this resulted in some unfortunate trends. Anyone who frequents just a little bit the market will notice the tremendous number of spammy HITs. (HIT = a task posted for completion in the market; stands for Human Intelligence Task). "Test if the ads in my website work". "Create a Twitter account and follow me". "Like my YouTube video". "Download this app". "Write a positive review on Yelp". A seemingly endless amount of spam HITs come to the market, mainly with the purpose of spamming "social media" metrics.&lt;br /&gt;&lt;hr align="center" style="background-color: #691f01; border: 1px; color: #691f01; display: block; height: 2px;" width="50%" /&gt;&lt;br /&gt;So, with &lt;a href="http://dahn.tamir.com/"&gt;Dahn Tamir&lt;/a&gt; and &lt;a href="http://www.linkedin.com/in/priyakanth"&gt;Priya Kanth&lt;/a&gt; (MS student at NYU), we decided to examine how big is the problem. How many spammers join the market? How many spam HITs are there?&lt;br /&gt;&lt;br /&gt;Using the data from &lt;a href="http://www.mturk-tracker.com/general/"&gt;Mechanical Turk Tracker&lt;/a&gt;, we picked all the requesters that first joined the market in September 2010 and October 2010. Why new ones? Because we assumed that long term requesters are not spammers. (But this remains to be verified.)&lt;br /&gt;&lt;br /&gt;This process resulted in 1733 new requesters that first appeared in the marketplace in September and October 2010. We then took all the HITs that these requesters posted in the market. This was a total of 5842 HIT groups. The activity patterns of the new requesters were similar to those of the general requester population.&lt;br /&gt;&lt;br /&gt;&lt;hr align="center" style="background-color: #691f01; border: 1px; color: #691f01; display: block; height: 2px;" width="50%" /&gt;&lt;br /&gt;The next step was to post these HITs on Mechanical Turk, and asked workers to classify them as spam or not, using the following guidelines:&lt;br /&gt;&lt;br /&gt;&lt;table border="1"&gt;&lt;tbody&gt;&lt;tr&gt;&lt;td&gt;&lt;blockquote&gt;&lt;br /&gt;Use the following guidelines to classify the HIT as SPAM: &lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;b&gt;SEO&lt;/b&gt;: Asks me to give a fake rating, vote, review, comment, or "like" on Facebook, YouTube, DIGG, etc., or to create fake mail or website accounts.&lt;/li&gt;&lt;li&gt;&lt;b&gt;Fake accounts&lt;/b&gt;: Asks me to create an account on Twitter, Facebook, and then perform a likely spam action.&amp;nbsp;&lt;/li&gt;&lt;li&gt;&lt;b&gt;Lead Gen&lt;/b&gt;: Asks me to go to a website and sign up for a trial, complete an offer, fill out a form requesting information, "test" a data-entry form, etc.&lt;/li&gt;&lt;li&gt;&lt;b&gt;Fake clicks&lt;/b&gt;: Asks me to go to a website and click on ads.&lt;/li&gt;&lt;li&gt;&lt;b&gt;Fake ads&lt;/b&gt;: Asks me to post an ad to Craigslist or other marketplace.&lt;/li&gt;&lt;li&gt;&lt;b&gt;Personal Info&lt;/b&gt;: Asks me for my real name, phone number, full mailing address or email.&lt;/li&gt;&lt;li&gt;You can also use your&amp;nbsp;intuition&amp;nbsp;to classify the HIT&lt;/li&gt;&lt;/ul&gt;Please DO NOT classify as spam, HITs that are legitimate in nature but priced offensively low.&lt;/blockquote&gt;&lt;/td&gt;&lt;/tr&gt;&lt;/tbody&gt;&lt;/table&gt;&lt;br /&gt;Interestingly enough, we got a ridiculous amount of spam from the worker side. Even with 99% approval rate and 1000 HITs as qualification, we got plenty of spammers giving us random data.&lt;br /&gt;&lt;br /&gt;Since spam was a big problem, we posted the HIT using CrowdFlower and we used a set of 100 manually classified HITs as gold. (Without Crowdflower, we had to manually kick out the spammers and repost the HITs. So, Crowdflower saved the day.)&lt;br /&gt;&lt;br /&gt;We asked 11 workers to classify each HIT, and we ignored&amp;nbsp;votes from the untrusted workers (that failed to answer correctly at least 75% of the gold tests). So, with 11 trusted workers working on each HIT, we were reasonably sure that the majority vote across these 11 votes resulted in an accurate HIT classification.&lt;br /&gt;&lt;br /&gt;I also ran the "&lt;a href="http://code.google.com/p/get-another-label/"&gt;get another label&lt;/a&gt;" code and I noticed that all the workers were of reasonable quality. Since the results were similar to those of the majority vote, I decided to keep things simple and go with the majority vote as the correct answer.&lt;br /&gt;&lt;br /&gt;&lt;hr align="center" style="background-color: #691f01; border: 1px; color: #691f01; display: block; height: 2px;" width="50%" /&gt;&lt;br /&gt;&lt;b&gt;The results&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;The results were disturbing. &lt;b&gt;Out of the total of&amp;nbsp;5841 HITs, a total of 2390 HITs, or 40.92% were marked as spam HITs.&lt;/b&gt;&lt;br /&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://2.bp.blogspot.com/_skxW9kDLt_c/TQpWpze7-VI/AAAAAAAAfn0/02yFP0s79eQ/s1600/spam-vs-notspam.PNG" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" src="http://2.bp.blogspot.com/_skxW9kDLt_c/TQpWpze7-VI/AAAAAAAAfn0/02yFP0s79eQ/s1600/spam-vs-notspam.PNG" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;br /&gt;This is not good! 40% of the HITs from new requesters are spam!&lt;br /&gt;&lt;br /&gt;Our next test was to examine whether there are accounts that post a mix of spam and not spam HITs. The analysis indicated that this is not the case. Very few accounts post both spam HITs and legitimate HITs:&lt;br /&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://3.bp.blogspot.com/_skxW9kDLt_c/TQmaiAORwnI/AAAAAAAAfnI/jo5vgvXRT04/s1600/spam-percentage-per-requester.PNG" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="267" src="http://3.bp.blogspot.com/_skxW9kDLt_c/TQmaiAORwnI/AAAAAAAAfnI/jo5vgvXRT04/s400/spam-percentage-per-requester.PNG" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;br /&gt;&lt;/div&gt;The plot illustrates that &lt;b&gt;31.83% of the new requesters post only spam HITs&lt;/b&gt;.&lt;br /&gt;&lt;br /&gt;In total, 757 out of the 1733 new requesters posted at least a one spam HIT, and 552 accounts were posting only spam HITs. 56.46% of the new requesters post no spam HITs. This nice separation indicates that it is easy to separate spam requesters from legitimate ones. There are not that many requesters that post both spam HITs and legitimate ones.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;So, 31.8% of the new requesters are clear spammers, and 40.92% of the new HITs are spam-related! &lt;/b&gt;This is clearly a problem. &lt;br /&gt;&lt;br /&gt;&lt;hr align="center" style="background-color: #691f01; border: 1px; color: #691f01; display: block; height: 2px;" width="50%" /&gt;&lt;br /&gt;&lt;b&gt;Spam HITs and pricing&lt;/b&gt;&lt;br /&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;br /&gt;So, what are the quantitative characteristics of the spam HITs?&lt;br /&gt;&lt;br /&gt;First of all, they tend to contain much fewer "HITs available" compared to the legitimate HITs. 95% of the spam HITs contain just a single HIT, while only 75% of the legitimate HITs have one HIT available.&lt;br /&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://4.bp.blogspot.com/_skxW9kDLt_c/TQqHqcz4FpI/AAAAAAAAfn4/aOjeQqaeaF8/s1600/hits_available_spam_not_spam.PNG" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="241" src="http://4.bp.blogspot.com/_skxW9kDLt_c/TQqHqcz4FpI/AAAAAAAAfn4/aOjeQqaeaF8/s400/hits_available_spam_not_spam.PNG" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;br /&gt;&lt;/div&gt;On the other hand, spammers tend to post HITs with higher rewards (perhaps because they do not pay?).&amp;nbsp;Approximately&amp;nbsp;80% of the legitimate HITs are priced below one dollar, while only 60% of the spam HITs are priced below this threshold. Actually, many of the best paying HITs tend to be spam-related ones.&lt;br /&gt;&lt;br /&gt;&lt;div style="text-align: center;"&gt;&lt;a href="http://4.bp.blogspot.com/_skxW9kDLt_c/TQqRpWam1qI/AAAAAAAAfn8/9rMKuElheR4/s1600/price_spam_not_spam.PNG" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="248" src="http://4.bp.blogspot.com/_skxW9kDLt_c/TQqRpWam1qI/AAAAAAAAfn8/9rMKuElheR4/s400/price_spam_not_spam.PNG" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div style="text-align: center;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="text-align: left;"&gt;By combining the two charts above, we can plot the total value of the spam vs not spam HITs.&amp;nbsp;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://3.bp.blogspot.com/_skxW9kDLt_c/TQqWdBnNEgI/AAAAAAAAfoA/8-_jn_yv5MI/s1600/totalvalue_spam_not_spam.PNG" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="247" src="http://3.bp.blogspot.com/_skxW9kDLt_c/TQqWdBnNEgI/AAAAAAAAfoA/8-_jn_yv5MI/s400/totalvalue_spam_not_spam.PNG" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div style="text-align: left;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="text-align: left;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="text-align: left;"&gt;Overall, the findings are not really surprising: Most of the spam HITs require large number of workers to complete a task. They want 1000 users to click an ad, not a single user to click a thousand times at a single ad. Therefore, I suspect that most of these spam HITs have a very significant amount of redundancy, (which unfortunately we cannot observe). This means that the total value of the posted spam HITs is most probably much higher than the total value of the legitimate HITs.&lt;/div&gt;&lt;div style="text-align: left;"&gt;&lt;br /&gt;&lt;hr align="center" style="background-color: #691f01; border: 1px; color: #691f01; display: block; height: 2px;" width="50%" /&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="text-align: left;"&gt;&lt;b&gt;What to do?&lt;/b&gt;&lt;/div&gt;&lt;div style="text-align: left;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="text-align: left;"&gt;These trends are very worrisome:&lt;/div&gt;&lt;div style="text-align: left;"&gt;&lt;/div&gt;&lt;ul&gt;&lt;li&gt;40% of the HITs from new requesters are spam.&amp;nbsp;&lt;/li&gt;&lt;li&gt;30% of the new requesters are clear spammers.&lt;/li&gt;&lt;li&gt;The spam HITs have bigger value than the legitimate ones.&amp;nbsp;&lt;/li&gt;&lt;/ul&gt;&lt;div&gt;It is very clear that active action should be taken against spam requesters.&amp;nbsp;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;According to our measurements, we see approximately &lt;a href="http://www.mturk-tracker.com/arrivals/"&gt;1500 new HITs arriving in the market every day&lt;/a&gt; (from all requesters), and approximately 30 new requester accounts join the market every day. It should be trivial to review all the HITs&amp;nbsp;&lt;b&gt;manually by posting them to MTurk for review.&lt;/b&gt;&amp;nbsp;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;But even if this manual inspection is expensive, this is a task that can be very easily automated. In our current work,&lt;b&gt; we realized that it is very easy to accurately classify HITs as spam or not. A simple SVM linear classifier that uses bag of words as features can achieve a &lt;span class="Apple-style-span" style="color: #990000;"&gt;95% true positive and 95% true negative rate&lt;/span&gt;. &lt;/b&gt;With a moderately advanced scheme, it should be possible to have a strong system in place pretty quickly.&lt;br /&gt;&lt;br /&gt;For whomever is interested, the data is available here&lt;br /&gt;&lt;iframe frameborder="0" height="300" src="https://spreadsheets.google.com/pub?key=0AjX1e06EhsXSdEhHWndZQ21oV0dERUpwVXA1UlFfX3c&amp;amp;hl=en&amp;amp;single=true&amp;amp;gid=0&amp;amp;output=html&amp;amp;widget=true" width="400"&gt;&lt;/iframe&gt;&lt;br /&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The disheartening part was the response of Amazon when we informed them about the issue. They pretty much assured us that everything is fine, and they believed there is no problem! For me, this was more problematic than the existence of spam.&lt;/div&gt;&lt;br /&gt;&lt;hr align="center" style="background-color: #691f01; border: 1px; color: #691f01; display: block; height: 2px;" width="50%" /&gt;&lt;br /&gt;&lt;div&gt;&lt;b&gt;Why Amazon ignores spam?&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;To answer this question, I have asked Amazon for access to the data to investigate further. Unfortunately, I was denied access. (It does not pay to &lt;a href="http://behind-the-enemy-lines.blogspot.com/2010/10/plea-to-amazon-fix-mechanical-turk.html"&gt;criticize Amazon&lt;/a&gt;.) Interestingly&amp;nbsp;enough, the MTurk guys &lt;a href="http://www.quora.com/If-you-could-observe-all-transactions-on-the-Amazon-Mechanical-Turk-marketplace-what-would-you-like-to-know"&gt;share data with other academics&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The key piece for answering this question, which I cannot get from my data: &lt;b&gt;Do spammers pay the workers?&lt;/b&gt;&amp;nbsp;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;If the spam requesters &lt;b&gt;do not pay&lt;/b&gt; the workers, then Amazon &lt;b&gt;&lt;i&gt;should &lt;/i&gt;&lt;/b&gt;be more proactive in battling spammer requesters. Workers need to be protected! It is easy to see that it is a death spiral otherwise. The more spammers can get away with getting work done and not paying, the less the workers will trust new requesters. Legitimate new requesters will face a significant uphill battle to convince the workers about their intentions, they will abandon their plans, and let the spammers prevail. We have &lt;a href="http://behind-the-enemy-lines.blogspot.com/2010/07/mechanical-turk-low-wages-and-market.html"&gt;a market for lemons&lt;/a&gt; on the inverse.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;If the spam requesters &lt;b&gt;pay &lt;/b&gt;the workers, then there is a cynical explanation: Amazon does not take an active role in cleaning the market because they simply profit from the spam. And it is part of the growth. And nobody within the MTurk division would cut in half the growth rate at this point.&lt;br /&gt;&lt;br /&gt;However, this would be an incredibly short-sighted approach.&amp;nbsp;With the amount of spam in the worker side, and the amount of spam in the requester side, then &lt;b&gt;&lt;i&gt;Mechanical Turk would slowly turn into a market where spammers requester talk to spammer workers&lt;/i&gt;&lt;/b&gt;... Ah yes, and academics running experiments...&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6538286886012474786-7511242498763507959?l=topsitemap.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/7511242498763507959'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/7511242498763507959'/><link rel='alternate' type='text/html' href='http://topsitemap.blogspot.com/2010/12/mechanical-turk-now-with-4092-spam.html' title='Mechanical Turk: Now with 40.92% spam.'/><author><name>Internet at Every Where</name><uri>http://www.blogger.com/profile/13430497579682715412</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://2.bp.blogspot.com/_skxW9kDLt_c/TQpWpze7-VI/AAAAAAAAfn0/02yFP0s79eQ/s72-c/spam-vs-notspam.PNG' height='72' width='72'/></entry><entry><id>tag:blogger.com,1999:blog-6538286886012474786.post-7207064072102940277</id><published>2010-12-12T22:30:00.000-08:00</published><updated>2011-04-17T19:00:19.219-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='academia'/><category scheme='http://www.blogger.com/atom/ns#' term='readability'/><category scheme='http://www.blogger.com/atom/ns#' term='research'/><category scheme='http://www.blogger.com/atom/ns#' term='open access'/><category scheme='http://www.blogger.com/atom/ns#' term='google'/><title type='text'>Sharing code, API's, and a Readability API</title><content type='html'>Yesterday, I received an email from a student that wanted to have access to some code that we used in our recent TKDE paper "&lt;a href="http://pages.stern.nyu.edu/~panos/publications/tkde2010-usefulness.pdf"&gt;Estimating the Helpfulness and Economic Impact of Product Reviews: Mining Text and Reviewer Characteristics&lt;/a&gt;".&lt;br /&gt;&lt;br /&gt;Specifically, the student wanted to estimate the &lt;a href="http://en.wikipedia.org/wiki/Readability"&gt;readability&lt;/a&gt; test scores for the reviews. For those not familiar with &lt;a href="http://en.wikipedia.org/wiki/Category:Readability_tests"&gt;readability tests&lt;/a&gt;, they are simple formulas that examine the text and estimate what is the necessary level education required in order to read and understand a particular piece of text.&lt;br /&gt;&lt;br /&gt;I tried to send the code, but then I realized that it had some dependencies to some old libraries, which have been&amp;nbsp;deprecated. At that point, I realized that it would be a pain to send the code to the student, then give instructions about all the dependencies etc. On the other hand, not sending the code is simply unacceptable.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;&lt;i&gt;Sharing code as an API&lt;/i&gt;&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;This got me thinking: How can we make the code to be robust to changes? How can we share the code in a way that it can be easily used by others? Given that all software packages today have web API's, why not creating API's for our own (research) code?&lt;br /&gt;&lt;br /&gt;Since I have never tried in the past to do some serious web programming, I decided that I can spend a few hours to familiarize myself with the basics and make my library to be a set of RESTful API calls.&lt;br /&gt;&lt;br /&gt;Apparently, it was not that difficult. I uploaded the code to the Google App Engine, and I wrote a small servlet that was taking as input the text, and was returning the readability metric of choice. Almost an assignment for a first-year student learning about programming.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;&lt;i&gt;Readability API&lt;/i&gt;&lt;/b&gt;&lt;br /&gt;&lt;b&gt;&lt;i&gt;&lt;br /&gt;&lt;/i&gt;&lt;/b&gt;&lt;br /&gt;After a few hours of coding, I managed to generate a first version of the demo at&amp;nbsp;&lt;a href="http://ipeirotis.appspot.com/readability.html"&gt;http://ipeirotis.appspot.com/readability.html&lt;/a&gt;. I also created a &lt;a href="http://ipeirotis.appspot.com/readability-api.html"&gt;basic API&lt;/a&gt; which can be easily used to estimate the readability scores of various texts.&lt;br /&gt;&lt;br /&gt;I followed the example of bit.ly and I allowed the API calls to return simple txt format, so that it can be possible to embed the Readability API calls in many places. For example, I really enjoy calling bit.ly within Excel or within R, in order to shorten URLs. Now, it is possible to do the same in order to compute readability scores.&lt;br /&gt;&lt;br /&gt;For example, if we want to compute the&amp;nbsp;&lt;a href="http://en.wikipedia.org/wiki/SMOG"&gt;SMOG score&lt;/a&gt; for for the text &lt;i&gt;"I do not like them in a box. I do not like them with a fox" &lt;/i&gt;and get back the score in simple text, you just need to call:&lt;br /&gt;&lt;br /&gt;&lt;a href="http://ipeirotis.appspot.com/readability/GetReadabilityScores?output=txt&amp;amp;metric=SMOG&amp;amp;text=I%20do%20not%20like%20them%20in%20a%20box.%20I%20do%20not%20like%20them%20with%20a%20fox."&gt;http://ipeirotis.appspot.com/readability/GetReadabilityScores?output=&lt;b&gt;txt&lt;/b&gt;&amp;amp;metric=&lt;b&gt;SMOG&lt;/b&gt;&amp;amp;text=I%20do%20not%20like%20them%20in%20a%20box.%20I%20do%20not%20like%20them%20with%20a%20fox.&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;The result is the SMOG score for the text, which in this case is&amp;nbsp;3.129. You can &lt;a href="http://ipeirotis.appspot.com/readability.html"&gt;play with the demo&lt;/a&gt;&amp;nbsp;and type whatever text you want,&amp;nbsp;and &lt;a href="http://ipeirotis.appspot.com/readability-api.html"&gt;see the documentation&lt;/a&gt;&amp;nbsp;if you want to use the code. Of course, the &lt;a href="http://code.google.com/p/panos-ipeirotis/"&gt;source code&lt;/a&gt; is also available.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;&lt;i&gt;Future Plans&lt;/i&gt;&lt;/b&gt;&lt;br /&gt;&lt;b&gt;&lt;i&gt;&lt;br /&gt;&lt;/i&gt;&lt;/b&gt;&lt;br /&gt;I actually like this idea and the result. I will be trying to port more of my code online, and make it available as an API. With the availability of sites such as Google App Engine, we do not have to worry about servers being taken down, or upgrades in OS, etc. The code can remain online and functioning.&amp;nbsp;Now, let's see how easy it will be to port some non-trivial code.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6538286886012474786-7207064072102940277?l=topsitemap.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/7207064072102940277'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/7207064072102940277'/><link rel='alternate' type='text/html' href='http://topsitemap.blogspot.com/2010/12/sharing-code-api-and-readability-api.html' title='Sharing code, API&amp;#39;s, and a Readability API'/><author><name>Internet at Every Where</name><uri>http://www.blogger.com/profile/13430497579682715412</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author></entry><entry><id>tag:blogger.com,1999:blog-6538286886012474786.post-144334337980056596</id><published>2010-12-06T17:08:00.000-08:00</published><updated>2011-04-17T19:00:19.224-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='mechanical turk'/><title type='text'>Excerpts from "The Communist Manifesto"</title><content type='html'>&lt;blockquote&gt;... A class of laborers, who live only so long as they find work, and who find work only so long as their labor increases capital.&lt;br /&gt;&lt;br /&gt;&lt;i&gt;These laborers, who must sell themselves piecemeal, are a commodity&lt;/i&gt;, like every other article of commerce, and are consequently exposed to all the vicissitudes of competition, to all the fluctuations of the market.&lt;br /&gt;&lt;br /&gt;Owing to the extensive use of machinery, and to the division of labor, &lt;i&gt;the work of the proletarians has lost all individual character, and, consequently, all charm for the workman&lt;/i&gt;.&lt;br /&gt;&lt;br /&gt;&lt;i&gt;He becomes an appendage of the machine, and it is only the most simple, most monotonous, and most easily acquired knack, that is required of him.&lt;/i&gt;&lt;br /&gt;&lt;br /&gt;&lt;div style="text-align: right;"&gt;&lt;i&gt;&lt;b&gt;Excerpts from "The Communist Manifesto", 1848&lt;/b&gt;&lt;/i&gt;&lt;/div&gt;&lt;/blockquote&gt;&lt;br /&gt;162 years later, the &lt;a href="http://www.marxists.org/archive/marx/works/1848/communist-manifesto/"&gt;Communist Manifesto&lt;/a&gt;, by Marx and Engels, finds a new meaning in the online world of Amazon Mechanical Turk.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6538286886012474786-144334337980056596?l=topsitemap.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/144334337980056596'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/144334337980056596'/><link rel='alternate' type='text/html' href='http://topsitemap.blogspot.com/2010/12/excerpts-from-communist-manifesto.html' title='Excerpts from &amp;quot;The Communist Manifesto&amp;quot;'/><author><name>Internet at Every Where</name><uri>http://www.blogger.com/profile/13430497579682715412</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author></entry><entry><id>tag:blogger.com,1999:blog-6538286886012474786.post-3205379086319351686</id><published>2010-11-28T21:04:00.000-08:00</published><updated>2011-04-17T19:00:19.227-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='mechanical turk'/><category scheme='http://www.blogger.com/atom/ns#' term='independence'/><category scheme='http://www.blogger.com/atom/ns#' term='wisdom of the crowds'/><category scheme='http://www.blogger.com/atom/ns#' term='research'/><title type='text'>Wisdom of the Crowds: When do we need Independence?</title><content type='html'>I have been thinking lately about the conditions and assumptions for the &lt;a href="http://en.wikipedia.org/wiki/The_Wisdom_of_Crowds"&gt;wisdom of crowds&lt;/a&gt; to work.&amp;nbsp;Surowiecki, in this popular book, gave the following four conditions for the crowd to arrive at&lt;br /&gt;the correct decision.&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;b&gt;Diversity of opinion&lt;/b&gt;: Each person should have private information even if it's just an eccentric interpretation of the known facts.&lt;/li&gt;&lt;li&gt;&lt;b&gt;Independence&lt;/b&gt;: People's opinions aren't determined by the opinions of&amp;nbsp;those around them.&lt;/li&gt;&lt;li&gt;&lt;b&gt;Decentralization&lt;/b&gt;:     People are able to specialize and draw on local knowledge.&lt;/li&gt;&lt;li&gt;&lt;b&gt;Aggregation&lt;/b&gt;:  Some mechanism exists for turning private judgments&amp;nbsp;into a collective decision.&lt;/li&gt;&lt;/ul&gt;The part that got me mostly puzzled is the independence assumption. Actually, I can support pretty much any thesis. I can argue that independence is necessary. I can argue that we do not really need independence so much. And I can argue that independence is evil. And I will do all these things below.&lt;br /&gt;&lt;br /&gt;&lt;hr align="center" style="background-color: #691f01; border: 1px; color: #691f01; display: block; height: 2px;" width="50%" /&gt;&lt;br /&gt;&lt;b&gt;Independence is necessary&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;It is not difficult to understand why, in some cases, independence is necessary. If the contributions from the crowd are not independent, then it we may easily observe a &lt;b&gt;herding behavior&lt;/b&gt;. &lt;a href="http://thenoisychannel.com/2010/11/17/an-information-cascade/"&gt;Daniel Tunkelang discusses a nice, instructional example&lt;/a&gt; (from the book &lt;a href="http://www.cs.cornell.edu/home/kleinber/networks-book/"&gt;Networks, Crowds, and Markets&lt;/a&gt;, by &lt;a href="http://www.arts.cornell.edu/econ/deasley/"&gt;David Easley&lt;/a&gt; and &lt;a href="http://www.cs.cornell.edu/home/kleinber/"&gt;Jon Kleinberg&lt;/a&gt;), in which the influence of the crowd can lead often to incorrect decisions, while independence can easily avoid erroneous outcomes.&lt;br /&gt;&lt;br /&gt;The paper "&lt;a href="http://www.jstor.org/stable/170754?seq=6"&gt;Limits for the Precision and Value of Information from Dependent Sources&lt;/a&gt;" by Clemen and Winkler shows that in the presence of &lt;b&gt;positive correlation&lt;/b&gt;, when we aggregate information from multiple &lt;b&gt;dependent &lt;/b&gt;sources, the resulting accuracy does not increase as we would expect. &lt;br /&gt;&lt;br /&gt;The figure below shows in the x-axis the number of&amp;nbsp;&lt;b&gt;dependent &lt;/b&gt;sources, and in the y-axis the equivalent number of &lt;b&gt;independent &lt;/b&gt;sources, for various correlation coefficients ?.&lt;br /&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://4.bp.blogspot.com/_skxW9kDLt_c/TPKswzulFKI/AAAAAAAAfiA/oU2sye1W4Tc/s1600/limit-correlation.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="333" src="http://4.bp.blogspot.com/_skxW9kDLt_c/TPKswzulFKI/AAAAAAAAfiA/oU2sye1W4Tc/s400/limit-correlation.png" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;Even at moderate levels of ?, we see how strong are the limitations. With ?=0.4 it is almost impossible to go above two independent sources. And if we have noisy input, &lt;a href="http://archive.nyu.edu/handle/2451/29799"&gt;we often need a large number of independent sources to separate signal from noise&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;&lt;i&gt;In other words, it is better to have a couple of independent opinions, rather than having thousands of correlated voices.&lt;/i&gt;&lt;br /&gt;&lt;br /&gt;&lt;hr align="center" style="background-color: #691f01; border: 1px; color: #691f01; display: block; height: 2px;" width="50%" /&gt;&lt;br /&gt;&lt;b&gt;Lack of independence: Perhaps not so bad&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;We have examples where lack of independence is not always bad.&lt;br /&gt;&lt;br /&gt;For example, according to the paper&amp;nbsp;"&lt;a href="http://www.edvul.com/pdf/VulPashler-PS-2008.pdf"&gt;Measuring the Crowd Within&lt;/a&gt;" by Vul and Pashler, even asking the &lt;i&gt;same person&lt;/i&gt; for a second time and getting the average can lead to improved outcomes.&lt;br /&gt;&lt;br /&gt;Or take the other poster-child application of wisdom of crowds: &lt;b&gt;prediction markets (or markets, in general)&lt;/b&gt;. In these markets, people trade based on their personal information. However, they can always see (and get influenced?) by the aggregated opinion of the crowd, as this is reflected in the market prices. And empirical evidence illustrates that (prediction) markets work &lt;a href="http://bpp.wharton.upenn.edu/jwolfers/Papers/Predictionmarkets.pdf"&gt;surprisingly&lt;/a&gt; &lt;a href="http://www.bocowgill.com/GooglePredictionMarketPaper.pdf"&gt;well&lt;/a&gt;, despite (or because of) the lack of independence. Prior work has even demonstrated that &lt;a href="http://www.jstor.org/stable/1831348"&gt;even non-public information spreads quickly&lt;/a&gt; through the market (and the SEC checks for &lt;a href="http://en.wikipedia.org/wiki/Insider_trading"&gt;insider trading&lt;/a&gt; if they detect unusual activity &lt;i&gt;before &lt;/i&gt;the public release of sensitive&amp;nbsp;information.)&lt;br /&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;br /&gt;&lt;b&gt;Wikipedia &lt;/b&gt;is another example: People do see what everyone else has done so far, before adding the extra information.&lt;br /&gt;&lt;br /&gt;One paper that I found to be of interest is the &lt;a href="http://www.stanford.edu/~bgolub/papers/naivelearning.pdf"&gt;Na�ve Learning in Social Networks and the&amp;nbsp;Wisdom of Crowds&lt;/a&gt; by Golub and Jackson. The authors address the following question: "&lt;i&gt;for which social network structures will a&amp;nbsp;society of agents who communicate and update na�vely come to aggregate decentralized&amp;nbsp;information completely and correctly?". &lt;/i&gt;The results are based on the ideas of convergence for Markov Chains. One of the basic result says that the Pagerank-score of a node in the network defines the weight of the node's influence in the final outcome.&lt;br /&gt;&lt;br /&gt;In all these cases, the participants get information from the crowd, they do not just follow blindly. So, there is some benefit in interacting.&lt;br /&gt;&lt;br /&gt;&lt;hr align="center" style="background-color: #691f01; border: 1px; color: #691f01; display: block; height: 2px;" width="50%" /&gt;&lt;br /&gt;&lt;b&gt;Independence is bad&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;Going even further, we have cases where&amp;nbsp;complete independence of participants is bad!&lt;br /&gt;&lt;br /&gt;This typically happens when participants know only parts of the overall information. Through communication, it is possible to identify the complete picture, but lack of communication leads to suboptimal outcomes. Consider the example in Proposition 2 from the paper&amp;nbsp;"&lt;a href="http://www.polemarchakis.org/a14-cdf.pdf"&gt;We can't disagree forever&lt;/a&gt;"&amp;nbsp;by Geanakoplos and Polemarchakis:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;We have a 4-sided dice, with mutually exclusive outcomes A, B, C, and D, each one&amp;nbsp;occurring&amp;nbsp;with probability 0.25.&lt;/li&gt;&lt;li&gt;In reality, the dice rolled 1. But nobody knows that. Instead the knowledge of the players is:&lt;/li&gt;&lt;ul&gt;&lt;li&gt;Player 1 knows that the event "&lt;i&gt;A or B&lt;/i&gt;" happened&lt;/li&gt;&lt;li&gt;Player 2&amp;nbsp;knows that the event "&lt;i&gt;A&amp;nbsp;or C&lt;/i&gt;" happened&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;Both players can bet on whether "&lt;i&gt;A or D&lt;/i&gt;" happened.&lt;/li&gt;&lt;/ul&gt;So, look what happens&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;b&gt;No independence: &lt;/b&gt;If player 1 &lt;i&gt;can communicate&lt;/i&gt; directly with player 2, they can figure out that event A happened, and they are certain that&amp;nbsp;"&lt;i&gt;A or D&lt;/i&gt;"&amp;nbsp;occurred&amp;nbsp;with probability 1.0&lt;/li&gt;&lt;li&gt;&lt;b&gt;Independence: &lt;/b&gt;If player 1 &lt;i&gt;cannot communicate&lt;/i&gt;, then both players assign a probability of 0.5 to the event "&lt;i&gt;A or D&lt;/i&gt;". This is despite the fact that they collectively own enough information to figure out that A happened, and &lt;i&gt;there is&lt;/i&gt; a market to trade the event. In other words, the market fails to aggregate the available information.&lt;/li&gt;&lt;/ul&gt;So, we have a scenario where the inability to spread information actually results in a bad outcome. However, if we allowed the participants to be non-independent, we could have an improved outcome.&lt;br /&gt;&lt;br /&gt;&lt;hr align="center" style="background-color: #691f01; border: 1px; color: #691f01; display: block; height: 2px;" width="50%" /&gt;&lt;br /&gt;&lt;b&gt;Influence vs Information Spread&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;So, we can see actual examples where spread of information (and hence, lack of independence) can be both good and bad. Lack of independence, can lead to groupthink: and the individual voices get drowned in a sea of correlated opinions. At the other extreme, lack of communication leads to suboptimal outcomes.&lt;br /&gt;&lt;br /&gt;The paper by Plott and Sunder &lt;a href="http://www.jstor.org/stable/1911360"&gt;"Rational Expectations and the Aggregation of Diverse Information in Laboratory Security Markets&lt;/a&gt;" discusses the issue in the context of security markets and examines how market design affects the information aggregation properties of markets. (Thanks for &lt;a href="http://blog.oddhead.com/"&gt;David Pennock&lt;/a&gt; for the pointer.)&lt;br /&gt;&lt;br /&gt;The paper by Ostrovsky "&lt;a href="http://papers.ssrn.com/sol3/papers.cfm?abstract_id=1321555"&gt;Information Aggregation in Dynamic Markets with Strategic Traders&lt;/a&gt;" (in EC'09, I think also forthcoming in Econometrica) provides a rigorous theoretical framework on what are the conditions for information to be aggregated in a market: essentially we have "separable" securities for which all the available information can be aggregated, and non-separable ones that do not have this property. However, I do not have the necessary background to fully understand and present the ideas in the paper. And I cannot see how to connect this with the literature of information spreading in social networks.&lt;br /&gt;&lt;br /&gt;In a more intuitive sense, it seems that &lt;b&gt;we need information to spread and not just influence&lt;/b&gt;.&lt;br /&gt;&lt;br /&gt;Unfortunately, I cannot grasp the full picture, despite the fact that I tried to look the problem from different angles (Ironic, eh?).&lt;br /&gt;&lt;br /&gt;I still not fully understand the implications of the above in the design of processes that involve human input. Does it make sense to show to people what other people have contributed so far? Will we see effects of &lt;a href="http://en.wikipedia.org/wiki/Anchoring"&gt;anchoring&lt;/a&gt;? Or will we see the &lt;a href="http://acl.ldc.upenn.edu/J/J97/J97-4008.pdf"&gt;establishment of a common ground&lt;/a&gt; and get people to coordinate better and understand each other's input?&lt;br /&gt;&lt;br /&gt;&lt;b&gt;&lt;i&gt;How can we quantify and put in a common framework all the above?&lt;/i&gt;&lt;/b&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6538286886012474786-3205379086319351686?l=topsitemap.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/3205379086319351686'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/3205379086319351686'/><link rel='alternate' type='text/html' href='http://topsitemap.blogspot.com/2010/11/wisdom-of-crowds-when-do-we-need.html' title='Wisdom of the Crowds: When do we need Independence?'/><author><name>Internet at Every Where</name><uri>http://www.blogger.com/profile/13430497579682715412</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://4.bp.blogspot.com/_skxW9kDLt_c/TPKswzulFKI/AAAAAAAAfiA/oU2sye1W4Tc/s72-c/limit-correlation.png' height='72' width='72'/></entry><entry><id>tag:blogger.com,1999:blog-6538286886012474786.post-4307971595818491801</id><published>2010-11-23T23:21:00.000-08:00</published><updated>2011-04-17T19:00:19.231-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='mechanical turk'/><category scheme='http://www.blogger.com/atom/ns#' term='psychology'/><category scheme='http://www.blogger.com/atom/ns#' term='research'/><category scheme='http://www.blogger.com/atom/ns#' term='cognitive dissonance'/><title type='text'>Mechanical Turk, "Interesting Tasks," and Cognitive Dissonance</title><content type='html'>&lt;div style="text-align: justify;"&gt;It is a well-known fact that the wages on Mechanical Turk are &lt;a href="http://behind-the-enemy-lines.blogspot.com/2008/09/how-much-turking-pays.html"&gt;horribly&lt;/a&gt;&amp;nbsp;&lt;a href="http://papers.ssrn.com/sol3/papers.cfm?abstract_id=1596874"&gt;low&lt;/a&gt;. We can have &lt;a href="http://portal.acm.org/citation.cfm?id=1837885.1837891"&gt;endless&lt;/a&gt; &lt;a href="http://vonahn.blogspot.com/2010/07/work-and-internet.html"&gt;discussions&lt;/a&gt; about this, and my own belief is that it is &lt;a href="http://behind-the-enemy-lines.blogspot.com/2010/07/mechanical-turk-low-wages-and-market.html"&gt;due to the lack of a strong worker reputation system&lt;/a&gt;. Others believe that this is due to the global competition for unskilled labor. And others are agnostic, saying that everything is a matter of supply and demand.&lt;/div&gt;&lt;div style="text-align: justify;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="text-align: justify;"&gt;Other people try to explain the low wages by looking at the &lt;a href="http://behind-the-enemy-lines.blogspot.com/2008/03/why-people-participate-on-mechanical.html"&gt;motivation of the workers&lt;/a&gt;: Quite a few people find the tasks on Mechanical Turk to be interesting. Ergo, they are willing to work for less.&lt;/div&gt;&lt;div style="text-align: justify;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="text-align: justify;"&gt;Perfectly normal right? The task is interesting, people are willing to do it for less money. Sounds reasonable. Right? RIGHT? Well, be careful: &lt;b&gt;correlation does not imply causation!&lt;/b&gt;&lt;/div&gt;&lt;div style="text-align: justify;"&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;/div&gt;&lt;hr style="background-color: #691f01; border-bottom-width: 1px; border-color: initial; border-left-width: 1px; border-right-width: 1px; border-style: initial; border-top-width: 1px; color: #691f01; display: block; height: 2px; text-align: justify;" width="50%" /&gt;&lt;div style="text-align: justify;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="text-align: justify;"&gt;Enter the region of social psychology (thanks &lt;a href="http://www.hiphoppsychology.org/?p=469"&gt;Konstantinos&lt;/a&gt;!): The &lt;b&gt;&lt;a href="http://en.wikipedia.org/wiki/Cognitive_dissonance"&gt;theory of cognitive dissonance&lt;/a&gt;&lt;/b&gt;&amp;nbsp;indicates that the causation may go in the entirely opposite direction: &lt;b&gt;The wages are low, so people justify their participation by saying the work is interesting!&lt;/b&gt;&lt;/div&gt;&lt;div style="text-align: justify;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="text-align: justify;"&gt;This surprising result is due to the paper "&lt;a href="http://psychclassics.yorku.ca/Festinger/"&gt;Cognitive Consequences of Forced Compliance&lt;/a&gt;" from&amp;nbsp;&lt;b&gt;Festinger and Carlsmith (1959). &lt;/b&gt;It is one of the classic papers in psychology.&lt;/div&gt;&lt;div style="text-align: justify;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="text-align: justify;"&gt;What did Festinger and Carlsmith say?&lt;/div&gt;&lt;div style="text-align: justify;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="text-align: justify;"&gt;That people that get low payment to do boring tasks, will convince themselves that they do this because the task is interesting. Otherwise, the conflict in their mind will be just too big: why do they work on such a boring task when the payment is horrible?&lt;/div&gt;&lt;div style="text-align: justify;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="text-align: justify;"&gt;In contrast, if someone gets paid well to do the same boring task, they will consider the task boring. These well-paid participant can easily justify that they do the work for the money, (so it makes sense to do a boring job).&lt;/div&gt;&lt;div style="text-align: justify;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="text-align: justify;"&gt;Amazingly enough, Festinger and Carlsmith verified this experimentally. Here is the experimental setup description from the &lt;a href="http://en.wikipedia.org/wiki/Cognitive_dissonance#The_Induced-Compliance_Paradigm"&gt;Wikipedia entry that describes this intriguing experiment&lt;/a&gt;:&lt;/div&gt;&lt;div style="text-align: justify;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;blockquote&gt;&lt;div style="text-align: justify;"&gt;Students were asked to spend an hour on boring and tedious tasks (e.g., turning pegs a quarter turn, over and over again). &lt;b&gt;The tasks were designed to generate a strong, negative attitude.&lt;/b&gt;&lt;/div&gt;&lt;div style="text-align: justify;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="text-align: justify;"&gt;Once the subjects had done this, the experimenters asked some of them to do a simple favor. They were asked to talk to another subject (actually an actor) and persuade them that the tasks were interesting and engaging.&lt;/div&gt;&lt;div style="text-align: justify;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="text-align: justify;"&gt;Some participants were paid $20 (inflation adjusted to 2010, this equates to $150) for this favor, another group was paid $1 (or $7.50 in "2010 dollars"), and a control group was not asked to perform the favor.&lt;/div&gt;&lt;div style="text-align: justify;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="text-align: justify;"&gt;&lt;b&gt;When asked to rate the boring tasks at the conclusion of the study (not in the presence of the other "subject"), those in the $1 group rated them more positively than those in the $20 and control groups.&lt;/b&gt;&lt;/div&gt;&lt;div style="text-align: justify;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="text-align: justify;"&gt;The researchers theorized that people experienced dissonance between the conflicting cognitions, "I told someone that the task was interesting", and "I actually found it boring." &lt;b&gt;When paid only $1, students were forced to internalize the attitude they were induced to express, because they had no other justification.&lt;/b&gt; &lt;b&gt;Those in the $20 condition, however, had an obvious external justification for their behavior (i.e., high payment), and thus experienced less dissonance.&lt;/b&gt;&lt;/div&gt;&lt;/blockquote&gt;&lt;div style="text-align: justify;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;hr style="background-color: #691f01; border-bottom-width: 1px; border-color: initial; border-left-width: 1px; border-right-width: 1px; border-style: initial; border-top-width: 1px; color: #691f01; display: block; height: 2px; text-align: justify;" width="50%" /&gt;&lt;div style="text-align: justify;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="text-align: justify;"&gt;So, when you read surveys (mine included) that indicate that Mechanical Turk workers participate on the platform because they "find the tasks interesting", (and so it makes sense to pay low wages), please have this&amp;nbsp;alternative&amp;nbsp;explanation in mind:&lt;/div&gt;&lt;div style="text-align: justify;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="text-align: justify;"&gt;&lt;i&gt;Turkers convince themselves that the work is interesting, otherwise they would be completely crazy sitting there doing mind-boggling&amp;nbsp;boring work, just to earn wage of a couple of bucks per hour.&lt;/i&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6538286886012474786-4307971595818491801?l=topsitemap.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/4307971595818491801'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/4307971595818491801'/><link rel='alternate' type='text/html' href='http://topsitemap.blogspot.com/2010/11/mechanical-turk-tasks-and-cognitive.html' title='Mechanical Turk, &amp;quot;Interesting Tasks,&amp;quot; and Cognitive Dissonance'/><author><name>Internet at Every Where</name><uri>http://www.blogger.com/profile/13430497579682715412</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author></entry><entry><id>tag:blogger.com,1999:blog-6538286886012474786.post-1336651669155361488</id><published>2010-11-23T17:22:00.000-08:00</published><updated>2011-04-17T19:00:19.234-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='teaching'/><category scheme='http://www.blogger.com/atom/ns#' term='data mining'/><category scheme='http://www.blogger.com/atom/ns#' term='large datasets'/><title type='text'>NYC, I Love You(r Data)</title><content type='html'>&lt;a href="http://behind-the-enemy-lines.blogspot.com/2009/11/using-nyc-data-mine-for-intro-database.html"&gt;Last year, I experimented&lt;/a&gt; with the &lt;a href="http://www.nyc.gov/html/datamine/html/home/home.shtml"&gt;NYC Data Mine repository&lt;/a&gt; as a source of data for our introductory course on information systems (for business students, mainly non-majors). The results of the assignment were great, so I repeated it this year. &lt;br /&gt;&lt;br /&gt;The goal of the assignment was to teach them how to grab and run database queries against large datasets. As part of an assignment, the students had to to go the  NYC Data Mine repository, pick two datasets of their interest, join them in Access, and perform some analysis of interest.&amp;nbsp;The ultimate goal was to get them to use some real data, and use them to perform an analysis of their interest.&lt;br /&gt;&lt;br /&gt;Last year, some students took the easy way out and joined the datasets manually(!) on the borough values (Manhattan, Bronx, Brooklyn, Queens, Staten Island). This year, I explicitly forbid them from doing so. Instead, I explicitly asked them to join only using attributed with a large number of values.&lt;br /&gt;&lt;br /&gt;The results are here and most of them are well-worth reading! The analyses below is almost like a tour guide on the New York's data sightseeings :-) The new generation of &lt;a href="http://fivethirtyeight.blogs.nytimes.com/"&gt;Nate Silver&lt;/a&gt;'s is coming. &lt;br /&gt;&lt;br /&gt;&lt;hr align="center" style="background-color: #691f01; border: 1px; color: #691f01; display: block; height: 2px;" width="50%" /&gt;&lt;br /&gt;Enjoy the projects:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;a href="https://files.nyu.edu/lla236/public/recycling/"&gt;Academia and Concern for the Environment!&lt;/a&gt; Is there a correlation between how much you recycle and how well students perform in school? Are kids who are more involved in school activities more likely to recycle? Does school really teach us to be environmentally conscious? To find out the answers check out our site! &lt;br /&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="https://files.nyu.edu/kc1294/public/itproject/IT_Group_Assignment/An_Analysis_of_NYC_Events.html"&gt;An Analysis of NYC Events&lt;/a&gt;: One of the greatest aspects about New York are the fun festivals, street fairs and block parties where you can really take in the culture. Our charts demonstrate which time to visit New York or what boroughs to attend events. We suggest that tourists and residents check out our research. Also organizers of events or people who make there money from events should also consult our analysis.&lt;br /&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="https://files.nyu.edu/nez204/public/incomevsafterschoolprograms.html"&gt;How are income and after school programs related?&lt;/a&gt;: This study is an analysis of how income levels are related to the number of after school programs in an area. The correlation between income and number of school programs was interesting to analyze across the boroughs because while they did follow a trend, the different environments of the boroughs also had an exogenous effect. This is most evident in Manhattan, which can be seen in the study. &lt;br /&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="https://files.nyu.edu/us266/public/Manhattan%20Restaurant%20Cleanliness.html"&gt;Restaurant Cleanliness in Manhattan&lt;/a&gt; What are the cleanest and dirtiest restaurants in Manhattan? What are the most common restaurant code violations? We analyzed data on restaurant inspection results and found answers to these questions and more.&lt;br /&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="https://files.nyu.edu./ps1486/public/NYCReport.html"&gt;Ethnic Dissimilarity's Effect on New Business&lt;/a&gt;: This analysis focuses on the relationship between new businesses and specific ethnic regions. Do ethnically dominated zip codes deter or promote business owners of differing ethnicities to open up shop? &lt;br /&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="https://files.nyu.edu/ap2427/public/Section%203%20-%20Group%2013%20Part%20E.html"&gt;Does The Perception Of Safety In Manhattan Match With Reality?&lt;/a&gt; People�s perception of events and their surroundings influence their behavior and outlook, even though facts may present a different story. In this regard, we took a look at the reported perception of people�s safety within Manhattan and compared it to the actual crime rates reported by the NYPD. The purpose of our study was to evaluate the difference between the actual crime rate and perceived safety of citizens and measure any discrepancy. &lt;br /&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="https://files.nyu.edu/km1602/public/Parte.html"&gt;Women's Organizations love food stores!&lt;/a&gt;: We have concluded that a large percentage of women's organizations are located near casual dining and takeout restaurants as well as personal and professional service establishments compared to what we originally believed would be shopping establishments. &lt;br /&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="https://files.nyu.edu/bk940/public/datamine/index.htm"&gt;Hispanics love electronics!&lt;/a&gt;: Our goal for this project is to analyze the relationship between electronic stores and demographics in a particular zip code. We conducted a ratio analysis instead of a count analysis to lessen the effects of population variability as to create an "apples to apples" comparison. From our analysis, it can be seen that there is a greater presence of electronic stores in zip codes with a higher proportion of Hispanics. &lt;br /&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="https://files.nyu.edu/amo328/public/Index.html"&gt;Political Contributions and Expenditures&lt;/a&gt;: A comprehensive analysis of the political contributions and expenditures during the 2009 elections. The breakdown of who, in what areas of Manhattan contribute as well as how candidates spend their money are particularly interesting! &lt;br /&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="https://files.nyu.edu/wo253/public/GroupProject1/"&gt;How Dirty is Your Food?&lt;/a&gt; Our goal for this project is to analyze the various hygiene conditions of restaurants in New York City. We cross referenced the inspection scores of the restaurants with the cuisine they serve to find out if there was any correlation between these two sets of data. By ranking the average health score of the various cuisines, we can determine which kinds of cuisines were more likely to conform to health standards. &lt;br /&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="https://files.nyu.edu/aan261/public/businessnyc.html"&gt;Want to Start a Laundromat? An Electronic Store?&lt;/a&gt; The best possible places to start a Laundromat and an electronic store. For Laundromats we gave the area that had the lowest per capita income, as we noticed a trend that Laundromats do better in poorer neighborhoods. For electronic stores we found the lowest saturated areas that have the highest per capita income. &lt;br /&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="https://files.nyu.edu/am3036/public/safety.html"&gt;Where to Let Your Children Loose During the Day in NYC&lt;/a&gt;: For this analysis, we wondered whether there was a correlation between how safe people felt in certain areas in New York and the availability of after-school programs in the different community boards. &lt;br /&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="https://files.nyu.edu/mc3077/public/parte.html"&gt;Best Place to Live in Manhattan After Graduation&lt;/a&gt;: We analyzed what locations in Manhattan, classified by zip code, would be the best to live for a newly graduate. We used factors like shopping, nightlife, gyms, coffeehouses, and more! Visit the website to get the full analysis. &lt;br /&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="https://files.nyu.edu/sr1860/public/page3.html"&gt;Political Contributions and Structures&lt;/a&gt;: Our report analyzes the correlation between political contributions and structures in New York in varying zip codes. &lt;br /&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="https://files.nyu.edu/jsa314/public/parkingandeatinginnyc.html"&gt;Best Places to Eat and Find Parking in New York City&lt;/a&gt;: Considering the dread of finding parking in New York City, our analysis is aimed at finding the restaurants with the largest number of parking spaces in their vicinities. &lt;br /&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="https://files.nyu.edu/jpc406/public/Documents/index.html"&gt;Are the Cleanest Restaurants Located in the Wealthiest Neighborhoods?&lt;/a&gt; Our analysis between property value and restaurant rating for the top and bottom ten rated restaurants by zip codes in New York City&lt;br /&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="https://files.nyu.edu/yp429/public/Popular%20Baby%20Names%20(NYC%20vs%20USA).html"&gt;Analysis of Popular Baby Names&lt;/a&gt; &lt;br /&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="https://files.nyu.edu/cc2739/public/infotechgroup9.html"&gt;Restaurant Sanitary Conditions&lt;/a&gt;: Our team was particularly interested in the various cuisines offered in various demographic neighborhoods, grouped by zip codes. We were especially curious about the sanitary level of various cuisines offered by restaurants. The questions we wanted to answer were: &lt;ul&gt;&lt;li&gt;What zip codes had the highest rated restaurants? What type of cuisines are found in these zip codes?&lt;/li&gt;&lt;li&gt;What zip codes had the lowest rated restaurants? What type of cuisines are found in these zip codes?&lt;br /&gt;&lt;/li&gt;&lt;/ul&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="https://files.nyu.edu/qhg200/public/page5.html"&gt;Does having more community facilities improve residents' satisfaction with city agencies?&lt;/a&gt; Does having more public and private community facilities in NYC such as schools, parks, libraries, public safety, special needs housing, health facilities, etc lead to greater satisfaction with city services? On intuition, the answer is a resounding YES! With more facilities, we would enjoy our neighborhood better and develop a better opinion of New York City services. But how accurate is this intuition? In this analysis, we put that to the test.&lt;br /&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="https://files.nyu.edu/smk483/public/housing.html"&gt;Housing Patterns in Manhattan&lt;/a&gt;: The objective of our analysis was to identify factors which play a role in determining vacancy rates in Manhattan�s community districts. We inferred that vacancy rates are representative of the population�s desire to live in a particular district. We examined determining factors of why people want to live in a particular district including: quality of education, health care services, crime control in each district, etc. &lt;br /&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="https://files.nyu.edu/prp247/public/projecte.html"&gt;Analysis of Cultural Presence and Building Age by Zip Code&lt;/a&gt;: Manhattan is a hub for cultural organizations and opportunities for community involvement. But does the amount of "community presence" differ based on area that you live? Is there any relationship between the year that buildings in various areas were built, and the available programs for students and cultural organizations for the general public in that area? We analyzed whether a relationship existed between the number of cultural organizations and after school programs available in a zip code, and the average year that the buildings in the zip code were built. To further our analysis we looked at whether the age of buildings in areas with greatest "cultural presence" affected the sales price of the buildings. &lt;br /&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="https://files.nyu.edu/jws377/public/Baby%20names.html"&gt;Analysis of Baby Names across the Boroughs&lt;/a&gt;: We decided to analyze the Baby Names given in 2008 across the boroughs of Manhattan, the Bronx and Brooklyn. We found the most popular names in each Borough, along with top names specific to each borough that were unpopular in other Boroughs. We also found certain factors that could be a determining factor in the naming of these babies. &lt;br /&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="https://files.nyu.edu/sek351/public/parte.html"&gt;Analysis of New York City Street Complaints&lt;/a&gt;: We analyzed the different kinds of street complaints made in New York City, how the city tends to respond to them, and which streets have the most overall complaints when you also bring building complaints into the picture. This analysis taught us that Broadway has the most street complaints but it also piqued our interest in conducting even further analyses.&lt;br /&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="https://files.nyu.edu/sw1262/public/infotech/index.html"&gt;Campaign Contributions and Community Service Programs&lt;/a&gt; The goal of our analysis was to determine if there is a correlation between contributions by NYC residents to election candidates and community service programs. We wanted to see if people who are more financially invested in elections are also more inclined to be involved in their neighborhoods through community programs. &lt;br /&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="https://files.nyu.edu/jdp343/public/libraries.html"&gt;Public Libraries in Queens&lt;/a&gt;: We looked at how many public libraries there were in each zip code in Queens. We also looked at the number of people and racial composition in each zip code, to see if these factors are related.&lt;br /&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="https://files.nyu.edu/glh237/public/InfoTechGroupProject1Group11PartE.html"&gt;Sidewalk Cafe Clustering&lt;/a&gt;: Our study�s goal is to understand where sidewalk cafes cluster and some potential reasons why they cluster. We start by looking at what areas of the city are most populated with sidewalk cafes. Then we look to see if there are any trends related to gender or race demographics. We finally look to see if there is any influence on property value on the abundance of sidewalk cafes. &lt;/li&gt;&lt;/ul&gt;&lt;div&gt;&lt;hr align="center" style="background-color: #691f01; border: 1px; color: #691f01; display: block; height: 2px;" width="50%" /&gt;&lt;br /&gt;The surprise in this year: Most students could not understand what is the "CSV" data file. Many of them thought it was some plain text, and did not try to use it. (Hence the prevalence of electronic and&amp;nbsp;laundromat&amp;nbsp;analyses, which were based on datasets available in Excel format.) I guess next year I will need to explain that as well.&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6538286886012474786-1336651669155361488?l=topsitemap.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/1336651669155361488'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/1336651669155361488'/><link rel='alternate' type='text/html' href='http://topsitemap.blogspot.com/2010/11/nyc-i-love-your-data.html' title='NYC, I Love You(r Data)'/><author><name>Internet at Every Where</name><uri>http://www.blogger.com/profile/13430497579682715412</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author></entry><entry><id>tag:blogger.com,1999:blog-6538286886012474786.post-7762667688472943880</id><published>2010-11-19T00:06:00.000-08:00</published><updated>2011-04-17T19:00:19.238-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='academia'/><category scheme='http://www.blogger.com/atom/ns#' term='research'/><category scheme='http://www.blogger.com/atom/ns#' term='advice'/><title type='text'>Introductory Research Course: Replicate a Paper</title><content type='html'>The transition to the happy life of a tenured professor meant that I get to be involved in the wonderful part of the job: Getting to sit in school-wide committees.&lt;br /&gt;&lt;br /&gt;Fortunately, I was assigned in an extremely interesting committee: We get to examine the PhD program for the school, see the best practices, see what works and what does not, and try to reconcile everything into a set of recommendations for the faculty to examine. The double benefit for me is that I get to understand how the other departments operate in the school, a thing which, for &amp;nbsp;a computer scientist in a business school, was still kind of a mystery to me.&lt;br /&gt;&lt;br /&gt;Anyway, as part of this task, I learned about an interesting approach to teach starting PhD students about research:&lt;br /&gt;&lt;br /&gt;&lt;div style="text-align: center;"&gt;&lt;b&gt;A course in which students pick a paper and get to replicate it.&lt;/b&gt;&lt;/div&gt;&lt;br /&gt;&lt;hr align="center" style="background-color: #691f01; border: 1px; color: #691f01; display: block; height: 2px;" width="50%" /&gt;&lt;br /&gt;I think this is a great idea. First of all, I am a big fan of learning-by-doing.&lt;br /&gt;&lt;br /&gt;For example, to understand how an algorithm works, you need to actually implement it. Not get the code and re-run the experiments. Implement everything, going as deeply as possible. In C, in Java, in Perl, in Python, in MatLab, in Maple, in Stata, it does not matter. For theory, the same thing: replicate the proofs. Do not skip the details. For data analysis, the same.&amp;nbsp;Get your hands dirty.&lt;br /&gt;&lt;br /&gt;During such a process, it is great to have someone to serve as a sounding board. Ask questions about the basics. Why do we follow this rule of thumb? What is the assumption behind the use of this method?&amp;nbsp;Asking these questions is much easier while working on replicating someone else's work, rather then when working on your own research and trying to get a paper out.&lt;br /&gt;&lt;br /&gt;Myself, I still write code for this very same reason. I need to see how the algorithm behaves. I need to see the small peculiarities in behavior. This observation gets me to understand better not only the algorithm itself but also other techniques that are employed by the algorithm. I am trying to understand econometrics a little bit deeper the last few months, and I do the same. Frustrating? Yes. Slow? Yes. Helpful? You bet!&lt;br /&gt;&lt;br /&gt;&lt;hr align="center" style="background-color: #691f01; display:block; border: 1px; color: #691f01; height: 2px;" width="50%" /&gt;&lt;br /&gt;So, at the end of the seminar, if the students can replicate the results of the paper, great: They learned what it takes to create a paper and most probably understood deeper a few other topics in the way. &lt;br /&gt;&lt;br /&gt;If the results are different than in the original paper, then perhaps this is the beginning of a deeper investigation. Why things are different? Tuning? Settings? Bugs? Perhaps uncovering something not seen by the authors?&lt;br /&gt;&lt;br /&gt;Even if the data from the authors are not available, the students should be able to reproduce and get similar results perhaps with different data sets. If the results with different data sets are qualitative different, then the paper is essentially not reproducible. (And &lt;a href="http://www.site.uottawa.ca/ICML09WS/papers/w2.pdf"&gt;replicability is not reproducibility&lt;/a&gt;.)&lt;br /&gt;&lt;br /&gt;And in any case, no matter if the students can replicate the results or not, no matter if the paper is reproducible or not, the lesson from such an exercise can be valuable. &lt;br /&gt;&lt;br /&gt;Often the student who understands better the paper, falls in love with a topic, and gets to learn more and more about the area. Following the footsteps of someone is often the first step to find your own path.&lt;br /&gt;&lt;br /&gt;&lt;hr align="center" style="background-color: #691f01; display:block; border: 1px; color: #691f01; height: 2px;" width="50%" /&gt;&lt;br /&gt;I think this seminar will make it to the final set of recommendations to the school. I am wondering how many other schools have such a course.&lt;br /&gt;&lt;br /&gt;&lt;hr align="center" style="background-color: #691f01; display:block; border: 1px; color: #691f01; height: 2px;" width="50%" /&gt;&lt;br /&gt;&lt;b&gt;Update1&lt;/b&gt;: Needless to say, this is a class, not something that students try on their own. Therefore, the professor should pick a set of papers which are educational and useful to replicate. This can be either an easy "classic" paper, or an "important new" result, or even a paper that forces the students to use particular tools and data sources. The students choose from a predefined set, not from the wild.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Update2&lt;/b&gt;: Thanks to Jun, a commenter below, we have now a reference to the originator of the idea. Apparently,&amp;nbsp;&lt;a href="http://gking.harvard.edu/"&gt;Gary King&lt;/a&gt; has published a paper in 2006, titled ""&lt;a href="http://gking.harvard.edu/files/abs/paperspub-abs.shtml"&gt;Publication, Publication&lt;/a&gt;", in "Political Science and Politics". From the abstract: &lt;i&gt;"I show herein how to write a publishable paper by beginning with the replication of a published article. This strategy seems to work well for class projects in producing papers that ultimately get published, helping to professionalize students into the discipline, and teaching them the scientific norms of the free exchange of academic information. I begin by briefly revisiting the prominent debate on replication our discipline had a decade ago and some of the progress made in data sharing since."&lt;/i&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6538286886012474786-7762667688472943880?l=topsitemap.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/7762667688472943880'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/7762667688472943880'/><link rel='alternate' type='text/html' href='http://topsitemap.blogspot.com/2010/11/introductory-research-course-replicate.html' title='Introductory Research Course: Replicate a Paper'/><author><name>Internet at Every Where</name><uri>http://www.blogger.com/profile/13430497579682715412</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author></entry><entry><id>tag:blogger.com,1999:blog-6538286886012474786.post-781070935308399476</id><published>2010-10-28T08:15:00.000-07:00</published><updated>2011-04-17T19:00:19.240-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='academia'/><category scheme='http://www.blogger.com/atom/ns#' term='humor'/><category scheme='http://www.blogger.com/atom/ns#' term='funny'/><title type='text'>Cease and desist...</title><content type='html'>This was just too funny to resist posting.&lt;br /&gt;&lt;br /&gt;Here is the background: As part of the core undergraduate introductory class "Information Technology in Business and Society", students have to create a website. To make things more interesting, I ask them to pick a few queries on Google, and try to create a website that will show up on the top of the results for these queries. Essentially it is a mix of technical skills with the ability to understand how pages are ranked and how to analyze the "competition" for these keywords.&lt;br /&gt;&lt;br /&gt;So, a student of mine (John Cintolo), created a website about "&lt;a href="https://files.nyu.edu/jpc406/public/Documents/Hit%20Club%20Music%20Summer%202010.html"&gt;Hit Club Music Summer 2010&lt;/a&gt;", with &lt;b&gt;&lt;i&gt;links&lt;/i&gt;&lt;/b&gt; to YouTube videos. No copyright infringment or anything illegal.&lt;br /&gt;&lt;br /&gt;And one day later, he gets a "cease and desist" letter from HotNewClubSongs. It has so many gems that I will list it here, for your viewing pleasure. &lt;br /&gt;&lt;br /&gt;&lt;blockquote&gt;To whom it may concern&lt;br /&gt;&lt;br /&gt;It has come to my attention that your website "Hit Club Music Summer 2010" on this  &lt;a href="https://files.nyu.edu/jpc406/public/Documents/Hit%20Club%20Music%20Summer%202010.html"&gt;URL&lt;/a&gt; has &lt;b&gt;potential to threaten my Alexa page ranking&lt;/b&gt;. As a consequence, this may cause our website to&lt;b&gt; lose vital income which is generated from ad-space&lt;/b&gt; and it will not be tolerated. Due to the nature of your actions I am requesting a formal take-down of your website due to copyright infringement as the music posted on your "http://www.youtube.com" links is not endorsed by the rightful authors, as counseled by my attorney. Considering that you are also going through the New York University server, your actions may cost you and your educational institution unless you cease the aforementioned copyright infringement. If you continue hosting your service &lt;b&gt;I will be forced to file a civil suit in which you will be charged for any lost advertisement revenue, averaging $0.52 per day&lt;/b&gt;.&lt;br /&gt;&lt;br /&gt;In addition, your html markup shows your ineptitude in online web design, making your website an inefficient option for visitors who truly care about the Club Songs Industry. The listing of the dates on your monthly playlists go in ascending order rather than descending. This is just one of the many flaws of your clearly haphazardly designed website. However, I will give you neither my website URL nor my constructive criticism, for you are clearly trying to make money in an industry which doesn�t have room for your lack of music and website design knowledge. My page viewers have complimented me numerous times on the layout and content of my page.&lt;br /&gt;&lt;br /&gt;You may contact me at this e-mail for any further concerns, although it is clear there is not much more to say. Your carelessness, inefficiency, and utter incompetence have gotten you into this hole, and unless you find a way out by October 31st, when my ad-space revenue comes in, further action will be taken. Also, for legal purposes, when and where was this website created? In the chance that it was created before September 30th, 2010, &lt;b&gt;a law suit will be filed for the obvious decrease in revenue from my ads last month, totaling $7.34.&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;Thank you for your time,&lt;br /&gt;&lt;br /&gt;HotNewClubSongs- A Forerunner in the Club Music Industry&lt;/blockquote&gt;&lt;br /&gt;Needless to say, I congratulated the student for achieving the goals of the assignment, and offered to cover the damages :-)&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6538286886012474786-781070935308399476?l=topsitemap.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/781070935308399476'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/781070935308399476'/><link rel='alternate' type='text/html' href='http://topsitemap.blogspot.com/2010/10/cease-and-desist.html' title='Cease and desist...'/><author><name>Internet at Every Where</name><uri>http://www.blogger.com/profile/13430497579682715412</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author></entry><entry><id>tag:blogger.com,1999:blog-6538286886012474786.post-2915054893574343537</id><published>2010-10-26T15:57:00.000-07:00</published><updated>2011-04-17T19:00:19.243-07:00</updated><title type='text'>Student websites</title><content type='html'>I am just posting this to provide links to the pages of my students, so that Google indexes their websites quickly. Feel free to browse, of course...&lt;br /&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/aan261/public/"&gt;https://files.nyu.edu/aan261/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/abs452/public/"&gt;https://files.nyu.edu/abs452/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/aco241/public/"&gt;https://files.nyu.edu/aco241/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/ag2846/public/"&gt;https://files.nyu.edu/ag2846/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/ahr258/public/"&gt;https://files.nyu.edu/ahr258/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/am3036/public/"&gt;https://files.nyu.edu/am3036/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/amb748/public/"&gt;https://files.nyu.edu/amb748/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/amh513/public/"&gt;https://files.nyu.edu/amh513/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/aml552/public/"&gt;https://files.nyu.edu/aml552/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/amo328/public/"&gt;https://files.nyu.edu/amo328/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/ap1730/public/"&gt;https://files.nyu.edu/ap1730/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/ap2427/public/"&gt;https://files.nyu.edu/ap2427/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/arr284/public/"&gt;https://files.nyu.edu/arr284/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/asn255/public/"&gt;https://files.nyu.edu/asn255/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/aww243/public/"&gt;https://files.nyu.edu/aww243/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/bjh292/public/"&gt;https://files.nyu.edu/bjh292/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/bk940/public/"&gt;https://files.nyu.edu/bk940/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/bm1032/public/"&gt;https://files.nyu.edu/bm1032/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/bmw308/public/"&gt;https://files.nyu.edu/bmw308/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/cc2739/public/"&gt;https://files.nyu.edu/cc2739/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/chm270/public/"&gt;https://files.nyu.edu/chm270/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/cl1296/public/"&gt;https://files.nyu.edu/cl1296/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/dr1241/public/"&gt;https://files.nyu.edu/dr1241/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/dzw201/public/99-cent-pizza-places-in-nyc.html"&gt;https://files.nyu.edu/dzw201/public/99-cent-pizza-places-in-nyc.html&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/esj227/public/"&gt;https://files.nyu.edu/esj227/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/eze200/public/"&gt;https://files.nyu.edu/eze200/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/fh443/public/"&gt;https://files.nyu.edu/fh443/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/fm812/public/"&gt;https://files.nyu.edu/fm812/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/glh237/public/"&gt;https://files.nyu.edu/glh237/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/hdw217/public/"&gt;https://files.nyu.edu/hdw217/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/hrs260/public/"&gt;https://files.nyu.edu/hrs260/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/hws221/public/"&gt;https://files.nyu.edu/hws221/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/hxl203/public/"&gt;https://files.nyu.edu/hxl203/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/id398/public/"&gt;https://files.nyu.edu/id398/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/igm215/public/"&gt;https://files.nyu.edu/igm215/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/jdp343/public/"&gt;https://files.nyu.edu/jdp343/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/jil245/public/"&gt;https://files.nyu.edu/jil245/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/jjl442/public/"&gt;https://files.nyu.edu/jjl442/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/jkl324/public/"&gt;https://files.nyu.edu/jkl324/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/jl3093/public/"&gt;https://files.nyu.edu/jl3093/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/jm3894/public/"&gt;https://files.nyu.edu/jm3894/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/jnz213/public/"&gt;https://files.nyu.edu/jnz213/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/jp1961/public/"&gt;https://files.nyu.edu/jp1961/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/jpc406/public/"&gt;https://files.nyu.edu/jpc406/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/jsa314/public/"&gt;https://files.nyu.edu/jsa314/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/jwi208/public/"&gt;https://files.nyu.edu/jwi208/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/jws377/public/"&gt;https://files.nyu.edu/jws377/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/jz692/public/"&gt;https://files.nyu.edu/jz692/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/kac471/public/"&gt;https://files.nyu.edu/kac471/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/kc1294/public/"&gt;https://files.nyu.edu/kc1294/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/kcc282/public/"&gt;https://files.nyu.edu/kcc282/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/kl991/public/"&gt;https://files.nyu.edu/kl991/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/km1602/public/"&gt;https://files.nyu.edu/km1602/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/kpk256/public/"&gt;https://files.nyu.edu/kpk256/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/kr881/public/"&gt;https://files.nyu.edu/kr881/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/krg267/public/"&gt;https://files.nyu.edu/krg267/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/lla236/public/"&gt;https://files.nyu.edu/lla236/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/lrg275/public/"&gt;https://files.nyu.edu/lrg275/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/lsc291/public/"&gt;https://files.nyu.edu/lsc291/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/mam931/public/"&gt;https://files.nyu.edu/mam931/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/mc3077/public/"&gt;https://files.nyu.edu/mc3077/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/mjj282/public/"&gt;https://files.nyu.edu/mjj282/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/mkj233/public/"&gt;https://files.nyu.edu/mkj233/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/ml2550/public/"&gt;https://files.nyu.edu/ml2550/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/ms4761/public/"&gt;https://files.nyu.edu/ms4761/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/ms5579/public/"&gt;https://files.nyu.edu/ms5579/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/msk378/public/"&gt;https://files.nyu.edu/msk378/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/mss479/public/"&gt;https://files.nyu.edu/mss479/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/nel233/public/"&gt;https://files.nyu.edu/nel233/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/nez204/public/"&gt;https://files.nyu.edu/nez204/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/nrt222/public/"&gt;https://files.nyu.edu/nrt222/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/nsb268/public/"&gt;https://files.nyu.edu/nsb268/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/prp247/public/"&gt;https://files.nyu.edu/prp247/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/ps1486/public/"&gt;https://files.nyu.edu/ps1486/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/psr244/public/"&gt;https://files.nyu.edu/psr244/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/qhg200/public/"&gt;https://files.nyu.edu/qhg200/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/qy220/public/"&gt;https://files.nyu.edu/qy220/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/rc1600/public/"&gt;https://files.nyu.edu/rc1600/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/rf1048/public/"&gt;https://files.nyu.edu/rf1048/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/rp1244/public/"&gt;https://files.nyu.edu/rp1244/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/rrt221/public/"&gt;https://files.nyu.edu/rrt221/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/rs2898/public/"&gt;https://files.nyu.edu/rs2898/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/sa1386/public/"&gt;https://files.nyu.edu/sa1386/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/sc2532/public/"&gt;https://files.nyu.edu/sc2532/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/scs384/public/"&gt;https://files.nyu.edu/scs384/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/sek351/public/"&gt;https://files.nyu.edu/sek351/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/shk350/public/"&gt;https://files.nyu.edu/shk350/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/sk2742/public/"&gt;https://files.nyu.edu/sk2742/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/sl2663/public/"&gt;https://files.nyu.edu/sl2663/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/slc439/public/"&gt;https://files.nyu.edu/slc439/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/sly232/public/"&gt;https://files.nyu.edu/sly232/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/smk483/public/"&gt;https://files.nyu.edu/smk483/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/sr1860/public/"&gt;https://files.nyu.edu/sr1860/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/sw1262/public/"&gt;https://files.nyu.edu/sw1262/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/tpj214/public/"&gt;https://files.nyu.edu/tpj214/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/us266/public/"&gt;https://files.nyu.edu/us266/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/vl515/public/"&gt;https://files.nyu.edu/vl515/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/wfk212/public/"&gt;https://files.nyu.edu/wfk212/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/wo253/public/"&gt;https://files.nyu.edu/wo253/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/xl345/public/"&gt;https://files.nyu.edu/xl345/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/xl396/public/"&gt;https://files.nyu.edu/xl396/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/xy267/public/"&gt;https://files.nyu.edu/xy267/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/yl809/public/"&gt;https://files.nyu.edu/yl809/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/yp429/public/"&gt;https://files.nyu.edu/yp429/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/yz511/public/"&gt;https://files.nyu.edu/yz511/public/&lt;/a&gt;&lt;br /&gt;&lt;a href="https://files.nyu.edu/zsn202/public/"&gt;https://files.nyu.edu/zsn202/public/&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6538286886012474786-2915054893574343537?l=topsitemap.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/2915054893574343537'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/2915054893574343537'/><link rel='alternate' type='text/html' href='http://topsitemap.blogspot.com/2010/10/student-websites.html' title='Student websites'/><author><name>Internet at Every Where</name><uri>http://www.blogger.com/profile/13430497579682715412</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author></entry><entry><id>tag:blogger.com,1999:blog-6538286886012474786.post-7371043607069147140</id><published>2010-10-25T21:55:00.000-07:00</published><updated>2011-04-17T19:00:19.248-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='mechanical turk'/><category scheme='http://www.blogger.com/atom/ns#' term='crowdsourcing'/><category scheme='http://www.blogger.com/atom/ns#' term='wisdom of the crowds'/><category scheme='http://www.blogger.com/atom/ns#' term='research'/><title type='text'>Can Crowdsourcing Scale? The Role of Active Learning</title><content type='html'>Nobody is denying the fact that crowdsourcing becoming mainstream. People use Mechanical Turk for all sorts of applications. And many startups create business plans assuming that crowdsourcing markets will be able to provide enough labor to complete the tasks that will be posted in the market.&lt;br /&gt;&lt;br /&gt;And at this point, things become a little tricky.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Can crowdsourcing markets scale?&lt;/b&gt;&amp;nbsp;MTurk can tag a thousand images within a few hours. But what will happen if we place one million images in the market?&lt;b&gt; Will there be enough labor to handle all of the posted tasks?&lt;/b&gt; &lt;b&gt;How long will the task take? And what will be the cost?&lt;/b&gt;&lt;br /&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;br /&gt;&lt;b&gt;Scaling by combining machine learning with crowdsourcing&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;Unless you can come up with &lt;a href="http://www.gwap.com/gwap/gamesPreview/espgame/"&gt;ingenious&lt;/a&gt; &lt;a href="http://www.gwap.com/gwap/"&gt;ideas&lt;/a&gt;, the acquisition of data comes at a cost.&amp;nbsp;To reduce cost, we need to reduce the need for humans to label data. To reduce the need for humans, we need to automate the process. To automate the process, we need to build machine learning models. To build machine learning models, we need humans to label data.... Infinite loop? Yes and no.&lt;br /&gt;&lt;br /&gt;The basic idea is to use crowdsourcing in conjunction with machine learning. In particular, we leverage ideas from &lt;a href="http://en.wikipedia.org/wiki/Active_learning_(machine_learning)"&gt;active learning&lt;/a&gt;: The idea behind active learning is to use humans only for the uncertain cases, and not for everything. Machine learning can take care of the simple cases, and ask humans to help for the most important and ambiguous cases.&lt;br /&gt;&lt;br /&gt;We also need to have one extra thing in mind:&amp;nbsp;Crowdsourcing generates &lt;b&gt;&lt;i&gt;noisy &lt;/i&gt;&lt;/b&gt;training data, as opposed to the perfect data that most active learning algorithms expect from humans. So, we need to perform active learning not only towards identifying the cases that are ambiguous for the model, but also figure out which human labels are more likely to be noisy, and fix them. And we also need to be proactive in identifying the quality of the workers.&lt;br /&gt;&lt;br /&gt;In any case, after addressing the quality complications, and once we have enough data, we can use the acquired data to build basic machine learning models. The basic machine learning models can then take care of the simple cases, and free humans to handle the more ambiguous and difficult cases. Then, once we collect enough training data for the more difficult cases, we can then build an &lt;i&gt;even better&lt;/i&gt; machine learning model. The new model will then automate an even bigger fraction of the process, leaving humans to deal with only the harder cases. And we repeat the process.&lt;br /&gt;&lt;br /&gt;This idea was at the core of our &lt;a href="http://pages.stern.nyu.edu/~panos/publications/kdd2008.pdf"&gt;KDD 2008 paper&lt;/a&gt;, and since then we have significantly expanded these techniques to work with a wider variety of cases (see our current working paper: &lt;a href="http://archive.nyu.edu/handle/2451/29799"&gt;Repeated Labeling using Multiple Noisy Labelers&lt;/a&gt;.)&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Example: AdSafe Media.&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;Here is an example application, deployed in practice through &lt;a href="http://www.adsafemedia.com/"&gt;AdSafe Media&lt;/a&gt;: Say that we want to build a classifier that recognizes porn pages. Here is an overview of the process, which follows the process of our KDD paper:&lt;br /&gt;&lt;ol&gt;&lt;li&gt;We get a few web pages labeled as porn or not.&amp;nbsp;&lt;/li&gt;&lt;li&gt;We get &lt;a href="http://archive.nyu.edu/handle/2451/29799"&gt;multiple workers to label each page&lt;/a&gt;, to ensure quality.&lt;/li&gt;&lt;li&gt;&lt;a href="http://pages.stern.nyu.edu/~panos/publications/hcomp2010.pdf"&gt;We compute the quality of each labeler, fix biases&lt;/a&gt;, and get better labels for the pages.&lt;/li&gt;&lt;li&gt;We train a classifier that classifies pages as porn or not.&lt;/li&gt;&lt;li&gt;For incoming pages, we classify them using the automatic classifier.&lt;/li&gt;&lt;/ol&gt;&lt;ul&gt;&lt;ul&gt;&lt;li&gt;If the classifier is confident, we use the outcome of the classifier&lt;/li&gt;&lt;li&gt;If the classifier is not confident, the page is directed to humans for labeling (the more ambiguous the page, the more humans we need)&lt;/li&gt;&lt;/ul&gt;&lt;/ul&gt;&lt;ol start="6"&gt;&lt;li&gt;Once we get enough new training data, we move to Step 4 again.&lt;/li&gt;&lt;/ol&gt;&lt;i&gt;Benefits&lt;/i&gt;: Once the classifier is robust enough, there is &lt;b&gt;no need to use humans to handle basic tasks&lt;/b&gt;. The classifier takes care of the majority of tasks, ensuring that the speed of classification is high, and that the cost is low. (Even at a 0.1 cents per page, humans are simply too expensive when we deal with billions of pages.)&amp;nbsp;&lt;b&gt;Humans are reserved to handle pages that are difficult to classify&lt;/b&gt;. This ensures that for the difficult cases there is always someone to provide feedback, and this crowdsourced feedback ensures that the classifier improves over time.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Other example&lt;/b&gt;: &lt;b&gt;SpeakerText.&amp;nbsp;&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;&lt;b&gt;&lt;/b&gt;According &lt;a href="http://techcrunch.com/2010/09/07/speakertext-crowdsources-micro-tasks-to-automate-video-transcripts-100-beta-invites/"&gt;to the press&lt;/a&gt;, &lt;a href="http://www.speakertext.com/"&gt;SpeakerText&lt;/a&gt;&amp;nbsp;&lt;a href="http://www.metamorphblog.com/2010/06/the-symbiosis-between-machine-learning-crowdsourcing.html"&gt;is using (?) this idea&lt;/a&gt;: they use an automatic transcription package to generate a first rough transcript, and then use humans to improve the transcription. The high quality transcriptions can be then used to train a better model for automatic speech recognition. And the cycle continues.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Another example&lt;/b&gt;: &lt;b&gt;Google Books.&amp;nbsp;&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;&lt;b&gt;&lt;/b&gt;The&amp;nbsp;&lt;a href="http://www.google.com/recaptcha"&gt;ReCAPTCHA&lt;/a&gt; technique is used as as the crowdsourcing component for digitizing books for the Google Books project. As you may have imagined, Google is actively using optical character recognition (OCR) to digitize the scanned books and make them searchable. However, even the best OCR software &lt;a href="http://www.google.com/recaptcha/learnmore"&gt;will not be able to recognize some words&lt;/a&gt; from the scanned books.&lt;br /&gt;&lt;br /&gt;ReCAPTCHA uses the millions of users on the Internet (most notably, the 500 million Facebook users) as transcribers that fix whatever OCR cannot capture. I guess that Google reuses the fixed words in order to improve their internal OCR system, so that they can reach their goal of digitizing &lt;a href="http://booksearch.blogspot.com/2010/08/books-of-world-stand-up-and-be-counted.html"&gt;129,864,880 books&lt;/a&gt; a little bit faster.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;The limit?&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;&lt;b&gt;&lt;/b&gt;I guess the Google Books and ReCAPTCHA project are really testing the scalability limits of this approach. The improvements in the accuracy of machine learning systems &lt;a href="http://www.aclweb.org/anthology-new/P/P01/P01-1005.pdf"&gt;start being marginal&lt;/a&gt; once we have enough training data, and we need orders of magnitude more training data to see &lt;i&gt;noticeable &lt;/i&gt;improvements.&lt;br /&gt;&lt;br /&gt;Of course, with 100 million books to digitize, even an "unnoticeable" improvement of 0.01% in accuracy corresponds to &lt;b&gt;1 billion more words being classified correctly&lt;/b&gt; (&lt;a href="http://answers.yahoo.com/question/index?qid=20080528121432AABk1LD"&gt;assuming 100K words per book&lt;/a&gt;), and results in&amp;nbsp;&lt;b&gt;1 billion less ReCAPTCHA's needed&lt;/b&gt;. But I am not sure how many&amp;nbsp;ReCAPTCHA's are needed in order to achieve this hypothetical 0.01% improvement. Luis, if you are reading, give us the numbers :-)&lt;br /&gt;&lt;br /&gt;But in any case, I think that 99.99% of the readers of this blog would be happy to hit this limit.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6538286886012474786-7371043607069147140?l=topsitemap.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/7371043607069147140'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/7371043607069147140'/><link rel='alternate' type='text/html' href='http://topsitemap.blogspot.com/2010/10/can-crowdsourcing-scale-role-of-active.html' title='Can Crowdsourcing Scale? The Role of Active Learning'/><author><name>Internet at Every Where</name><uri>http://www.blogger.com/profile/13430497579682715412</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author></entry><entry><id>tag:blogger.com,1999:blog-6538286886012474786.post-5679422892542446339</id><published>2010-10-20T22:56:00.000-07:00</published><updated>2011-04-17T19:00:19.253-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='mechanical turk'/><category scheme='http://www.blogger.com/atom/ns#' term='crowdsourcing'/><category scheme='http://www.blogger.com/atom/ns#' term='wisdom of the crowds'/><category scheme='http://www.blogger.com/atom/ns#' term='research'/><category scheme='http://www.blogger.com/atom/ns#' term='advice'/><title type='text'>A Plea to Amazon: Fix Mechanical Turk!</title><content type='html'>It is now almost four years since &lt;a href="http://behind-the-enemy-lines.blogspot.com/2007/08/experiences-using-amazon-mechanical.html"&gt;I started experimenting with Mechanical Turk&lt;/a&gt;. Over these years I have been a great evangelist of the idea.&lt;br /&gt;&lt;br /&gt;But as Mechanical Turk becomes mainstream, it is now time for the service to get the basic stuff right. The last few weeks I found myself repeating the same things again and again, so I realized that it is now time to write these things down...&lt;br /&gt;&lt;br /&gt;&lt;b&gt;&lt;u&gt;&lt;span class="Apple-style-span" style="font-size: large;"&gt;Mechanical Turk, It is Time to Grow Up&lt;/span&gt;&lt;/u&gt;&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;The beta testing is over. If the platform wants to succeed, it needs to evolve. Many people want to build on top of MTurk, and the foundations are lacking important structural elements.&lt;br /&gt;&lt;br /&gt;Since the beginning of September, I have met with at least 15 different startups describing their ideas and their problems in using and leveraging Mechanical Turk. And hearing their stories, one after the other, I realized:&lt;b&gt; Every single requester has the same problems:&lt;/b&gt;&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;b&gt;Scaling up&lt;/b&gt;&lt;/li&gt;&lt;li&gt;&lt;b&gt;Managing the complex API&lt;/b&gt;&lt;/li&gt;&lt;li&gt;&lt;b&gt;Managing execution time&lt;/b&gt;&lt;/li&gt;&lt;li&gt;&lt;b&gt;Ensuring quality&lt;/b&gt;&lt;/li&gt;&lt;/ul&gt;&lt;b&gt;These problems were identified years ago. And the problems were never addressed.&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;The current status quo simply cannot continue. It is not good for the requesters, it is not good for the workers, it is not good for even completing the tasks. Amazon, pay attention. These are not just feature requests. These are fundamental requirements for any marketplace to function.&lt;br /&gt;&lt;br /&gt;Amazon likes to present the hands-off approach to Mechanical Turk as a strategic choice: In the same way that EC2, S3, and many other web services are targeted to developers, in the same way Mechanical Turk is a neutral clearinghouse of labor. It provides just the ability to match requesters and workers. Everything else is the responsibility of the two consenting parties.&lt;br /&gt;&lt;br /&gt;Too bad that this hands-off approach cannot work for a marketplace. The badly needed aspects can be easily summarized in four bullet points:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;a href="http://www.blogger.com/post-edit.g?blogID=7118563403027467631&amp;amp;postID=7096192957312731847#req_interface"&gt;A Better Interface To Post Tasks (Useful for Requesters)&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://www.blogger.com/post-edit.g?blogID=7118563403027467631&amp;amp;postID=7096192957312731847#worker_reputation"&gt;A Worker Reputation System (Useful for for Requesters)&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://www.blogger.com/post-edit.g?blogID=7118563403027467631&amp;amp;postID=7096192957312731847#req_reputation"&gt;A Requester Trustworthiness Guarantee (Useful for Workers)&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://www.blogger.com/post-edit.g?blogID=7118563403027467631&amp;amp;postID=7096192957312731847#worker_interface"&gt;A Better Task Search Interface (Useful for Workers)&lt;/a&gt;&lt;/li&gt;&lt;/ul&gt;&lt;div&gt;Below, I discuss these topics in more detail.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;b&gt;&lt;u&gt;&lt;a href="http://www.blogger.com/post-edit.g?blogID=7118563403027467631&amp;amp;postID=7096192957312731847" name="req_interface"&gt;&lt;/a&gt;&lt;span class="Apple-style-span" style="font-size: large;"&gt;Requesters Need: A Better Interface To Post Tasks&lt;/span&gt;&lt;/u&gt;&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;A major task of a marketplace is to reduce overhead, friction, transaction costs, and search costs. The faster and easier it is to transact, the better the market. And MTurk fails miserably on that aspect.&lt;br /&gt;&lt;br /&gt;I find it amazing that the last &lt;b&gt;&lt;i&gt;major &lt;/i&gt;&lt;/b&gt;change on Mechanical Turk for the requesters was the&lt;a href="http://behind-the-enemy-lines.blogspot.com/2008/07/mechanical-turk-allows-bulk-submissions.html"&gt; introduction of a UI to submit batch tasks&lt;/a&gt;. &lt;i&gt;This was back in the summer of 2008&lt;/i&gt;. George Bush was the president, Lehman Brothers was an investment bank, Greece had one of the highest growing GDP's in Europe, Facebook had less than 100 million users, and Twitter was still a novelty. It would take 8 more months for FourSquare to launch.&lt;br /&gt;&lt;br /&gt;It is high time to &lt;b&gt;make it easier to requesters to post tasks. It is ridiculous to call the command-line tools user-friendly!&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;What is the benefit of having access to a workforce for &lt;i&gt;&lt;b&gt;micro&lt;/b&gt;&lt;/i&gt;tasks, if a requester needs to &lt;b&gt;hire a full time developer (costing at least $60K) &lt;/b&gt;just to deal with all the complexities? How many microtasks someone should execute to recoup the cost of development? &lt;br /&gt;&lt;br /&gt;If every requester, in order to get good results, needs to: (a) build a quality assurance system from scratch, (b) ensure proper allocation of qualifications, (c) learn to break tasks properly into a workflow, (d) stratify workers according to quality, (e) [whatever else...], then the barrier is just too high. Only very serious requesters will devote the necessary time and effort. &lt;br /&gt;&lt;br /&gt;What is the expected outcome of this barrier? We expect to see a few big requesters and a long tail of small requesters that are posting tiny tasks. (&lt;a href="http://behind-the-enemy-lines.blogspot.com/2010/10/mechanical-turk-requester-activity.html"&gt;Oh wait, this is the case already.&lt;/a&gt;) In other words: It is very difficult for small guys to grow. &lt;br /&gt;&lt;br /&gt;Since we are talking about allowing easy posting of tasks: &lt;b&gt;Amazon, please take a look at &lt;/b&gt;&lt;a href="http://groups.csail.mit.edu/uid/turkit/"&gt;&lt;b&gt;TurkIt&lt;/b&gt;&lt;/a&gt;&lt;b&gt;.&lt;/b&gt; Buy it, copy it, do whatever, but please allow easy implementation of such workflows in the market. Very few requesters have simple, one-pass tasks. Most requesters want to have crowdsourced &lt;b&gt;workflows&lt;/b&gt;. Give them the tools to do so easily. &lt;br /&gt;&lt;br /&gt;MTurk is shooting themselves in the foot by&lt;b&gt; encouraging requesters to build their own interfaces and own workflow systems from scratch&lt;/b&gt;! For many many HITs, the only way to have a decent interface is to build it yourself in an iframe. What is the problem with iframe? Doing that, MTurk makes it extremely easy for the requester to switch labor channels. The requester who has build an iframe-powered HIT can easily get non-Turk workers to work on these HITs. (Hint: just use different workerid's for other labor channels and get the other workers to visit directly  the iframe html page to complete the task.) Yes, it is good for the requester in the long term not to be locked in, but I guess all requesters would be happier if they did not have to build the app from scratch. &lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;a href="http://www.blogger.com/post-edit.g?blogID=7118563403027467631&amp;amp;postID=7096192957312731847" name="worker_reputation"&gt;&lt;/a&gt;&lt;b&gt;&lt;u&gt;&lt;span class="Apple-style-span" style="font-size: large;"&gt;Requesters Need: A True Reputation System for Workers&lt;/span&gt;&lt;/u&gt;&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;My other big complaint. The current reputation system on Mechanical Turk is simply bad. "Number of completed HITs" and "approval rate" are &lt;a href="http://behind-the-enemy-lines.blogspot.com/2010/10/be-top-mechanical-turk-worker-you-need.html"&gt;easy to game&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;Requesters need a better reputation profile for workers. Why? &lt;b&gt;A market without a reputation mechanism turns quickly into &lt;/b&gt;&lt;a href="http://behind-the-enemy-lines.blogspot.com/2010/07/mechanical-turk-low-wages-and-market.html"&gt;&lt;b&gt;a market for lemons&lt;/b&gt;&lt;/a&gt;: When requesters cannot differentiate easily good from bad workers, they tend to assume that every worker is bad. This results in good workers getting paid the same amount as the bad ones. With so low wages, good workers leave the market. At the end, the only Turkers that remain in the market are the bad ones (or the crazy good ones willing to work for the same payment as the bad workers.)&lt;br /&gt;&lt;br /&gt;This in turn requires the same task to be completed from many workers,&lt;b&gt; way too many times to ensure quality&lt;/b&gt;. I am not against redundancy! (&lt;a href="http://archive.nyu.edu/handle/2451/29799"&gt;Quite the opposite!&lt;/a&gt;) But it should be a technique for taking moderate quality input to generate high quality output. A technique for capturing diverse points of view for the same HIT. Repeated labeling should NOT be the primary weapon against spam.&lt;br /&gt;&lt;br /&gt;The lack of a strong reputation system hurts everyone, and hurts the marketplace! Does Amazon want to run a market for lemons? I am sure that the margins will not be high.&lt;br /&gt;&lt;br /&gt;Here are a few suggestions on what a worker reputation mechanism should include.&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;b&gt;Have more public qualification tests:&lt;/b&gt;&lt;i&gt;Does the worker have the proper English writing skills? Can the worker proofread?&lt;/i&gt;Most marketplaces (eLance, oDesk, vWorker, Guru), allow participants to pass certification tests to signal their quality and knowledge in different areas. Same should happen on Turk. If Amazon does not want to build such tests, let requesters make their own qualification tests available to other requesters for a fee? Myself, I would pay to use the qualifications assigned by CastingWords and CrowdFlower. These requesters would serve as the certification authorities for MTurk, in the same way that universities certify abilities for the labor markets.&lt;/li&gt;&lt;li&gt;&lt;i&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-style: normal;"&gt;Keep track of working history: &lt;/span&gt;&lt;/b&gt;For which requester did the worker work in the past? How many HITs, for what payment? For how long?&lt;/i&gt; Long history of work with reputable requesters is a good sign. In the real world, working history matters. People list their work histories in their resumes. Why not on MTurk?&lt;/li&gt;&lt;li&gt;&lt;b&gt;Allow rating of workers:&lt;/b&gt; &lt;i&gt;What is the rating that the worker received for the submitted work?&lt;/i&gt; Please allow requesters to rate workers. We have it everywhere else. We rate films, books, electronics, we rate pretty much everything. &lt;/li&gt;&lt;li&gt;&lt;b&gt;Disconnect payment from rating:&lt;/b&gt; Tying reputation to acceptance rate is simply wrong. Currently, we can either accept the work and pay, or reject the work and refuse to pay. This is just wrong. We do not rate restaurants based on how often the customers refused to pay for the food! I should not have to reject and not pay for the work, if the only thing that I want to say is that the quality was not perfect. &lt;i&gt;Rejecting work should be an option reserved for spammers. &lt;/i&gt;It should never be used against honest workers that do not meet the expectations of the requester.&lt;/li&gt;&lt;li&gt;&lt;b&gt;Separate HITs and ratings by type:&lt;/b&gt; &lt;i&gt;What was the type of the submitted work? Transcription? Image tagging? Classification? Content generation? Twitter spam?&lt;/i&gt; Workers are not uniformly good in all types of tasks. Writing an article requires a very different set of skills from those required for transcription, which in turn are different than the skills for image tagging. Allow requesters to see the rating across these different categories. Almost as good as the public qualification tests.&lt;/li&gt;&lt;li&gt;&lt;b&gt;And make all the above accessible from an API, for automatic hiring decisions.&lt;/b&gt;&lt;/li&gt;&lt;/ul&gt;It cannot be that hard to do the above! Amazon.com runs a huge marketplace with thousands of merchants, for years. The guys as Amazon know how to design, maintain, and protect a reputation system for a much bigger marketplace. How hard can it be to port it to Mechanical Turk?&lt;br /&gt;&lt;br /&gt;&lt;b&gt;&lt;i&gt;(Amazon's response about the reputation system...&lt;/i&gt;&lt;/b&gt;&lt;i&gt; )&lt;/i&gt;&lt;br /&gt;&lt;br /&gt;In a recent meeting, I asked this same question: Why not having a real reputation system?&lt;br /&gt;&lt;br /&gt;The MTurk representative defended the current setup, with the following argument:&lt;br /&gt;&lt;blockquote&gt;On the Amazon.com marketplace, the (large number of) buyers can rate the (small number of) merchants, but not vice versa. So, the same thing happens on MTurk. The (large number of) workers can rate the (small number of) requesters using &lt;a href="http://www.turkernation.com/"&gt;TurkerNation &lt;/a&gt;and &lt;a href="http://turkopticon.differenceengines.com/"&gt;TurkOpticon&lt;/a&gt;. So the opposite should not happen: requesters should &lt;i&gt;&lt;b&gt;not &lt;/b&gt;&lt;/i&gt;rate workers.&lt;/blockquote&gt;I felt that the answer made sense: two-sided reputation systems indeed have deficiencies. They often lead to &lt;a href="http://en.wiktionary.org/wiki/mutual_admiration_society"&gt;mutual-admiration schemes&lt;/a&gt;, so such systems end up being easy to hack (&lt;a href="http://behind-the-enemy-lines.blogspot.com/2010/10/be-top-mechanical-turk-worker-you-need.html"&gt;not that the current system is too hard to beat&lt;/a&gt;.) So, I was satisfied with the given answer... For approximately 10 minutes! Then I realized: Humbug!&lt;br /&gt;&lt;br /&gt;There is &lt;b&gt;no need for a reputation system for product buyers on Amazon.com's marketplace&lt;/b&gt;! It is not like eBay, where a buyer can win the auction and never pay! The reputation of the buyer on Amazon.com is irrelevant. On Amazon, when a buyer buys a product, &lt;b&gt;as long as the credit card payment clears, the reputation of the buyer simply does not matter&lt;/b&gt;. There is no uncertainty, and no need to know anything about the buyer.&lt;br /&gt;&lt;br /&gt;Now let's compare the Amazon.com product marketplace with MTurk: The &lt;b&gt;uncertainty on MTurk is about the workers&lt;/b&gt; (who are the ones selling services of uncertain quality). The&lt;b&gt; requester is the buyer &lt;/b&gt;in the MTurk market. So, indeed, there should not be a need for a reputation system for requesters, but the workers should be rated.&lt;br /&gt;&lt;br /&gt;And at that point, people will protest: Why do we have the Hall of Fame/Shame on Turker Nation, why do we have TurkOpticon? Does Panos consider these efforts irrelevant and pointless?&lt;br /&gt;&lt;br /&gt;And here is my reply: The very fact that we have such systems means that there is something very wrong with the Mturk marketplace. I expand below.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;a href="http://www.blogger.com/post-edit.g?blogID=7118563403027467631&amp;amp;postID=7096192957312731847" name="req_reputation"&gt;&lt;/a&gt;&lt;b&gt;&lt;u&gt;&lt;span class="Apple-style-span" style="font-size: large;"&gt;Workers Need: A Trustworthiness Guarantee for Requesters&lt;/span&gt;&lt;/u&gt;&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;Amazon should really learn from its own marketplace on Amazon.com. Indeed, on Amazon.com, it is not possible to rate buyers. Amazon simply ensures that when a buyer buys a product online, the buyer pays the merchant. So, Amazon, as the marketplace owner, ensures the trustworthiness of at least one side of the market.&lt;br /&gt;&lt;br /&gt;Unfortunately, MTurk does not really guarantee the trustworthiness of the requesters. Requesters are free to reject good work and not pay for work they get to keep. Requesters do not have to pay on time. In a sense, the requesters are serving as the slave masters. The only difference is that on MTurk the slaves can choose their master.&lt;br /&gt;&lt;br /&gt;And so, Turker Nation and TurkOpticon were born for exactly this reason: To allow workers to learn more about their masters. To learn which requesters behave properly, which requesters abuse their power. &lt;br /&gt;&lt;br /&gt;However, this generates a wrong dynamic in the market. Why? Let's see how things operate.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;&lt;i&gt;The Requester Initiation Process&lt;/i&gt;&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;When new requesters come to the market, they are treated with caution by the experienced, good workers. Legitimate workers will simply not complete many HITs of a new requester, until the workers know that the requester is legitimate, pays promptly, and does not reject work unfairly. Most of the good workers will complete just a few HITs of the newcomer, and then wait and observe how the requester behaves.&lt;br /&gt;&lt;br /&gt;Now, try to be on the requester's side.&lt;br /&gt;&lt;br /&gt;If the requester posts small batches, things may work well. A few good workers do a little bit of good work, and the results come back like magic. The requester is happy, pays, everyone is happy. The small requester will come back after a while, post another small batch, and so on. &lt;b&gt;This process generates a large number of happy small requesters.&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;However, what happens when the newcomers post big batches of HITs? Legitimate workers will do a little bit of work and then wait and see. Nobody wants to risk a mass rejection, which can be lethal for the reputation of the worker. Given the above, who are the workers who will be willing to work on HITs of the new, unproven requester? &lt;i&gt;You guessed right&lt;/i&gt;: Spammers and inexperienced workers. Result? The requester gets low quality results, gets disappointed and wonders what went wrong.&lt;br /&gt;&lt;br /&gt;In the best case, the new requesters will seek expert help, (if they can afford it). In the worst case, the new requesters leave the market and use more conventional solutions.&lt;br /&gt;&lt;br /&gt;At this point, it should be clear that &lt;b&gt;just having a subjective reputation system for requesters is simply not enough&lt;/b&gt;. We&lt;b&gt; need a trustworthiness guarantee &lt;/b&gt;for the requesters. Workers should not be afraid of working for a particular requester.&lt;br /&gt;&lt;br /&gt;Online merchant in the Amazon marketplace do not need to check the reputation of the people the sell to. Amazon ensures that the byers are legitimate and not fraudsters. Can you imagine if every seller on Amazon had to check the credit score and the trustworthiness of every buyer they sell to? What did you say? It would be a disaster? That people would only sell to a few selected buyers? Well, witness the equivalent disaster on Mechanical Turk.&lt;br /&gt;&lt;br /&gt;So, what is needed for the requesters? Since the requester is essentially the "buyer", there is &lt;b&gt;no need to have &lt;i&gt;subjective &lt;/i&gt;ratings&lt;/b&gt;. The worker should see a set of &lt;b&gt;objective &lt;/b&gt;characteristics of the requester, and decide whether to pick a specific posted HIT or not. Here are a few things that are &lt;b&gt;objective&lt;/b&gt;:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;b&gt;Show speed of payment: &lt;/b&gt;The requester payment already goes into an Amazon-controlled "escrow" account. The worker should know how fast the requester typically releases payment.&lt;/li&gt;&lt;li&gt;&lt;b&gt;Show the rejection rate for the requester: &lt;/b&gt;Is a particular requesters litigious and reports frequently the work of the workers as spam?&lt;/li&gt;&lt;li&gt;&lt;b&gt;Show the appeal rate for the requester&lt;/b&gt;: A particular requester may have high rejection rate just due to an attack from spammers. However, if the rejected workers appeal and win frequently, then there is something wrong with the requester.&lt;/li&gt;&lt;li&gt;&lt;b&gt;Disallow the ability to reject work that is not spam:&lt;/b&gt;&amp;nbsp;The requester should not be able to reject submitted work without paying. Rejection should be a last-resort mechanism, reserved only for obviously bad work. The worker should have the right to appeal (and potentially have the submitted work automatically reviewed by peers). This should take out a significant uncertainty in the market, allowing workers to be more confident to work with a new requester.&lt;/li&gt;&lt;li&gt;&lt;b&gt;Show total volume of posted work: &lt;/b&gt;Workers want to know if the requester is going to come back to the market. Volume of posted work and the lifetime of the worker in the market are important characteristics: workers can use this information to decide whether it makes sense to invest the time to learn the tasks of the requester.&lt;/li&gt;&lt;li&gt;&lt;b&gt;Make all the above accessible from an API:&lt;/b&gt; Let other people build worker-facing applications on top of MTurk.&lt;/li&gt;&lt;/ul&gt;So, a major role of a marketplace is to instill a sense of trust. Requesters should trust the workers to complete the work, and workers should not have to worry about unreasonable behavior of the workers. This minimizes the search costs associated with finding a trustworthy partner in the market.&lt;br /&gt;&lt;br /&gt;Let's see the final part that is missing.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;a href="http://www.blogger.com/post-edit.g?blogID=7118563403027467631&amp;amp;postID=7096192957312731847" name="worker_interface"&gt;&lt;/a&gt;&lt;b&gt;&lt;u&gt;&lt;span class="Apple-style-span" style="font-size: large;"&gt;Workers Need: A Better User Interface&lt;/span&gt;&lt;/u&gt;&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;As mentioned earlier, beyond trust, the other important role of the market is to minimize as much as possible transaction overhead and search costs. The transacting parties should find each other as fast as possible, fulfill their goals, and move on. The marketplace should almost be invisible. In this market, where requesters post tasks and the tasks wait for the workers, it is important to make it as easy as possible for workers to find tasks the workers want to work on. &lt;br /&gt;&lt;br /&gt;&lt;b&gt;&lt;i&gt;Current Problem: Unpredictable Completion Times&lt;/i&gt;&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;Unfortunately, currently the workers are highly restricted by the current interface, in their ability to find tasks. Workers cannot search for a requester, unless the requester put their name in the keywords. Also workers have no way to navigate and browse through the available tasks, to find things of interest.&lt;br /&gt;&lt;br /&gt;At the end, &lt;a href="http://doi.acm.org/10.1145/1837885.1837889"&gt;workers end up using mainly two main sorting mechanisms&lt;/a&gt;: See the most recent HITs, or see the HITgroups with the most HITs. This means that &lt;b&gt;workers use priority queues to pick the tasks to work on&lt;/b&gt;.&lt;br /&gt;&lt;br /&gt;What is the result when tasks are being completed following priorities? &lt;a href="http://behind-the-enemy-lines.blogspot.com/2009/02/your-estimated-completion-time-infinite.html"&gt;The completion times of the tasks follow a power-law&lt;/a&gt;! (For details on the analysis, see the preprint of the XRDS report "&lt;a href="http://archive.nyu.edu/handle/2451/29801"&gt;Analyzing the Amazon Mechanical Turk Marketplace&lt;/a&gt;".) What is the implication? &lt;b&gt;It is effectively impossible to predict the completion time of the posted tasks&lt;/b&gt;. For the current marketplace (with a power-law exponent a=1.5), the distribution cannot even be used to predict the average waiting time: the theoretical average is infinite, i.e., in practice the mean completion time is expected to increase continuously as we observe the market for longer periods of time.&lt;br /&gt;&lt;br /&gt;The proposed solutions? So easy, so obvious solutions, that it even hurts to propose them:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;b&gt;Have a browsing system&lt;/b&gt; with tasks being posted under task categories. See for example, the main page for oDesk, where tasks are being posted under one or more categories. Is this really hard to do?&lt;br /&gt;&lt;br /&gt;&lt;a href="http://2.bp.blogspot.com/_skxW9kDLt_c/TLflHe_U2_I/AAAAAAAAfYw/fjaIaZr82oo/s1600/odesk.PNG" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="244" src="http://2.bp.blogspot.com/_skxW9kDLt_c/TLflHe_U2_I/AAAAAAAAfYw/fjaIaZr82oo/s320/odesk.PNG" style="cursor: move;" width="320" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;/li&gt;&lt;li&gt;&lt;b&gt;Improve the search engine. &lt;/b&gt;Seriously, how hard is it to include all the fields of a HIT into the search index? Ideally it would be better to have a faceted interface on top, but I would be happy to just see the basic things done right.&lt;/li&gt;&lt;li&gt;&lt;b&gt;Use a recommender system to propose HITs to workers. &lt;/b&gt;For this suggestion, I have to credit ba site on the Internet, with some nifty functionality: it monitors your past buying and rating history, and then recommends products that you may enjoy. It is actually pretty nice and helped that online store to differentiate itself from its competitors. Trying to remember the name of the site... The recommendations look like that:&lt;br /&gt;&lt;br /&gt;&lt;a href="http://3.bp.blogspot.com/_skxW9kDLt_c/TLfnLNnlnmI/AAAAAAAAfY0/HLqtnI3tItM/s1600/recommender.PNG" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="124" src="http://3.bp.blogspot.com/_skxW9kDLt_c/TLfnLNnlnmI/AAAAAAAAfY0/HLqtnI3tItM/s320/recommender.PNG" width="320" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;It would be a good idea to have something like that on the &lt;b&gt;&lt;i&gt;Amazon &lt;/i&gt;&lt;/b&gt;Mechanical Turk. Ah! I remembered! The name of the site with the nice recommendations is Amazon! Seriously. Amazon cannot have a good recommender system for its own market?&lt;/li&gt;&lt;/ul&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;b&gt;&lt;u&gt;&lt;span class="Apple-style-span" style="font-size: large;"&gt;Competition waits&lt;/span&gt;&lt;/u&gt;&lt;/b&gt;&lt;br /&gt;&lt;b&gt;&lt;u&gt;&lt;br /&gt;&lt;/u&gt;&lt;/b&gt;&lt;br /&gt;Repeat after me: A labor marketplace is not the same thing as a computing service. Even if everything is an API, the design of the market still matters.&lt;br /&gt;&lt;br /&gt;It is too risky to assume that MTurk can simply a bare-bones clearinghouse for labor, in the same way that S3 can be a bare-bones provider of cloud storage. There is simply no sustainable advantage and no significant added value. Network effects are &lt;b&gt;not &lt;/b&gt;strong (especially in the absence of reputation), and just clearing payments and dealing with Patriot Act and KYC is not a significant added value.&lt;br /&gt;&lt;br /&gt;Other marketplaces already do that, build API's, and have better design as well. It will not be difficult to get to the micro segment of the crowdsourcing market, and it may happen much faster than Amazon expects. Imho, oDesk and eLance are moving towards the space by having strong APIs for worker management, and good reputation systems. Current MTurk requesters that create their HITs using iframes, can very easily hire eLance and oDesk workers instead of using MTurk.&lt;br /&gt;&lt;br /&gt;The recent surge of &lt;a href="http://behind-the-enemy-lines.blogspot.com/2010/10/explosion-of-micro-crowdsourcing.html"&gt;microcrowdsourcing services&lt;/a&gt; indicates that there are many who believe that the position of MTurk in the market is ready to be challenged.&lt;br /&gt;&lt;br /&gt;Is it worth trying to challenge MTurk? Luis von Ahn, looking at an earlier post of mine, &lt;a href="http://twitter.com/#!/LuisvonAhn/status/27760350779"&gt;tweeted&lt;/a&gt;:&lt;br /&gt;&lt;br /&gt;&lt;blockquote&gt;MTurk is TINY (total market size is on the order of $1M/year): Doesn't seem like it's worth all the attention.&lt;/blockquote&gt;&lt;br /&gt;I will reply with a prior &lt;a href="http://twitter.com/#!/ipeirotis/status/26811384962"&gt;tweet &lt;/a&gt;of mine:&lt;br /&gt;&lt;br /&gt;&lt;blockquote&gt;Mechanical Turk is for crowdsourcing what AltaVista was for search engines. We now wait to see who will be the Google. &lt;/blockquote&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6538286886012474786-5679422892542446339?l=topsitemap.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/5679422892542446339'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/5679422892542446339'/><link rel='alternate' type='text/html' href='http://topsitemap.blogspot.com/2010/10/plea-to-amazon-fix-mechanical-turk.html' title='A Plea to Amazon: Fix Mechanical Turk!'/><author><name>Internet at Every Where</name><uri>http://www.blogger.com/profile/13430497579682715412</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://2.bp.blogspot.com/_skxW9kDLt_c/TLflHe_U2_I/AAAAAAAAfYw/fjaIaZr82oo/s72-c/odesk.PNG' height='72' width='72'/></entry><entry><id>tag:blogger.com,1999:blog-6538286886012474786.post-1481300473008845364</id><published>2010-10-15T08:41:00.000-07:00</published><updated>2011-04-17T19:00:19.259-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='mechanical turk'/><category scheme='http://www.blogger.com/atom/ns#' term='crowdsourcing'/><category scheme='http://www.blogger.com/atom/ns#' term='research'/><category scheme='http://www.blogger.com/atom/ns#' term='propublica'/><title type='text'>Mechanical Turk and Data Driven Journalism: The Case of ProPublica</title><content type='html'>Last year, in a Mechanical Turk Meetup in New York, I met with &lt;a href="http://www.propublica.org/site/author/amanda_michel"&gt;Amanda Michel&lt;/a&gt; of&lt;a href="http://www.propublica.org/about/"&gt; Propublica, a "non-profit newsroom that produces investigative journalism in the public interest"&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;ProPublica had a set of very interesting ideas on how to use crowdsourcing, to improve their practices and increase their reporting reach. Amanda had some great ideas on how to use crowdsourcing, starting with operational aspects of data-driven journalism,&amp;nbsp;up to &lt;a href="http://behind-the-enemy-lines.blogspot.com/2010/05/crowdsourcing-not-just-cost-saver.html"&gt;more ambitious goals&lt;/a&gt;.&amp;nbsp;What was common, in all efforts, was a simple goal:&amp;nbsp;&lt;b&gt;Find, reveal, and fight corruption&lt;/b&gt;.&amp;nbsp;When you meet with such people, it is hard not to be inspired. So, over last year I kept interacting with ProPublica on how to use Mechanical Turk for their goals.&lt;br /&gt;&lt;br /&gt;Take a simple example.&amp;nbsp;ProPublica was facing a significant data integration problem. For one of their projects, they wanted to extract data from hundreds of different city, country, and state databases. Needless to say, building an integration system of such scale is difficult and beyond the reach of many advanced IT companies. Definitely not a problem that a journalism organization could solve for the purpose of writing a single story. How Mechanical Turk could help? The Turkers could be the ones interacting with the databases,&amp;nbsp;&lt;b&gt;creating an effective, human-powered hidden-web crawler, that was up and running in a couple of days.&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;Mechanical Turk became quickly an integral part of ProPublica's newsroom operations. It became so valuable, that ProPublica today published an article describing how they are&amp;nbsp;&lt;a href="http://www.propublica.org/article/how-we-use-mechanical-turk-to-do-data-driven-reporting-and-how-you-can-too"&gt;using Amazon�s Mechanical Turk to do data-driven reporting&lt;/a&gt;&amp;nbsp;and they made public the&amp;nbsp;&lt;a href="http://www.propublica.org/article/propublicas-guide-to-mechanical-turk"&gt;ProPublica's Guide to Mechanical Turk&lt;/a&gt;. It goes step by step through all the challenges that a newcomer on Mechanical Turk may face, and shows how to best approach the tool. Needless to say, these links are being passed around on Twitter like crazy.&lt;br /&gt;&lt;br /&gt;ProPublica is a great case study, not because they did something artistic or fancy, but because they kept their focus razor-sharp in integrating crowdsourcing to their operations. &lt;a href="http://www.thesheepmarket.com/"&gt;The 10,000 sheep&lt;/a&gt; will be passed around virally and inspire ideas, but mainstream adoption will come after reading success stories like the one from ProPublica. At the end of the day, people want to know how to get things done.&lt;br /&gt;&lt;br /&gt;I kept the best part for the end. &lt;a href="http://www.propublica.org/article/propublica-wins-innovation-award"&gt;From this article&lt;/a&gt;:&lt;br /&gt;&lt;br /&gt;&lt;blockquote&gt;&lt;a href="http://www.j-lab.org/awards/category/2010kb_winners/"&gt;ProPublica has received a Special Distinction Award&lt;/a&gt; from the Knight-Batten Awards for Innovations in Journalism. &lt;a href="http://www.propublica.org/ion/reporting-network"&gt;ProPublica's Distributed Reporting Project&lt;/a&gt; was honored for "&lt;b&gt;systematizing the process of crowdsourcing, conducting experiments, polishing their process and tasking citizens with serious assignments&lt;/b&gt;." The judges called it "a major step forward with how we understand crowdsourcing."&lt;/blockquote&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6538286886012474786-1481300473008845364?l=topsitemap.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/1481300473008845364'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/1481300473008845364'/><link rel='alternate' type='text/html' href='http://topsitemap.blogspot.com/2010/10/mechanical-turk-and-data-driven.html' title='Mechanical Turk and Data Driven Journalism: The Case of ProPublica'/><author><name>Internet at Every Where</name><uri>http://www.blogger.com/profile/13430497579682715412</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author></entry><entry><id>tag:blogger.com,1999:blog-6538286886012474786.post-2978158026512233957</id><published>2010-10-11T22:15:00.000-07:00</published><updated>2011-04-17T19:00:19.262-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='mechanical turk'/><category scheme='http://www.blogger.com/atom/ns#' term='spam'/><title type='text'>Be a Top Mechanical Turk Worker: You Need $5 and 5 Minutes</title><content type='html'>&lt;div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;"&gt;The current reputation system on Mechanical Turk is simply inadequate.&amp;nbsp;The only &lt;i&gt;built-in&lt;/i&gt; reputation metrics are the number of completed HITs and the approval rate.&amp;nbsp;&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;"&gt;Some people believe that they are adequate as a basic filtering mechanism. They are not.&amp;nbsp;&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;"&gt;For example, ask for all workers in your HITs to have 1000 completed HITs and 99% approval rate. You believe that you will only get high quality workers? You are wrong!&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;"&gt;I tried to filter workers using just these metrics. I failed. Spammers got me again. (And once in, spammers submit &lt;b&gt;a lot&lt;/b&gt; of crap. It costs nothing.) I questioned why. How can it be? And then I realized: It is trivial to beat these metrics.&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;"&gt;Let's see the effort it takes to beat the system.&amp;nbsp;&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;"&gt;&lt;i&gt;&lt;br /&gt;&lt;/i&gt;&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;"&gt;&lt;i&gt;&lt;b&gt;The mission: Become a top Turker, 100% approval rate and 1000 completed HITs.&lt;/b&gt;&lt;/i&gt;&lt;b&gt;&amp;nbsp;&lt;/b&gt;&lt;/div&gt;&lt;div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;"&gt;&lt;/div&gt;&lt;ul&gt;&lt;li&gt;&lt;b&gt;Step 1&lt;/b&gt;: Login as a requester. Post a task, with 1000 HITs. Each HIT pays 1 cent. Total cost: $15. Out of these, $10 go to the worker, $5 go to Amazon. The title of the HIT: "Write a 500 word review". No sane worker will touch these HITs. Done. Logout.&lt;/li&gt;&lt;li&gt;&lt;b&gt;Step 2&lt;/b&gt;: Login as worker, using a different email. Complete and submit the 1000 HITs created in Step 1. You just need to click submit 1000 times. Bored? iMacro and Greasemonkey can help. Done. Logout.&lt;/li&gt;&lt;li&gt;&lt;b&gt;Step 3&lt;/b&gt;: Login as a requester again. Approve all submitted HITs. Pay the $15. Amazon gets $5. The worker account has the remaining $10. Done. Logout.&lt;/li&gt;&lt;/ul&gt;&lt;br /&gt;&lt;div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;"&gt;Your worker account is a top Turker now. 1000 completed HITs and 100% approval rate. Congratulations! You have a license to spam.&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6538286886012474786-2978158026512233957?l=topsitemap.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/2978158026512233957'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/2978158026512233957'/><link rel='alternate' type='text/html' href='http://topsitemap.blogspot.com/2010/10/be-top-mechanical-turk-worker-you-need.html' title='Be a Top Mechanical Turk Worker: You Need $5 and 5 Minutes'/><author><name>Internet at Every Where</name><uri>http://www.blogger.com/profile/13430497579682715412</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author></entry><entry><id>tag:blogger.com,1999:blog-6538286886012474786.post-3872114860213042641</id><published>2010-10-10T21:18:00.000-07:00</published><updated>2011-04-17T19:00:19.265-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='mechanical turk'/><category scheme='http://www.blogger.com/atom/ns#' term='crowdsourcing'/><category scheme='http://www.blogger.com/atom/ns#' term='wisdom of the crowds'/><title type='text'>The Explosion of Micro-Crowdsourcing Services</title><content type='html'>&lt;div&gt;In my last post, I expressed my surprise for the sudden explosion of the research-oriented workshops in computer science conferences that are explicitly focused on the concept of crowdsourcing.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;I should also note though, that there is a parallel explosion of similar micro-crowdsourcing services. Here is a list of services that I have ran into:&amp;nbsp;&lt;/div&gt;&lt;ul&gt;&lt;li&gt;&lt;a href="https://www.mturk.com/mturk/welcome"&gt;Amazon Mechanical Turk&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://www.agentanything.com/"&gt;Agent Anything&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://www.clickworker.com/en/"&gt;Clickworker&lt;/a&gt;&amp;nbsp;(aka HumanGrid)&lt;/li&gt;&lt;li&gt;&lt;a href="http://www.cloudcrowd.com/"&gt;CloudCrowd&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://crowdflower.com/"&gt;CrowdFlower&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://www.domywork.net/"&gt;DoMyWork&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://www.jobboy.com/"&gt;JobBoy&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="https://www.livework.com/"&gt;LiveWork&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://www.microtask.com/"&gt;Microtask&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://www.microworkers.com/"&gt;microWorkers&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://minifreelance.com/"&gt;MiniFreelance&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://minijobz.com/"&gt;MiniJobz&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://www.minuteworkers.com/"&gt;MinuteWorkers&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://myeasytask.com/"&gt;MyEasyTask&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://mymicrojob.com/index.php"&gt;MyMicroJob&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://www.optask.com/"&gt;OpTask&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://rapidworkers.com/"&gt;RapidWorkers&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://www.samasource.org/"&gt;Samasource&lt;/a&gt; (NonProfit)&lt;/li&gt;&lt;li&gt;&lt;a href="http://www.shorttask.com/"&gt;ShortTask&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://www.simpleworkers.com/"&gt;SimpleWorkers&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://www.smartsheet.com/crowdsourcing"&gt;SmartSheet&lt;/a&gt;&lt;/li&gt;&lt;/ul&gt;&lt;div&gt;Some of the companies above are serious, some are new and upcoming, some are copycats, and some are there just to facilitate spamming.&amp;nbsp;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;I thought of doing a more detailed comparison (similar to &lt;a href="http://www.smartsheet.com/blog/brent-frei/paid-crowdsourcing-march-toward-mainstream-business"&gt;the report that Brent Frei prepared last year for the more general area of paid crowdsourcing&lt;/a&gt;) but then I realized that I do not trust enough half of these companies to even give them my email.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;This growing list makes it clear that we enter the bubble period. Bubbles are not necessarily negative. During bubble periods you see many innovations coming into the industry from many different parties. While most of the entrants in the market will die sooner rather than later, I except to see interesting things coming out of this.&amp;nbsp;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Do not forget that the dotcom bubble generated the Pets.com failures but also gave birth to Google, who replaced the early dominant players, such as Lycos and Altavista.&amp;nbsp;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6538286886012474786-3872114860213042641?l=topsitemap.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/3872114860213042641'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/3872114860213042641'/><link rel='alternate' type='text/html' href='http://topsitemap.blogspot.com/2010/10/explosion-of-micro-crowdsourcing.html' title='The Explosion of Micro-Crowdsourcing Services'/><author><name>Internet at Every Where</name><uri>http://www.blogger.com/profile/13430497579682715412</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author></entry><entry><id>tag:blogger.com,1999:blog-6538286886012474786.post-8596351097562677830</id><published>2010-10-08T22:22:00.000-07:00</published><updated>2011-04-17T19:00:19.268-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='mechanical turk'/><category scheme='http://www.blogger.com/atom/ns#' term='crowdsourcing'/><category scheme='http://www.blogger.com/atom/ns#' term='wisdom of the crowds'/><category scheme='http://www.blogger.com/atom/ns#' term='research'/><title type='text'>The Explosion of Crowdsourcing Workshops</title><content type='html'>Over the last couple of years, there has been an explosion of workshops that look at the topic of crowdsourcing from the academic point of view, within the broader computer science field. Here are the ones that I am aware of:&lt;br /&gt;&lt;ol&gt;&lt;li&gt;&lt;a href="http://www.hcomp2009.org/Home.html"&gt;Human Computation Workshop (HCOMP 2009)&lt;/a&gt;, with KDD 2009&lt;/li&gt;&lt;li&gt;&lt;a href="http://www.ischool.utexas.edu/~cse2010/call.htm"&gt;Workshop on Crowdsourcing for Search Evaluation&lt;/a&gt;, with SIGIR 2010&lt;/li&gt;&lt;li&gt;&lt;a href="http://hcomp.info/HComp2010/index.html"&gt;Second Human Computation Workshop (HCOMP 2010)&lt;/a&gt;, with KDD 2010&lt;/li&gt;&lt;li&gt;&lt;a href="http://ttic.uchicago.edu/~dparikh/acvhl2010.htm"&gt;Advancing Computer Vision with Humans in the Loop (ACVHL)&lt;/a&gt;, with CVPR 2010&lt;/li&gt;&lt;li&gt;&lt;a href="http://sites.google.com/site/amtworkshop2010/"&gt;Creating Speech and Language Data With Amazon�s Mechanical Turk&lt;/a&gt;, with NAACL 2010&lt;/li&gt;&lt;li&gt;&lt;a href="http://www.cs.umass.edu/~wallach/workshops/nips2010css/"&gt;Computational Social Science and the Wisdom of Crowds&lt;/a&gt;, with&amp;nbsp;NIPS 2010&lt;/li&gt;&lt;li&gt;&lt;a href="http://www.ukp.tu-darmstadt.de/scientific-community/coling-2010-workshop/"&gt;The People�s Web Meets NLP: Collaboratively Constructed Semantic Resources&lt;/a&gt;, with COLING 2010&lt;/li&gt;&lt;li&gt;&lt;a href="http://www.personal.psu.edu/u1o/crowdsourcing/"&gt;Workshop on Ubiquitous Crowdsourcing&lt;/a&gt;, with UBIComp 2010&lt;/li&gt;&lt;li&gt;&lt;a href="http://www.e-crowdsourcing.org/"&gt;Enterprise Crowdsourcing Workshop&lt;/a&gt;, with&amp;nbsp;ICWE 2010&lt;/li&gt;&lt;li&gt;&lt;a href="http://www.wiki-translation.com/tiki-index.php?page=AMTA+2010+Workshop+--+Collaborative+Translation:+technology,+crowdsourcing,+and+the+translator+perspective"&gt;Collaborative Translation: technology, crowdsourcing, and the translator perspective&lt;/a&gt;, with&amp;nbsp;AMTA 2010&lt;/li&gt;&lt;li&gt;&lt;a href="http://www.cs.umd.edu/hcil/monotrans/workshop/"&gt;Workshop on Crowdsourcing and Translation&lt;/a&gt;,&amp;nbsp;&lt;/li&gt;&lt;li&gt;&lt;a href="http://ir.ischool.utexas.edu/csdm2011/"&gt;Crowdsourcing for Search and Data Mining&lt;/a&gt;, with WSDM 2011&lt;/li&gt;&lt;/ol&gt;(If you think that I missed a relevant workshop, drop me a line, and I will add it to the list above)&lt;br /&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;In addition to the workshops above, we also have the &lt;a href="http://crowdconf.com/"&gt;CrowdConf 2010 conference&lt;/a&gt;, organized by CrowdFlower, with some academic presence but overall targeted mainly to industry.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Yes, one workshop in 2009, followed by ten (at least!) additional workshops in 2010, and who knows how many more in 2011. (I am already aware of 3 planned workshops, in addition to the one in WSDM 2011.)&amp;nbsp;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;I am deeply interested in the topic and I already feel that I am losing track of the venues that I need to follow.&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6538286886012474786-8596351097562677830?l=topsitemap.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/8596351097562677830'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/8596351097562677830'/><link rel='alternate' type='text/html' href='http://topsitemap.blogspot.com/2010/10/explosion-of-crowdsourcing-workshops.html' title='The Explosion of Crowdsourcing Workshops'/><author><name>Internet at Every Where</name><uri>http://www.blogger.com/profile/13430497579682715412</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author></entry><entry><id>tag:blogger.com,1999:blog-6538286886012474786.post-8050578132370698993</id><published>2010-10-07T22:47:00.000-07:00</published><updated>2011-04-17T19:00:19.272-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='mechanical turk'/><category scheme='http://www.blogger.com/atom/ns#' term='research'/><title type='text'>Mechanical Turk Requester Activity: The Insignificance of the Long Tail</title><content type='html'>The &lt;a href="http://en.wikipedia.org/wiki/Pareto_principle"&gt;Pareto principle&lt;/a&gt; says that 80% of the effects come from 20% of the causes. It is a favorite anecdote to cite that 20% of the employees in an organization do 80% of the work, or that 20% of the customers are those that generate 80% of the profits.&lt;br /&gt;&lt;br /&gt;In online settings, such inequalities are often amplified. For Wikipedia we have the &lt;a href="http://en.wikipedia.org/wiki/1%25_rule_(Internet_culture)"&gt;1% rule&lt;/a&gt;, where &lt;a href="http://www.useit.com/alertbox/participation_inequality.html"&gt;1% of the contributors (this is 0.003% of the users) contribute two thirds of the content&lt;/a&gt;. In the &lt;a href="http://www.washingtonpost.com/wp-dyn/content/article/2009/04/21/AR2009042103786.html"&gt;Causes application on Facebook&lt;/a&gt;, there are 25 million users, but only 1% of them contribute a donation.&lt;br /&gt;&lt;br /&gt;So, adapting this question for Mechanical Turk, we want to see: &lt;b&gt;What is the distribution of activity across requesters?&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;&lt;b&gt;&lt;i&gt;The Activity Distribution: The (Insignificance of the) Long Tail of Requesters&lt;/i&gt;&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;To analyze the level of participation, &lt;a href="http://archive.nyu.edu/handle/2451/29801"&gt;for the XRDS paper&lt;/a&gt;, we took the requesters that posted a task on Mechanical Turk from January 2009 until April 2010, and we ranked them according to the total reward amount of the posted HITs. Then, we measured what percentage of the rewards comes from the top the requesters in the market. Here is the resulting plot:&lt;br /&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; text-align: center;"&gt;&lt;a href="http://3.bp.blogspot.com/_skxW9kDLt_c/TK4BGwMdk9I/AAAAAAAAfXo/t3bHDrRV3aU/s1600/q-q-requesters-activity.PNG" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="322" src="http://3.bp.blogspot.com/_skxW9kDLt_c/TK4BGwMdk9I/AAAAAAAAfXo/t3bHDrRV3aU/s400/q-q-requesters-activity.PNG" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;Indeed, the result shows that Mechanical Turk is closer to the "1% rule" of Wikipedia, than to the general 80-20 principle. As in Wikipedia, the top 1% of the requesters, contribute two thirds of the activity in the market.&lt;br /&gt;&lt;br /&gt;By reading the graph, we see the following:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;Castingwords, the top requester across the 10K requesters in the dataset, accounts for 10% of the dollar-weighted activity (!).&lt;/li&gt;&lt;li&gt;The top 0.1% of the requesters (i.e., the top-10 requesters) account for 30% of the dollar-weighted activity.&lt;/li&gt;&lt;li&gt;The top 1% of the requesters account for 60% of the dollar-weighted activity.&lt;/li&gt;&lt;li&gt;The top 10% of the requesters account for 90% of the dollar-weighted activity.&lt;/li&gt;&lt;li&gt;The long tail of the 90% of the requesters &lt;b&gt;is effectively insignificant&lt;/b&gt;.&lt;/li&gt;&lt;/ul&gt;A closer look at the distribution of requester activity shows that the activity per requester follows roughly a log-normal distribution.&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: auto;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://2.bp.blogspot.com/_skxW9kDLt_c/TK6xPHy4ULI/AAAAAAAAfYE/0imcW9R36B4/s1600/req-activity.PNG" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="215" src="http://2.bp.blogspot.com/_skxW9kDLt_c/TK6xPHy4ULI/AAAAAAAAfYE/0imcW9R36B4/s400/req-activity.PNG" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;ul&gt;&lt;li&gt;&lt;b&gt;The average level of posted rewards is $58&lt;/b&gt;. This corresponds to an average level of activity of just four dollars per month.&lt;/li&gt;&lt;li&gt;&lt;b&gt;The median is just $1.60&lt;/b&gt;. Yes, this is not a typo: 1.6 dollars. In other words, 50% of the requesters never post more than a couple of dollars worth of tasks.&lt;/li&gt;&lt;li&gt;Only a small fraction of requesters (less than 1%) posted 1000 dollars worth of tasks or more over the period from January 2009 till April 2010.&amp;nbsp;&lt;/li&gt;&lt;/ul&gt;The lognormal distribution of activity, also shows that requesters increase their participation exponentially over time: They post a few tasks, they get the results. If the results are good, they increase &lt;b&gt;by a percentage&lt;/b&gt; the size of the tasks that they post next time. This multiplicative behavior is the basic process that generates the lognormal distribution of activity.&lt;br /&gt;&lt;br /&gt;I would like to try is to check if this model indeed corresponds to reality. Do we see a geometric growth in activity as the requester stays in the market for longer? Do we observe "deaths" of requesters? (The  &lt;a href="http://dx.doi.org/10.1002/dir.20074"&gt;Fader-Hardie model&lt;/a&gt; may be a nice, simple model to try.) What is the expected future activity of a requester?&lt;br /&gt;&lt;br /&gt;Such questions may be useful for guiding decisions of workers when deciding whether to invest time and effort to get a good reputation for a given requester (e.g., by completing qualification tests or completing the basic HITs that unlock access to the "protected" HITs.)&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6538286886012474786-8050578132370698993?l=topsitemap.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/8050578132370698993'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/8050578132370698993'/><link rel='alternate' type='text/html' href='http://topsitemap.blogspot.com/2010/10/mechanical-turk-requester-activity.html' title='Mechanical Turk Requester Activity: The Insignificance of the Long Tail'/><author><name>Internet at Every Where</name><uri>http://www.blogger.com/profile/13430497579682715412</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://3.bp.blogspot.com/_skxW9kDLt_c/TK4BGwMdk9I/AAAAAAAAfXo/t3bHDrRV3aU/s72-c/q-q-requesters-activity.PNG' height='72' width='72'/></entry><entry><id>tag:blogger.com,1999:blog-6538286886012474786.post-7720504704420180402</id><published>2010-10-07T21:28:00.000-07:00</published><updated>2011-04-17T19:00:19.277-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='mechanical turk'/><category scheme='http://www.blogger.com/atom/ns#' term='research'/><title type='text'>What Tasks Are Posted on Mechanical Turk?</title><content type='html'>A few months back, I got an invitation from &lt;a href="http://people.csail.mit.edu/msbernst/"&gt;Michael Bernstein&lt;/a&gt; (of &lt;a href="http://code.google.com/p/soylent/"&gt;Soylent&lt;/a&gt; &lt;a href="http://www.youtube.com/watch?v=n_miZqsPwsc"&gt;fame&lt;/a&gt;) to write a small article about Mechanical Turk for the student magazine of ACM, the &lt;a href="http://xrds.acm.org/"&gt;ACM XRDS&lt;/a&gt; (aka Crossroads). I could have written a summary of past research, a position paper, or anything that I find interesting.&lt;br /&gt;&lt;br /&gt;Instead of summarizing and resubmitting already published material, I decided to push myself and start analyzing some data that I have been collecting about the Mechanical Turk marketplace. In the past, I&lt;a href="http://behind-the-enemy-lines.blogspot.com/2010/03/new-demographics-of-mechanical-turk.html"&gt; analyzed the data about the demographics of the workers on Mechanical Turk&lt;/a&gt;. However, we have limited analysis for the requester side of the market, and for the type of tasks being posted.&lt;br /&gt;&lt;br /&gt;My goal was to put some very preliminary analysis in place, just to scratch the surface of a variety of a questions that I have heard over time. Hopefully this will push me to start working more towards getting the answers, and will inspire some interesting new questions for the students and others that read the article. A &lt;a href="http://archive.nyu.edu/handle/2451/29801"&gt;preprint of the paper&lt;/a&gt; is available through the NYU Faculty Digital Archive and the print version should appear sometime early in 2011.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;&lt;i&gt;Data Set&lt;/i&gt;&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;I have been collecting data about the marketplace through my &lt;a href="http://mturk-tracker.com/general/"&gt;Mechanical Turk Tracker&lt;/a&gt;. The tracker collects complete snapshots of the marketplace, every hour, starting from January 2009. For the analysis, I took the data from the period of January 2009 till April 2010: The snapshots has a total of:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;165,368 HIT groups&lt;/li&gt;&lt;li&gt;6,701,406 HITs&lt;/li&gt;&lt;li&gt;9,436 requesters&lt;/li&gt;&lt;li&gt;$529,259 rewards&lt;/li&gt;&lt;/ul&gt;These numbers, of course, do not account for the redundancy of the posted HITs, or for HITs that were posted &lt;i&gt;and &lt;/i&gt;disappeared between our hourly crawls. Nevertheless, they should be good approximations (within an order of magnitude) of the activity of the marketplace.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;&lt;i&gt;Top Requesters&lt;/i&gt;&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;The first question that I looked at was an analysis of the tasks that are being posted on Mechanical Turk. One way to understand what types of tasks are being completed in the marketplace is to find the �top� requesters and analyze the HITs that they post. By ranking requesters according to the sum of the posted rewards, we get the following list, showing the level of activity and the type of tasks that these requesters post. &lt;span class="Apple-style-span" style="font-size: small;"&gt;(Note: To avoid skewing the data towards one-shot requesters, I excluded from the list a requesters that were active only for small periods of time or requesters that posted only a small number of HITs. The goal was to find not only the requesters that post big tasks, but also requesters that do that consistently over time.)&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; text-align: center;"&gt;&lt;a href="http://3.bp.blogspot.com/_skxW9kDLt_c/TK3_pVU9VGI/AAAAAAAAfXk/HF1u_aIvUDw/s1600/top-requesters.PNG" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="205" src="http://3.bp.blogspot.com/_skxW9kDLt_c/TK3_pVU9VGI/AAAAAAAAfXk/HF1u_aIvUDw/s400/top-requesters.PNG" style="cursor: move;" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;So, &lt;b&gt;transcription, classification, and content generation seems to be a common activity on Mechanical Turk&lt;/b&gt;. This indicates that people have &lt;b&gt;developed sufficient best practices and can actually get quality work done&lt;/b&gt;. (If not, they would not be posting so many tasks.)&lt;br /&gt;&lt;br /&gt;&lt;b&gt;&lt;i&gt;Top Keywords&lt;/i&gt;&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;We also wanted to get a feeling of the tasks that are being posted in the market, across &lt;b&gt;all&lt;/b&gt; requesters. The table below shows the top-50 most frequent HIT keywords in the dataset, ranked by total reward amount, # of HITgroups, and # of HITs.&lt;br /&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; text-align: center;"&gt;&lt;a href="http://1.bp.blogspot.com/_skxW9kDLt_c/TK4UjK3MEKI/AAAAAAAAfXw/oMkEyxLZ_aI/s1600/top-keywords.PNG" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="640" src="http://1.bp.blogspot.com/_skxW9kDLt_c/TK4UjK3MEKI/AAAAAAAAfXw/oMkEyxLZ_aI/s640/top-keywords.PNG" style="cursor: move;" width="385" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;Beyond the tasks identified before, we also see &lt;b&gt;data collection, image tagging, website feedback, and usability tests&lt;/b&gt; to be common tasks being posted in the marketplace.&lt;br /&gt;&lt;br /&gt;In future posts, I will post further analysis of other aspects of the AMT marketplace.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6538286886012474786-7720504704420180402?l=topsitemap.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/7720504704420180402'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/7720504704420180402'/><link rel='alternate' type='text/html' href='http://topsitemap.blogspot.com/2010/10/what-tasks-are-posted-on-mechanical.html' title='What Tasks Are Posted on Mechanical Turk?'/><author><name>Internet at Every Where</name><uri>http://www.blogger.com/profile/13430497579682715412</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://3.bp.blogspot.com/_skxW9kDLt_c/TK3_pVU9VGI/AAAAAAAAfXk/HF1u_aIvUDw/s72-c/top-requesters.PNG' height='72' width='72'/></entry><entry><id>tag:blogger.com,1999:blog-6538286886012474786.post-5646587899262463851</id><published>2010-09-15T18:41:00.000-07:00</published><updated>2011-04-17T19:00:19.281-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='mechanical turk'/><category scheme='http://www.blogger.com/atom/ns#' term='wisdom of the crowds'/><category scheme='http://www.blogger.com/atom/ns#' term='research'/><title type='text'>Worker Evaluation in Crowdsourcing: Gold Data or Multiple Workers?</title><content type='html'>Evaluating the quality of workers on crowdsourcing environments is &lt;a href="http://behind-the-enemy-lines.blogspot.com/2010/06/detecting-spammers-on-mechanical-turk.html"&gt;standing&lt;/a&gt; &lt;a href="http://behind-the-enemy-lines.blogspot.com/2010/07/detecting-spammers-on-mechanical-turk.html"&gt;problem&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;The two most common approaches for dealing with the problem are:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;b&gt;Use "gold" data:&lt;/b&gt;&amp;nbsp;Give to the workers questions for which we already know the answers and see how the workers are performing. (Crowdflower is &lt;a href="http://crowdflower.com/docs/gold"&gt;using&lt;/a&gt; &lt;a href="http://crowdflower.com/enterprise/examples"&gt;this approach&lt;/a&gt;.)&lt;/li&gt;&lt;li&gt;&lt;b&gt;Use multiple workers&lt;/b&gt;: Give the same answer to multiple workers and then &lt;a href="http://lingpipe-blog.com/2008/09/15/dolores-labs-text-entailment-data-from-amazon-mechanical-turk/"&gt;use a latent class model&lt;/a&gt; ala Dawid &amp;amp; Skene to estimate the quality of the workers.&lt;/li&gt;&lt;/ul&gt;&lt;div&gt;The nice thing about the two approaches is that they can be seamlessly combined into a unified algorithm.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;What was not clear to me, though, was the relative importance of the two. How much better and faster can we estimate the quality of the workers if we use gold data? What is the rate of improving the quality estimation as we add more gold data? &lt;b&gt;Generating high-quality gold data is an expensive process,&lt;/b&gt; so having good answers to these questions is important.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Being a professor, the moment these questions came up, I knew what I had to do: Ask a PhD student to give me the answers! (&lt;a href="http://pages.stern.nyu.edu/~jwang5/"&gt;Thanks Jing!&lt;/a&gt;) So, I am just the messenger here, Jing did all he work and the analysis. You know where to send your thanks.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The assumptions: We have examples belonging to 2 categories. The&amp;nbsp;examples&amp;nbsp;are equally distributed in the two categories (i.e., 50% in each). We created a set of workers with their quality picked randomly and uniformly from the range (&lt;i&gt;55% correct&lt;/i&gt;) to (&lt;i&gt;100% correct&lt;/i&gt;), for an average quality of ~77%. The workers assigned (noisy) labels to the&amp;nbsp;examples, with an accuracy directly proportional to their quality.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;We examined the performance of the&amp;nbsp;Dawid &amp;amp; Skene algorithm, which we modified to take into consideration the existence of gold data. We measured two things:&lt;/div&gt;&lt;div&gt;&lt;ul&gt;&lt;li&gt;&lt;b&gt;Classification error&lt;/b&gt;: How well the algorithm estimates the correct class of the examples&lt;/li&gt;&lt;li&gt;&lt;b&gt;Quality estimation error&lt;/b&gt;: How well the algorithm estimates the quality of the workers&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;div&gt;We experimented with having 0% gold examples in the data, 25% gold, 50% gold, and 75% gold.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;Results on classification error&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;/div&gt;&lt;div&gt;So, we measured the accuracy of estimating the correct class of each example. The results are listed below:&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;table align="center" cellpadding="0" cellspacing="0" class="tr-caption-container" style="margin-left: auto; margin-right: auto; text-align: center;"&gt;&lt;tbody&gt;&lt;tr&gt;&lt;td style="text-align: center;"&gt;&lt;a href="http://4.bp.blogspot.com/_skxW9kDLt_c/TJFh6qa6q3I/AAAAAAAAW38/0R0ilqPM0_U/s1600/AvgClassErr_3.png" imageanchor="1" style="margin-left: auto; margin-right: auto;"&gt;&lt;img border="0" height="224" src="http://4.bp.blogspot.com/_skxW9kDLt_c/TJFh6qa6q3I/AAAAAAAAW38/0R0ilqPM0_U/s320/AvgClassErr_3.png" width="320" /&gt;&lt;/a&gt;&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td class="tr-caption" style="text-align: center;"&gt;3 workers per example&lt;/td&gt;&lt;/tr&gt;&lt;/tbody&gt;&lt;/table&gt;&lt;table align="center" cellpadding="0" cellspacing="0" class="tr-caption-container" style="margin-left: auto; margin-right: auto; text-align: center;"&gt;&lt;tbody&gt;&lt;tr&gt;&lt;td style="text-align: center;"&gt;&lt;a href="http://4.bp.blogspot.com/_skxW9kDLt_c/TJFh_bQDmBI/AAAAAAAAW4E/3T8Jdp6LXYQ/s1600/AvgClassErr_5.png" imageanchor="1" style="margin-left: auto; margin-right: auto;"&gt;&lt;img border="0" src="http://4.bp.blogspot.com/_skxW9kDLt_c/TJFh_bQDmBI/AAAAAAAAW4E/3T8Jdp6LXYQ/s320/AvgClassErr_5.png" /&gt;&lt;/a&gt;&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td class="tr-caption" style="text-align: center;"&gt;5 workers per example&lt;/td&gt;&lt;/tr&gt;&lt;/tbody&gt;&lt;/table&gt;&lt;table align="center" cellpadding="0" cellspacing="0" class="tr-caption-container" style="margin-left: auto; margin-right: auto; text-align: center;"&gt;&lt;tbody&gt;&lt;tr&gt;&lt;td style="text-align: center;"&gt;&lt;a href="http://2.bp.blogspot.com/_skxW9kDLt_c/TJFiBAxe2cI/AAAAAAAAW4M/goRylhJwehg/s1600/AvgClassErr_10.png" imageanchor="1" style="margin-left: auto; margin-right: auto;"&gt;&lt;img border="0" src="http://2.bp.blogspot.com/_skxW9kDLt_c/TJFiBAxe2cI/AAAAAAAAW4M/goRylhJwehg/s320/AvgClassErr_10.png" /&gt;&lt;/a&gt;&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td class="tr-caption" style="text-align: center;"&gt;10 workers per example&lt;/td&gt;&lt;/tr&gt;&lt;/tbody&gt;&lt;/table&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: left;"&gt;One immediate observation is that the value of having gold data is limited when we have significant number of workers per label. With 10 workers per example, no matter if we have gold data, the difference is minimal. Even with just 5 workers, the additional value of gold data is small.&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: left;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: left;"&gt;The cases where it makes sense to use gold data is when we have only small number of workers per label. (Not an uncommon case!)&amp;nbsp;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: left;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: left;"&gt;An interesting observation, though, is that &lt;b&gt;&lt;i&gt;we can achieve the same effect by simply forcing workers to work on more examples&lt;/i&gt;&lt;/b&gt;.&amp;nbsp;Once a worker has given us 30 answers, the completely unsupervised algorithm can work almost as well as the algorithm that uses 75% gold data. This holds even when we have just 3 workers per example.&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: left;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: left;"&gt;Of course, on an environment like Mechanical Turk, forcing workers to work on a large number of HITs may not be feasible. But we can always bundle multiple questions in a single HIT, achieving the same result.&amp;nbsp;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: left;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: left;"&gt;OK, so gold data do not seem to be very useful in for getting better accuracy in class estimation.&amp;nbsp;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: left;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: left;"&gt;But, it should help in estimating the quality of workers, &lt;i&gt;right&lt;/i&gt;?&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: left;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: left;"&gt;&lt;b&gt;Results on worker quality estimation&lt;/b&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: left;"&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: left;"&gt;For the quality estimation, we also calculated the error when having 100% gold examples. (This is the lower bound for the estimation error, of course.)&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: left;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;table align="center" cellpadding="0" cellspacing="0" class="tr-caption-container" style="margin-left: auto; margin-right: auto; text-align: center;"&gt;&lt;tbody&gt;&lt;tr&gt;&lt;td style="text-align: center;"&gt;&lt;a href="http://4.bp.blogspot.com/_skxW9kDLt_c/TJFnQP7soRI/AAAAAAAAW4U/VZmB4x0ztv8/s1600/AvgCostErr_3.png" imageanchor="1" style="margin-left: auto; margin-right: auto;"&gt;&lt;img border="0" src="http://4.bp.blogspot.com/_skxW9kDLt_c/TJFnQP7soRI/AAAAAAAAW4U/VZmB4x0ztv8/s320/AvgCostErr_3.png" /&gt;&lt;/a&gt;&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td class="tr-caption" style="text-align: center;"&gt;3 workers per example&lt;/td&gt;&lt;/tr&gt;&lt;/tbody&gt;&lt;/table&gt;&lt;table align="center" cellpadding="0" cellspacing="0" class="tr-caption-container" style="margin-left: auto; margin-right: auto; text-align: center;"&gt;&lt;tbody&gt;&lt;tr&gt;&lt;td style="text-align: center;"&gt;&lt;a href="http://1.bp.blogspot.com/_skxW9kDLt_c/TJFnSJjhEPI/AAAAAAAAW4c/rnVJxaZpgXI/s1600/AvgCostErr_5.png" imageanchor="1" style="margin-left: auto; margin-right: auto;"&gt;&lt;img border="0" src="http://1.bp.blogspot.com/_skxW9kDLt_c/TJFnSJjhEPI/AAAAAAAAW4c/rnVJxaZpgXI/s320/AvgCostErr_5.png" /&gt;&lt;/a&gt;&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td class="tr-caption" style="text-align: center;"&gt;5 workers per example&lt;/td&gt;&lt;/tr&gt;&lt;/tbody&gt;&lt;/table&gt;&lt;table align="center" cellpadding="0" cellspacing="0" class="tr-caption-container" style="margin-left: auto; margin-right: auto; text-align: center;"&gt;&lt;tbody&gt;&lt;tr&gt;&lt;td style="text-align: center;"&gt;&lt;a href="http://4.bp.blogspot.com/_skxW9kDLt_c/TJFn_OrAOUI/AAAAAAAAW4k/8cDpF0T-GZU/s1600/AvgCostErr_10.png" imageanchor="1" style="margin-left: auto; margin-right: auto;"&gt;&lt;img border="0" src="http://4.bp.blogspot.com/_skxW9kDLt_c/TJFn_OrAOUI/AAAAAAAAW4k/8cDpF0T-GZU/s320/AvgCostErr_10.png" /&gt;&lt;/a&gt;&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td class="tr-caption" style="text-align: center;"&gt;10 workers per example&lt;/td&gt;&lt;/tr&gt;&lt;/tbody&gt;&lt;/table&gt;&lt;div class="separator" style="clear: both; text-align: left;"&gt;As expected, with 10 workers per example, we gain almost nothing in terms of quality estimation when we use gold data.&amp;nbsp;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: left;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: left;"&gt;With 3-5 workers per example, having gold data improves the quality estimation in a rather consistent manner. Again, though, we observe that if we have each worker completing a large enough number of assignments, we can get most of the benefits of having gold data, without actually having gold data.&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: left;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: left;"&gt;&lt;b&gt;Why gold data then?&lt;/b&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: left;"&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: left;"&gt;So, given the results above, someone would ask: Why do we even need gold data? The unsupervised approach seems to work very well!&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: left;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: left;"&gt;In reality there are a few reasons for which we may still need to have gold data:&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: left;"&gt;&lt;/div&gt;&lt;ul&gt;&lt;li&gt;&lt;i&gt;&lt;b&gt;Imbalanced datasets&lt;/b&gt;:&amp;nbsp;&lt;/i&gt;When we have very imbalanced data sets, the estimation becomes more challenging. For imbalanced data, we need to&amp;nbsp;quickly&amp;nbsp;and preemptively&amp;nbsp;test workers using data from all categories, rather than waiting for the occasional object from the minority category to appear. To give an example: if we monitor a security camera trying to detect the presence of people in prohibited areas, we want to ensure that the workers will be tested early on with images that have people in them. Otherwise it may take a long time to get a&amp;nbsp;reliable&amp;nbsp;estimate of their ability to classify correctly examples from the minority class.&lt;/li&gt;&lt;li&gt;&lt;i&gt;&lt;b&gt;Very low quality of workers&lt;/b&gt;: &lt;/i&gt;When workers have very low quality, we need more workers per example and more labels per worker to replicate the results above. In this case, having gold data allows us quickly to get rid of the workers that do not meet quality standards. This is very useful on high-noise environments like the "unprotected"&amp;nbsp;Amazon Mechanical Turk marketplace (by unprotected I mean without using qualification tests or other quality assurance mechanisms).&lt;/li&gt;&lt;li&gt;&lt;i&gt;&lt;b&gt;Giving confidence to non-technical people&lt;/b&gt;: &lt;/i&gt;If you say that you test the workers with known examples, everyone understands the process. If you say that you rely on agreement between workers, or on latent class models&amp;nbsp;("what?")&amp;nbsp;and on expectation maximization or Bayesian estimation&amp;nbsp;("come again?")&amp;nbsp;, most people will start feeling&amp;nbsp;uncomfortable. Everyone understands random tests, not everyone is willing to let unsupervised methods to direct the quality assurance process. So, even if the gold examples do not help much, it is a very reassuring factor for people that just want to know that there is a familiar, understandable, and easy-to-explain quality control mechanism in place.&lt;/li&gt;&lt;li&gt;&lt;i&gt;&lt;b&gt;Calibrating results and giving feedback&lt;/b&gt;&lt;/i&gt;: One of the final reasons for having gold data is to be able to calibrate workers and give them feedback about the expected coding standards. For example, when rating pages as porn or not and into degrees of severity, different people have different levels of sensitivity. If we have enough "sensitive" workers in the workforce, we may end up with results that are consistent but shifted upwards in terms of severity. (Or vice versa if the coders are more tolerant, the results may be shifted downwards.) This can, incorrectly, give the impression that all data collected through crowdsourcing are wrong. However, if the final user of the data provides a few gold data as anchor points, the Dawid &amp;amp; Skene code gives back results that are more in line with expectations. At the same time, these gold data points can be used to give immediate feedback to the workers about their errors and implicitly direct them to use the expected rating guidelines and self-calibrate.&lt;/li&gt;&lt;/ul&gt;&lt;div class="separator" style="clear: both; text-align: left;"&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: left;"&gt;In practice, the last two reasons are often more important than the technical aspects of estimation. So, before starting any big crowdsourcing annotation project, spend some time and create some gold data. Or, alternatively, take a small dataset and label it using a large number of workers per example. Then verify the outcome, correct and calibrate some of the unexpected results, and run Dawid &amp;amp; Skene again. The generated data will be close enough to gold. Having such gold data will pay back the effort and cost multiple times during the overall process.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6538286886012474786-5646587899262463851?l=topsitemap.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/5646587899262463851'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/5646587899262463851'/><link rel='alternate' type='text/html' href='http://topsitemap.blogspot.com/2010/09/worker-evaluation-in-crowdsourcing-gold.html' title='Worker Evaluation in Crowdsourcing: Gold Data or Multiple Workers?'/><author><name>Internet at Every Where</name><uri>http://www.blogger.com/profile/13430497579682715412</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://4.bp.blogspot.com/_skxW9kDLt_c/TJFh6qa6q3I/AAAAAAAAW38/0R0ilqPM0_U/s72-c/AvgClassErr_3.png' height='72' width='72'/></entry><entry><id>tag:blogger.com,1999:blog-6538286886012474786.post-1525795062227067303</id><published>2010-09-14T08:52:00.000-07:00</published><updated>2011-04-17T19:00:19.285-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='academia'/><category scheme='http://www.blogger.com/atom/ns#' term='teaching'/><title type='text'>Analytics for Class Lectures</title><content type='html'>&lt;div class="separator" style="clear: both; text-align: justify;"&gt;The classes for the new academic year have started, so naturally I started thinking about teaching-related topics.&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: justify;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: justify;"&gt;&lt;b&gt;Mining video interactions&lt;/b&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: justify;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: justify;"&gt;A few days back, &lt;a href="http://palblog.fxpal.com/?p=4528"&gt;FXPal released TalkMiner&lt;/a&gt;,&amp;nbsp;a system for indexing and searching video of lecture broadcasts. One of the interesting ideas is that it is possible to mine the interactions of students with the video, to see what are the topics of interest for the students, what parts of the class get skipped, and so on. From the blog post of FXPal:&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: justify;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: justify;"&gt;&lt;/div&gt;&lt;blockquote&gt;The Berkeley webcasting system (developed by our president Larry Rowe while he was a professor there) showed that&lt;/blockquote&gt;&lt;blockquote&gt;� students almost always watched the lectures on-demand rather than in real-time, and they rarely watched the entire lecture. &amp;nbsp;Students use the webcasts to study for exams � we could see this clearly by patterns of usage � and, they primarily wanted to review selected material covered by the instructor. &amp;nbsp;In one class we discovered that for over 50% of the lectures, students watched less than 10 minutes from a 50-minute lecture and students watched the entire lecture only 10% of the time. &amp;nbsp;Consequently, for using the system, effective search is a big issue.&lt;/blockquote&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: justify;"&gt;At Stern, all the classes get recorded and are available to students for reviewing the class material. The students get access to a layout like the following and have the ability to rearrange the layout, emphasizing the slides, or the video. (You can see a &lt;a href="http://bit.ly/aHSMpN"&gt;lecture of mine&lt;/a&gt;;&amp;nbsp;login: &lt;i&gt;scribe &lt;/i&gt;and password: &lt;i&gt;Scribe987!&lt;/i&gt;)&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: justify;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://3.bp.blogspot.com/_skxW9kDLt_c/TI-EK5myGjI/AAAAAAAAWm0/YWkWef-qcoE/s1600/class-capture.PNG" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" src="http://3.bp.blogspot.com/_skxW9kDLt_c/TI-EK5myGjI/AAAAAAAAWm0/YWkWef-qcoE/s320/class-capture.PNG" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: justify;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: justify;"&gt;It seems to be a natural next step to show to the instructor the patterns of interaction that students have with the videos. It would be very interested to see what parts of the class go largely unexamined and which ones are played again and again. Needless to say, these are either complicated topics, or topics that the instructor did not explain clearly.&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: justify;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: justify;"&gt;&lt;b&gt;Mining search queries using transcripts&lt;/b&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: justify;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: justify;"&gt;Another interesting idea is to also have transcripts of the class. (For example, for &lt;a href="http://bit.ly/aHSMpN"&gt;this lecture&lt;/a&gt;&amp;nbsp;[login:&amp;nbsp;&lt;i&gt;scribe&amp;nbsp;&lt;/i&gt;and password:&amp;nbsp;&lt;i&gt;Scribe987!&lt;/i&gt;] &lt;a href="http://transcripts.castingwords.com/Mry/67481.html"&gt;see the transcript, done by CastingWords for $0.75/min.&lt;/a&gt;) This would allow students to search the class not only using text in the slides but also to recall particular points of the class discussion. This is especially important for courses that have a significant component of in-class discussion. We already know, from web search, that query logs are important source of information. Doing the same for class content would easily identify what students are looking for in the class recordings.&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: justify;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: justify;"&gt;One problem with transcription is that it is rather expensive. &lt;a href="http://castingwords.com/"&gt;CastingWords&lt;/a&gt; and &lt;a href="http://www.speakertext.com/"&gt;SpeakerText&lt;/a&gt; seem to charge one or two dollars per minute for human-verified transcriptions. (Fully-automatic solutions are not ready for prime time, as the automatic transcription of &lt;a href="http://www.youtube.com/watch?v=QkhBcLk_8f0"&gt;these&lt;/a&gt;&amp;nbsp;&lt;a href="http://www.youtube.com/watch?v=Ac33dOAgqus"&gt;YouTube&lt;/a&gt; videos shows. Make sure to click the "cc" button and then "transcribe audio".) With approximately 28 lectures a semester, 75 minutes each, at 1-2 dollars per minute, we have a cost of $2000 to $4000 per semester. At this cost level, it is certainly more beneficial to hire an extra TA rather than provide the transcription of the lecture to the students.&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: justify;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: justify;"&gt;&lt;b&gt;Mining class participation&amp;nbsp;&lt;/b&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: justify;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: justify;"&gt;Another thing that I would love to have is the ability to transcribe not only what the instructor said but also who are the students that contributed to the discussion, together with what they said.&amp;nbsp;This would allow not only to track and quantify participation but also uncover some patterns that may not be obvious to the instructor.&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: justify;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: justify;"&gt;For example, take a look at this diagram below, created as part of the yearly teaching evaluation that we undergo at Stern:&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: justify;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://1.bp.blogspot.com/_skxW9kDLt_c/TI1ka7NGAXI/AAAAAAAATT4/McLfZT9FdkQ/s1600/class-diagram-rotated.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="307" src="http://1.bp.blogspot.com/_skxW9kDLt_c/TI1ka7NGAXI/AAAAAAAATT4/McLfZT9FdkQ/s400/class-diagram-rotated.png" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: justify;"&gt;The diagram was created by an evaluator who sat in my class, tracked the composition of the student body, where each student was sitting in the amphitheater, how many times they raised their hand, and how many times I asked them to&amp;nbsp;answer&amp;nbsp;a question. &lt;span class="Apple-style-span" style="font-size: small;"&gt;(To answer the inevitable question: No, the teaching feedback is not focused only on such analyses. In my earlier years, the feedback was focused more on substantial issues, e.g., structuring lectures and discussions, encouraging participation, etc. Now, with feedback and experience, the more substantial and important issues are addressed. &amp;nbsp;So we focus on such, seemingly more superficial, but also important, stuff...)&lt;/span&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: justify;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: justify;"&gt;The results? I was paying significantly more attention to the left part of the amphitheater: I asked 80% of the time students sitting in the left, and only 20% of the time I asked students on the right. Also, the percentage of female students participating in the discussion was significantly lower: 50% for male students participated, but only 21% of the female students did.&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: justify;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: justify;"&gt;These are patterns that are hard to understand while teaching, but would be easier to find out if we had detailed transcripts of the class discussion together, potentially, with a standardized seat chart. I was also told that some universities (the rumor is about Harvard Business School) use software to track student participation. However, I was not able to locate any such software offerings.&amp;nbsp;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: justify;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: justify;"&gt;&lt;b&gt;Moving forward&lt;/b&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: justify;"&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: justify;"&gt;The ability to videotape lectures has been around for a while and is being used extensively for distance learning applications. (Columbia Engineering had a &lt;a href="http://www.cvn.columbia.edu/"&gt;well-established distance learning program&lt;/a&gt; when I joined the PhD program back in 1999.) However, it was mainly a broadcast mechanism, and not a medium for providing feedback to the instructor (and even to the students who can see that they are lacking in terms of participation).&amp;nbsp;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: justify;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: justify;"&gt;It would be interesting to start having such technologies for providing feedback on teaching. Analytics have been changing many industries. Education has been surprisingly behind in that respect.&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: justify;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6538286886012474786-1525795062227067303?l=topsitemap.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/1525795062227067303'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/1525795062227067303'/><link rel='alternate' type='text/html' href='http://topsitemap.blogspot.com/2010/09/analytics-for-class-lectures.html' title='Analytics for Class Lectures'/><author><name>Internet at Every Where</name><uri>http://www.blogger.com/profile/13430497579682715412</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://3.bp.blogspot.com/_skxW9kDLt_c/TI-EK5myGjI/AAAAAAAAWm0/YWkWef-qcoE/s72-c/class-capture.PNG' height='72' width='72'/></entry><entry><id>tag:blogger.com,1999:blog-6538286886012474786.post-9065469249176761225</id><published>2010-08-07T22:29:00.000-07:00</published><updated>2011-04-17T19:00:19.289-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='crowdsourcing'/><category scheme='http://www.blogger.com/atom/ns#' term='research'/><category scheme='http://www.blogger.com/atom/ns#' term='market for lemons'/><category scheme='http://www.blogger.com/atom/ns#' term='spam'/><category scheme='http://www.blogger.com/atom/ns#' term='reputation'/><title type='text'>Reputation Integration and the Future of Reviews</title><content type='html'>Last week I was at Google for the annual Google Faculty Summit. While discussing research challenges related to social media and shopping, a common theme and questions emerged: How can we evaluate the trustworthiness of the reviews that abound on the Internet?&lt;br /&gt;&lt;br /&gt;&lt;b&gt;&lt;i&gt;Past Performance predicts Future Performance&lt;/i&gt;&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;Current solutions focus on the trustworthiness and history of a reviewer in each site. For example, Amazon has the reviewer rank, which is computed using the amount of reviews contributed, the helpful votes that they&amp;nbsp;amassed, and other secret-sauce factors. Other sites, like Yelp, follow similar&amp;nbsp;approaches to identify the best reviewers.&lt;br /&gt;&lt;br /&gt;Of course, knowing the good reviewers is very valuable: Unlike investments, &lt;a href="http://behind-the-enemy-lines.blogspot.com/2010/01/did-you-find-this-helpful.html"&gt;past performance is a strong indicator of future performance&lt;/a&gt;. Reviewers that wrote good reviews in the past are likely to write good reviews in the future. Or, more general, high-quality users in the past are likely to be high-quality in the future.&lt;br /&gt;&lt;br /&gt;So, problem solved! We just need to know which user has high quality!&lt;br /&gt;&lt;br /&gt;&lt;b&gt;&lt;i&gt;Network Effects and Closed Reputation Platforms&lt;/i&gt;&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;Well, at this point we have a problem. Today, we treat site-specific user profiles as separate individuals. A profile of a user on Amazon is not connected to the corresponding reviewer profile in Epinions, in NewEgg, in B&amp;amp;H. So, reviewers that have written tens of reviews and amassed thousands of helpful votes will be insignificant newbies if they decide to write some reviews on B&amp;amp;H website. Similarly, a person that has contributed plenty of reviews and discussions on Chowhound&amp;nbsp;over the years, will be an insignificant newbie on Yelp. The result? We cannot trust the reviews of these individuals, even if they have proven themselves trustworthy in the past!&lt;br /&gt;&lt;br /&gt;This is a &lt;a href="http://en.wikipedia.org/wiki/Vendor_lock-in"&gt;lock-in&lt;/a&gt; associated with &lt;a href="http://en.wikipedia.org/wiki/Network_effect"&gt;network&lt;/a&gt; &lt;a href="http://oz.stern.nyu.edu/io/network.html"&gt;effects&lt;/a&gt;, which is similar to the lock-in that happens&amp;nbsp;&lt;a href="http://en.wikipedia.org/wiki/Network_effect#Open_versus_closed_standards"&gt;under closed &amp;nbsp;and proprietary standards&lt;/a&gt;. For example, Microsoft has achieved domination in Office Productivity software by keeping proprietary the file formats for Office. Since no other office productivity suite could&amp;nbsp;inter-operate&amp;nbsp;with Office, smaller players could simply not compete: the minority of users that did not use Office could not exchange files with the Office users. Under such scenarios, either we see a market split into isolated markets (the case of reputation-based sites today), or a single dominant player (e.g., in Office).&lt;br /&gt;&lt;br /&gt;&lt;b&gt;&lt;i&gt;Closed Standards and Networks Effects&lt;/i&gt;&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;Interestingly enough, under closed standards, when the market ends up being fragmented, this is not optimum. An excellent example is the SMS market in the United States. Before 2000, the different telecoms did not allow their subscribers to send SMS messages to subscribers of other companies. The main rationale was to force users to adopt the network already chosen by their friends, if they wanted to text each other. So the market looked like that:&lt;br /&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://3.bp.blogspot.com/_skxW9kDLt_c/TF46hFYaLjI/AAAAAAAAJ04/0WGBKjwg4yQ/s1600/closed-networks.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" src="http://3.bp.blogspot.com/_skxW9kDLt_c/TF46hFYaLjI/AAAAAAAAJ04/0WGBKjwg4yQ/s320/closed-networks.png" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;&lt;br /&gt;Interesting approach but the result was not ideal: texting in the US was essentially non-existent before 2000. (I vividly remember when I arrived in the US how nobody was using SMS to communicate with each other. In Europe sending SMS was commonplace.) &lt;b&gt;However, once the networks decided to cooperate and allow SMS to flow freely across networks, the market took off, attracting more players and being much more useful.&amp;nbsp;&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://4.bp.blogspot.com/_skxW9kDLt_c/TF460C5n3fI/AAAAAAAAJ1I/MsZ_jq9HI8I/s1600/open-networks.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" src="http://4.bp.blogspot.com/_skxW9kDLt_c/TF460C5n3fI/AAAAAAAAJ1I/MsZ_jq9HI8I/s320/open-networks.png" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;In other words, if we do not have a monopoly, closed networks are suboptimal. Fragmented markets for goods with network effects are almost never optimal, even for the current dominant players.&lt;br /&gt;&lt;br /&gt;&lt;i&gt;&lt;b&gt;The Need for Open Reputation and Reputation Integration&lt;/b&gt;&lt;/i&gt;&lt;br /&gt;&lt;br /&gt;I believe that we see a similar case in the reputation identities. Identities are fragmented across sites and users have little incentive to contribute high-quality content in sites in which they do not have established presence: Nobody pays attention to them in any case.&lt;br /&gt;&lt;br /&gt;This also makes it difficult to start from scratch any website that requires the existence of trustworthy reviews in order to attract viewers. Visitors will not come as there are no trustworthy reviews, and reviewers already established in other sites will not join.&lt;br /&gt;&lt;br /&gt;So, it is important to start thinking on how to integrate reputations across different websites. It is simply not optimal to have fragmented identities. Even though there are privacy concerns for anonymous profiles, there should be at least the capability to connect identities across sites for users that do not try to preserve their anonymity.&amp;nbsp;By integrating profiles across websites, we can allow reputation to flow across websites and creating a better, comprehensive profile of the participants in today's Internet.&lt;br /&gt;&lt;br /&gt;This can allow participants to build their reputation profiles without worrying about lock in. OpenID moves towards the right direction. Next versions should allow explicit, distributed connection of profiles, without requiring a dominant players to control the profile. (Thank you for the offer, Facebook. I will pass.)&lt;br /&gt;&lt;br /&gt;Third-party services can play this role as well. By identifying profiles across sites that belong to the same individual, we can quickly learn about the identity of a new contributor. The history can follow the users that are not worried about anonymity. Such strong reputation signals can help improve any existing site that relies on user contributed content. In the presence of strong reputations, low-quality contributors will simply never have a chance of getting a dominant position in the marketplace.&lt;br /&gt;&lt;br /&gt;As I mentioned &lt;a href="http://behind-the-enemy-lines.blogspot.com/2010/07/mechanical-turk-low-wages-and-market.html"&gt;in my earlier blog post&lt;/a&gt;, lack of credible reputation mechanisms simply degenerates any market into a market for lemons. In the attention economy, lack of strong reputation signals simply gives the incentives to spammer to come and pollute. Reputation integration mechanisms can solve this issue.&lt;br /&gt;&lt;br /&gt;&lt;a href="http://blog.reppify.com/reputation/reputation-integration-and-the-future-of-reviews/"&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="font-size: small;"&gt;Cross-posted at the Reppify blog&lt;/span&gt;&lt;/i&gt;&lt;/a&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="font-size: small;"&gt; (disclaimer: I am an advisor for Reppify)&amp;nbsp;&lt;/span&gt;&lt;/i&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6538286886012474786-9065469249176761225?l=topsitemap.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/9065469249176761225'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/9065469249176761225'/><link rel='alternate' type='text/html' href='http://topsitemap.blogspot.com/2010/08/reputation-integration-and-future-of.html' title='Reputation Integration and the Future of Reviews'/><author><name>Internet at Every Where</name><uri>http://www.blogger.com/profile/13430497579682715412</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://3.bp.blogspot.com/_skxW9kDLt_c/TF46hFYaLjI/AAAAAAAAJ04/0WGBKjwg4yQ/s72-c/closed-networks.png' height='72' width='72'/></entry><entry><id>tag:blogger.com,1999:blog-6538286886012474786.post-8396908801408455016</id><published>2010-07-27T20:18:00.000-07:00</published><updated>2011-04-17T19:00:19.296-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='mechanical turk'/><category scheme='http://www.blogger.com/atom/ns#' term='crowdsourcing'/><category scheme='http://www.blogger.com/atom/ns#' term='wisdom of the crowds'/><category scheme='http://www.blogger.com/atom/ns#' term='research'/><category scheme='http://www.blogger.com/atom/ns#' term='market for lemons'/><title type='text'>Mechanical Turk, Low Wages, and the Market for Lemons</title><content type='html'>In HCOMP this year, one of the memorable and discussed presentations (although highly unconventional) was by M. Six Silberman who discussed the "Sellers' problems in human computation markets". The basic question: can we protect the workers there from exploitation and from sweatshop salaries?&lt;br /&gt;&lt;br /&gt;Luis von Ahn posted &lt;a href="http://vonahn.blogspot.com/2010/07/work-and-internet.html"&gt;a similar post on his blog&lt;/a&gt;. In the comments of the blog post, someone suggested that the low wages on Mechanical Turk is simply the result of high supply of workers and low demand for their work. As there is more supply, the salaries drop. And having minimum wages, would interfere with the free market.&lt;br /&gt;&lt;br /&gt;I actually disagree with this interpretation. First of all, there is no oversupply of labor on Mechanical Turk. The distribution of completion times (follows a power law), suggests that the market operates at maximum capacity. My gut instinct actually tells me that &lt;b&gt;there are not enough workers available for the posted work&lt;/b&gt;, not vice versa.&lt;br /&gt;&lt;br /&gt;I can hear the protests: If there is not enough supply of workers, why don't requesters simply increase the offered prices?&lt;br /&gt;&lt;br /&gt;My explanation: The requesters already pay minimum wages for work that is worth minimum wage. How is that possible given the effective hourly rate of $2/hour?&lt;br /&gt;&lt;br /&gt;The basic problem: Spammers. Given that many large tasks attract spammers, most requesters rely on redundancy to ensure quality. So instead of having a single worker to do a task, they get 5 workers to work on it. This increases the effective rate from $2/hr to $10/hr.&lt;br /&gt;&lt;br /&gt;Effectively, what Amazon Mechanical Turk is today is a &lt;a href="http://en.wikipedia.org/wiki/The_Market_for_Lemons"&gt;market for lemons&lt;/a&gt;, following the terminology of Akerlof's famous paper, for which he got the 2001 Nobel prize.&lt;br /&gt;&lt;br /&gt;A &lt;b&gt;market for lemons&lt;/b&gt; is a market where the sellers cannot evaluate beforehand the quality of the goods that they are buying. So, if you have two types of products (say good workers and low quality workers) and cannot tell who is whom, the price that the buyer is willing to pay will be proportional to the average quality of the worker. So the offered price will be between the price of a good worker and a low quality worker. What a good worker would do? Given that good workers will not get enough payment for their true quality, they leave the market. This leads the buyer to lower the price even more towards the price for low quality workers. At the end, we only have low quality workers in the market (or workers willing to work for similar wages) and the offered price reflects that.&lt;br /&gt;&lt;br /&gt;This is exactly what is happening on Mechanical Turk today. Requesters pay everyone as if they are low quality workers, assuming that extra quality assurance techniques will be required on top of Mechanical Turk.&lt;br /&gt;&lt;br /&gt;So, how can someone resolve such issues? The basic solution is the concept of&amp;nbsp;&lt;a href="http://en.wikipedia.org/wiki/Signalling_(economics)"&gt;signalling&lt;/a&gt;. Good workers need a method to signal to the buyer their higher quality. In this way, they can differentiate themselves from low quality workers. Unfortunately, Amazon has not implemented a good reputation mechanism. The "number of HITs worked" and the "acceptance percentage" are simply not sufficient signalling mechanisms.&lt;br /&gt;&lt;br /&gt;Here are some ideas:&lt;br /&gt;&lt;br /&gt;&lt;ul&gt;&lt;li&gt;Allowing workers to get endorsements from &lt;b&gt;reputable&lt;/b&gt; requesters (to avoid scam rings like on eBay)&lt;/li&gt;&lt;li&gt;Allowing requesters to post machine readable feedback on the performance of the workers, disconnecting evaluation from the approval rate.&lt;/li&gt;&lt;li&gt;Certifications and qualification tests that indeed measure ability on different tasks (e.g., language abilities, reading comprehension tests, etc)&lt;/li&gt;&lt;li&gt;Publishing the reputation history of the workers, so that requesters can evaluate the quality of the worker.&lt;/li&gt;&lt;/ul&gt;&lt;div&gt;Of course, similar measures can be adopted for requesters! There is a symmetric market for lemons on that side! Scam requesters post HITs, behave badly, and cause good workers to avoid any newcomer. New requesters then get only low quality workers, get disappointed with the quality of the results and they leave the market.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;In other words, Amazon can only gain by taking the time to build a more robust reputation system on top of Mechanical Turk. Trust is at the very core of marketplaces. If Mechanical Turk wants to "grow up", then a good reputation system for both sides of the market is grossly overdue.&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6538286886012474786-8396908801408455016?l=topsitemap.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/8396908801408455016'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/8396908801408455016'/><link rel='alternate' type='text/html' href='http://topsitemap.blogspot.com/2010/07/mechanical-turk-low-wages-and-market.html' title='Mechanical Turk, Low Wages, and the Market for Lemons'/><author><name>Internet at Every Where</name><uri>http://www.blogger.com/profile/13430497579682715412</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author></entry><entry><id>tag:blogger.com,1999:blog-6538286886012474786.post-4023750210230078972</id><published>2010-07-27T10:39:00.000-07:00</published><updated>2011-04-17T19:00:19.299-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='academia'/><category scheme='http://www.blogger.com/atom/ns#' term='research'/><category scheme='http://www.blogger.com/atom/ns#' term='peer reviewing'/><title type='text'>Peer Reviewing for Oral Presentations?</title><content type='html'>Everyone who has attended a conference knows that the quality of the talks is very uneven. There are talks that are highly engaging, entertaining, and describe nicely the research challenges and solutions. And there are talks that are a waste of time. Either the presenter cannot present clearly, or the presented content is impossible to digest within the time frame of the presentation.&lt;br /&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;So, my question is: &lt;b&gt;&lt;i&gt;Why don't we have peer reviewing for oral presentations?&amp;nbsp;&lt;/i&gt;&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;We already have reviewing for the written part. The program committee examines the quality of the &lt;b&gt;written&lt;/b&gt; paper and vouch for its technical content. However, by looking at a paper it is impossible to know how nicely it can be presented. Perhaps the seemingly solid but boring paper can be a very entertaining presentation. Or an excellent paper may be written by a horrible presenter.&lt;br /&gt;&lt;br /&gt;Why not having a second round of reviewing, where the authors of &lt;b&gt;&lt;i&gt;accepted &lt;/i&gt;&lt;/b&gt;papers submit their &lt;b&gt;&lt;i&gt;presentations &lt;/i&gt;&lt;/b&gt;(slides and a YouTube video) for presentation to the conference. The paper will be accepted and be included in the proceedings anyway but having a paper does not mean that the author gets a slot for an oral presentation.&lt;br /&gt;&lt;br /&gt;Under an oral presentation peer review, a committee looks at the presentation, votes on accept/reject and potentially provides feedback to the presenter. The best presentations get a slot on the conference program. This also allows the conference to accept more papers that are worthy of inclusion to the proceedings, without worrying about capacity constraints.&amp;nbsp;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Some other side benefits of this scheme:&lt;/div&gt;&lt;div&gt;&lt;ul&gt;&lt;li&gt;Presentations are accessible in an archival format&lt;/li&gt;&lt;li&gt;Authors have hard incentives to be better presenters&lt;/li&gt;&lt;li&gt;The time of the attendees in conferences is not wasted in clearly sub-par presentations.&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;div&gt;And if someone says that this system is biased towards good and sleek presenters, I would argue that the system is already biased towards good authors. A well-written paper will eventually have a higher impact than one that is badly written. Same thing for presentations.&lt;br /&gt;&lt;br /&gt;Learning to communicate properly the results of our research should be a goal, not an afterthought.&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6538286886012474786-4023750210230078972?l=topsitemap.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/4023750210230078972'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/4023750210230078972'/><link rel='alternate' type='text/html' href='http://topsitemap.blogspot.com/2010/07/peer-reviewing-for-oral-presentations.html' title='Peer Reviewing for Oral Presentations?'/><author><name>Internet at Every Where</name><uri>http://www.blogger.com/profile/13430497579682715412</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author></entry><entry><id>tag:blogger.com,1999:blog-6538286886012474786.post-6438097227079528720</id><published>2010-07-26T12:31:00.000-07:00</published><updated>2011-04-17T19:00:19.301-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='mechanical turk'/><category scheme='http://www.blogger.com/atom/ns#' term='wisdom of the crowds'/><category scheme='http://www.blogger.com/atom/ns#' term='research'/><category scheme='http://www.blogger.com/atom/ns#' term='hcomp'/><title type='text'>Liveblogging from HCOMP 2010</title><content type='html'>Following last year's practice, I am blogging about the workshop this year as well. (If I do it well for a few more years, I hope to be a tenured blogger for HCOMP.)&lt;br /&gt;&lt;br /&gt;The workshop was very well-attended, despite the strong competition for attention (there are &lt;a href="http://www.kdd.org/kdd2010/workshops.shtml"&gt;14 workshops at KDD&lt;/a&gt; this year).&lt;br /&gt;&lt;br /&gt;&lt;b&gt;8.40am, Invited Talk&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;The workshop started with an invited talk by Ross Smith from Microsoft, who talked about "Using Games to Improve Productivity in Software Engineering". He described how Microsoft used games internally to improve the quality of its products.&lt;br /&gt;&lt;br /&gt;He particularly emphasized the attempts on using engineers to help with the localization/internationalization of the Microsoft products. One of the things that was interesting was the pride that engineers had when translating the messages into their own native language. The "my-work-is-in-the-product-that-you-are-using-mom"-factor was a strong motivation for engineers to contribute their time and effort to such volunteer efforts.&lt;br /&gt;&lt;br /&gt;Ross also covered a variety of behavioral aspects of the process. For example, leaderboards has different effect, depending on how competitive the workers are in this particular country. When people compete, they have a positive factor. However, if someone has big difference than the rest, this is typically a demotivating factor, as many workers know that they cannot reach the top in any case.&lt;br /&gt;&lt;br /&gt;Another interesting factor is that games should not be similar to the duties of the worker. If a programmer writes C++, then playing a game that requires the worker to write C++ is bad. First, the programmer may spend time in the game that he would have otherwise spend programming. Second if a worker is way too good in a task, and spends a lot of time there, other workers cannot compete easily and get demotivated.&lt;br /&gt;&lt;br /&gt;An interesting&amp;nbsp;aspect&amp;nbsp;going forward is that games are increasingly being used to tap into the "discretionary" time of the workers, so there is now competition to make the games more interesting, more attractive, more meaningful etc. For example, currently workers accumulate points in the games, and their reward is that these points are translated into dollars that can be donated to disaster relief efforts.&lt;br /&gt;&lt;br /&gt;Finally, games should not come with a mandate from the management. &lt;i&gt;(Luis von Ahn mentioned that when you say "play the game" people do not.)&lt;/i&gt;&amp;nbsp;The counterexample was Japan, in which the linguistic pride to get the applications translated into Japanese worked (despite? due to?) the clear mandate from the management to play the game that aids into the localization of Microsoft products.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;9:10am, Session 1: Market Design&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;&lt;b&gt;&lt;i&gt;Task Search in a Human Computation Market&amp;nbsp;&lt;/i&gt;&lt;/b&gt;&lt;br /&gt;Lydia Chilton (University of Washington) (*** presenting)&lt;br /&gt;John Horton (Harvard University)&lt;br /&gt;Robert Miller (MIT CSAIL)&lt;br /&gt;Shiri Azenkot (University of Washington)&lt;br /&gt;&lt;br /&gt;The paper described how workers tend to search for tasks on Amazon Mechanical Turk. The analysis indicated that "Most HITs Available" and "Most Recently Posted" are the most commonly ranking techniques for users to find tasks. By monitoring HITs and scraping the website every 30 seconds, the authors figured out how quickly different tasks are being done.&lt;br /&gt;&lt;br /&gt;Plus they run a survey, trying to target rankings that are NOT frequently chosen by workers and compared "best case" scenario and "worst case" scenario. Interestingly enough, there is almost a 30x factor in the rate of completion. This explains all the gaming that is going on today, where major requesters keep posting HITs within their HITgroups to keep their HITs in the first page.&lt;br /&gt;&lt;br /&gt;This paper really highlights the negative aspects of the&amp;nbsp;prioritization&amp;nbsp;schemes currently used on Mechanical Turk. Allowing workers to find easier tasks to work on, and employing some randomization in the presentation, Mechanical Turk can really contribute to&amp;nbsp;&lt;a href="http://behind-the-enemy-lines.blogspot.com/2009/02/your-estimated-completion-time-infinite.html"&gt;more predictable&amp;nbsp;completion&amp;nbsp;times for the&amp;nbsp;tasks&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;&lt;i&gt;The Anatomy of a Large-Scale Human Computation Engine&lt;/i&gt;&lt;/b&gt;&lt;br /&gt;Shailesh Kochhar (Metaweb Technologies)&amp;nbsp;(*** presenting)&lt;br /&gt;Stefano Mazzocchi (Metaweb Technologies)&lt;br /&gt;Praveen Paritosh (Metaweb Technologies)&lt;br /&gt;&lt;br /&gt;This paper described Rabj, the platform used by Metaweb to improve the quality of FreeBase by having humans to look at the ambiguous cases, that cannot be handled well by automatic techniques. The basic goal is to take a system that is 99% accurate, and improve precision well above 99%&lt;br /&gt;&lt;br /&gt;Metaweb did &lt;b&gt;&lt;i&gt;not &lt;/i&gt;&lt;/b&gt;use Mechanical Turk for this task. Instead, they hired people through oDesk, by first training them for a day, so that they can do their tasks properly and then let them work. By building some long term relationship, they were able to improve the quality of the results, without employing too complicated solutions for solving the worker quality problem. They use the oDesk API as well, and pay an hourly wage that varies from $5 to $15 per hour, depending&lt;br /&gt;&lt;br /&gt;One thing that was interesting is that&lt;b&gt;&lt;i&gt; they are paying per hour, and not per piece. &lt;/i&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;This is a conscious choice. The distribution of completion times for various tasks follows a lognormal distribution. At the very tail, we have the hard tasks that need a lot of time. These are actually the tasks that MetaWeb cares a lot to get right. Paying by piece means that workers have the incentives to do these tasks quickly, and move to the next. Paying for time means that workers can spend some time more in such hard tasks. The quality control process of Metaweb includes testing workers for throughput (if a worker is very significantly slower than the others gets warned and then dismissed).&lt;/span&gt;&lt;/b&gt;&lt;br /&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/b&gt;&lt;br /&gt;&lt;b&gt;&lt;i&gt;Sellers' problems in human computation markets&lt;/i&gt;&lt;/b&gt;&lt;br /&gt;M. Six Silberman (University of California, Irvine)&amp;nbsp;(*** presenting)&lt;br /&gt;Joel Ross (University of California, Irvine)&lt;br /&gt;Lilly Irani (University of California, Irvine)&lt;br /&gt;Bill Tomlinson (University of California, Irvine)&lt;br /&gt;&lt;br /&gt;M. Six Silberman discussed the problems that workers (aka sellers) face in the marketplace of Mechanical Turk. There are bad requesters that reject good work and do not pay, or try to scam workers. Given the &lt;a href="http://behind-the-enemy-lines.blogspot.com/2010/03/new-demographics-of-mechanical-turk.html"&gt;increasing number of workers that rely on Turk for income&lt;/a&gt;, it is not surprising that workers start demanding guarantees of fairness. The paper even points to a "Turker Bill of Rights". Tools like Turkopticon help in that respect. (Btw, readers that are interested in the labor law aspects of crowdsourcing, should definitely read the paper "&lt;a href="http://papers.ssrn.com/sol3/papers.cfm?abstract_id=1593853"&gt;Working the Crowd: Employment and Labor Law in the Crowdsourcing Industry&lt;/a&gt;" by Alek Felstiner.)&lt;br /&gt;&lt;br /&gt;One interesting aspect of this presentation was its slideless nature. The speaker just read the conclusions from his notes.&amp;nbsp;Although I found the mode of presentation difficult to follow, I think the message was clear: Do we care about the workers? Do we pay them fairly? There was significant discussion afterwards, and I bet this is the only place in KDD (or in any other CS conference) where people engaged into discussion about the fairness of minimum wage laws, issues of immigration and labor, and so on :-)&lt;br /&gt;&lt;br /&gt;&lt;b&gt;10 am,&amp;nbsp;Session 2: Human Computation in Practice[Coffee, Demos &amp;amp; Posters, all in parallel]&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;In the next session, we had a set of very interesting demos and posters. I did not have time to see them all, so find below my notes about each one of them,&lt;br /&gt;&lt;br /&gt;&lt;b&gt;&lt;i&gt;Frontiers of a Paradigm -- Exploring Human Computation with Digital Games&lt;/i&gt;&lt;/b&gt;&lt;br /&gt;Markus Krause (University of Bremen)&lt;br /&gt;Aneta Takhtamysheva (University of Bremen)&lt;br /&gt;Marion Wittstock (University of Bremen)&lt;br /&gt;Rainer Malaka (University of Bremen)&lt;br /&gt;&lt;br /&gt;Markus had a very interesting game, for discovering synonyms and antonyms. You control a spaceship, and you try to shoot down the antonyms of the word given to you, and you try to collect the synonyms. This was a real arcade game, with graphics, collision detection, and so on. Markus mentioned that he writes it in Flash, because it is fast, and because there are websites where you post your game for people to procrastinate, and then you get effortlessly users. He routinely gets 3 million players a month. Even very simply games (e.g., click the boxes) get 5000 users to play them.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;&lt;i&gt;GiveALink Tagging Game: An Incentive for Social Annotation&lt;/i&gt;&lt;/b&gt;&lt;br /&gt;Li Weng (Indiana University)&lt;br /&gt;Filippo Menczer (Indiana University)&lt;br /&gt;&lt;br /&gt;This was a game to find words/tags that will take you from one page to another, essentially uncovering the semantic connections between pages.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;&lt;i&gt;Crowdsourcing Participation Inequality: A SCOUT Model for the Enterprise Domain&lt;/i&gt;&lt;/b&gt;&lt;br /&gt;Osamuyimen Stewart (IBM Research)&lt;br /&gt;David Lubensky (IBM Research)&lt;br /&gt;Juan M. Huerta (IBM Research)&lt;br /&gt;Julie Marcotte (IBM GBS)&lt;br /&gt;Cheng Wu (IBM Research)&lt;br /&gt;Andrzej Sakrajda (IBM Research)&lt;br /&gt;&lt;br /&gt;&lt;b&gt;&lt;i&gt;Mutually Reinforcing Systems&lt;/i&gt;&lt;/b&gt;&lt;br /&gt;John Ferguson (University of Glasgow)&lt;br /&gt;Marek Bell (University of Glasgow)&lt;br /&gt;Matthew Chalmers (University of Glasgow)&lt;br /&gt;&lt;br /&gt;&lt;b&gt;&lt;i&gt;A Note on Human Computation Limits&lt;/i&gt;&lt;/b&gt;&lt;br /&gt;Paul Rohwer (Indiana University)&lt;br /&gt;&lt;br /&gt;This a case study of two attempts to crowdsource writing a novel. The &amp;nbsp;first attempt by Penguin Books and De Montfort University used a wiki&amp;nbsp;to crowd source a novel. The result was a failure. No organization, disconnected elements, incoherent result. When BBC attempted the same a couple of years later, the result was a success. The difference? BBC assigned a curator, who overlooked the process. Lesson? Any attempt to harness the wisdom of the crowds needs a reliable aggregator that will kick out the junky contributors and their contributions, keeping only the good contributions from the crowd.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;&lt;i&gt;Reconstructing the World in 3D: Bringing Games with a Purpose Outdoors&lt;/i&gt;&lt;/b&gt;&lt;br /&gt;Kathleen Tuite (University of Washington)&lt;br /&gt;Noah Snavely (Cornell University)&lt;br /&gt;Dun-Yu Hsiao (University of Washington)&lt;br /&gt;Adam Smith (UC Santa Cruz)&lt;br /&gt;Zoran Popovic (University of Washington)&lt;br /&gt;&lt;br /&gt;Interesting real-life game: The goal is to cover and create a 3D reconstruction of a city. Players get points when they go out, take a photo, and cover a part of a city/building that was not covered before. Using the images, they can reconstruct in 3D the buildings without gaping holes.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;&lt;i&gt;Improving Music Emotion Labeling Using Human Computation&lt;/i&gt;&lt;/b&gt;&lt;br /&gt;Brandon G. Morton (Drexel University)&lt;br /&gt;Jacquelin A. Speck (Drexel University)&lt;br /&gt;Erik M. Schmidt (Drexel University)&lt;br /&gt;Youngmoo E. Kim (Drexel University)&lt;br /&gt;&lt;br /&gt;A game in which you listen to a song and try to guess the tempo and sentiment of the song, and agree with a co-listener. There is a continuous, intermittent feedback about the choice of the other player. The player that moves first to the agreed location gets extra points as influencer. I make it sound more complicated that it seems. I played it and it was very very intuitive and easy to play.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;&lt;i&gt;Webpardy: Harvesting QA by HC&lt;/i&gt;&lt;/b&gt;&lt;br /&gt;Hidir Aras (University of Bremen)&lt;br /&gt;Markus Krause (University of Bremen)&lt;br /&gt;Andreas Haller (University of Bremen)&lt;br /&gt;Rainer Malaka (University of Bremen)&lt;br /&gt;&lt;br /&gt;&lt;b&gt;&lt;i&gt;Measuring Utility of Human-Computer Interaction&lt;/i&gt;&lt;/b&gt;&lt;br /&gt;Michael Toomim (University of Washington)&lt;br /&gt;James A. Landay (University of Washington)&lt;br /&gt;&lt;br /&gt;A very interesting study about how the design of a HIT can influence participation. They change HIT parameters (price/design/etc) and examine for how long users will keep doing HITs. Reminded me a little bit of Dan Ariely's work on how motivation affects desire to work on a task.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;&lt;i&gt;Translation by Iterative Collaboration between Monolingual Users&lt;/i&gt;&lt;/b&gt;&lt;br /&gt;Chang Hu (University of Maryland)&lt;br /&gt;Benjamin B. Bederson (University of Maryland)&lt;br /&gt;Philip Resnik (University of Maryland)&lt;br /&gt;&lt;br /&gt;A demo that showed how two monolingual humans can collaborate to translate a document. They start with a human translation, and the human examines which part of the human translation do not make sense. After rephrasing and sending back (again through machine translation), the other human check if the translation makes sense and whether it corresponds to the original sentence that was translated. What I missed was how users can get motivated to participate in this system.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;b&gt;11:00am, Session 3: Task and Process Design&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;&lt;b&gt;&lt;i&gt;Sentence Recall Game: A Novel Tool for Collecting Data to Discover Language Usage Patterns&lt;/i&gt;&lt;/b&gt;&lt;br /&gt;Jun Wang (Syracuse University) ****&lt;br /&gt;Bei Yu (Syracuse University)&lt;br /&gt;&lt;br /&gt;This game worked as follows: The user looks at a sentence, then the sentence disappears, and the user has to type the sentence again. Typically people cannot retype the exact sentence but type something similar. The main outcome is that through this game we can discover paraphrases and (especially when played by non-natives) typical mistakes in specific language constructs.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;&lt;i&gt;Word Sense Disambiguation via Human Computation&lt;/i&gt;&lt;/b&gt;&lt;br /&gt;Nitin Seemakurty (Carnegie Mellon University)&lt;br /&gt;Jonathan Chu (Carnegie Mellon University)&lt;br /&gt;Luis von Ahn (Carnegie Mellon University)&lt;br /&gt;Anthony Tomasic (Carnegie Mellon University) *****&lt;br /&gt;&lt;br /&gt;The goal of this game is to disambiguate words (e.g., think of the different meanings of "bass" in "I can hear bass sounds" and "I like grilled bass"). The idea follows the ESP game, and asks users to type alternate words for the given underlined word in a phrase. If two people agree, then move on. Taboo words appear when their usage does not allow the disambiguation of a word (e.g., the word is associated with two senses). The experimental results clearly showed the fact that users are learning over time and perform better.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;b&gt;&lt;i&gt;Quality Management on Amazon Mechanical Turk&amp;nbsp;&lt;/i&gt;&lt;/b&gt;&lt;br /&gt;Panagiotis G. Ipeirotis (New York University)&lt;br /&gt;Foster Provost (New York University)&lt;br /&gt;Jing Wang (New York University)&lt;br /&gt;&lt;br /&gt;For many tasks on Mechanical Turk, there are spammers submitting wrong results. Using repeated labeling and an algorithm like Dawid and Skene, we can estimate the error rates of the workers. The question is, can we infer from the confusion matrixes who is a spammer? Error rate alone is not enough: Spammers that put everything in the majority class have lower rates than honest but imperfect workers. Also, biased workers who are systematically off (e.g., more conservative or more liberal than other workers) end up having very high error rates. The solution is to compensate for the errors and see how the assigned class looks like after compensating for the errors. If the corrected labels are concentrated in one class, the worker is good. If they are spread across all classes, the worker is bad.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;&lt;i&gt;Exploring Iterative and Parallel Human Computation Processes&lt;/i&gt;&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;Greg Little (MIT) ****&lt;br /&gt;Lydia B. Chilton (University of Washington)&lt;br /&gt;Max Goldman (MIT CSAIL)&lt;br /&gt;Robert C. Miller (MIT CSAIL)&lt;br /&gt;&lt;br /&gt;The TurkIt toolkit introduced the idea of iterative tasks, introducing the ideas of iterative elimination voting, the idea of iternative tasks in which workers build on each others results, and so on. This paper examines the outcomes of different task designs.&amp;nbsp;Basic question: Does it make sense to run tasks in parallel, or does it make sense to let workers build on each other's results?&amp;nbsp;For description of images, iterative tends to be better, as people really build on each other's results. Similarly for transcriptions of highly noisy results.&amp;nbsp;However, for tasks with shorter answers (e.g., coming up with company names) there is an interesting tradeoff:&amp;nbsp;Iterative process tends to have higher average, but parallel has higher variance. If you are interested in the &lt;b&gt;max &lt;/b&gt;and not in the average rating of the responses, then parallel is better. Iterative will find the consensus, but it will not be great. Parallel will generate some disasters, but also some gems.&amp;nbsp;So if the goal is to find the "best", then parallel processes (i.e., independence) should work best. However, if you are afraid of disastrous outcomes, then workers should interact to eliminate outliers.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;&lt;i&gt;Toward Automatic Task Design: A Progress Report&lt;/i&gt;&lt;/b&gt;&lt;br /&gt;Eric Huang (Harvard University)&lt;br /&gt;Haoqi Zhang (Harvard University) ****&lt;br /&gt;David C. Parkes (Harvard University)&lt;br /&gt;Krzysztof Z. Gajos (Harvard University)&lt;br /&gt;Yiling Chen (Harvard University)&lt;br /&gt;&lt;br /&gt;The final talk of the workshop focused on optimizing task design, an area that I see as having significant potential for follow-up work. The basic question asked is: How should we design optimally a task for crowdsourcing, given a set of constraints? What will generate best quality? What design aspects will&amp;nbsp;improve&amp;nbsp;speed? In a sense, how can we start moving crowdsourcing from an ad-hoc execution, into a mode in which we specify the task, and a black box optimizer selects all the appropriate aspects of the design for us. The paper gave some first results on predicting the quality and quantity of tags assigned to an image and showed that designs that are predicted to be optimal before execution indeed perform much better than designs that are suboptimal.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;12:00noon, Concluding Remarks&lt;/b&gt;&lt;br /&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;br /&gt;Yours truly, at the end, was assigned with the task of coming up with conclusions and describing the overall themes. I think the keyword is this workshop was "Design". Design for individual tasks (either games or MTurk HITs), and design of processes in handling such crowdsources tasks in marketplaces. One theme that I would have liked to see more is incentive designs to motivate people to participate and contribute. But I was very happy overall.&lt;br /&gt;&lt;br /&gt;After the concluding remarks there was some discussion of quality control and examining the robustness of crowdsourcing systems to manipulation attacks. While we have no definite answers on how to guarantee protection from coordinated attacks in the absence of ground truth, in the current settings we rarely see extensive collusion and coordination across attackers. Most of the current spammers are there to make an easy buck, and not to spend extensive amounts of time trying to scam pennies. (There are better targets for that.)&lt;br /&gt;&lt;br /&gt;Of course, having ground truth for verification of answers and for worker evaluation can help significantly in that respect: Luis von Ahn mentioned the attack on &lt;a href="http://www.google.com/recaptcha"&gt;reCAPTCHA&lt;/a&gt; from the 4chan clan, which randomly entered the word "penis" as one of the two words, hoping to fill in the digitizations of books with the word "penis". (Given that they were failing in 50% of the attempts, it was easy to isolate them and remove their entries.)&lt;br /&gt;&lt;br /&gt;The problem of handling a completely anonymous crowd, without any ground truth knowledge, and getting good results is hard to solve. Perhaps some security people will need to take a look and examine the theoretical guarantees.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6538286886012474786-6438097227079528720?l=topsitemap.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/6438097227079528720'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/6438097227079528720'/><link rel='alternate' type='text/html' href='http://topsitemap.blogspot.com/2010/07/liveblogging-from-hcomp-2010.html' title='Liveblogging from HCOMP 2010'/><author><name>Internet at Every Where</name><uri>http://www.blogger.com/profile/13430497579682715412</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author></entry><entry><id>tag:blogger.com,1999:blog-6538286886012474786.post-1707217683930092303</id><published>2010-07-04T05:51:00.000-07:00</published><updated>2011-04-17T19:00:19.305-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='mechanical turk'/><category scheme='http://www.blogger.com/atom/ns#' term='research'/><title type='text'>Detecting Spammers on Mechanical Turk, Part II</title><content type='html'>In my &lt;a href="http://behind-the-enemy-lines.blogspot.com/2010/06/detecting-spammers-on-mechanical-turk.html"&gt;previous post&lt;/a&gt;, I gave a brief overview of different techniques used to improve the quality of the results on Amazon Mechanical Turk. The main outcome of these techniques is a matrix that describes the error rate of each worker.&lt;br /&gt;&lt;br /&gt;For example, consider the task of categorizing webpages as porn or not. We have three target categories:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;G-rated: Pages appropriate for a general audience, children included.&lt;/li&gt;&lt;li&gt;PG-rated: Pages with adult themes but without any explicit sexual scenes, appropriate only for children above 13&lt;/li&gt;&lt;li&gt;R-rated: Pages with content appropriate for adults only.&lt;/li&gt;&lt;/ul&gt;In this case, the confusion matrix of a worker, inferred using the techniques described in my &lt;a href="http://behind-the-enemy-lines.blogspot.com/2010/06/detecting-spammers-on-mechanical-turk.html"&gt;previous post&lt;/a&gt;, would look like:&lt;br /&gt;&lt;br /&gt;&lt;div style="text-align: center;"&gt;$ \left( \begin{array}{ccc} Pr[G \rightarrow G] &amp;amp; Pr[G \rightarrow PG] &amp;amp; Pr[G \rightarrow R] \\ Pr[PG \rightarrow G] &amp;amp; Pr[PG \rightarrow PG] &amp;amp; Pr[PG \rightarrow R] \\ Pr[R \rightarrow G] &amp;amp; Pr[R \rightarrow PG] &amp;amp; Pr[R \rightarrow R] \end{array} \right) $&lt;/div&gt;&lt;br /&gt;where $Pr[X \rightarrow Y]$ is the probability that the worker will give the answer $Y$ when given a question where the correct answer is $X$. (The sum of the elements in each line should sum up to 1.)&lt;br /&gt;&lt;br /&gt;And here is the question that seems trivially easy: Given the confusion matrix, how can we detect the spammers?&lt;br /&gt;&lt;br /&gt;&lt;b&gt;&lt;i&gt;Computing the Error Rate&lt;/i&gt;&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;The simple answer is: Just sum up the elements out of the diagonal! Since every non-diagonal element corresponds to an error, if the sum is high, the worker is a spammer.&amp;nbsp;Of course, this ignores the fact that class priors will often differ. So, instead of giving equal weights to each category, we weight the errors according to the class priors (i.e., how often we expect to see each &lt;i&gt;correct &lt;/i&gt;answer). &lt;br /&gt;&lt;br /&gt;&lt;div style="text-align: center;"&gt;$\\ Pr[G] (Pr[G \rightarrow PG] + Pr[G \rightarrow R]) + \\ Pr[PG] (Pr[PG \rightarrow G] + Pr[PG \rightarrow R]) + \\ Pr[R] (Pr[R \rightarrow G] + Pr[R \rightarrow PG]) $&lt;/div&gt;&lt;br /&gt;For example, if in confusion matrix is&lt;br /&gt;&lt;br /&gt;&lt;div style="text-align: center;"&gt;$\left( \begin{array}{ccc} 0.5 &amp;amp; 0.3 &amp;amp; 0.2 \\ 0.2 &amp;amp; 0.6 &amp;amp; 0.2 \\ 0.1 &amp;amp; 0.1 &amp;amp; 0.8 \end{array} \right) $&lt;/div&gt;&lt;br /&gt;and the class priors are 80% $G$, 15% $PG$, and 5% $R$, then the weighted error rate is&lt;br /&gt;&lt;br /&gt;&lt;div style="text-align: center;"&gt;$80\% \cdot  (0.3 + 0.2) + 15\% \cdot (0.2 + 0.2) + 5\% \cdot  (0.1 + 0.1) = 0.47$&lt;/div&gt;&lt;br /&gt;Notice that the error rates for the first line, which correspond to category $G$, got weighted more heavily.&lt;br /&gt;&lt;br /&gt;Unfortunately, this method does not work very well.&lt;br /&gt;&lt;br /&gt;When we started using this technique, we ended up marking legitimate workers as spammers (false positives), and classifying spammers as legitimate workers (false negatives). Needless to say, both mistakes were hurting us. Legitimate workers were complaining and (understandably) badmouthing us, and spammers kept polluting the results.&lt;br /&gt;&lt;br /&gt;Let me give some more details on how such errors appear.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;&lt;i&gt;False Negatives: Strategic Spammers and Uneven Class Priors&lt;/i&gt;&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;Spammers on Mechanical Turk are often smart &lt;i&gt;and&lt;/i&gt; lazy. They will try to submit answers that seem legitimate but without spending too much time. (Otherwise, they may as well do the work :-)&lt;br /&gt;&lt;br /&gt;In our case, we were categorizing sites as porn or not. Most of the time the sites were not porn, and only 10%-20% of the time we had sites that were falling into one of the porn categories. Some workers noticed this fact, and realized that they could keep their error rate low by simply classifying&amp;nbsp;everything&amp;nbsp;as not-porn.&lt;br /&gt;&lt;br /&gt;Following the standard way of computing an error rate, these workers were faring much better than legitimate workers that were misclassifying some of the not-porn sites.&lt;br /&gt;&lt;br /&gt;Here is an illustration. With three categories (G-, PG13-, and R-rated), the confusion matrix for a spammer looks like this:&lt;br /&gt;&lt;br /&gt;&lt;div style="text-align: center;"&gt;$\left( \begin{array}{ccc} 1.0 &amp;amp; 0.0 &amp;amp; 0.0 \\ 1.0 &amp;amp; 0.0 &amp;amp; 0.0 \\ 1.0 &amp;amp; 0.0 &amp;amp; 0.0 \end{array} \right) $&lt;/div&gt;&lt;br /&gt;With class priors are 80% in G, 15% in PG13, and 5% in R, the weighted error rate of the spammer is:&lt;br /&gt;&lt;br /&gt;&lt;div style="text-align: center;"&gt;$80\% \cdot (0.0 + 0.0) + 15\% \cdot  (1.0 + 0.0) + 5\% \cdot (1.0 + 0.0) = 0.2$&lt;/div&gt;&lt;br /&gt;Compare this overall error rate with the one of a legitimate worker, with rather modest error rates:&lt;br /&gt;&lt;br /&gt;&lt;div style="text-align: center;"&gt;$\left( \begin{array}{ccc} 0.8 &amp;amp; 0.2 &amp;amp; 0.0 \\ 0.1 &amp;amp; 0.8 &amp;amp; 0.1 \\ 0.0 &amp;amp; 0.25 &amp;amp; 0.75 \end{array} \right) $&lt;/div&gt;&lt;br /&gt;The error rate for this legitimate worker is:&lt;br /&gt;&lt;br /&gt;&lt;div style="text-align: center;"&gt;$80\% \cdot (0.2 + 0.0) + 15\% \cdot (0.1 + 0.1) + 5\% \cdot (0.25 + 0.0) = 0.2025$&lt;/div&gt;&lt;br /&gt;Yes, the legitimate worker appears to be worse than the spammer!&lt;br /&gt;&lt;br /&gt;&lt;b&gt;&lt;i&gt;False Positives: Biased Workers&lt;/i&gt;&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;The second type of error is when we classify honest workers as spammers. Interestingly enough, when we started evaluating workers, the top "spammers" ended up being members of the internal team. Take a look at the error rate of this worker:&lt;br /&gt;&lt;br /&gt;&lt;div style="text-align: center;"&gt;$\left( \begin{array}{ccc} 0.35 &amp;amp; 0.65 &amp;amp; 0.0 \\ 0.0 &amp;amp; 0.0 &amp;amp; 1.0 \\ 0.0 &amp;amp; 0.0 &amp;amp; 1.0 \end{array} \right) $&lt;/div&gt;&lt;br /&gt;The error rate is:&lt;br /&gt;&lt;br /&gt;&lt;div style="text-align: center;"&gt;$80\% \cdot (0.6 + 0.0) + 15\% \cdot  (0.0 + 1.0) + 5\% \cdot (0.0 + 0.0) = 0.67$&lt;/div&gt;&lt;br /&gt;The error rate would imply that this worker is essentially random. A clear case of a worker that should be banned.&lt;br /&gt;&lt;br /&gt;After a careful inspection though, you can see that this is not the case. This is the confusion matrix of a worker that tends to be much more conservative than others and classifies 65% of the "G" pages as "PG13". Similarly, all the pages tat are in reality "PG13" are classified as "R".&amp;nbsp;(This worker was a parent with young children and was much more strict on what content would pass as "G" vs "PG13.)&lt;br /&gt;&lt;br /&gt;In a sense, this is a a pretty careful worker!&amp;nbsp;Even though this worker does mix up R and PG13 pages, there is a very clear separation between G and PG13/R pages.&amp;nbsp;Still the error rate alone would put this worker very clearly in the spam category.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;&lt;i&gt;Solution: Examine Ambiguity not Errors&lt;/i&gt;&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;You will notice that one thing that separates spammers from legitimate workers is the information provided by their answers. A spammer that gives the same reply all the time does not give us any information.&amp;nbsp;In contrast, when the biased worker gives the answer "PG13", we know that this corresponds to a page that in reality belongs to the "G" class. Even if the answer is wrong, we can always guess the correct answer!&lt;br /&gt;&lt;br /&gt;So, by "reversing" the errors, we can see how ambiguous are the answers of a worker, and use this information to decide whether to reject a worker or not.&lt;br /&gt;&lt;br /&gt;You can find more details about the process in our HCOMP 2010 paper "&lt;a href="http://pages.stern.nyu.edu/~panos/publications/hcomp2010.pdf"&gt;Quality Management on Amazon Mechanical Turk&lt;/a&gt;".&lt;br /&gt;&lt;br /&gt;You can also find a demo of the algorithm at&amp;nbsp;&lt;a href="http://qmturk.appspot.com/"&gt;http://qmturk.appspot.com/&lt;/a&gt;&amp;nbsp;and you can plug your own data to see how it works. The code will take as input the responses of the workers, the misclassification costs, and the "gold" data points, if you have any. The demo returns the confusion matrix for each worker, and the estimated "cost" of each worker. The output is plain text and kind of ugly but you can find what you need.&lt;br /&gt;&lt;br /&gt;The code is also open source and available at&amp;nbsp;&lt;a href="http://code.google.com/p/get-another-label/"&gt;Google Code&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;If you have any questions, just drop me an email!&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6538286886012474786-1707217683930092303?l=topsitemap.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/1707217683930092303'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/1707217683930092303'/><link rel='alternate' type='text/html' href='http://topsitemap.blogspot.com/2010/07/detecting-spammers-on-mechanical-turk.html' title='Detecting Spammers on Mechanical Turk, Part II'/><author><name>Internet at Every Where</name><uri>http://www.blogger.com/profile/13430497579682715412</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author></entry><entry><id>tag:blogger.com,1999:blog-6538286886012474786.post-5453146053459365353</id><published>2010-06-02T20:08:00.000-07:00</published><updated>2011-04-17T19:00:19.310-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='mechanical turk'/><category scheme='http://www.blogger.com/atom/ns#' term='research'/><category scheme='http://www.blogger.com/atom/ns#' term='spam'/><title type='text'>Detecting Spammers on Mechanical Turk, Part I</title><content type='html'>Anyone who has tried to use Mechanical Turk for any large scale task knows what is the biggest problem: Spammers!&lt;br /&gt;&lt;br /&gt;Many people try initially to post small tasks on Turk and are amazed to get back high-quality results of high quality, very quickly, and with very small cost. Then, they post similar, but much larger, tasks and they realize that now the results are pretty much crap. Spammers start giving random answers, and since they spend no effort, they can easily be the most prolific workers! For example, in the &lt;a href="http://blog.doloreslabs.com/wp-content/uploads/2008/09/amt_emnlp08_final.pdf"&gt;Snow et al. 2008 paper&lt;/a&gt;, almost&amp;nbsp;&lt;a href="http://lingpipe-blog.com/2008/09/19/filtering-out-bad-annotators/"&gt;40% of the submissions came from spam workers&lt;/a&gt;, according to Bob Carpenter's analysis.&lt;br /&gt;&lt;br /&gt;Unfortunately, this problem is widespread, and tends to discourage new requesters. People just want to get things done on Mechanical Turk, they do not want to spend their time fighting spammers.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;&lt;i&gt;Spam Protection Measures: Qualification tests and "best of 3" policy&lt;/i&gt;&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Qualification tests: &lt;/b&gt;Some people, to avoid spammers, use qualification tests. While some measures, such as qualification tests help, they are not a panacea. Determined spammers simply pass the qualification test, and proceed with spamming. Nevertheless, it is a good idea to have a qualification test, as it also tends to filter out workers that come and complete only a couple of HITs. The main disadvantage is that you need programming or use of the command-line tools to create and require a qualification test (i.e., the web interface is not enough).&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Best of Three:&lt;/b&gt; Other requesters use a "best of 3" (or "2+1") policy: if two workers agree on their answer, the answer is accepted. This technique is relatively widespread across seasoned requesters (&lt;a href="http://turkers.proboards.com/index.cgi?board=shame&amp;amp;action=display&amp;amp;thread=4789&amp;amp;page=1"&gt;and drives some legitimate workers crazy&lt;/a&gt;).&lt;br /&gt;&lt;br /&gt;Why do legitimate workers hate this verification mechanism? Because every time a worker does not agree with the other two, the submission gets rejected. This drops the acceptance ratio, an important reputation characteristics on Mechanical Turk.&lt;br /&gt;&lt;br /&gt;Interestingly, the "best of 3" strategy led to an interesting counter-strategy from spammers: they come in pairs and vote concurrently at the same task, giving the same votes. This defeats the simple "best of 3" approach. &lt;i&gt;(&lt;/i&gt;&lt;b&gt;&lt;i&gt;Update&lt;/i&gt;&lt;/b&gt;&lt;i&gt;: Josh &lt;/i&gt;&lt;a href="http://behind-the-enemy-lines.blogspot.com/2010/06/detecting-spammers-on-mechanical-turk.html?showComment=1275660742549#c3795231787500348058"&gt;&lt;i&gt;in the comments&lt;/i&gt;&lt;/a&gt;&lt;i&gt; suggests a good strategy to avoid this: Randomize the HITs by picking examples from a pool, instead of having "static" HITs. This is a generally good suggestion, even if someone uses the more sophisticated tools described below.)&lt;/i&gt;&lt;br /&gt;&lt;br /&gt;While the "best of 3" approach ensures some basic quality of the data, it does not work well in the long run. Legitimate workers tend to avoid such HITs, as their approval rating can plummet if there are a couple of spammers lurking around. Also, legitimate workers get penalized too severely for honest mistakes.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;&lt;i&gt;Spam Protection Measures: Using "gold standard" data and/or unsupervised EM algorithm&lt;/i&gt;&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;A better approach for detecting the quality of the workers is to perform the analysis over a larger set of HITs, rather than operating on single HITs, as "best of 3" does.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Using gold standard&lt;/b&gt;: One approach, followed often, is to slip into the stream of HITs some tasks with known correct answers. By comparing the answer given by the worker with the correct answer, we can compute the "confusion matrix" for each worker. In other words, we can see how often a worker gives answer B when the correct answer is A, how often workers says B when correct answer is B, and so on.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Using unsupervised, expectation maximization algorithm: &lt;/b&gt;Using gold standard works well when we have in our disposal plenty of such "gold" cases. Unfortunately, this is not always possible, especially when we want to post new tasks frequently. In this case, it makes sense to use the&lt;a href="http://www.jstor.org/pss/2346806"&gt; iterative expectation-maximization algorithm proposed by Dawid and Skene&lt;/a&gt;:&lt;br /&gt;&lt;ol&gt;&lt;li&gt;Initialize the error-rates of the workers (e.g., assume all workers are perfect)&lt;/li&gt;&lt;li&gt;Initialize the most likely class for each example (e.g., using majority vote)&lt;/li&gt;&lt;li&gt;Compute the confusion matrix (aka error rates) for each worker: Compare the answers given by the worker with the estimated correct answers.&lt;/li&gt;&lt;li&gt;Compute the most likely class for each example: Use the answers given by the workers, and weight the answers using the confusion matrix for each worker.&lt;/li&gt;&lt;li&gt;Go to step 3, until convergence&lt;/li&gt;&lt;/ol&gt;&lt;div&gt;Interestingly enough, &lt;i&gt;the supervised and unsupervised techniques can be naturally combined&lt;/i&gt;: In step 4, for all the examples for which we know the correct answer, we simply do not update the class label. Trivially easy. (In a dual fashion, we can also incorporate trusted workers by not updating the error rates in step 3.) By providing some known data, the algorithm can actually estimate with less data the quality of the workers and the correct labels for the unknown examples.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The outcome of these algorithms is:&lt;/div&gt;&lt;div&gt;&lt;ul&gt;&lt;li&gt;The correct label for each example&lt;/li&gt;&lt;li&gt;The confusion matrix for each worker&lt;/li&gt;&lt;/ul&gt;&lt;div&gt;If you want to try this algorithm, go to our demo at&amp;nbsp;&lt;a href="http://qmturk.appspot.com/"&gt;http://qmturk.appspot.com/&lt;/a&gt;&lt;/div&gt;&lt;/div&gt;&lt;br /&gt;As the output of this process, we know for each worker its corresponding confusion matrix. So, for a task with 3 possible answers, A, B, C, the confusion matrix would look like:&lt;br /&gt;&lt;br /&gt;&lt;div&gt;$ \left( \begin{array}{ccc} Pr[A \rightarrow A] &amp;amp; Pr[A \rightarrow B] &amp;amp; Pr[A \rightarrow C] \\ Pr[B \rightarrow A] &amp;amp; Pr[B \rightarrow B] &amp;amp; Pr[B \rightarrow C] \\ Pr[C \rightarrow A] &amp;amp; Pr[B \rightarrow C] &amp;amp; Pr[C \rightarrow C] \end{array} \right) $&lt;/div&gt;&lt;br /&gt;where $Pr[A \rightarrow B]$ is the probability that the worker will classify an object that belongs to class A into class B. Of course, a perfect worker will have a confusion matrix that is perfectly diagonal&lt;br /&gt;&lt;br /&gt;$ \left( \begin{array}{ccc} Pr[A \rightarrow A] = 1 &amp;amp; Pr[A \rightarrow B] = 0 &amp;amp; Pr[A \rightarrow C] = 0 \\ Pr[B \rightarrow A] = 0 &amp;amp; Pr[B \rightarrow B] = 1  &amp;amp; Pr[B \rightarrow C] = 0 \\ Pr[C \rightarrow A] =0 &amp;amp; Pr[B \rightarrow C] = 0 &amp;amp; Pr[C \rightarrow C] = 1 \end{array} \right) $&lt;br /&gt;&lt;br /&gt;Interestingly enough, the opposite question is not trivial to answer: How can we identify spammers, by examining their confusion matrices? The answer &lt;a href="http://behind-the-enemy-lines.blogspot.com/2010/07/detecting-spammers-on-mechanical-turk.html"&gt;in the next blog post&lt;/a&gt;.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6538286886012474786-5453146053459365353?l=topsitemap.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/5453146053459365353'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/5453146053459365353'/><link rel='alternate' type='text/html' href='http://topsitemap.blogspot.com/2010/06/detecting-spammers-on-mechanical-turk.html' title='Detecting Spammers on Mechanical Turk, Part I'/><author><name>Internet at Every Where</name><uri>http://www.blogger.com/profile/13430497579682715412</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author></entry><entry><id>tag:blogger.com,1999:blog-6538286886012474786.post-5236788910664504095</id><published>2010-05-28T21:35:00.000-07:00</published><updated>2011-04-17T19:00:19.313-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='mechanical turk'/><category scheme='http://www.blogger.com/atom/ns#' term='research'/><category scheme='http://www.blogger.com/atom/ns#' term='hcomp'/><title type='text'>Accepted Papers for HCOMP 2010</title><content type='html'>The list of accepted papers for&lt;a href="http://hcomp.info/HComp2010/"&gt; HCOMP 2010&lt;/a&gt; is now available. We received a total of 35 submissions (18 long paper submissions, 11 short paper submissions and 6 demo submissions). This was a healthy increase compared to last years numbers (12 long, 9 short, 9 demos).&lt;br /&gt;&lt;br /&gt;Unfortunately, due to the half-day duration of the workshop, we had space for only 8 papers (4 long and 4 short).&amp;nbsp;We really had to make very hard decisions to fit as many papers as possible in the program. We reduced the length of the talks, "downgraded" some papers to short, some others to posters, and at the end of the day we even had to reject papers that had no reject ratings!&amp;nbsp;Quite a few of the rejected papers were interesting and I would love the opportunity to talk to the authors about their work. Let's hope that next time we will manage to get a full day for the workshop in order to&amp;nbsp;accommodate&amp;nbsp;the demand. Having acceptance rates of ~25% for a workshop is simply too low.&lt;br /&gt;&lt;br /&gt;I expect to see a very exciting program! If you can be in DC on July 25th, you should make the effort to come to the workshop!&lt;br /&gt;&lt;br /&gt;Special thanks go to &lt;a href="http://research.microsoft.com/en-us/people/ramanc/"&gt;Chandra&lt;/a&gt; for running the show this year!&lt;br /&gt;&lt;br /&gt;Below, I list the accepted papers, with links to the unofficial versions that the authors posted on their websites. Once we have the camera-ready versions available, I will put the proper links.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Long Papers&lt;/b&gt;&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;a href="http://www.eecs.harvard.edu/~hq/papers/turk.pdf"&gt;Toward Automatic Task Design: A Progress Report&lt;/a&gt;&lt;br /&gt;Eric Huang, Haoqi Zhang, David Parkes, Krzysztof Gajos, Yiling Chen&lt;br /&gt;&lt;i&gt;Abstract:&amp;nbsp;A central challenge in human computation is in understanding how to design task environments that effectively attract participants and coordinate the problem solving process. In this paper, we consider a common problem that requesters face on Amazon Mechanical Turk: how should a task be designed so as to induce good output from workers? In posting a task, a requester decides how to break down the task into unit tasks, how much to pay for each unit task, and how many workers to assign to a unit task. These design decisions affect the rate at which workers complete unit tasks, as well as the quality of the work that results. Using image labeling as an example task, we consider the problem of designing the task to maximize the number of quality tags received within given time and budget constraints. We consider two different measures of work quality, and construct models for predicting the rate and quality of work based on observations of output to various designs. Preliminary results show that simple models can accurately predict the quality of output per unit task, but are less accurate in predicting the rate at which unit tasks complete. At a fixed rate of pay, our models generate different designs depending on the quality metric, and optimized designs obtain significantly more quality tags than baseline comparisons.&lt;/i&gt;&lt;br /&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://glittle.org/Papers/CHI2010.pdf"&gt;Exploring Iterative and Parallel Human Computation Processes&lt;/a&gt;&lt;br /&gt;Greg Little, Lydia Chilton, Max Goldman, Robert Miller&lt;br /&gt;&lt;i&gt;Abstract:&amp;nbsp;Services like Amazon's Mechanical Turk have opened the door for exploration of processes that outsource computation to humans. These human computation processes hold tremendous potential to solve a variety of problems in novel and interesting ways. However, we are only just beginning to understand how to design such processes. This paper explores two basic approaches: one where workers work alone in parallel and one where workers iteratively build on each other's work. We present a series of experiments exploring tradeoffs between each approach in several problem domains: writing, brainstorming, and transcription. In each of our experiments, iteration increases the average quality of responses. The increase is statistically significant in writing and brainstorming. However, in brainstorming and transcription, it is not clear that iteration is the best overall approach, in part because both of these tasks benefit from a high variability of responses, which is more prevalent in the parallel process. Also, poor guesses in the transcription task can lead subsequent workers astray.&lt;/i&gt;&lt;br /&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://dl.dropbox.com/u/420874/search%20-%20final.pdf"&gt;Task Search in a Human Computation Market&lt;/a&gt;&lt;br /&gt;Lydia Chilton, John Horton, Robert Miller, Shiri Azenkot&lt;br /&gt;&lt;i&gt;Abstract:&amp;nbsp;&amp;nbsp;"In order to understand how a labor market for human computation &amp;nbsp;functions, it is important to know how workers search for tasks. This &amp;nbsp;paper uses two complementary methods to gain insight into how workers &amp;nbsp;search for tasks on Mechanical Turk. &amp;nbsp;First, we perform a high &amp;nbsp;frequency scrape of 36 pages of search results and analyze it by &amp;nbsp;looking at the rate of disappearance of tasks across key ways &amp;nbsp;Mechanical Turk allows workers to sort tasks. Second, we present the &amp;nbsp;results of a survey in which we paid workers for self-reported &amp;nbsp;information about how they search for tasks. &amp;nbsp;Our main findings are &amp;nbsp;that on a large scale, workers sort by which tasks are most recently &amp;nbsp;posted and which have the largest number of tasks available. &amp;nbsp;Furthermore, &amp;nbsp;we find &amp;nbsp;that workers look mostly at the first page of the most recently posted &amp;nbsp;tasks and the first two pages of the tasks with the most available &amp;nbsp;instances but in both categories the position on the result page is &amp;nbsp;unimportant to workers. &amp;nbsp;We observe that &amp;nbsp;at least some employers try to manipulate the position of their task in the search results to &amp;nbsp;exploit the tendency to search for recently posted tasks. &amp;nbsp;On an individual level, we observed &amp;nbsp;workers searching by almost all the possible categories and looking &amp;nbsp;more than 10 pages deep. &amp;nbsp; For a task we posted to Mechanical Turk, we confirmed that a favorable position in the search &amp;nbsp; results do matter: our task with favorable positioning was completed 30 times faster and for less money than when its position was unfavorable. &amp;nbsp; &amp;nbsp;"&amp;nbsp;&lt;/i&gt;&lt;br /&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://kochhar.org/f/pubs/hcomp10-anatomy.pdf"&gt;The Anatomy of a Large-Scale Human Computation Engine&lt;/a&gt;&lt;br /&gt;Shailesh Kochhar, Stefano Mazzocchi, Praveen Paritosh&lt;br /&gt;&lt;i&gt;Abstrat:&amp;nbsp;In this paper we describe RABJ, an engine designed to simplify collecting human input. We have used RABJ to collect over 2.3 million human judgments to augment data mining, data entry, data validation and curation problems at Freebase over the course of a year. We illustrate several successful applications that have used RABJ to collect human judgment. &amp;nbsp;We describe how the architecture and design decisions of RABJ are affected by the constraints of content agnosticity, data freshness, latency and visibility. We present work aimed at increasing the yield and reliability of human computation efforts. Finally, we discuss empirical observations and lessons learned in the course of a year of operating the service.&lt;/i&gt;&lt;/li&gt;&lt;/ul&gt;&lt;b&gt;Short Papers&lt;/b&gt;&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;b&gt;Sellers' problems in human computation markets&lt;/b&gt;&lt;br /&gt;M. Six Silberman, Joel Ross, Lilly Irani, Bill Tomlinson&lt;br /&gt;&lt;i&gt;Abstract:&amp;nbsp;"Tools for human computers" is an underexplored design space in human computation research, which has focused on techniques for buyers of human computation rather than sellers. We characterize the sellers in one human computation market, Mechanical Turk, and describe some of the challenges they face. We list several projects developed to approach these problems, and conclude with a list of open questions relevant to sellers, buyers, and researchers.&lt;/i&gt;&lt;br /&gt;&lt;/li&gt;&lt;li&gt;&lt;b&gt;Sentence Recall Game: A Novel Tool for Collecting Data to Discover Language Usage Patterns&lt;/b&gt;&lt;br /&gt;Jun Wang, Bei Yu&lt;br /&gt;&lt;i&gt;Abstract:&amp;nbsp;Recently we ran a simple memory test experiment, called sentence recall, in which participants were asked to recall sentences that they had just seen on the screen. &amp;nbsp;Many participants, especially non-native English speakers, made various deviations in their recalled sentences. &amp;nbsp;Some deviations represent alternative ways to express the same meaning, but others suggest that there are missing pieces in the participants' language knowledge. &amp;nbsp;The deviation data, on the one hand, can provide individual users valuable feedback on their language usage patterns that they may never notice, on the other hand, can be used as training data for automatically discovering language usage patterns in a subpopulation of language learners. &amp;nbsp;This paper presents our attempts to create an enjoyable sentence recall game for collecting a large amount of deviation data. &amp;nbsp;Our results show that the game is fun to play and the collected deviation data can reveal common language usage patterns among non-native speakers.&amp;nbsp;&lt;/i&gt;&lt;br /&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://pages.stern.nyu.edu/~panos/publications/hcomp2010.pdf"&gt;Quality Management on Amazon Mechanical Turk&lt;/a&gt;&lt;br /&gt;Panagiotis Ipeirotis, Jing Wang, Foster Provost&lt;br /&gt;&lt;i&gt;Abstract:&amp;nbsp;Crowdsourcing services, such as Amazon Mechanical Turk, allow for easy distribution of small tasks to a large number of workers. Unfortunately, since manually verifying the quality of the submitted results is hard, &amp;nbsp;malicious workers often take advantage of the verification difficulty and submit answers of low quality. Currently, most requesters rely on redundancy to identify the correct answers. However, redundancy is not a panacea. Massive redundancy is expensive, increasing significantly the cost of crowdsourced solutions. Therefore, we need techniques that will accurately estimate the quality of the workers, allowing for the rejection and blocking of the low-performing workers and spammers. However, existing techniques cannot separate the true (unrecoverable) error rate from the (recoverable) biases that some workers exhibit. This lack of separation leads to incorrect assessments of a worker's quality. We present algorithms that improve the existing state-of-the-art techniques, enabling the separation of bias and error. Our algorithm generates a scalar score representing the inherent quality of each worker. We illustrate how to incorporate cost-sensitive classification errors in the overall framework and how to seamlessly integrate unsupervised and supervised techniques for inferring the quality of the workers. &amp;nbsp;We present experimental results demonstrating the performance of the proposed algorithm under a variety of settings.&amp;nbsp;&lt;/i&gt;&lt;br /&gt;&lt;/li&gt;&lt;li&gt;&lt;b&gt;Human Computation for Word Sense Disambiguation&lt;/b&gt;&lt;br /&gt;Nitin Seemakurty, Jonathan Chu, Luis von Ahn, Anthony Tomasic&lt;br /&gt;&lt;i&gt;Abstract:&amp;nbsp;One formidable problem in language technology is the word sense disambiguation (WSD) problem: disambiguating the true sense of a word as it occurs in a sentence (e.g., recognizing whether the word "bank" refers to a river bank or to a financial institution). This paper explores a strategy for harnessing the linguistic abilities of human beings to develop datasets that can be used to train machine learning algorithms for WSD. To create such datasets, we introduce a new interactive system: a fun game designed to produce valuable output by engaging human players in what they perceive to be a cooperative task of guessing the same word as another player. Our system makes a valuable contribution by tackling the knowledge acquisition bottleneck in the WSD problem domain. Rather than using conventional and costly techniques of paying lexicographers to generate training data for machine learning algorithms, we delegate the work to people who are looking to be entertained.&lt;/i&gt;&lt;/li&gt;&lt;/ul&gt;&lt;b&gt;Poster/Demos&lt;/b&gt;&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;b&gt;Frontiers of a Paradigm: Exploring Human Computation with Digital Games&lt;/b&gt;&lt;br /&gt;Markus Krause&lt;/li&gt;&lt;li&gt;&lt;b&gt;GiveALink Tagging Game: An Incentive for Social Annotation&lt;/b&gt;&lt;br /&gt;Li Weng, Filippo Menczer&lt;/li&gt;&lt;li&gt;&lt;b&gt;Crowdsourcing Participation Inequality; A SCOUT Model for the Enterprise Domain&lt;/b&gt;&lt;br /&gt;Osamuyimen Stewart, David Lubensky, Juan Huerta, Julie Marcotte, Cheng Wu, Andrzej Sakrajda&lt;/li&gt;&lt;li&gt;&lt;b&gt;Mutually Reinforcing Systems: A Method For The Acquisition Of Specific Data From Games With By-Products&lt;/b&gt;&lt;br /&gt;John Ferguson, Marek Bell, Matthew Chalmers&lt;/li&gt;&lt;li&gt;&lt;b&gt;A Note on Human Computation Limits&lt;/b&gt;&lt;br /&gt;Paul Rohwer&lt;/li&gt;&lt;/ul&gt;&lt;b&gt;Demos&lt;/b&gt;&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;b&gt;Reconstructing the World in 3D: Bringing Games with a Purpose Outdoors&lt;/b&gt;&lt;br /&gt;Kathleen Tuite, Noah Snavely, Zoran Popovic, Dun-Yu Hsiao, Adam Smith&lt;/li&gt;&lt;li&gt;&lt;b&gt;Improving Music Emotion Labeling Using Human Computation&lt;/b&gt;&lt;br /&gt;Brandon Morton, Jacquelin Speck, Erik Schmidt, Youngmoo Kim&lt;/li&gt;&lt;li&gt;&lt;b&gt;Webpardy: Harvesting QA by HC&lt;/b&gt;&lt;br /&gt;Hidir Aras, Markus Krause, Andreas Haller, Rainer Malaka&lt;/li&gt;&lt;li&gt;&lt;b&gt;Measuring Utility of Human-Computer Interactions&lt;/b&gt;&lt;br /&gt;Michael Toomim, James Landay&lt;/li&gt;&lt;li&gt;&lt;b&gt;Translation by Iterative Collaboration between Monolingual Users&lt;/b&gt;&lt;br /&gt;Chang Hu, Benjamin Bederson, Philip Resnik&lt;/li&gt;&lt;/ul&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6538286886012474786-5236788910664504095?l=topsitemap.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/5236788910664504095'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/5236788910664504095'/><link rel='alternate' type='text/html' href='http://topsitemap.blogspot.com/2010/05/accepted-papers-for-hcomp-2010.html' title='Accepted Papers for HCOMP 2010'/><author><name>Internet at Every Where</name><uri>http://www.blogger.com/profile/13430497579682715412</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author></entry><entry><id>tag:blogger.com,1999:blog-6538286886012474786.post-1770936761556332542</id><published>2010-05-21T16:00:00.000-07:00</published><updated>2011-04-17T19:00:19.320-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='research'/><category scheme='http://www.blogger.com/atom/ns#' term='data mining'/><category scheme='http://www.blogger.com/atom/ns#' term='machine learning'/><title type='text'>Prediction Optimizers</title><content type='html'>The announcement of the Google Prediction API created quite a &lt;a href="http://www.neverreadpassively.com/2010/05/machine-learning-as-service-google.html"&gt;lot&lt;/a&gt; &lt;a href="http://hunch.net/?p=1383"&gt;of&lt;/a&gt; &lt;a href="http://mark.reid.name/iem/prediction-services.html"&gt;discussion&lt;/a&gt; about the future of predictive modeling. The reactions were mainly positive, but there was one common concern: Google Predict API operates as a black box. You give the data, you train, you get predictions. No selection of the training algorithm, no parameter tuning, no nothing. Black box. Data in, predictions out.&lt;br /&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;So, the natural question arises: Is it possible to do machine learning like that? Can we trust something if we do not understand the internals of the prediction mechanism?&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;&lt;i&gt;Declarative query execution and the role of query optimizers&lt;/i&gt;&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;For me this approach, being trained as a database researcher, directly corresponds to the approach of a query optimizer. In relational databases, you upload your data, and issue declarative SQL queries to get answers. The internal query optimizer selects how to evaluate the SQL query in order to return the results in the fastest possible manner. Most of the users of relational databases today do not even know how a query is executed. Is the join executed as a hash-join or as a sort-merge join? In which order do we join the tables? How is the GROUP BY aggregation computed? I bet that 99% of the users have no idea, and they do not want to know.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Definitely, knowing how a database works can help. If you know that you will perform mainly lookup queries on a given column, you want to build an index. Or create a histogram with the distribution statistics for another column. Or create a materialized view for frequently executed queries. But even these tasks today are increasingly automated. The database tuning advisor on SQL Server routinely suggests indexes and partitionings for my databases that I would never thought of building. (And I have a PhD in databases!)&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;&lt;i&gt;Declarative predictive modeling&lt;/i&gt;&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;I see an exact parallel of this approach in the Google Prediction API. You upload the data and Google selects for you the most promising model. (Or, more likely, they build many models and do some form of meta-learning on top of these predictions.) I would call this a "&lt;b&gt;prediction optimizer&lt;/b&gt;" making the parallel with the query optimize that is built within every relational database system today. My prediction (pun intended) is that such prediction optimizers will be an integral part of every predictive analytics suite in the future.&amp;nbsp;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Someone may argue that it would be better to have the ability to hand tune some parts. For example, if you know something about the structure of the data, you may pass hints to the "prediction optimizer", indicating that a particular learning strategy is better. This has a direct correspondence in the query optimization world: if you know that a particular execution strategy is better, you can pass a HINT to the optimizer as part of the SQL query.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;&lt;i&gt;Can we do better? Yes (but 99% of the people do not care)&lt;/i&gt;&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;&lt;i&gt;&lt;br /&gt;&lt;/i&gt;&lt;/b&gt;&lt;/div&gt;&lt;div&gt;The obvious question is, can we do better than relying on a prediction optimizer to build a model? The answer is pretty straightforward: Yes!&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;In fact, if you build any custom solution for handling your data, it will be significantly better than any commercial database. Databases have a lot of extra baggage (e.g., transactions) that are not useful in all applications but are slowing down execution considerably. I will not even go to the discussion about web crawlers, financial trading systems, etc. However, these solutions come at a cost (time, people, money...). Many people just want to store and manage their data. For them, existing database systems and their behind-the-scenes optimizers&amp;nbsp;are good enough!&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Similarly, you can expect to see many people building predictive models "blindly" using a blackbox approach, relying on a prediction optimizer to handle the details. If people do not care about hash joins vs. sort-merge joins, I do not think that anyone will care if the prediction came from a support vector machine with a radial basis function, from a random forest, or from a Mechanical Turk worker (Yes, I had to put MTurk in the post).&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;&lt;i&gt;The future&lt;/i&gt;&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;I know that predictions, especially about the future, are hard, but here is my take: We are going to see a market for predictive modeling suites, similar to the market for databases. Multiple vendors will built (or have already&amp;nbsp;&lt;a href="http://mark.reid.name/iem/prediction-services.html"&gt;built&lt;/a&gt;)&amp;nbsp;similar suites. In the same way that today Oracle, Microsoft, IBM, Teradata and so on, compete for the best SQL engine, we will see competition for such turnkey solutions for predictive modeling. You upload the data and then the engines complete for scalability, speed of training, and for the best ROC curve.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Let's see: Upload-train-predict. Waiting for an answer...&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6538286886012474786-1770936761556332542?l=topsitemap.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/1770936761556332542'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/1770936761556332542'/><link rel='alternate' type='text/html' href='http://topsitemap.blogspot.com/2010/05/prediction-optimizers.html' title='Prediction Optimizers'/><author><name>Internet at Every Where</name><uri>http://www.blogger.com/profile/13430497579682715412</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author></entry><entry><id>tag:blogger.com,1999:blog-6538286886012474786.post-4790567722123691082</id><published>2010-05-19T21:50:00.000-07:00</published><updated>2011-04-17T19:00:19.325-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='research'/><title type='text'>Google Prediction API: Commoditization of Large-Scale Machine Learning?</title><content type='html'>Today Google has announced the availability of the &lt;a href="http://code.google.com/apis/predict/"&gt;Google Prediction API&lt;/a&gt;: In brief, it allows users to upload massive data sets into the Google Datastore and then let Google built a supervised machine learning model (aka classifier) from the data. This is simply big news!&lt;br /&gt;&lt;br /&gt;Google seems to promise great simplicity: Upload data in CSV format and Google takes care of the rest. They select the appropriate model for the data, train the model, report accuracy statistics, and let you classify new instances. Building classifiers from large-scale datasets becomes trivial.&lt;br /&gt;&lt;br /&gt;While I have not had the chance to access the API, this seems to be a game changer. The ability to scale models to work with massive datasets was beyond the reach of many, and now suddenly becomes a commodity. Research labs that wanted to built classifiers as tools (and not as the focus of their research) will be able to do so without requiring much expertise. Similarly, startups will be able to use a scalable machine learning infrastructure, without having access to an inhouse expert.&lt;br /&gt;&lt;br /&gt;In a sense, it seems to bring machine learning to the masses, bringing the performance baseline to very high level. If Google Predict is "good enough", will people seek for more advanced solutions? The optimizer of MySQL pretty much sucks but it is "good enough" for many.&lt;br /&gt;&lt;br /&gt;Will Google Predict make large-scale machine learning a commodity? Does it mean that the value is now in having the data and in feature engineering? Unclear, but definitely a plausible scenario.&lt;br /&gt;&lt;br /&gt;I will withhold further commentary until I manage to get access to the API. But I am excited!&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6538286886012474786-4790567722123691082?l=topsitemap.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/4790567722123691082'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/4790567722123691082'/><link rel='alternate' type='text/html' href='http://topsitemap.blogspot.com/2010/05/google-prediction-api-commoditization.html' title='Google Prediction API: Commoditization of Large-Scale Machine Learning?'/><author><name>Internet at Every Where</name><uri>http://www.blogger.com/profile/13430497579682715412</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author></entry><entry><id>tag:blogger.com,1999:blog-6538286886012474786.post-7799397365684780270</id><published>2010-05-09T19:55:00.000-07:00</published><updated>2011-04-17T19:00:19.335-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='academia'/><category scheme='http://www.blogger.com/atom/ns#' term='research'/><category scheme='http://www.blogger.com/atom/ns#' term='google'/><title type='text'>Google Scholar now Supports Email Alerts</title><content type='html'>While searching Google Scholar, I noticed a new icon:&lt;br /&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://4.bp.blogspot.com/_skxW9kDLt_c/S-dyuextbxI/AAAAAAAAEjY/8PR1GSbWNG0/s1600/gscholar.PNG" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="180" src="http://4.bp.blogspot.com/_skxW9kDLt_c/S-dyuextbxI/AAAAAAAAEjY/8PR1GSbWNG0/s400/gscholar.PNG" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;By clicking this button, you can create an email alert, which notifies you for any new papers that may appear for the given query. You can use it:&lt;br /&gt;&lt;br /&gt;&lt;ul&gt;&lt;li&gt;For normal queries, getting notifications for queries such as [author:&lt;i&gt;lastname&lt;/i&gt;] or [intitle:&lt;i&gt;titleword&lt;/i&gt;], or by any other query on Google Scholar&lt;/li&gt;&lt;li&gt;For citation queries, getting information about new papers that cite a given paper. For that, you need to first go to the "Cited by X" page, and then click the alert icon.&lt;/li&gt;&lt;/ul&gt;&lt;div&gt;At the end, you get a nice list of alerts that can notify you when new papers of a particular author appear on Google Scholar, when new papers about a topic get indexed, or when a paper gets cited (the almighty citation!). Here is how the alert list looks like:&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://2.bp.blogspot.com/_skxW9kDLt_c/S-d1LBhMgQI/AAAAAAAAEjk/NaF-mQL2cCM/s1600/gscholar2.PNG" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="165" src="http://2.bp.blogspot.com/_skxW9kDLt_c/S-d1LBhMgQI/AAAAAAAAEjk/NaF-mQL2cCM/s400/gscholar2.PNG" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;I tries to find some official announcement for this feature and I could not find anything. Not sure if this is rolled out to everyone but it is certainly very very useful!&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;My &lt;a href="http://citation-tracker.com/"&gt;Citation Tracker&lt;/a&gt; &lt;a href="http://behind-the-enemy-lines.blogspot.com/2009/09/citation-tracker-monitoring-citations.html"&gt;tool&lt;/a&gt; becomes less useful now, but I am glad that we will stop trying to implement this feature on top of Google Scholar and Google will start doing it natively. Next step Google: RSS alerts!&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6538286886012474786-7799397365684780270?l=topsitemap.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/7799397365684780270'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/7799397365684780270'/><link rel='alternate' type='text/html' href='http://topsitemap.blogspot.com/2010/05/google-scholar-now-supports-email.html' title='Google Scholar now Supports Email Alerts'/><author><name>Internet at Every Where</name><uri>http://www.blogger.com/profile/13430497579682715412</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://4.bp.blogspot.com/_skxW9kDLt_c/S-dyuextbxI/AAAAAAAAEjY/8PR1GSbWNG0/s72-c/gscholar.PNG' height='72' width='72'/></entry><entry><id>tag:blogger.com,1999:blog-6538286886012474786.post-6845871145209451366</id><published>2010-05-04T19:13:00.000-07:00</published><updated>2011-04-17T19:00:19.341-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='mechanical turk'/><category scheme='http://www.blogger.com/atom/ns#' term='crowdsourcing'/><title type='text'>Crowdsourcing: Not just a cost saver</title><content type='html'>Whenever I am discussing crowdsourcing, people tend to ask about the "killer app" for crowdsourcing. In my mind, a new technology is being used first for cost reduction, then due to the afforded flexibility, and finally to achieve tasks that were previously infeasible.&lt;br /&gt;&lt;br /&gt;As with many technologies (e.g., VoIP), the first applications of crowdsourcing tend to focus on &lt;b&gt;cost reduction&lt;/b&gt;. Label images, moderate comments, verify addresses, collect emails... All tasks that were traditionally done by low-paid interns, are now crowdsourced, for significant cost savings.&amp;nbsp;Although these applications tend to be useful, it is hard to see applications focused on cost cutting to be the ones that will drive the wide adoption of crowdsourcing. In any case, if someone has big tasks like that, they can always contact an outsourcing shop in a developing country, and achieve similar rates.&lt;br /&gt;&lt;br /&gt;So, &lt;b&gt;flexibility of deployment&lt;/b&gt; tends to be another angle that tends to drive the adoption of crowdsourcing. Pretty much as in the case of cloud computing, crowdsourcing can come pretty handy for companies that need significant resources for just a small period of time. &lt;a href="http://www.liveops.com/"&gt;LiveOps&lt;/a&gt; was a pioneer of this model, applying "crowdsourcing" before the terms was invented, to dynamically staff call centers for telemarketing companies, handling sudden spikes in demand, and so on.&lt;br /&gt;&lt;br /&gt;However, the really interesting applications are those that are &lt;b&gt;truly being enabled by such technology&lt;/b&gt;. One such idea was given to me by &lt;a href="http://www.propublica.org/site/author/amanda_michel"&gt;Amanda Michel&lt;/a&gt; of &lt;a href="http://www.propublica.org/"&gt;Propublica&lt;/a&gt;, last September after a Mechanical Turk Meetup: Propublica wanted to monitor the spending of stimulus money, and examine if the projects assigned to different contractors were being completed properly. By analyzing the collected data, it would be possible to identify systematic problems or outright corruption.&lt;br /&gt;&lt;br /&gt;Of course, collecting such data would have been impossible for a journalist, or even for a team of journalists. However, using crowdsourcing it would be pretty easy for local residents to check if a bridge has been fixed, if a road was properly paved, and so on. I loved the idea!&amp;nbsp;It was a clear demonstration of how to take an infeasible task, and use crowdsourcing to make it feasible. Even better: it was encouraging the involvement of citizens and their active collaboration with the government. A win-win project! &lt;br /&gt;&lt;br /&gt;Interestingly enough, a recent article on BusinessWeek,&lt;a href="http://www.businessweek.com/magazine/content/10_19/b4177029171961.htm"&gt; indicates that the Obama administration is also employing crowdsourcing for the same task&lt;/a&gt;! From the article: "&lt;i&gt;It's the surest way to prevent, say, a convicted contractor from reincorporating a new company under his wife's name and applying for stimulus money, explains Earl Devaney, the special inspector general who oversees stimulus spending. "Only local folks can connect those kinds of dots," he says.&lt;/i&gt;&lt;br /&gt;&lt;i&gt;&lt;br /&gt;&lt;/i&gt;&lt;br /&gt;Indeed! If only we had such ideas in Greece...&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6538286886012474786-6845871145209451366?l=topsitemap.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/6845871145209451366'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/6845871145209451366'/><link rel='alternate' type='text/html' href='http://topsitemap.blogspot.com/2010/05/crowdsourcing-not-just-cost-saver.html' title='Crowdsourcing: Not just a cost saver'/><author><name>Internet at Every Where</name><uri>http://www.blogger.com/profile/13430497579682715412</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author></entry><entry><id>tag:blogger.com,1999:blog-6538286886012474786.post-6017907188374107139</id><published>2010-05-03T15:26:00.000-07:00</published><updated>2011-04-17T19:00:19.346-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='research'/><category scheme='http://www.blogger.com/atom/ns#' term='hcomp'/><title type='text'>KDD Accepted Papers, Deadlines for HCOMP 2010 and SNAKDD 2010 workshops</title><content type='html'>The accepted papers for KDD 2010 are now posted and available at&amp;nbsp;&lt;a href="http://www.kdd.org/kdd2010/papers.shtml"&gt;http://www.kdd.org/kdd2010/papers.shtml&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;As a reminder, the KDD this year will take place in Washington DC, from July 25th to July 28th.&lt;br /&gt;&lt;br /&gt;I would also like to draw your attention to two KDD workshops that I am involved with, the Human Computation Workshop (HCOMP 2010) and the Workshop on Social Network Mining and Analysis (SNAKDD 2010). Both have &lt;b&gt;submission deadlines on May 7th. &lt;/b&gt;It is pretty easy to travel to DC, so if you have any cool idea, or demo that would be appropriate for these workshops, please submit!&lt;br /&gt;&lt;br /&gt;For those too bored to visit the respective websites, here are the call for papers:&lt;br /&gt;&lt;br /&gt;&lt;blockquote&gt;&lt;b&gt;Human Computation Workshop (HCOMP 2010) - Call for Papers&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;Most research in data mining and knowledge discovery relies heavily on the availability of datasets. With the rapid growth of user generated content on the internet, there is now an abundance of sources from which data can be drawn. Compared to the amount of work in the field on techniques for pattern discovery and knowledge extraction, there has been little effort directed at the study of effective methods for collecting and evaluating the quality of data.&lt;br /&gt;&lt;br /&gt;Human computation is a relatively new research area that studies the process of channeling the vast internet population to perform tasks or provide data towards solving difficult problems that no known efficient computer algorithms can yet solve. There are various genres of human computation applications available today. Games with a purpose (e.g., the ESP Game) specifically target online gamers who, in the process of playing an enjoyable game, generate useful data (e.g., image tags). Crowdsourcing marketplaces (e.g. Amazon Mechanical Turk) are human computation applications that coordinate workers to perform tasks in exchange for monetary rewards. In identity verification tasks, users need to perform some computation in order to access some online content; one example of such a human computation application is reCAPTCHA, which leverages millions of users who solve CAPTCHAs every day to correct words in books that optical character recognition (OCR) programs fail to recognize with certainty.&lt;br /&gt;&lt;br /&gt;Human computation is an area with significant research challenges and increasing business interest, making this doubly relevant to KDD. KDD provides an ideal forum for a workshop on human computation as a form of cost-sensitive data acquisition. The workshop also offers a chance to bring in practitioners with complementary real-world expertise in gaming and mechanism design who might not otherwise attend this academic conference.&lt;br /&gt;&lt;br /&gt;The first Human Computation Workshop (HComp 2009) was held on June 28th, 2009, in Paris, France, collocated with KDD 2009. The overall themes that emerged from this workshop were very clear: on the one hand, there is the experimental side of human computation, with research on new incentives for users to participate, new types of actions, and new modes of interaction. This includes work on new programming paradigms and game templates designed to enable rapid prototyping, allow partial completion of tasks, and aid in reusability of game design. On the more theoretic side, we have research modeling these actions and incentives to examine what theory predicts about these designs. Finally, there is work on noisy results generated by such games and systems: how can we best handle noise, identify labeler expertise, and use the generated data for data mining purposes?&lt;br /&gt;&lt;br /&gt;Learning from HComp 2009, we have expanded the topics of relevance to the workshop. The goal of HComp 2010 is to bring together academic and industry researchers in a stimulating discussion of existing human computation applications and future directions of this new subject area. We solicit papers related to various aspects of both general human computation techniques and specific applications, e.g. general design principles; implementation; cost-benefit analysis; theoretical approaches; privacy and security concerns; and incorporation of machine learning / artificial intelligence techniques. An integral part of this workshop will be a demo session where participants can showcase their human computation applications. Specifically, topics of interests include, but are not limited to:&lt;br /&gt;&lt;br /&gt;&lt;ul&gt;&lt;li&gt;Abstraction of human computation tasks into taxonomies of mechanisms&lt;/li&gt;&lt;li&gt;Theories about what makes some human computation tasks fun and addictive&lt;/li&gt;&lt;li&gt;Differences between collaborative vs. competitive tasks&lt;/li&gt;&lt;li&gt;Programming languages, tools and platforms to support human computation&lt;/li&gt;&lt;li&gt;Domain-specific implementation challenges in human computation games&lt;/li&gt;&lt;li&gt;Cost, reliability, and skill of labelers&lt;/li&gt;&lt;li&gt;Benefits of one-time versus repeated labeling&lt;/li&gt;&lt;li&gt;Game-theoretic mechanism design of incentives for motivation and honest reporting&lt;/li&gt;&lt;li&gt;Design of manipulation-resistance mechanisms in human computation&lt;/li&gt;&lt;li&gt;Effectiveness of CAPTCHAs&lt;/li&gt;&lt;li&gt;Concerns regarding the protection of labeler identities&lt;/li&gt;&lt;li&gt;Active learning from imperfect human labelers&lt;/li&gt;&lt;li&gt;Creation of intelligent bots in human computation games&lt;/li&gt;&lt;li&gt;Utility of social networks and social credit in garnering data&lt;/li&gt;&lt;li&gt;Optimality in the context of human computation&lt;/li&gt;&lt;li&gt;Focus on tasks where crowds, not individuals, have the answers&lt;/li&gt;&lt;li&gt;Limitations of human computation&lt;/li&gt;&lt;/ul&gt;&lt;br /&gt;&lt;/blockquote&gt;&lt;br /&gt;&lt;blockquote&gt;&lt;b&gt;Workshop on Social Network Mining and Analysis - Call for Papers&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;Social networks research has come a long way since the notable �six-degree separation� experiment. In recent years, social network research has advanced significantly, thanks to the prevalence of the online social websites and the availability of a variety of offline large-scale social network systems such as collaboration networks. These social network systems are usually characterized by the complex network structures and rich accompanying contextual information. Researchers are increasingly interested in addressing a wide range of challenges residing in these disparate social network systems, including identifying common static topological properties and dynamic properties during the formation and evolution of these social networks, and how contextual information can help in analyzing the pertaining social networks. These issues have important implications on community discovery, anomaly detection, trend prediction and can enhance applications in multiple domains such as information retrieval, recommendation systems, security and so on.&lt;br /&gt;&lt;br /&gt;The fourth SNA-KDD '2010 aims to bring together practitioners and researchers with a specific focus on the emerging trends and industry needs associated with the traditional Web, the social Web, and other forms of social networking systems.  Both theoretical and experimental submissions are encouraged. The interesting topics include (1) data mining advances on the discovery and analysis of communities, on personalization for solitary activities (like search) and social activities (like discovery of potential friends), on the analysis of user behavior in open fora (like conventional sites, blogs and fora) and in commercial platforms (like e-auctions) and on the associated security and privacy-preservation challenges; (2) social network modeling, scalable, customizable social network infrastructure construction, dynamic growth and evolution patterns identification and discovery using machine learning approaches or multi-agent based simulation.&lt;br /&gt;&lt;br /&gt;The fourth SNA-KDD '2010 solicits contributions on social network analysis and graph mining, including the emerging applications of the Web as a social medium. Papers should elaborate on data mining methods, issues associated to data preparation and pattern interpretation, both for conventional data (usage logs, query logs, document collections) and for multimedia data (pictures and their annotations, multi-channel usage data). Topics of interest include but are not limited to:&lt;br /&gt;&lt;br /&gt;&lt;ul&gt;&lt;li&gt;Communities discovery and analysis in large scale online and offline social networks&lt;br /&gt;&lt;/li&gt;&lt;li&gt;Personalization for search and for social interaction&lt;br /&gt;&lt;/li&gt;&lt;li&gt;Recommendations for product purchase, information acquisition and establishment of social relations&lt;br /&gt;&lt;/li&gt;&lt;li&gt;Data protection inside communities&lt;br /&gt;&lt;/li&gt;&lt;li&gt;Misbehavior detection in communities&lt;br /&gt;&lt;/li&gt;&lt;li&gt;Web mining algorithms for clickstreams, documents and search streams&lt;br /&gt;&lt;/li&gt;&lt;li&gt;Preparing data for web mining&lt;br /&gt;&lt;/li&gt;&lt;li&gt;Pattern presentation for end-users and experts&lt;br /&gt;&lt;/li&gt;&lt;li&gt;Evolution of patterns in the Web&lt;br /&gt;&lt;/li&gt;&lt;li&gt;Evolution of communities in the Web&lt;br /&gt;&lt;/li&gt;&lt;li&gt;Dynamics and evolution patterns of social networks, trend prediction&lt;br /&gt;&lt;/li&gt;&lt;li&gt;Contextual social network analysis&lt;br /&gt;&lt;/li&gt;&lt;li&gt;Temporal analysis on social networks topologies&lt;br /&gt;&lt;/li&gt;&lt;li&gt;Search algorithms on social networks&lt;br /&gt;&lt;/li&gt;&lt;li&gt;Multi-agent based social network modeling and analysis&lt;br /&gt;&lt;/li&gt;&lt;li&gt;Application of social network analysis&lt;br /&gt;&lt;/li&gt;&lt;li&gt;Anomaly detection in social network evolution&lt;br /&gt;&lt;/li&gt;&lt;/ul&gt;&lt;/blockquote&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6538286886012474786-6017907188374107139?l=topsitemap.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/6017907188374107139'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/6017907188374107139'/><link rel='alternate' type='text/html' href='http://topsitemap.blogspot.com/2010/05/kdd-accepted-papers-deadlines-for-hcomp.html' title='KDD Accepted Papers, Deadlines for HCOMP 2010 and SNAKDD 2010 workshops'/><author><name>Internet at Every Where</name><uri>http://www.blogger.com/profile/13430497579682715412</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author></entry><entry><id>tag:blogger.com,1999:blog-6538286886012474786.post-7560295361561661493</id><published>2010-04-15T10:03:00.000-07:00</published><updated>2011-04-17T19:00:19.352-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='academia'/><title type='text'>Yahoo!'s Key Scientific Challenges: Your student is a winner!</title><content type='html'>I got the following email:&lt;br /&gt;&lt;br /&gt;&lt;blockquote&gt;Dear Panos,&lt;/blockquote&gt;&lt;blockquote&gt;&lt;br /&gt;&lt;/blockquote&gt;&lt;blockquote&gt;The judging is done. From an outstanding group of 200 proposals, twenty-two exceptional PhD students have been selected to be part of Yahoo!�s 2010 Key Scientific Challenges (KSC) Program. Your student, &lt;a href="http://pages.stern.nyu.edu/~narchak/"&gt;Nikolay Archak&lt;/a&gt; has been chosen to receive this very competitive award. Congratulations!&lt;/blockquote&gt;&lt;blockquote&gt;&lt;br /&gt;&lt;/blockquote&gt;&lt;blockquote&gt;Supporting the academic community is a top priority at Yahoo!. We created the KSC Program to support a limited number of outstanding PhD students who we believe are doing research in very important and challenging areas.  The Program provides each student with $5,000 of unrestricted funds for the support of their research activities (e.g., conference fees and travel, lab materials, professional society membership dues, etc.).  The funds are distributed through the university and paid directly to the student for use at their discretion. As part of this program, the student also receives an exclusive invitation to a unique workshop (most likely to be held in August at our location in Sunnyvale, California), where we will focus on novel disciplines and important technical challenges for the Internet research community, slanted specifically for graduate students whose innovative work is just emerging.  We think that the opportunity to interact with Yahoo! scientists and other top graduate students in an informal and supportive environment without having to submit papers for review will provide a unique forum for open and stimulating discussion of work in its early stages.&lt;/blockquote&gt;&lt;blockquote&gt;&lt;br /&gt;&lt;/blockquote&gt;&lt;blockquote&gt;Students will also have the opportunity to work with select datasets through our Webscope program. Yahoo! will cover all travel expenses to the KSC Workshop independent of the $5,000 award.&lt;/blockquote&gt;&lt;blockquote&gt;&lt;br /&gt;&lt;/blockquote&gt;&lt;blockquote&gt;We are excited to be able to support these students with KSC grants, and we look forward to having them as part of our broad research alliance of outstanding students and faculty.  Please feel free to contact Jamie Lockwood at jamieloc@yahoo-inc.com if you have any questions regarding our KSC Program.&lt;/blockquote&gt;&lt;blockquote&gt;&lt;br /&gt;&lt;/blockquote&gt;&lt;blockquote&gt;I offer you my warmest congratulations.  We eagerly look forward to hosting Nikolay at our KSC Workshop as part of the Yahoo! family.&lt;/blockquote&gt;&lt;blockquote&gt;&lt;br /&gt;&lt;/blockquote&gt;&lt;blockquote&gt;Best regards,&lt;/blockquote&gt;&lt;blockquote&gt;&lt;br /&gt;&lt;/blockquote&gt;&lt;blockquote&gt;Ken Schmidt&lt;/blockquote&gt;&lt;blockquote&gt;Director, Academic Relations&lt;/blockquote&gt;&lt;br /&gt;&amp;nbsp;Nikolay is not a stranger to winning competitions. He has won in the past 3 times the TopCoder competition, ended up 2nd two more times. He also has a streak of accepted publications, including a &lt;i&gt;single-authored&amp;nbsp;&lt;/i&gt;&lt;a href="http://pages.stern.nyu.edu/~narchak/wfp0004-archak.pdf"&gt;paper at WWW2010&lt;/a&gt; this year.&lt;br /&gt;&lt;br /&gt;Nikolay, congratulations!&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6538286886012474786-7560295361561661493?l=topsitemap.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/7560295361561661493'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/7560295361561661493'/><link rel='alternate' type='text/html' href='http://topsitemap.blogspot.com/2010/04/yahoo-key-scientific-challenges-your.html' title='Yahoo!&amp;#39;s Key Scientific Challenges: Your student is a winner!'/><author><name>Internet at Every Where</name><uri>http://www.blogger.com/profile/13430497579682715412</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author></entry><entry><id>tag:blogger.com,1999:blog-6538286886012474786.post-1920710410114226050</id><published>2010-04-10T21:40:00.000-07:00</published><updated>2011-04-17T19:00:19.356-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='academia'/><category scheme='http://www.blogger.com/atom/ns#' term='research'/><category scheme='http://www.blogger.com/atom/ns#' term='peer reviewing'/><title type='text'>Stop Publishing!</title><content type='html'>The last few months, I feel that I have an endless queue of reviewing tasks to complete. WWW, followed by DBRank, followed by EC, followed by KDD, followed by VLDB, followed by WebDB, plus an NSF panel, plus some journal reviews, and I have rejected invitations for a few additional conferences including SIGIR, SIGMOD, and a few others. This puts my count at least 40 reviews over the last 4-5 months. (&lt;a href="http://behind-the-enemy-lines.blogspot.com/2008/04/how-much-paper-submission-costs.html"&gt;Just to break even&lt;/a&gt;, I will need to submit 10-12 papers.)&lt;br /&gt;&lt;br /&gt;Needless to say, having such a reviewing load means that I cannot really do a good job in reviewing. My reviews have been declining in quality, signalling that I need to learn to say no.&lt;br /&gt;&lt;br /&gt;At the same time, I also notice that the &lt;i&gt;other &lt;/i&gt;reviews that are being submitted are not that great either. While on the one hand I feel happy ("OK, I am not that bad"), on the other hand I feel that this cannot be good. If nobody has time to review thoroughly, what is the whole point of peer reviewing?&lt;br /&gt;&lt;br /&gt;One solution is to accept fewer invitations for PCs, allowing for more time per paper. However, I know that without volunteering time and effort for reviewing the system cannot work! There are simply not that many reviewers available!&lt;br /&gt;&lt;br /&gt;Part of this problem is, of course, the increased need to get papers published: For tenure, for getting a job, even for being admitted to a PhD program! I feel that there is something wrong when, to be admitted to a PhD program, you need to have already research experience. This increased need for more and more publications, overloads the reviewing system which unfortunately has a limited capacity.&lt;br /&gt;&lt;br /&gt;Unfortunately, it is not easy to reverse this trend. The incentives are setup in a way to encourage quantity of publications, preferably in good venues. Once the paper gets accepted in a good venue, the goal is achieved. This encourages publications that are "good enough" to pass the reviewing process, not papers that have stellar quality. And with the increased noise in the reviewing process, the distinction between "good enough to be published" and "what the hell, send it, we may get lucky" is getting&amp;nbsp;blurrier&amp;nbsp;and&amp;nbsp;blurrier. In fact, I have cases in &lt;b&gt;&lt;i&gt;my &lt;/i&gt;&lt;/b&gt;papers that the reviewers did such a poor job that I never understood at the end if my paper was worth getting published, or I got just lucky.&lt;br /&gt;&lt;br /&gt;I noticed though a positive development! Through the &lt;a href="http://www.gurforum.org/2010/04/10/%CE%BD%CE%AD%CE%BF%CE%B9-%CE%BA%CE%B1%CE%BD%CF%8C%CE%BD%CE%B5%CF%82-%CE%B1%CF%80%CF%8C-%CF%84%CE%B7%CE%BD-dfg/"&gt;Greek University Reform Forum&lt;/a&gt;, I learned that:&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;blockquote&gt;The German Research Society (DFG) &lt;a href="http://www.hochschulverband.de/cms1/newsletter-2010-030.html"&gt;has introduced new guidelines for applications and evaluations of proposals&lt;/a&gt;, which will be valid as of July 1, 2010.&lt;br /&gt;&lt;br /&gt;A rough-and-ready translation of the main points:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;Applicants should cite in their CV only up to FIVE publications, those which are most relevant for the proposal at hand;&lt;/li&gt;&lt;li&gt;In reports about running projects, a maximum of TWO publications PER YEAR. In case of projects with more than one PIs, a maximum of THREE publications PER YEAR.&lt;/li&gt;&lt;li&gt;The goal of the new guideleines is to put emphasis on quality instead of quantity and to stop the flood of publishing for the sake of the numbers.&lt;/li&gt;&lt;/ul&gt;&lt;br /&gt;It has caused quite some stir here in Germany and the voices to enforce such rules also for decisions on faculty positions are getting louder.&lt;/blockquote&gt;&lt;br /&gt;While some of these ideas are already in place (e.g., NSF also allows only five publications in the CV), the idea of "counting" only two publications per year for each project is definitely a step towards the right direction. It is not going to be trivial to reverse the "get as many publications in top venues as possible" trend, but every step towards de-emphasizing quantity counts.&lt;br /&gt;&lt;br /&gt;After all, &lt;a href="http://en.wikipedia.org/wiki/Jackson_Pollock"&gt;Pollock&lt;/a&gt; was also getting paid by the piece when he worked for the&lt;a href="http://en.wikipedia.org/wiki/Federal_Art_Project"&gt; Federal Art Project&lt;/a&gt;, but none of his famous paintings come from that period.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6538286886012474786-1920710410114226050?l=topsitemap.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/1920710410114226050'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/1920710410114226050'/><link rel='alternate' type='text/html' href='http://topsitemap.blogspot.com/2010/04/stop-publishing.html' title='Stop Publishing!'/><author><name>Internet at Every Where</name><uri>http://www.blogger.com/profile/13430497579682715412</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author></entry><entry><id>tag:blogger.com,1999:blog-6538286886012474786.post-8395917511108864214</id><published>2010-03-31T19:16:00.000-07:00</published><updated>2011-04-17T19:00:19.361-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='mechanical turk'/><category scheme='http://www.blogger.com/atom/ns#' term='research'/><title type='text'>Getting High Quality Results on MTurk</title><content type='html'>In &lt;a href="http://behind-the-enemy-lines.blogspot.com/2010/03/new-demographics-of-mechanical-turk.html"&gt;a previous post&lt;/a&gt;, an anonymous commenter left a &lt;a href="http://behind-the-enemy-lines.blogspot.com/2010/03/new-demographics-of-mechanical-turk.html?showComment=1268595311475#c1337719675687832820"&gt;pretty interesting comment&lt;/a&gt;, which I think is worth repeating:&lt;br /&gt;&lt;br /&gt;&lt;blockquote&gt;On the subject of higher pay not rendering better work, there are some AMT requesters who pay well, and in the instructions for the task, put in bold print a note that says something along the lines of:&lt;br /&gt;&lt;br /&gt;&lt;b&gt;"We pay very well for these tasks, because we expect perfect, error-free work. You don't have to rush through these tasks to make money doing them, so slow down and take your time. Poor quality work WILL be rejected and you WILL be blocked from working for us in the future if your work is poor."&lt;/b&gt; &lt;br /&gt;&lt;br /&gt;My gut feeling, as a task worker at AMT, is that this is an effective method for increasing the quality of the work (high pay, plus a message reminding the worker to take their time because they don't HAVE to rush through these tasks to make money as an incentive, plus the motivating threat of having work rejected and getting blocked as a disincentive to rushing) but it would be neat to see it officially demonstrated.&lt;br /&gt;&lt;/blockquote&gt;&lt;br /&gt;Anyone tried this or willing to try? I would be very interesting in hearing the results.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6538286886012474786-8395917511108864214?l=topsitemap.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/8395917511108864214'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/8395917511108864214'/><link rel='alternate' type='text/html' href='http://topsitemap.blogspot.com/2010/03/getting-high-quality-results-on-mturk.html' title='Getting High Quality Results on MTurk'/><author><name>Internet at Every Where</name><uri>http://www.blogger.com/profile/13430497579682715412</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author></entry><entry><id>tag:blogger.com,1999:blog-6538286886012474786.post-2380717924528314180</id><published>2010-03-15T11:42:00.000-07:00</published><updated>2011-04-17T19:00:19.378-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='academia'/><category scheme='http://www.blogger.com/atom/ns#' term='research'/><title type='text'>Citation Tracker: Now with an API</title><content type='html'>A few months back I &lt;a href="http://behind-the-enemy-lines.blogspot.com/2009/09/citation-tracker-monitoring-citations.html"&gt;announced the availability of Citation Tracker&lt;/a&gt;, a tool that allows monitoring of your publications for incoming citations and mentions on the web. We received plenty of suggestions and we have been trying to implement them, with the hope of moving out of the alpha version by the end of the year.&lt;br /&gt;&lt;br /&gt;Today, although we are still in alpha, I am happy to announce that we reached a major milestone: &lt;a href="http://github.com/mickek/Citation-Tracker-API"&gt;Citation Tracker now has a public API available!&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;We have posted documentation online and a sample PHP client for those that want to experiment. For details you can see the online documentation, but here is some brief description of the things that you can do through the API:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;b&gt;Publications&lt;/b&gt;: Add, remove, get, and edit publications. Pretty self-explanatory&lt;/li&gt;&lt;ul&gt;&lt;li&gt;Add publications&lt;/li&gt;&lt;li&gt;Delete publication&lt;/li&gt;&lt;li&gt;Edit publication&lt;/li&gt;&lt;li&gt;Get publications&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;&lt;b&gt;Monitoring&lt;/b&gt;: Add, remove, get, and edit monitoring channels. A monitoring channel is an citation-oriented&amp;nbsp;site (such as Google Scholar, Libra, SSRN), or a general web search engine (Google, Bing, Ask, Yahoo). We monitor these sites for new results that match the publications and return back the new citations (or mentions, if we refer to general web results).&lt;/li&gt;&lt;ul&gt;&lt;li&gt;Add monitoring channel&lt;/li&gt;&lt;li&gt;Delete monitoring channel&lt;/li&gt;&lt;li&gt;Get active monitoring channels&lt;/li&gt;&lt;li&gt;Update monitoring channel&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;&lt;b&gt;Citations:&amp;nbsp;&lt;span class="Apple-style-span" style="font-weight: normal;"&gt;Get, update, and edit&amp;nbsp;citations or web mentions, as returned by the different monitoring channels.&lt;/span&gt;&lt;/b&gt;&lt;/li&gt;&lt;ul&gt;&lt;li&gt;Get citations&lt;/li&gt;&lt;li&gt;Update citation state (new, accepted, discarded, review-later)&lt;/li&gt;&lt;li&gt;Update citation&lt;/li&gt;&lt;/ul&gt;&lt;/ul&gt;&lt;br /&gt;&lt;div&gt;Although we are still in alpha, NYU Library started using the tool and the API for creating a new service on top. I am sure that other people will have ideas of what they can do with the API. Enjoy and let me know if you have any feedback.&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6538286886012474786-2380717924528314180?l=topsitemap.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/2380717924528314180'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/2380717924528314180'/><link rel='alternate' type='text/html' href='http://topsitemap.blogspot.com/2010/03/citation-tracker-now-with-api.html' title='Citation Tracker: Now with an API'/><author><name>Internet at Every Where</name><uri>http://www.blogger.com/profile/13430497579682715412</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author></entry><entry><id>tag:blogger.com,1999:blog-6538286886012474786.post-969776997930952829</id><published>2010-03-09T17:06:00.000-08:00</published><updated>2011-04-17T19:00:19.382-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='mechanical turk'/><category scheme='http://www.blogger.com/atom/ns#' term='research'/><title type='text'>The New Demographics of Mechanical Turk</title><content type='html'>Past surveys on the demographics on Mechanical Turk users indicated that most of the workers come from the US, are younger and more educated than the general population, and work on MTurk as a way to get some spare cash.&lt;br /&gt;&lt;br /&gt;Since the last survey, a few things have changed. First, Amazon allows now workers in India to get paid in cash in rupees, essentially encouraging many people from India to start using Mechanical Turk as workers. Second, the recession has affected many households, leaving many people at home looking for cash to cover their needs. These two forces has changed the demographics of the participants, so a new survey was needed to capture the new demographics of the Mechanical Turk workers.&lt;br /&gt;&lt;br /&gt;So, in February 2010, I conducted a new survey on Mechanical Turk, paying the workers10 cents for participating.&lt;br /&gt;&lt;br /&gt;The first major change was the country of origin. In the past 70%-80% of the workers were coming from the US, but now the percentage is closer to 50% and it may decrease even more. India is now a major contributor of workers, with almost 35% of the workers coming from the subcontinent. The remaining workers come from 66 different countries. The exact numbers in the survey:&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;ul&gt;&lt;li&gt;United States: 46.80%&lt;/li&gt;&lt;li&gt;India: 34.00%&lt;/li&gt;&lt;li&gt;Miscellaneous: 19.20%&lt;/li&gt;&lt;/ul&gt;&lt;br /&gt;The analysis also indicated that the profile of the Indian workers is quite different from the profile of the U.S-based workers. So, below I present the results broken down by country.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Gender Breakdown&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;The first analysis focus on the gender breakdown. Across US-based workers, there are significantly more females than males, while the situation is reversed for Indian workers. &lt;br /&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://4.bp.blogspot.com/_skxW9kDLt_c/S5bhZ-i8EBI/AAAAAAAAEb0/RzSvlYP5KYY/s1600-h/gender.PNG" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="206" src="http://4.bp.blogspot.com/_skxW9kDLt_c/S5bhZ-i8EBI/AAAAAAAAEb0/RzSvlYP5KYY/s400/gender.PNG" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;&lt;br /&gt;The main reason for the overrepresentation of females in the US-based workforce is the nature of the tasks and work on Mechanical Turk. Most participants in the US use Mechanical Turk as a supplementary source of income, and often Mechanical Turk is used by stay-at-home parents, unemployed and underemployed workers, and so on. Since females are more likely to fit into these categories, there is a corresponding increase in representation. On the contrary, more Indian workers treat Mechanical Turk as a primary (or at least significant) source of income, and we see more males working on Mechanical Turk.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Age Distribution&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;In terms of age distribution, there is definitely an overrepresentation of younger workers, compared to the general population of Internet users. While this holds both for the US and for India, we see an even higher skew towards younger workers among Indians. &lt;br /&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://4.bp.blogspot.com/_skxW9kDLt_c/S5bhzBTD34I/AAAAAAAAEb8/UdDG6wxOqx0/s1600-h/yob-us.PNG" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="223" src="http://4.bp.blogspot.com/_skxW9kDLt_c/S5bhzBTD34I/AAAAAAAAEb8/UdDG6wxOqx0/s400/yob-us.PNG" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://2.bp.blogspot.com/_skxW9kDLt_c/S5bh4JNpcII/AAAAAAAAEcE/MhjDR45co_s/s1600-h/yob-india.PNG" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="223" src="http://2.bp.blogspot.com/_skxW9kDLt_c/S5bh4JNpcII/AAAAAAAAEcE/MhjDR45co_s/s400/yob-india.PNG" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Educational Level&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;We also asked the Mechanical Turk workers to declare their educational level. In general, the (self-declared) educational level of the workers is higher than the general US and Indian population. There are two factors that may contribute to this. First, many of the workers are younger than the overall population and, ceteris paribus, this leads to higher educational level. Finally, while we may not necessarily discount the possibility of false disclosure, there are no incentives that would bias workers towards lying in this survey.&lt;br /&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://1.bp.blogspot.com/_skxW9kDLt_c/S5biMTGXPhI/AAAAAAAAEcM/x0KnVb1bBTU/s1600-h/education.PNG" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="198" src="http://1.bp.blogspot.com/_skxW9kDLt_c/S5biMTGXPhI/AAAAAAAAEcM/x0KnVb1bBTU/s400/education.PNG" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Income Level&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;We were also interested to examine the income level of the workers on Mechanical Turk. In the US, the shape of the distribution roughly matches the income distribution in the general US population. However, it is noticeable that the income level of US workers on Mechanical Turk is shifted towards lower income levels. For example, while 45% of the US Internet population has income below $60K/yr, the corresponding percentage across US-based Mechanical Turk workers is 66.7%. (This finding is consistent with the earlier surveys that compared income levels on MTurk workers with income level of the general US population of Internet users.) The picture is drastically different across US-based and Indian workers. Workers based in India have significantly lower incomes, as expected, and more than 55% of the workers declared an income of less than $10K/year.&lt;br /&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://4.bp.blogspot.com/_skxW9kDLt_c/S5bi19YAcvI/AAAAAAAAEcU/ZH6t2wE5D60/s1600-h/income-us.PNG" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="222" src="http://4.bp.blogspot.com/_skxW9kDLt_c/S5bi19YAcvI/AAAAAAAAEcU/ZH6t2wE5D60/s400/income-us.PNG" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://1.bp.blogspot.com/_skxW9kDLt_c/S5bi5ju2i9I/AAAAAAAAEcc/wcQbwMJWbQk/s1600-h/income-india.PNG" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="235" src="http://1.bp.blogspot.com/_skxW9kDLt_c/S5bi5ju2i9I/AAAAAAAAEcc/wcQbwMJWbQk/s400/income-india.PNG" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Marital Status, Children, and Household Size&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;In terms of marital status and household size, the answers tend to match the age demographic of the workers reported earlier. The majority of the workers, both in India and in the US, do not have children, and a significant fraction of them are single. An interesting contrast is the household size, which seems more to reflect cultural norms than anything specific to Mechanical Turk: While more Indian workers are single and without children, they seem to stay in houses with larger number of household members, compared to US workers: Indian workers either stay with their family, or they tend to have a comparatively larger number of roommates, compared to US workers.&lt;br /&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://3.bp.blogspot.com/_skxW9kDLt_c/S5bjkWztkII/AAAAAAAAEck/hi_iGDF_tSI/s1600-h/marital-us.PNG" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="151" src="http://3.bp.blogspot.com/_skxW9kDLt_c/S5bjkWztkII/AAAAAAAAEck/hi_iGDF_tSI/s400/marital-us.PNG" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://3.bp.blogspot.com/_skxW9kDLt_c/S5bjqXiB1GI/AAAAAAAAEcs/4uYvK_5gU9I/s1600-h/marital-india.PNG" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="166" src="http://3.bp.blogspot.com/_skxW9kDLt_c/S5bjqXiB1GI/AAAAAAAAEcs/4uYvK_5gU9I/s400/marital-india.PNG" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://3.bp.blogspot.com/_skxW9kDLt_c/S5bjuEQ2hfI/AAAAAAAAEc0/Ow_TyfD0JcA/s1600-h/hh-size.PNG" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="223" src="http://3.bp.blogspot.com/_skxW9kDLt_c/S5bjuEQ2hfI/AAAAAAAAEc0/Ow_TyfD0JcA/s400/hh-size.PNG" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;&lt;b&gt;Level of Engagement on Mechanical Turk&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;We also asked a set of questions for evaluating the level of engagement of Mechanical Turk workers on the marketplace. Since we did not detect significant deviations across countries, we will be reporting the results in aggregate form, without separating by country of origin of the worker. In general most workers spend a day or less per week working on Mechanical Turk, and tend to complete 20-100 HITs per week. Correspondingly, this generates a relatively low income stream for Mechanical Turk work, which is often less than $20 per week. Of course, there are a few workers that devote a significant amount of time and effort, completing thousands of HITs, and generating a respectable income of more than $1000/month. For these workers, Mechanical Turk tends to be the primary source of income, of course. For Indian-based workers, such salary levels are typically satisfactory for the type of work that is available on Mechanical Turk (i.e., tedious tasks that do not require significant specialized skills)&lt;br /&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://3.bp.blogspot.com/_skxW9kDLt_c/S5bkTF_a04I/AAAAAAAAEc8/JbMMBNLG--o/s1600-h/time-spent.PNG" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="212" src="http://3.bp.blogspot.com/_skxW9kDLt_c/S5bkTF_a04I/AAAAAAAAEc8/JbMMBNLG--o/s400/time-spent.PNG" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://3.bp.blogspot.com/_skxW9kDLt_c/S5bkWzdnPWI/AAAAAAAAEdE/afNYvjp3DVE/s1600-h/hits-per-week.PNG" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="238" src="http://3.bp.blogspot.com/_skxW9kDLt_c/S5bkWzdnPWI/AAAAAAAAEdE/afNYvjp3DVE/s400/hits-per-week.PNG" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://3.bp.blogspot.com/_skxW9kDLt_c/S5bka-WDLQI/AAAAAAAAEdM/hPq-aLy2JNQ/s1600-h/income-per-week.PNG" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="233" src="http://3.bp.blogspot.com/_skxW9kDLt_c/S5bka-WDLQI/AAAAAAAAEdM/hPq-aLy2JNQ/s400/income-per-week.PNG" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Motivations for Participating on Mechanical Turk&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;To understand better why people participate on Mechanical Turk, we asked for both qualitative (i.e., free text) and a set of structured questions. The main structured question that we asked was the following:&lt;br /&gt;&lt;br /&gt;&lt;blockquote&gt;&lt;i&gt;Why do you complete tasks in Mechanical Turk? Please check any of the following that applies:&lt;/i&gt;&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;i&gt;Fruitful way to spend free time and get some cash (e.g., instead of watching TV)&lt;/i&gt; &lt;/li&gt;&lt;li&gt;&lt;i&gt;For "primary" income purposes (e.g., gas, bills, groceries, credit cards)&lt;/i&gt; &lt;/li&gt;&lt;li&gt;&lt;i&gt;For "secondary" income purposes, pocket change (for hobbies, gadgets, going out)&lt;/i&gt; &lt;/li&gt;&lt;li&gt;&lt;i&gt;To kill time&lt;/i&gt; &lt;/li&gt;&lt;li&gt;&lt;i&gt;I find the tasks to be fun&lt;/i&gt; &lt;/li&gt;&lt;li&gt;&lt;i&gt;I am currently unemployed, or have only a part time job&lt;/i&gt;&lt;br /&gt;&lt;/li&gt;&lt;/ul&gt;&lt;/blockquote&gt;&lt;br /&gt;The answers were quite different across Indian and US-workers. Very few Indian workers participate on MTurk for "killing time", and significantly more Indians treat MTurk as a primary source of income. (Not surprising given the average income level of an Indian worker vs the income level of the US workers.)&lt;br /&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://1.bp.blogspot.com/_skxW9kDLt_c/S5bsYOrM4zI/AAAAAAAAEdU/SBy9yuMAo4A/s1600-h/motivation-fruitful.PNG" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="192" src="http://1.bp.blogspot.com/_skxW9kDLt_c/S5bsYOrM4zI/AAAAAAAAEdU/SBy9yuMAo4A/s400/motivation-fruitful.PNG" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://3.bp.blogspot.com/_skxW9kDLt_c/S5bsaRkxo1I/AAAAAAAAEdc/bTRqHn-u_tI/s1600-h/motivation-fun.PNG" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="215" src="http://3.bp.blogspot.com/_skxW9kDLt_c/S5bsaRkxo1I/AAAAAAAAEdc/bTRqHn-u_tI/s400/motivation-fun.PNG" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://2.bp.blogspot.com/_skxW9kDLt_c/S5bsbl1j4XI/AAAAAAAAEdk/l4asgDq9MS8/s1600-h/motivation-killtime.PNG" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="266" src="http://2.bp.blogspot.com/_skxW9kDLt_c/S5bsbl1j4XI/AAAAAAAAEdk/l4asgDq9MS8/s400/motivation-killtime.PNG" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://2.bp.blogspot.com/_skxW9kDLt_c/S5bsdHSNBsI/AAAAAAAAEds/ggZYJg0rqrU/s1600-h/motivation-primaryincome.PNG" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="260" src="http://2.bp.blogspot.com/_skxW9kDLt_c/S5bsdHSNBsI/AAAAAAAAEds/ggZYJg0rqrU/s400/motivation-primaryincome.PNG" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://2.bp.blogspot.com/_skxW9kDLt_c/S5bsePJxXvI/AAAAAAAAEd0/nna-qxlPckg/s1600-h/motivation-secondaryincome.PNG" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="262" src="http://2.bp.blogspot.com/_skxW9kDLt_c/S5bsePJxXvI/AAAAAAAAEd0/nna-qxlPckg/s400/motivation-secondaryincome.PNG" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://4.bp.blogspot.com/_skxW9kDLt_c/S5bsfSda5rI/AAAAAAAAEd8/8wn5Rg9fyFs/s1600-h/motivation-unemployed.PNG" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="255" src="http://4.bp.blogspot.com/_skxW9kDLt_c/S5bsfSda5rI/AAAAAAAAEd8/8wn5Rg9fyFs/s400/motivation-unemployed.PNG" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;br /&gt;&lt;/div&gt;While these graphs are ok, I would actually encourage everyone to go through the textual responses of the workers. Below you can find the data embedded in a Google Spreadsheet. Go through the column "EngagementQ1" and I am sure that you will enjoy reading all the answers given by the workers.&lt;br /&gt;&lt;br /&gt;&lt;script src="http://spreadsheets.google.com/gpub?url=http%3A%2F%2Ftngmqk5kknht7idkbhrks3qtltpmeg9f.spreadsheets.gmodules.com%2Fgadgets%2Fifr%3Fup__table_query_url%3Dhttp%253A%252F%252Fspreadsheets.google.com%252Ftq%253Frange%253DA1%25253AV1001%2526headers%253D-1%2526gid%253D1%2526key%253D0AjX1e06EhsXSdDZ0OGx6SzFkb2x5T0tWdy11cnpCV3c%2526pub%253D1%26up_title%3D%26up_last_query_hash%3D%26up_groupbycolumn%3D%26up__table_query_refresh_interval%3D300%26up_showfilters%3D1%26up_aggregateby%3D%26up_enablegrouping%3D1%26url%3Dhttp%253A%252F%252Fwww.google.com%252Fig%252Fmodules%252Ftable.xml&amp;amp;height=400&amp;amp;width=400"&gt;&lt;/script&gt;&lt;br /&gt;&lt;br /&gt;&lt;b&gt;More Details&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;The set of blog posts about the demographics of Mechanical Turk have been a little bit too popular, reaching a point where people were asking me how to cite these surveys. While I thought that pointing to the blog would be enough, I was surprised to find out that many people consider a blog post to be too "informal" to cite.&lt;br /&gt;&lt;br /&gt;&lt;div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;"&gt;Although I find this academic conservatism kind of funny,&amp;nbsp;I am not sure whether such type of work &lt;i&gt;&lt;b&gt;should &lt;/b&gt;&lt;/i&gt;appear in an academic conference, journal, or magazine. Maybe a magazine-style journal would be ok but I am still not 100% convinced.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;Anyway, as a compromise, I now created a working paper with the results of these demographic studies, and now whomever is interested can cite the "official" working paper with the results on the &lt;a href="http://archive.nyu.edu/handle/2451/29585"&gt;demographics of Mechanical Turk&lt;/a&gt;. (As you will notice, it is essentially this blog post, pasted in a PDF file.) As an added value, you can also find there the Excel spreadsheet with all the results and perform your own analyses and studies.&lt;br /&gt;&lt;br /&gt;I would like to consider this working paper as a true working paper, i.e., update it over time with the most current results, if I consider this necessary. Until then, enjoy the current results and let me know if there are other questions that you would like to see answered.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6538286886012474786-969776997930952829?l=topsitemap.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/969776997930952829'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/969776997930952829'/><link rel='alternate' type='text/html' href='http://topsitemap.blogspot.com/2010/03/new-demographics-of-mechanical-turk.html' title='The New Demographics of Mechanical Turk'/><author><name>Internet at Every Where</name><uri>http://www.blogger.com/profile/13430497579682715412</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://4.bp.blogspot.com/_skxW9kDLt_c/S5bhZ-i8EBI/AAAAAAAAEb0/RzSvlYP5KYY/s72-c/gender.PNG' height='72' width='72'/></entry><entry><id>tag:blogger.com,1999:blog-6538286886012474786.post-7225078511782809733</id><published>2010-02-24T13:17:00.000-08:00</published><updated>2011-04-17T19:00:19.386-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='mechanical turk'/><title type='text'>Why Mechanical Turk Allows Only US-based Requesters?</title><content type='html'>Many people read the blog from outside the United States. All these readers learn about Mechanical Turk and are excited about the concept, so they want to try it out. Unfortunately, no such luck! You need a US credit card to be able to fund the account (or you need to work as a Turker to accumulate the amount necessary to fund your own tasks.)&lt;br /&gt;&lt;br /&gt;So, many people are asking: Why Amazon does not open the service internationally? Why restrict Mechanical Turk only to people that have US credit cards?&lt;br /&gt;&lt;br /&gt;This was kind of puzzling to me as well, given that other Amazon Web Services (e.g., EC2, S3, etc) are open to international customers. Why other web services are open but MTurk is not?&lt;br /&gt;&lt;br /&gt;Today I met with John Hoskins, Senior Manager of Business Development of Mechanical Turk, and asked the very same question. The answer was clarifying: For all other web services, the customer is consuming Amazon services and pays Amazon. For Mechanical Turk, Amazon receives funds from requesters and then distributes them to workers.&lt;br /&gt;&lt;br /&gt;This flow of payments forces Amazon to comply with the &lt;a href="http://en.wikipedia.org/wiki/USA_PATRIOT_Act"&gt;US Patriot Act&lt;/a&gt;, especially the &lt;a href="http://en.wikipedia.org/wiki/USA_PATRIOT_Act,_Title_III"&gt;provisions about money laundering and financing of terrorist activities&lt;/a&gt;. The basic idea, known as the "&lt;a href="http://en.wikipedia.org/wiki/Know_your_customer"&gt;Know Your Customer (KYC)&lt;/a&gt;" doctrine, is that Amazon should know from whom they get money and to whom they send the money. This is possible for US credit cards and for US bank accounts, due to the regulations of the US banking system. (I also guess that this is also possible for India, given that &lt;a href="https://www.mturk.com/mturk/help?helpPage=worker#how_recv_rupees"&gt;Amazon now pays workers in India using rupees&lt;/a&gt;.)&lt;br /&gt;&lt;br /&gt;I find it kind of fascinating that Mechanical Turk could be used as a venue for money laundering but, in retrospect, not unlikely. In fact, given the relatively low fees that Amazon charges for funds to change hands, it is almost appealing. I can easily see a person posting one million $10 do-nothing HITs, available only to a single qualified worker, who can then consume them and get paid "clean" money.&lt;br /&gt;&lt;br /&gt;I guess the news is a disappointment for many aspiring international requesters, but there is some hope: If you can open a US-based credit card (e.g., have a pre-paid credit card or a gift credit card), then it should be possible to open a Mechanical Turk requester account. Or, simply go and use &lt;a href="https://crowdflower.com/"&gt;CrowdFlower&lt;/a&gt;&amp;nbsp;that will serve as an intermediary and submit your tasks to Mechanical Turk, providing many other value-added services along the way (thanks to&amp;nbsp;S�rgio Nunes for reminding me about that!).&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6538286886012474786-7225078511782809733?l=topsitemap.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/7225078511782809733'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/7225078511782809733'/><link rel='alternate' type='text/html' href='http://topsitemap.blogspot.com/2010/02/why-mechanical-turk-allows-only-us.html' title='Why Mechanical Turk Allows Only US-based Requesters?'/><author><name>Internet at Every Where</name><uri>http://www.blogger.com/profile/13430497579682715412</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author></entry><entry><id>tag:blogger.com,1999:blog-6538286886012474786.post-8116611991382158694</id><published>2010-01-26T14:10:00.000-08:00</published><updated>2011-04-17T19:00:19.388-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='academia'/><category scheme='http://www.blogger.com/atom/ns#' term='research'/><title type='text'>Universities and Intellectual Property: A Minefield?</title><content type='html'>One of the things that I never understood at NYU is what are the rights that the university has on the work produced by the faculty members and students. &lt;br /&gt;&lt;br /&gt;Following the intellectual properly law, there are four basic types of intellectual properly:&lt;br /&gt;&lt;ul&gt;&lt;li&gt; Copyright&lt;/li&gt;&lt;li&gt; Patents&lt;/li&gt;&lt;li&gt; Trademarks&lt;/li&gt;&lt;li&gt; Trade Secrets&lt;/li&gt;&lt;/ul&gt;If someone works for a corporation, things are pretty clear. Any paper, before being published needs to get approval. Any developed algorithms and code written is the intellectual property of the company and the company owns the copyright for the code, and can treat the algorithms as a trade secret. The company may also patent useful inventions and register some valuable trademarks. But, in all these cases, everything that is being produced within the corporation is work made for hire and owned by the corporation. The employee has typically no ownership of the produced work and it is commonly prohibited for the employee to work for another company or provide any sort of consulting services..&lt;br /&gt;&lt;br /&gt;For the work of faculty, I always felt that everything falls into a grey area. Most of the work is made public as soon as possible. Code is often released as open source, following some pretty liberal licensing scheme, or even released to the public domain. Papers are written and publicized without much, if any, vetting and the algorithms and methods described there are typically in the public domain. The only case where a university has some control over intellectual property is when a patent is filed and granted. &lt;br /&gt;&lt;br /&gt;Now, the great confusion arises when the faculty wants to work with a corporation and the university allows faculty members to engage into consulting agreements. Who owns and controls the expertise and discoveries of the faculty member? &lt;br /&gt;&lt;br /&gt;Let's say that myself, Panos, invented an algorithm in area X, wrote a paper, and published the code in an open source format. Corporation A, comes to me and asks me to consult them on area X. What is the control that my employer, NYU, has on my work? Yes, Corporation A wants to hire me because of the IP that I produced while at NYU. This IP though is publicly available, so I do not really transfer anything protected under copyright law. &lt;br /&gt;&lt;br /&gt;I have asked this question to our own tech transfer office. Unfortunately, I did not get back a clear answer. They told me that I cannot transfer code and that any patent is owned by NYU. Correct, these are indeed intellectual property assets. (Although for the case of open source code, this is again confusing.) But what about the expertise that a faculty member develops? In corporations this is often protected using some no-compete clauses in the employment contract, effectively preventing employees from directly transferring know-how. In universities, there is no such provision.&lt;br /&gt;&lt;br /&gt;I find this merging of academic and corporate worlds to be particularly confusing and I find this to be a potential minefield. Who owns what? Any ideas? Any experiences? How other universities treat the concept of tech transfer?&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6538286886012474786-8116611991382158694?l=topsitemap.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/8116611991382158694'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/8116611991382158694'/><link rel='alternate' type='text/html' href='http://topsitemap.blogspot.com/2010/01/universities-and-intellectual-property.html' title='Universities and Intellectual Property: A Minefield?'/><author><name>Internet at Every Where</name><uri>http://www.blogger.com/profile/13430497579682715412</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author></entry><entry><id>tag:blogger.com,1999:blog-6538286886012474786.post-2191532417536980731</id><published>2010-01-25T20:59:00.000-08:00</published><updated>2011-04-17T19:00:19.391-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='reviews'/><category scheme='http://www.blogger.com/atom/ns#' term='academia'/><category scheme='http://www.blogger.com/atom/ns#' term='wisdom of the crowds'/><category scheme='http://www.blogger.com/atom/ns#' term='research'/><category scheme='http://www.blogger.com/atom/ns#' term='data mining'/><title type='text'>Did you find this helpful?</title><content type='html'>Last week, the New York Times Sunday magazine had an article titled &lt;a href="http://www.nytimes.com/2010/01/17/magazine/17FOB-Medium-t.html"&gt;The Reviewing Stand&lt;/a&gt;, starting with the following:&lt;br /&gt;&lt;br /&gt;&lt;blockquote&gt;Here�s a challenge for students of expository writing: review a popular product on Amazon and aim to get your review chosen by readers as �most helpful.� It�s dead hard. The product review, as a literary form, is in its heyday. Polemical, evocative, witty, narrative, exhortative, furious, ironic, off the cuff....&lt;br /&gt;&lt;/blockquote&gt;&lt;br /&gt;What I found amusing was the fact that, after reading this article, I got a notification that the journal version of the paper &lt;a href="http://ssrn.com/abstract=1261751"&gt;Estimating the Helpfulness and Economic Impact of Product Reviews: Mining Text and Reviewer Characteristics&lt;/a&gt;, co-authored with my frequent co-author, &lt;a href="http://pages.stern.nyu.edu/~aghose/"&gt;Anindya Ghose&lt;/a&gt;, has been accepted for publication at the&amp;nbsp;IEEE Transactions on Knowledge and Data Engineering (TKDE) journal.&lt;br /&gt;&lt;br /&gt;As the title suggests, one of the problems that we attack in the paper is how to predict the usefulness of a product review. For example, if you go on Amazon, you will see, on top of many reviews, how many people considered a particular product review helpful:&lt;br /&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://www.amazon.com/review/RW2POAX24DGXE/ref=cm_cr_rdp_perm"&gt;&lt;img border="0" height="86" src="http://1.bp.blogspot.com/_skxW9kDLt_c/S156AnCy_NI/AAAAAAAAEX4/bIYbPZ-t7_0/s400/helpfulness.PNG" width="400" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;/div&gt;&lt;br /&gt;So, the question is: &lt;b&gt;Can we predict how helpful a particular review will be?&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;Our first attempts to address this problem appeared in the &lt;a href="http://pages.stern.nyu.edu/~panos/publications/wits2006.pdf"&gt;WITS 2006&lt;/a&gt; and the &lt;a href="http://pages.stern.nyu.edu/~panos/publications/icec2007.pdf"&gt;ICEC 2007&lt;/a&gt; papers. Following the &lt;a href="http://lingpipe-blog.com/2008/09/11/scientific-zeitgeist-vs-independent-discoveries/"&gt;scientific zeitgeist&lt;/a&gt;, a large number of other papers appeared these years, all tackling the question of predicting helpfulness of reviews. (See the actual paper for references.)&lt;br /&gt;&lt;br /&gt;What I found rather surprising was the relative easiness of the task. A few relatively straightforward features can be used to predict with good accuracy whether a review will be deemed helpful or not.&lt;br /&gt;&lt;ul&gt;&lt;li&gt;Check the &lt;b&gt;readability &lt;/b&gt;of the article, &lt;a href="http://en.wikipedia.org/wiki/Readability"&gt;as measured by one of the many readability metrics&lt;/a&gt;,&amp;nbsp;check the number of spelling errors, and measure basic statistics of the text, such as review length.&amp;nbsp;Using just the readability and the fraction of spelling errors in the article we can estimate with 70%-80%&amp;nbsp;accuracy&amp;nbsp;whether a review will be deemed helpful or not.&lt;/li&gt;&lt;li&gt;Check the &lt;b&gt;history &lt;/b&gt;of the reviewer. If the reviewer has been writing helpful reviews in the past, it is highly likely that reviews in the future will also be helpful. Also, if a reviewer has disclosed personal details (name, location, etc) the reviews are more likely to be helpful. Again, using just reviewer history and disclosure details, we get&amp;nbsp;70%-80%&amp;nbsp;accuracy, as measured with the AUC metric.&lt;/li&gt;&lt;li&gt;Check the "&lt;b&gt;subjectivity&lt;/b&gt;" of the review. We call a review objective if it contains mainly information that can be found in the product description and specs. A subjective review contains information that depends on the personal experiences of the reviewer. Helpful reviews tend to contain a mix of both.&lt;/li&gt;&lt;/ul&gt;&lt;div&gt;Interestingly enough, all three feature sets seem to have equivalent predictive power. Even using them all together does not seem to increase substantially the predictive performance.&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;While preparing the final version of the paper, I also checked other papers that were attacking the same problem. While many papers were trying to predict helpfulness using textual features, I noticed that a few papers were using a set of&amp;nbsp;alternative&amp;nbsp;and interesting features:&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;ul&gt;&lt;li&gt;&lt;b&gt;Coverage of product features&lt;/b&gt;. Many products can be considered an aggregation of multiple product features. For example, a digital camera has resolution, size, battery life, sensor size, etd. How many product features are being discussed in the review? This feature tends to have predictive power, according to (&lt;a href="http://acl.ldc.upenn.edu/D/D07/D07-1035.pdf"&gt;Liu et al, EMNLP 2007&lt;/a&gt;).&lt;/li&gt;&lt;li&gt;&lt;b&gt;Dynamics &lt;/b&gt;of reviews. Reviews that are posted early on get a higher fraction of helpful votes. In contrast, later reviews need to be more informative and comprehensive to attract the same fraction of helpful votes&amp;nbsp;(&lt;a href="http://acl.ldc.upenn.edu/D/D07/D07-1035.pdf"&gt;Liu et al, EMNLP 2007&lt;/a&gt;).&amp;nbsp;&lt;/li&gt;&lt;li&gt;&lt;b&gt;Controversy&lt;/b&gt;. The helpfulness of a review depends not only on its own content but also on how controversial is the product under consideration (&lt;a href="http://www.cs.cornell.edu/home/kleinber/www09-helpfulness.pdf"&gt;Danescu-Niculescu-Mizil, WWW 2009&lt;/a&gt;).&lt;/li&gt;&lt;li&gt;&lt;b&gt;Social network of reviewers&lt;/b&gt;. If reviewer A trusts the reviews of reviewer B, then the reviews of B are likely to be more helpful than the reviews of A. ("Exploiting Social Context for Review Quality Prediction"; by Lu, Tsaparas, Ntoulas, and Polanyi; WWW 2010)&lt;/li&gt;&lt;/ul&gt;&lt;div&gt;Although I have not seen a paper combining all the&amp;nbsp;above&amp;nbsp;features in order to predict the helpfulness of a review (or for ranking reviews by helpfulness), I guess that these set of features will bring predictive accuracy pretty close to its limit for this task.&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;What is next? I guess personalized recommendations are going to appear sooner or later, matching users with reviews that are more likely to benefit them.&lt;i&gt; (&lt;/i&gt;&lt;b&gt;&lt;i&gt;Update&lt;/i&gt;&lt;/b&gt;&lt;i&gt;: See Eugene's comment below for related papers.)&lt;/i&gt; For example, a beginner in photography will be interested in a&amp;nbsp;different&amp;nbsp;type of review when buying an SLR, compared to a seasoned professional. We already know that reviews from similar users can be used for recommending products (see Netflix) so it is not unlikely that different types of reviews will be deemed helpful by different types of users.&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;So, did you find this blog post useful?&lt;br /&gt;&lt;/div&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6538286886012474786-2191532417536980731?l=topsitemap.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/2191532417536980731'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/2191532417536980731'/><link rel='alternate' type='text/html' href='http://topsitemap.blogspot.com/2010/01/did-you-find-this-helpful.html' title='Did you find this helpful?'/><author><name>Internet at Every Where</name><uri>http://www.blogger.com/profile/13430497579682715412</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://1.bp.blogspot.com/_skxW9kDLt_c/S156AnCy_NI/AAAAAAAAEX4/bIYbPZ-t7_0/s72-c/helpfulness.PNG' height='72' width='72'/></entry><entry><id>tag:blogger.com,1999:blog-6538286886012474786.post-1825029497739676166</id><published>2009-12-05T16:33:00.000-08:00</published><updated>2011-04-17T19:00:19.394-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='mechanical turk'/><category scheme='http://www.blogger.com/atom/ns#' term='research'/><title type='text'>Prisoner's Dilemma and Mechanical Turk</title><content type='html'>I have been reading lately, about the differences between mathematical models of behavior and real human behavior. So, I decided to try on Mechanical Turk the classical game theory model of &lt;a href="http://en.wikipedia.org/wiki/Prisoner's_dilemma"&gt;Prisoner's Dilemma&lt;/a&gt;. (See also Brendan's &lt;a href="http://anyall.org/blog/2007/07/game-outcome-graphs-prisoners-dilemma-with-fun-arrows/"&gt;nice explanations and diagrams&lt;/a&gt; if you have never been exposed to game theory before.)&lt;br /&gt;&lt;br /&gt;From Wikipedia:&lt;br /&gt;&lt;br /&gt;&lt;blockquote&gt;In its classical form, the prisoner's dilemma ("PD") is presented as follows:&lt;br /&gt;&lt;br /&gt;&lt;i&gt;Two suspects are arrested by the police. The police have insufficient evidence for a conviction, and, having separated both prisoners, visit each of them to offer the same deal. If one testifies (defects from the other) for the prosecution against the other and the other remains silent (cooperates with the other), the betrayer goes free and the silent accomplice receives the full 10-year sentence. If both remain silent, both prisoners are sentenced to only six months in jail for a minor charge. If each betrays the other, each receives a five-year sentence. Each prisoner must choose to betray the other or to remain silent. Each one is assured that the other would not know about the betrayal before the end of the investigation. How should the prisoners act?&lt;/i&gt;&lt;br /&gt;&lt;br /&gt;If we assume that each player cares only about minimizing his or her own time in jail, then the prisoner's dilemma forms a non-zero-sum game in which two players may each cooperate with or defect from (betray) the other player. In this game, as in all game theory, the only concern of each individual player (prisoner) is maximizing his or her own payoff, without any concern for the other player's payoff. The unique equilibrium for this game is a Pareto-suboptimal solution, that is, rational choice leads the two players to both play defect, even though each player's individual reward would be greater if they both played cooperatively.&lt;br /&gt;&lt;/blockquote&gt;&lt;br /&gt;My first attempt was to post to Mechanical Turk this dilemma in a setting of the following game:&lt;br /&gt;&lt;br /&gt;&lt;blockquote&gt;You are playing a game together with a stranger.&amp;nbsp;Each of you have two choices to play: "trust" or "cheat".&lt;br /&gt;&lt;ul&gt;&lt;li&gt;If both of you play "trust", you win $30,000 each.&lt;/li&gt;&lt;li&gt;If both of you play "cheat", you get $10,000 each.&lt;/li&gt;&lt;li&gt;If one player plays "trust" and the other plays "cheat", then the player that played "cheat" gets $50,000 and the player that played "trust" gets $0.&lt;/li&gt;&lt;/ul&gt;You cannot communicate during the game, and CANNOT see the final action of the other player. Both actions will be revealed simultaneously.&lt;br /&gt;&lt;br /&gt;What would you play? "Cheat" or "Trust"?&lt;br /&gt;&lt;/blockquote&gt;&lt;br /&gt;Basic &lt;a href="http://en.wikipedia.org/wiki/Prisoner's_dilemma"&gt;game theory predicts&lt;/a&gt; that the participants will choose "cheat" resulting in a suboptimal equilibrium. However, participants on Mechanical Turk did not behave like that. Instead, 48 out of the 100 participants decided to play "trust", which is above the 33% observed in the lab experiments of&amp;nbsp;&lt;a href="http://dx.doi.org/10.1016/0010-0285(92)90015-T"&gt;(Shafir and Tversky, 1992)&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;Next, I wanted to make the experiment more realistic. Would anything change if instead of playing an imaginary game, I promised actual monetary benefits to the participants?&amp;nbsp;So, I modified the game, and asked the participants to play against each other. Here is the revised task description.&lt;br /&gt;&lt;br /&gt;&lt;blockquote&gt;You are playing a game against another Turker. Your action here will be matched with an action of another Mechanical Turk worker.&lt;br /&gt;&lt;br /&gt;Each of you have two choices to play: "trust" or "cheat".&lt;br /&gt;&lt;ul&gt;&lt;li&gt;If both of you play "trust", you both get a bonus of $0.30.&lt;/li&gt;&lt;li&gt;If both of you play "cheat", you both get a bonus of $0.10.&lt;/li&gt;&lt;li&gt;If one Turker plays "trust" and the other plays "cheat", then the Turker that played "cheat" gets a bonus of $0.50 and the Turker that played "trust" gets nothing.&lt;/li&gt;&lt;/ul&gt;What is your action? "Cheat" or "Trust"?&lt;br /&gt;&lt;/blockquote&gt;&lt;br /&gt;I asked 120 participants to play the game, paying just 1 cent for the participation. Interestingly enough, I had a perfect split in the results. 60 Turkers decided to cheat, and 60 Turkers decided to cheat. The final result was 20 pairs of trust-trust, 20 pairs of cheat-cheat, and 20 pairs of cheat-trust.&lt;br /&gt;&lt;br /&gt;In other words, the theory prediction that people will be locked in a non-optimal equilibrium was not correct, neither in the "imaginary" game, nor in the case where the workers had to gain some actually monetary benefit.&lt;br /&gt;&lt;br /&gt;Finally, I decided to change the payoff matrix, and replicate the structure of the &lt;a href="http://en.wikipedia.org/wiki/Friend_or_Foe%3F"&gt;TV game show "Friend or Foe"&lt;/a&gt;. There, participants get $50K each if they cooperate, $0 if they do not, and if one chooses trust and the other cheat, the "cheat" gets $100K and the "trust" gets $0.&lt;br /&gt;&lt;br /&gt;&lt;blockquote&gt;You are playing a game together with a stranger.&lt;br /&gt;&lt;br /&gt;Each of you have two choices to play: "trust" or "cheat".&lt;br /&gt;&lt;ul&gt;&lt;li&gt;If both of you play "trust", you both win $50,000. &lt;br /&gt;&lt;/li&gt;&lt;li&gt;If both of you play "cheat", you both get $0. &lt;br /&gt;&lt;/li&gt;&lt;li&gt;If one player plays "trust" and the other plays "cheat", then the player that played "cheat" gets $100,000 and the player that played "trust" gets $0. &lt;br /&gt;&lt;/li&gt;&lt;/ul&gt;You cannot communicate during the game, and CANNOT see the final action of the other player. Both actions will be revealed simultaneously.&lt;br /&gt;&lt;br /&gt;What would you play? "Cheat" or "Trust"?&lt;br /&gt;&lt;/blockquote&gt;&lt;br /&gt;Interestingly enough, in this setting &lt;b&gt;ALL 100 players ended up playing "trust"&lt;/b&gt;, which was quite different from the previous game and from the &lt;a href="http://dx.doi.org/10.1162/rest.88.3.463"&gt;behavior of the players in the TV show&lt;/a&gt;, where, in&amp;nbsp;almost 25% of the played games, both players chose "cheat" ending up with $0, and in 25% of the games the players collaborated and played "trust" getting $50K each.&lt;br /&gt;&lt;br /&gt;So, in my final attempt, I asked Turkers to play this "Friend of Foe" game, having monetary incentives. Here is the task that I posted on Mechanical Turk.&lt;br /&gt;&lt;br /&gt;&lt;blockquote&gt;You are playing a game against another Turker. Your action here will be matched with an action of another Mechanical Turk worker.&lt;br /&gt;&lt;br /&gt;Each of you have two choices to play: "trust" or "cheat".&lt;br /&gt;&lt;ul&gt;&lt;li&gt;If both of you play "trust", you both get a bonus of $0.50.&lt;/li&gt;&lt;li&gt;If both of you play "cheat", you both get $0.&lt;/li&gt;&lt;li&gt;If one Turker plays "trust" and the other plays "cheat", then the Turker that played "cheat" gets a bonus of $1.00 and the Turker that played "trust" gets nothing.&lt;/li&gt;&lt;/ul&gt;What is your action? "Cheat" or "Trust"?&lt;br /&gt;&lt;/blockquote&gt;&lt;br /&gt;In this game, 33% of the users decided to cheat, resulting in 6/50 games where both players got nothing, 23/50 games where both players got a 50 cent bonus, and 21/50 games where one player got $1 and the other player got nothing.&lt;br /&gt;&lt;br /&gt;I found the difference in behavior between the imaginary game and the actual one to be pretty interesting. Also, the deviation from the predictions of the game-theoretic model is striking.&lt;br /&gt;&lt;br /&gt;Although I am not the first to actually observe that, this deviation got me wondering: Why do we use elaborate game theory models for modeling user behavior, when not even the simplest such models do not correspond to reality? How can someone take seriously  the concept of an equilibrium when a game, introduced in the intro chapter of every game theory textbook, simply does not correspond to reality? Do we really understand the limitations of our tools, or mathematical and analytic elegance end up being more important than reality?&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6538286886012474786-1825029497739676166?l=topsitemap.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/1825029497739676166'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/6538286886012474786/posts/default/1825029497739676166'/><link rel='alternate' type='text/html' href='http://topsitemap.blogspot.com/2009/12/prisoner-dilemma-and-mechanical-turk.html' title='Prisoner&amp;#39;s Dilemma and Mechanical Turk'/><author><name>Internet at Every Where</name><uri>http://www.blogger.com/profile/13430497579682715412</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author></entry><entry><id>tag:blogger.com,1999:blog-6538286886012474786.post-7338618755286416170</id><published>2009-11-30T17:58:00.000-08:00</published><updated>2011-04-17T19:00:19.397-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='academia'/><category scheme='http://www.blogger.com/atom/ns#' term='research'/><category scheme='http://www.blogger.com/atom/ns#' term='hcomp'/><title type='text'>Anchoring and Mechanical Turk</title><content type='html'>Over the last few days, I have been reading the book &lt;a href="http://www.amazon.com/Predictably-Irrational-Hidden-Forces-Decisions/dp/006135323X/"&gt;Predictably Irrational&lt;/a&gt;&amp;nbsp;by &lt;a href="http://web.mit.edu/ariely/www/MIT/"&gt;Dan Ariely&lt;/a&gt;, which (predictably) describes many biases that we exhibit when making decisions. These biases are not just effects of random chance but are rather expected and predictable. Such biases and the "irrationality" of human agents is one of the focuses of behavioral economics; these biases have been also extensively studied in the field of cognitive psychology, which examines the ways that human agents process information.&lt;br /&gt;&lt;br /&gt;One of the classic biases is the bias of "&lt;a href="http://en.wikipedia.org/wiki/Anchoring"&gt;anchoring&lt;/a&gt;". Dan Ariely in his book shows how he got students to bid higher or lower for a particular bottle of wine: He asked students to write down the &lt;i&gt;last digit of their social security number&lt
