{"id":5305,"date":"2010-12-31T16:11:20","date_gmt":"2010-12-31T16:11:20","guid":{"rendered":"http:\/\/scienceblogs.com\/principles\/2010\/12\/31\/benfords-law-of-amazon-ranking\/"},"modified":"2010-12-31T16:11:20","modified_gmt":"2010-12-31T16:11:20","slug":"benfords-law-of-amazon-ranking","status":"publish","type":"post","link":"http:\/\/chadorzel.com\/principles\/2010\/12\/31\/benfords-law-of-amazon-ranking\/","title":{"rendered":"Benford&#8217;s Law of Amazon Rankings"},"content":{"rendered":"<p>Late last year, Matthew Beckler was nice enough to make a <a href=\"http:\/\/www.mbeckler.org\/dog_physics\/\">sales rank tracker<\/a> for <a href=\"http:\/\/dogphysics.com\/\"><cite>How to Teach Physics to Your Dog<\/cite><\/a>. Changes in the Amazon page format made it stop working a while ago, though, and now Amazon reports roughly equivalent data via its AuthorCentral feature, with the added bonus of BookScan sales figures. So I&#8217;ve got a new source for my book sales related cat-vacuuming.<\/p>\n<p>Still, there&#8217;s this great big data file sitting there with thousands of hourly sales rank numbers, and I thought to myself &#8220;I ought to be able to do something else amusing with this&#8230;&#8221; And then Corky at the Virtuosi did a <a href=\"http:\/\/thevirtuosi.blogspot.com\/2010\/12\/benfords-law.html\">post about Benford&#8217;s Law<\/a>, and I said &#8220;Ah-ha!&#8221;<\/p>\n<p>Benford&#8217;s Law, if you&#8217;re not familiar with it, says that in a large assortment of numbers generated by some process, you expect the first non-zero digits of all the numbers to be distributed in a logarithmic fashion. About 30% of the first digits should be &#8220;1,&#8221; and only about 4% of the first digits should be &#8220;9.&#8221; This goes against the naive expectation that the numbers ought to be evenly distributed, and is actually used by &#8220;forensic accountants&#8221; to catch people who are cooking their books&#8211; someone who is making up numbers to fill a phony set of books is fairly likely to pick numbers that don&#8217;t follow a Benford&#8217;s Law distribution.<\/p>\n<p>So, I&#8217;ve got 6,818 hourly values of the Amazon sales rank for my book, spanning almost three orders of magnitude. How do those digits match up with Benford&#8217;s Law? Well:<\/p>\n<p><!--more--><\/p>\n<p><img decoding=\"async\" src=\"http:\/\/scienceblogs.com\/principles\/wp-content\/blogs.dir\/467\/files\/2012\/04\/i-c92e76f3144932da122da397066ff84e-benford_rank.png\" alt=\"i-c92e76f3144932da122da397066ff84e-benford_rank.png\" \/><\/p>\n<p>That&#8217;s&#8230; pretty good, really. The blue diamonds are the actual frequency of the digit, the red squares are the prediction of Benford&#8217;s Law. There&#8217;s a slight shortage of 1&#8217;s and a surplus of 5&#8217;s and 6&#8217;s, but all the actual frequencies are within about 5% of the expected values. The most basic assumption about the statistics of this sort of data set would lead you to expect an uncertainty of about 1% (that is, 1 over the square root of 6818), but that&#8217;s pretty crude.<\/p>\n<p>What does this tell us? Not a whole lot, really. if Amazon is somehow fudging their sales rank data (which I have no reason to suspect them of doing), they&#8217;re clever enough not to get caught by this really crude analysis of one book&#8217;s figures. <\/p>\n<p>Making this graph has, however, given me a way to put off some tedious and annoying work for another hour or so, so let&#8217;s hear it for Benford&#8217;s Law!<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Late last year, Matthew Beckler was nice enough to make a sales rank tracker for How to Teach Physics to Your Dog. Changes in the Amazon page format made it stop working a while ago, though, and now Amazon reports roughly equivalent data via its AuthorCentral feature, with the added bonus of BookScan sales figures.&hellip; <a class=\"more-link\" href=\"http:\/\/chadorzel.com\/principles\/2010\/12\/31\/benfords-law-of-amazon-ranking\/\">Continue reading <span class=\"screen-reader-text\">Benford&#8217;s Law of Amazon Rankings<\/span><\/a><\/p>\n","protected":false},"author":2,"featured_media":0,"comment_status":"open","ping_status":"1","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[67,9,147,143,11],"tags":[572,573,574,326,157,520],"class_list":["post-5305","post","type-post","status-publish","format-standard","hentry","category-book_writing","category-math","category-playing-with-graphs","category-sales","category-science","tag-amazon-sales-rank","tag-benfords-law","tag-cat-vacuuming","tag-how-to-teach-physics-to-your-dog","tag-math-2","tag-statistics","entry"],"_links":{"self":[{"href":"http:\/\/chadorzel.com\/principles\/wp-json\/wp\/v2\/posts\/5305","targetHints":{"allow":["GET"]}}],"collection":[{"href":"http:\/\/chadorzel.com\/principles\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"http:\/\/chadorzel.com\/principles\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"http:\/\/chadorzel.com\/principles\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"http:\/\/chadorzel.com\/principles\/wp-json\/wp\/v2\/comments?post=5305"}],"version-history":[{"count":0,"href":"http:\/\/chadorzel.com\/principles\/wp-json\/wp\/v2\/posts\/5305\/revisions"}],"wp:attachment":[{"href":"http:\/\/chadorzel.com\/principles\/wp-json\/wp\/v2\/media?parent=5305"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"http:\/\/chadorzel.com\/principles\/wp-json\/wp\/v2\/categories?post=5305"},{"taxonomy":"post_tag","embeddable":true,"href":"http:\/\/chadorzel.com\/principles\/wp-json\/wp\/v2\/tags?post=5305"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}