{"id":2480,"date":"2004-05-11T18:11:33","date_gmt":"2004-05-12T08:11:33","guid":{"rendered":"http:\/\/michaelhans.com\/eclecticism\/2004\/05\/12\/google-bits-redactions-and-spam\/"},"modified":"2019-12-10T09:38:18","modified_gmt":"2019-12-10T17:38:18","slug":"google-bits-redactions-and-spam","status":"publish","type":"post","link":"https:\/\/michaelhans.com\/eclecticism\/2004\/05\/11\/google-bits-redactions-and-spam\/","title":{"rendered":"Google bits: redactions and spam"},"content":{"rendered":"<div class='__iawmlf-post-loop-links' style='display:none;' data-iawmlf-post-links='[{&quot;id&quot;:6664,&quot;href&quot;:&quot;http:\\\/\\\/www.google.com&quot;,&quot;archived_href&quot;:&quot;https:\\\/\\\/web-wp.archive.org\\\/web\\\/20260309014042\\\/https:\\\/\\\/www.google.com\\\/&quot;,&quot;redirect_href&quot;:&quot;&quot;,&quot;checks&quot;:[{&quot;date&quot;:&quot;2026-03-09 07:42:11&quot;,&quot;http_code&quot;:200},{&quot;date&quot;:&quot;2026-03-16 19:56:46&quot;,&quot;http_code&quot;:200},{&quot;date&quot;:&quot;2026-03-24 23:48:45&quot;,&quot;http_code&quot;:200},{&quot;date&quot;:&quot;2026-03-29 04:40:19&quot;,&quot;http_code&quot;:200},{&quot;date&quot;:&quot;2026-04-01 08:28:11&quot;,&quot;http_code&quot;:200}],&quot;broken&quot;:false,&quot;last_checked&quot;:{&quot;date&quot;:&quot;2026-04-01 08:28:11&quot;,&quot;http_code&quot;:200},&quot;process&quot;:&quot;done&quot;},{&quot;id&quot;:11111,&quot;href&quot;:&quot;http:\\\/\\\/www.google.com\\\/googleblog\\\/2004\\\/05\\\/going-out-of-our-way-to-find-right.html&quot;,&quot;archived_href&quot;:&quot;&quot;,&quot;redirect_href&quot;:&quot;&quot;,&quot;checks&quot;:[],&quot;broken&quot;:false,&quot;last_checked&quot;:null,&quot;process&quot;:&quot;done&quot;},{&quot;id&quot;:11112,&quot;href&quot;:&quot;http:\\\/\\\/diveintomark.org\\\/archives\\\/2004\\\/05\\\/11\\\/google-watcher&quot;,&quot;archived_href&quot;:&quot;https:\\\/\\\/web-wp.archive.org\\\/web\\\/20110806092707\\\/http:\\\/\\\/diveintomark.org\\\/archives\\\/2004\\\/05\\\/11\\\/google-watcher&quot;,&quot;redirect_href&quot;:&quot;&quot;,&quot;checks&quot;:[{&quot;date&quot;:&quot;2026-03-11 12:19:17&quot;,&quot;http_code&quot;:503},{&quot;date&quot;:&quot;2026-04-03 09:13:11&quot;,&quot;http_code&quot;:503}],&quot;broken&quot;:false,&quot;last_checked&quot;:{&quot;date&quot;:&quot;2026-04-03 09:13:11&quot;,&quot;http_code&quot;:503},&quot;process&quot;:&quot;done&quot;},{&quot;id&quot;:11113,&quot;href&quot;:&quot;http:\\\/\\\/slashdot.org\\\/comments.pl?sid=107211&amp;cid=9119940&quot;,&quot;archived_href&quot;:&quot;&quot;,&quot;redirect_href&quot;:&quot;&quot;,&quot;checks&quot;:[],&quot;broken&quot;:false,&quot;last_checked&quot;:null,&quot;process&quot;:&quot;done&quot;},{&quot;id&quot;:11114,&quot;href&quot;:&quot;http:\\\/\\\/www.metafilter.com\\\/mefi\\\/33035&quot;,&quot;archived_href&quot;:&quot;https:\\\/\\\/web-wp.archive.org\\\/web\\\/20070213183426\\\/http:\\\/\\\/www.metafilter.com:80\\\/mefi\\\/33035&quot;,&quot;redirect_href&quot;:&quot;&quot;,&quot;checks&quot;:[{&quot;date&quot;:&quot;2026-03-11 12:19:25&quot;,&quot;http_code&quot;:200},{&quot;date&quot;:&quot;2026-04-03 09:13:08&quot;,&quot;http_code&quot;:200}],&quot;broken&quot;:false,&quot;last_checked&quot;:{&quot;date&quot;:&quot;2026-04-03 09:13:08&quot;,&quot;http_code&quot;:200},&quot;process&quot;:&quot;done&quot;},{&quot;id&quot;:11115,&quot;href&quot;:&quot;http:\\\/\\\/hello.typepad.com\\\/hello\\\/2004\\\/05\\\/google_on_outso.html&quot;,&quot;archived_href&quot;:&quot;https:\\\/\\\/web-wp.archive.org\\\/web\\\/20250911040314\\\/https:\\\/\\\/hello.typepad.com\\\/hello\\\/2004\\\/05\\\/google_on_outso.html&quot;,&quot;redirect_href&quot;:&quot;&quot;,&quot;checks&quot;:[{&quot;date&quot;:&quot;2026-03-11 12:19:26&quot;,&quot;http_code&quot;:403},{&quot;date&quot;:&quot;2026-04-03 09:13:10&quot;,&quot;http_code&quot;:403}],&quot;broken&quot;:false,&quot;last_checked&quot;:{&quot;date&quot;:&quot;2026-04-03 09:13:10&quot;,&quot;http_code&quot;:403},&quot;process&quot;:&quot;done&quot;},{&quot;id&quot;:8962,&quot;href&quot;:&quot;http:\\\/\\\/ranchero.com\\\/netnewswire&quot;,&quot;archived_href&quot;:&quot;https:\\\/\\\/web-wp.archive.org\\\/web\\\/20210115155306\\\/https:\\\/\\\/ranchero.com\\\/netnewswire\\\/&quot;,&quot;redirect_href&quot;:&quot;&quot;,&quot;checks&quot;:[{&quot;date&quot;:&quot;2026-03-10 15:54:12&quot;,&quot;http_code&quot;:404},{&quot;date&quot;:&quot;2026-03-19 00:53:04&quot;,&quot;http_code&quot;:404},{&quot;date&quot;:&quot;2026-03-31 14:04:21&quot;,&quot;http_code&quot;:404}],&quot;broken&quot;:true,&quot;last_checked&quot;:{&quot;date&quot;:&quot;2026-03-31 14:04:21&quot;,&quot;http_code&quot;:404},&quot;process&quot;:&quot;done&quot;}]'><\/div>\n<p>Two interesting <a href=\"http:\/\/www.google.com\/\" title=\"Google\">Google<\/a>-related bits today.<\/p>\n<p>Firstly, a paragraph about outsourcing jobs mysteriously disappeared from the <a href=\"http:\/\/www.google.com\/googleblog\/2004\/05\/going-out-of-our-way-to-find-right.html\" title=\"Going out of our way to find the right people.\">Google Weblog<\/a> at some point during the day. <a href=\"http:\/\/diveintomark.org\/archives\/2004\/05\/11\/google-watcher\" title=\"Google Watcher\">Mark Pilgrim pointed this out<\/a> (along with <a href=\"http:\/\/slashdot.org\/comments.pl?sid=107211&amp;cid=9119940\" title=\"\/.: Comment on outsourcing disappeared\">\/.<\/a>, <a href=\"http:\/\/www.metafilter.com\/mefi\/33035\" title=\"Google's blog is highly dynamic indeed!\">MeFi<\/a>, and <a href=\"http:\/\/hello.typepad.com\/hello\/2004\/05\/google_on_outso.html\" title=\"Google on outsourcing\">Hello Typepad<\/a>) and quite rightly took Google to task for the unremarked changes:<\/p>\n<blockquote><p>\n  This kind of revisionist history is unacceptable, regardless of who does it. If you don&#8217;t want it saved for all time, don&#8217;t publish it on the Internet. Putting &#8220;blog&#8221; on the top of the page does not absolve you of all responsibility.\n<\/p><\/blockquote>\n<p><a href=\"http:\/\/ranchero.com\/netnewswire\/\" title=\"NetNewsWire\">NetNewsWire<\/a>&#8216;s &#8220;show changes&#8221; feature caught the edits, though, so here&#8217;s a quick screen capture showing just how the post was reworded:<\/p>\n<p><a href=\"https:\/\/michaelhans.com\/eclecticism\/graphics\/2004\/05\/graphics\/google_outsourcing.jpg\"><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/michaelhans.com\/eclecticism\/graphics\/2004\/05\/graphics\/google_outsourcing-tm.jpg\" height=\"252\" width=\"400\" alt=\"Google's outsourcing remarks\" \/><\/a><\/p>\n<p>The second bit is more on the amusing side, and has less to do directly with Google. I got a piece of comment spam earlier that, when I looked at it, made me laugh, simply because in an effort to make it look <em>almost<\/em> like a real comment, the spammer had mixed links in with a paragraph of real text. It just so happens that the paragraph they chose was one from Google&#8217;s website, discussing how pages are indexed after being submitted to Google. I&#8217;ve replaced the links with bolded text in the following snippet, of course:<\/p>\n<blockquote><p>\n  When a URL is submitted to Google, <strong>Sex Toy Shop<\/strong> we look for it in our <strong>Hotel Booking<\/strong> next crawl. If <strong>Low Interest Credit Card<\/strong> you&#8217;ve already submitted your <strong>Buy Cialis<\/strong> URL, your site could easily <strong>Atkins Diet<\/strong> appear in our new index, which will go <strong>Nude Celebrity<\/strong> up when the current crawl is completed. However, <strong>Online Casinos<\/strong> if no other site links to yours, it <strong>Dating Personals<\/strong> may be difficult for our crawler to find <strong>Tag Watch<\/strong> you. Conversely, if many sites link to <strong>Seiko Watch<\/strong> your page, there is a good <strong>Car Hire<\/strong> chance we will find you without your submitting your <strong>Register Domain Name<\/strong> URL. Occasionally, websites are not reachable <strong>Ladies Watches<\/strong> when we try to crawl them because of <strong>Coral Bookmaker<\/strong> network or hosting problems.\n<\/p><\/blockquote>\n<p>It <em>almost<\/em> makes sense when you read it&#8230;<\/p>\n<p><strong>iTunes:<\/strong> &#8220;Another One Bites the Dust (Wyclef Jean)&#8221; by Queen feat. Free\/Jean, Wyclef\/Pras from the album <em>Small Soldiers<\/em> (1998, 4:22).<\/p>\n","protected":false},"excerpt":{"rendered":"<p>I got a piece of comment spam earlier that, when I looked at it, made me laugh, simply because in an effort to make it look almost like a real comment, the spammer had mixed links in with a paragraph of real text. It just so happens that the paragraph they chose was one from Google&#8217;s website, discussing how pages are indexed after being submitted to Google.<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[2040],"tags":[39,599],"class_list":["post-2480","post","type-post","status-publish","format-standard","hentry","category-blog","tag-links","tag-weblogs"],"_links":{"self":[{"href":"https:\/\/michaelhans.com\/eclecticism\/wp-json\/wp\/v2\/posts\/2480","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/michaelhans.com\/eclecticism\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/michaelhans.com\/eclecticism\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/michaelhans.com\/eclecticism\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/michaelhans.com\/eclecticism\/wp-json\/wp\/v2\/comments?post=2480"}],"version-history":[{"count":0,"href":"https:\/\/michaelhans.com\/eclecticism\/wp-json\/wp\/v2\/posts\/2480\/revisions"}],"wp:attachment":[{"href":"https:\/\/michaelhans.com\/eclecticism\/wp-json\/wp\/v2\/media?parent=2480"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/michaelhans.com\/eclecticism\/wp-json\/wp\/v2\/categories?post=2480"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/michaelhans.com\/eclecticism\/wp-json\/wp\/v2\/tags?post=2480"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}