{"id":2524,"date":"2003-03-18T05:48:36","date_gmt":"2003-03-18T05:48:36","guid":{"rendered":"https:\/\/ezoons.com\/?p=2524"},"modified":"2003-03-18T05:48:36","modified_gmt":"2003-03-18T05:48:36","slug":"email_crawlers","status":"publish","type":"post","link":"https:\/\/ezoons.com\/?p=2524","title":{"rendered":"Email Crawlers are Evil"},"content":{"rendered":"<p><a href=\"http:\/\/philringnalda.com\/blog\/2003\/03\/allnewer_fast_again_weblogscom.php\">Phil Ringnalda writes<\/a> about <a href=\"http:\/\/www.scripting.com\/\">Dave<\/a> upgrading the weblogs.com machine and finding someone crawling it.<\/p>\n<blockquote>\n<p>I&#8217;m trying really hard not to think about how Dave was seeing a heavy load during the changeover because someone was crawling all over the Radio discussion group. Whether he meant radio.userland.com\/discuss\/ or radiocomments.userland.com\/discuss\/, in either case if I were crawling it, it would be because of all those juicy email addresses, sitting out at the end of \/profiles\/$ URLs all over the place. I&#8217;ve always thought those were fat enough targets to be well worth writing a special purpose crawler. (Note for the irony impaired: I&#8217;m not actually a spammer, or a writer of email harvesters.)<\/p>\n<\/blockquote>\n<p>I hate to break it to you, but they&#8217;ve been crawled a bunch (and I know I&#8217;ve griped about it a few times here).  They grabbed one of my addresses and I&#8217;ve gotten span from it.  I personally think Userland needs to start encoding email addresses anywhere it prints them out on a web page (or find a way not to print them out at all, I personally don&#8217;t trust the endcoding thing).<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Phil Ringnalda writes about Dave upgrading the weblogs.com machine and finding someone crawling it. I&#8217;m trying really hard not to think about how Dave was seeing a heavy load during the changeover because someone was crawling all over the Radio discussion group. Whether he meant radio.userland.com\/discuss\/ or radiocomments.userland.com\/discuss\/, in either case if I were crawling [&hellip;]<\/p>\n","protected":false},"author":2,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[],"class_list":["post-2524","post","type-post","status-publish","format-standard","hentry","category-uncategorized"],"_links":{"self":[{"href":"https:\/\/ezoons.com\/index.php?rest_route=\/wp\/v2\/posts\/2524","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/ezoons.com\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/ezoons.com\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/ezoons.com\/index.php?rest_route=\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/ezoons.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=2524"}],"version-history":[{"count":0,"href":"https:\/\/ezoons.com\/index.php?rest_route=\/wp\/v2\/posts\/2524\/revisions"}],"wp:attachment":[{"href":"https:\/\/ezoons.com\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=2524"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/ezoons.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=2524"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/ezoons.com\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=2524"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}