Web Page Packrats: The Top 25 Del.icio.us Users…

As a casual user or del.icio.us (the popular web 2.0 bookmarking site), it always amazes me to see individuals who have even a few hundred bookmarks. My curiosity piqued, I took a few minutes to throw together a script that, in conjuction with Google and del.icio.us, was able to find some of the biggest users of del.icio.us. Below are the top 25 by my calculations, with estimates of the total number of bookmarks they have on del.icio.us. Did you make the list? User Bookmarks planetoid 10260 alancordova 10000 katiesays17 9930 julan 9450 yerfatma 7870 Preoccupations 7680 snfg 7440 tolvuvit 7390 maidenhalo 7310 yesmar 7120 tomz 6940 joaom 6630 mymarkup 6500 hustwj 6470 maratimba 6230 tsupo 6070 bswrchrd 6000 rblackwe 5350 owen 5240 erniesthings 5240 ravee_27 5230...

Surfing As GoogleBot – Their IP, Their User-Agent, Their Bot Characteristics

After reading this article and this article which give frustratingly over-simplifications on user-agent spoofing to get past cloaked websites, I figured I should write something on how to REALLY behave like Google. Cloaking often goes well beyond this, using IP delivery, User Agent cloaking, javascript and cookie detection, and referer detection – all of which can be used to determine that you are you and not a bot. So, how do you beat all 5 major types of cloaking? 1. Beat IP Delivery: Use Google Translate as a Proxy, translating from spanish->english even though the site is already in English. 2. Beat User-Agent Cloaking: Use the FirefoxUser-Agent Switcher to spoof as GoogleBot 3. Beat Javascript Detection: Use the Firefox Web Developer Toolbar to turn...