About the author

Steven HarmanSteven Harman is a passionate developer who believes that writing great software isn't just a job, its a craft.

ASP.NET MVP

For recent posts and more about me, scroll to the bottom.

Subscribe

  • Subscribe to my feed. via RSS
  • Subscribe via email via email

Jobs

Badges

  • Subtext Project
  • Support Subtext
  • HiddenNetwork.com Banner

Preventing Comment Spam - Google's Approach

I'm sure anyone who has their own Blog, or has even come across someone else's Blog, would agree that comment spam can be quit a nuisance. Google has recently announced their idea for how to prevent such abuse. Google's idea is to add a new attribute to "any [hyper]link that a user can create on your site automatically." Google claims that when their spiders see hyperlinks with the attribute (which takes the form rel="nofollow"), they won't give any credit to the links when they rank their search results. The thinking is this will eliminate the benefit the spammer would gain (higher ranking resulting in more page visits) from posting such links. Having a high ranking result for a search can mean a lot of traffic, and so a lot of revenue for any site.

While this solution may very well result in lower page rankings for spammer's sites, it doesn't seem to solve the problem of your Blog getting spammed. Also, this seems to work directly against one of the greatest (and inherently brilliant) features of Blogs, Forums, etc... As they grow they automatically build vast, interconnected bases of information on all sorts of topics. There are Blogs out there for nearly every topic imaginable, and while many may seem (and probably are) useless to you, there are also those that you (or someone) will find very valuable. The great thing is that when you find one Blog (or forum post, etc...) with information you need, it will usually point to to more resources along the same topic. Sound eerily familiar?  Perhaps that's b/c its precisely what Google's PageRank is designed to exploit.

So what is a good solution? As we've seen time and time again, there is no end-all-be-all, no magic bullet (err... piece of code) to stop spammers. They will continue to find ways to circumvent whatever measures we take to protect ourselves from their constant barrage of propaganda. However, to quote the old cliché, lets not "cut off our nose, to spite our face." I think that something like CAPTCHA is a much more viable solution. Most of those are for .NET, but there are CAPTCHA controls out there for all kinds of languages/frameworks. Well, I'm going to give it a try on my Blog anyhow.

Update: Since I'm now involved in the subText project, I'll just wait till I install it (soon) since some anti-spam measures have already been implemented.

What others are saying.

# re: Preventing Commnet Spam - Google's Approach
Gravatar Cliff
Jan 19, 2005
Let me know how well CAPTCHA works out. Right now, my CM software emails me about each comment, which I then have to (manually) approve. For now thats fine - I think a total of 3 people (including myself) have seen the site. If I were a /. moderator, on the other hand...
# re: Preventing Commnet Spam - Google's Approach
Gravatar Cliff
Jan 19, 2005
hmmm...interesting handling of link tags...
# re: Preventing Commnet Spam - Google's Approach
Gravatar Steve Harman
Jan 19, 2005
Fixed your comments... tho I had to do it manually. I think that once I get the CAPTCHA working, I'm going to tweak the .Text code to allow commenters to post some HTML tags.
# re: Preventing Comment Spam - Google's Approach
Gravatar Steve Harman
Jan 31, 2005
Looks like there may be a new breed of CAPTCHA... well at least in theory.

"An Effective Solution for Spam" (using 3-D models):
http://home.nyc.rr.com/spamsolution/An%20Effective%20Solution%20for%20Spam.htm
# re: Preventing Comment Spam - Google's Approach
Gravatar Cliff
Feb 02, 2005
CAPTCHA has been defeated:

http://www.brains-n-brawn.com/default.aspx?vDir=aicaptcha

Just thought you might wanna know... :-)
# re: There's gonna be war, like there have never been before!!
# re: Preventing Comment Spam - Google's Approach
Gravatar william lee
Oct 11, 2005
catcha is useless most spam robot now goes after the back ends and sent the form information right to your receiving pages. They never need to visit your form pages the Google method is the only way out like it or not. I no longer leave my URl when leaving comments because these is one sure way of inviting frefferal and comments spam to your own blog.
# re: Preventing Comment Spam - Google's Approach
Gravatar Steve Harman
Oct 11, 2005
I no longer leave my URl when leaving comments because these is one sure way of inviting frefferal and comments spam to your own blog.
William, so you don't like to leave your URL when posting comments on other blogs for fear that it will lead to more comment spam on your blog. OK, fair enough. HOWEVER, this is in direct contradiction to one of the greatest resources that internet communities (be it blogs, forums, etc...) provide. (Please re-read the original post if you're unclear as to what I am referring.) Another, and to some more important, disadvantage of that type of thinking is -- less traffic for your site. Consider this: had you left a URL in your comment, I would most certainly have followed it through just to see what else you might have to say. Who knows, perhaps I even would have found something useful or entertaining there, and possibly even visited your site again later. But, now we'll never know.

Oh, and that last little bit there, the part about the links between sites, is also a major part of what Google does so well. Hell, the entire algorithm was originally designed on the idea of BackLinks.

Also, the Google method isn't a viable one if you wish to have an immediate impact on the ammount of spam (read: have less) hitting your site. It is only a means by which they (Google) and the rest of the internet community can hope to eventually eliminate the payoff of comment/referral spam, rendering it useless to spammers. By which time our evil-spam-overlords will have figured out a new way to flood the net with their garbage.

Ok, I'll get off my soap box now... sorry about the rant.

Steve
# re: Preventing Comment Spam - Google's Approach
Gravatar cathy
Oct 12, 2005
How do we know you are not correcting url for spammer? or a distributor of a spam net work. A refferal spmmer in UK is earning $7000 a month he can easily paid you a commision.
# Very nice site!,Good site
Gravatar Very nice site!,Harry
Jul 23, 2006
Very nice site!
# re: Preventing Comment Spam - Google's Approach
Gravatar Shiva
Feb 21, 2007
I enable CAPATCH and am still getting heavily SPAMed on my subtext 1.9.2.30 blog.

Any ideas on how to fix this ?

Thanks
-Shiva
# re: Preventing Comment Spam - Google's Approach
Gravatar Steve Harman
Feb 21, 2007
@Shiva: The short and sweet answer is: Akismet!

But if you're looking for more information about how Akismet and Subtext work together, check out this post.
# re: Preventing Comment Spam - Google's Approach
Gravatar okubi
Dec 28, 2007
I enable CAPATCH and am still getting heavily SPAMed on my subtext 1.9.2.30 blog.
Comments have been closed on this topic.