The Most Tedious Job Ever

Yesterday, during a lull in my day, I decided to tackle something I’d ignored for about 8 months. My blog SPAM. There were over 25,000 entries in the queue. Most, if not all, bloggers face an almost impossible onslaught of spammers hitting their comments sections. I use an application called WordPress to blog and it’s actually pretty great. It comes with all kinds of bells and whistles and is the reason why I switched over from Live Journal to begin with…more flexibility. With that comes the ability to mass edit/delete your comments. Still, when you’ve got over 25,000, it’s more than daunting.

I decided that I would not only go through all 25,000+ comments, but create a keyword blacklist containing words commonly used by spammers. For example:

Adderall
ambein
Ambien
Amoxicillin
animal sex
atenolol
Avandia
Biaxin
birkoff
Bontril
Carisoprodol
Celebrex
Celexa
Cephalexin
CheapAdobePremiumSuite
Cialis
cipro
Codeine
Cyclobenzaprine
Cymbalta
Diazepam
diethylpropion
Diflucan
Doxycycline
Effexor
Eltroxin
ephedra
Ephedrine
fabiola
fabiola-qz
feechka
Fioricet
Fosamax
hananim
Hydrocodone
ionamin
kistov
klonopin
Larcik
Lexapro
Lipitor
Lortab
meridia
mimaxa
Norvasc
oxycodone
oxycontin
paxil
percocet
Phendimetrazine
Phentermine
Prednisone
Propecia
protonix
provigil
Prozac
Reductil
Ritalin
rohypnol
Sibutramine
Soma
Synthroid
tamadol
Topamax
Toprol
Tramadol
Ultracet
Ultram
Valium
Valtrex
verasova
Viagra
vicodin
Wellbutrin
Xanax
xenical
Zithromax
Zocor
Zoloft.
Zolpidem
Zovirax

As you can see, I listed 82 words. You might also notice that 98% of those words are names of drugs. Of course, this doesn’t even begin to scratch the surface. Spammers (crafty lot that they are) will bombard your comments boxes with links, keywords, and gibberish. And because they hardly use the same IP twice, banning IP addresses is useless. Furthermore, even though I could ban the several thousand IP addresses held within 25,000 comments, it just wouldn’t make sense in the long run. You see, Internet Service Providers sometimes use blocks of IP’s for their users and often rotate them on a regular basis. It’s not like any one person has the same IP over their lifetime…it’s not like a home address. I could ban a block of IP’s, but I might be blocking whole groups of people who aren’t spammers. Which might not be bad or rather, wouldn’t actually matter.

In all the 25,000+ comments that I moderated, there were only 4 that I approved. Two of which were questionable, but not so much that I would block them. It’s sometimes hard to tell. Spammers are sometimes real people who get paid to post links as much as they can, anywhere they can. They get paid via click-thrus. When someone clicks their specific link, they make something like 10 cents or something per click. I’m not sure how it works. It’s a shady business anyway. And because they realize there are people like me out there taking a stand, they’ve learned to become sneaky. One specific comment read:

[...] Cinka is very entertaining. Check out the latest and greatest post on Fly Home Like a Rock Star. See below for a quick excerpt of the entertainment: [...]

I found this example amusing. Made to look as though they were informing me that they’d linked back to a certain post. Which hopefully means I’ll approve the comment allowing them to comment more in the future, like an open door. The grand purpose of all this is to get web-traffic; which in turns means commerce.

So why don’t spammers stop spamming a particular site after they see they’re not making any headway? Because it doesn’t matter. The majority of SPAM is automated. I’m not certain of the logistics, but I do know that the method is computerized and the application doesn’t care if the comment gets approved. It’s job is to simply post, post, post. With hundreds of thousands of comments being sent out every day, if 10% don’t go through, they’re still ahead of the game. Thus, lowly bloggers such as myself simply have to deal with it. And even with bells and whistles, you can’t stop SPAM completely. You can hinder it slightly, but defense is the best offense. Vigilance and determination. Up until yesterday, I’d completely ignored my SPAM, but when you spend 4 hours wading through it, you tend to get a fire lit under your ass.

My 82 keyword blacklist will help, but as I reached the final 2000 comments, I realized that SPAMMERS also use common words you wouldn’t want to blacklist. Words we use everyday. Blacklisting everyday language would mean the two legitimate commenters wouldn’t be able to post anything. I don’t get a lot of comments here. I never have. In fact, with any regularity, there really are only two people who comment: My sister and Nevla. Even though I know more than two people read this blog, they’re the only ones who comment. Thanks guys! Makes me feel special. My blog is only mildly entertaining and not enough to drive in masses of traffic. I don’t know how I feel about that.

I receive an email every time a comment come in. WordPress holds unapproved comments in a queue until I can deal with it. I’ve cleared my inbox, I’ve cleared the queue. I’ve cleared the slate and can really start to get a handle on this. Of course, it’s a daily task. I’ll have to moderate SPAM everyday. I feel like Butch Cassidy and The Sundance Kid…I know I’m licked, but I’m going out in a blaze of glory.