I go through these cycles where I'll spend all my time outside of work completely avoiding anything computer-related... to times when I'll want to spend all my time coding on side projects.

In any case, I seem to be caught in the latter cycle currently. Tons of work being done on Tabulas behind the scenes... while some of it is visible, a big chunk of work from this past week isn't.

Tabulas attracts a lot of spammers. A ton. I've built a series of automated tools to shut down the ones who use scripts to post (identifying those are easy), it's tougher to automatically shut down manually posted entries. Fortunately, it only takes me about 20 minutes a day to police the site, and I'm happy to do so.

But what to do with the thousands of sites? Instead of deleting the content, we now crosspost the site to another domain. It doesn't hurt Tabulas' SEO, and we can monetize the shit out of the content with AdSense (which tends to be high SEO-value keywords). Plus, I guess, the spammers get their content hosted somewhere. If one wanted to be really mean, I'd centralize all the links and let people buy the endpoints. Now that would be cruel.

So now, when your site gets flagged for being commercial, we migrate the user and their content to the commercial sister site of Tabulas.

So far? Nearly 7,000 entries (from just the past three weeks) have been purged from Tabulas and migrated over to the commercial site.

Our next problem: automatic categorization of the entries. For now, we'll take a manual approach and categorize them...

Posted by roy on September 14, 2010 at 12:46 AM in Web Development, Tabulas | 2 Comments

Related Entries

Want to comment with Tabulas?. Please login.

crb (guest)

Comment posted on September 14th, 2010 at 02:19 AM
You filthy spammer. I hope you're using your AdSense revenue to buy ads telling you how awesome you are!
Comment posted on September 14th, 2010 at 09:30 PM
That would be pretty awesome. Unfortunately, the revenue is not for me ... but for the owner of Tabulas :)