social.taurix.net
lang
Also I think it's a good idea to put a Bayesian spam filter in Friendica sooner rather than later. We're much better positioned to deal with unwanted posts than some of our federation partners, but it's much better to have this in place and not be used than not to have it when it's needed.

This isn't difficult, however I'm faced with a choice and wanted to take a show of hands. Whether the spam filter training data should be global for a site, or if it is per-person.

If it is global, it will work instantly for new members without requiring their own corpus of spam decisions. It will also catch a post that several other people on the site flag.

The downside of a global filter is somebody who has legitimate conversations about penis enlargment and Rolex watches could be seriously inconvenienced. Such people do exist. It's also likely that MafiaWars posts won't be visible in anybody's stream.

I'm leaning towards personal so that you can tailor it to your own likes/dislikes, but the obvious benefits of a global filter make it a tough call.
27 comments show more
@David Benfell - it turns out the deleted items were not being physically expunged from disk (though the actual text contents are cleared). I very recently added a fix to expunge the item table after we're certain that downstream sites have been notified of the deletion.

Your table size should now start to shrink or at least flatten.

@Martin Farrent - the biggest problem we've seen is that adding columns to the item table can become a long operation and the longer it runs the more potential there is for bad things to happen.

I think we've finally sorted out an older problem where multiple processes would compete to run the update - and this almost always led to serious issues on large or very active sites.
Phew! After the hours and hours that last change took, I was beginning to wonder if it was feasible to continue running my own site.

-rw-r--r-- 1 benfell benfell 283552462 2012-02-01 03:03 /storage/graton/mysql/2012-02-01-03.00.01/friendika.sql.gz -rw-r--r-- 1 benfell benfell 284194128 2012-02-02 15:04 /storage/graton/mysql/2012-02-02-15.00.01/friendika.sql.gz

I'll hope for an improvement after the next pull.