Peter's Blog

Redefining the Impossible

Living with Spamassassin


Spamassassin at work is detecting spam nicely, no false positives so far although some false negatives are getting through. I don't have a big enough spam or ham corpus to get the bayesian spam filter running but when I do this could make a difference.

I have created three IMAP folders for spam:

Possibly This holds messages with a spam score > 5. Most spams come into this category. Messages are moved here by a procmail filter.
Definitely This holds messages with a spam score > 12. Messages are moved here by a procmail filter.
Categorically I use this for spam that isn't detected. When these arrive in the inbox I move them here manually. I also move messages from the 'Possibly' folder to here.

I then have a cron job running each night to run sa-learn --spam on the 'categorically' folder and sa-learn --ham on my inbox. It runs on the 'cur' subfolder but not the 'new' subfolder so it should not pick up spam that arrives when I am not around to approve it.

The Definitely spams are not added to the bayesian filter. I think this is best, it avoids reinforcing prejudices which is not a good thing.

The spam I receive has already been through the companys surfcontrol spam filter. Draw your own conclusions.

Something I must do sometime is extract 250M of old emails from my outlook archive and put them on the IMAP server. I could do this by hooking Outlook up to the IMAP server and just moving the messages but I have a gut feeling that this will take hours and involve a lot of pain.

At home I must figure out a way to get my old email out of the clutches of Intellimail so that I can uninstall it.


Filed under: email imap intellimail outlook

Comments are Closed