Re: Spam, bogofilter, etc

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



John Graham-Cumming wrote:
Linus Torvalds <torvalds <at> osdl.org> writes:
I'm sorry, but spam-filtering is simply harder than the bayesian word-count weenies think it is. I even used to _know_ something about bayesian filtering, since it was one of the projects I worked on at uni, and dammit, it's not a good approach, as shown by the fact that it's trivial to get around.

Have you actually followed any of the research into Bayesian (and similar
machine learning based) anti-spam filtering, and attacks on such filters?  Are
you making a claim that these filters are 'trivial to get around' based on a
project you did at University over 10 years ago?

Well the recent spate of spams with technical/jargon keywords in their subjects was enough to make my Seamonkey client start marking all incoming mail as spam. Interesting that recent journals talk about this as an approach to get spam past current filters; instead it had a reverse effect.

So much for email management at our hosting provider. At least on my highlandsun.com domain I've got my own sendmail milter blocking spams before they get into the server. It's basically the equivalent of a sendmail accessdb in LDAP, plus simple rules to reject relays from unregistered IP addresses, or addresses with dynamically generated hostnames. Rejecting with 451 temporary failure is also useful, most bulk mailer programs fail immediately and go away. Real mail servers will retry; by looking at the logs of the envelope FROM and RCPT I can pick out any emails that should have been let thru and add an OK exception to LDAP so the message eventually gets redelivered. I suppose I could put a URL in the reject error message, and let the sender confirm it from there. At this point the only spam that gets thru is from dedicated mass marketers with legitimate DNS registrations and I just manually add their subnets to my blacklist.

(One then is faced with the interesting question - what if someone from one of those companies was actually trying to hire my services? Their loss I guess, sometimes money really is tainted...)
--
  -- Howard Chu
  Chief Architect, Symas Corp.  http://www.symas.com
  Director, Highland Sun        http://highlandsun.com/hyc
  OpenLDAP Core Team            http://www.openldap.org/project/
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [email protected]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

[Index of Archives]     [Kernel Newbies]     [Netfilter]     [Bugtraq]     [Photo]     [Stuff]     [Gimp]     [Yosemite News]     [MIPS Linux]     [ARM Linux]     [Linux Security]     [Linux RAID]     [Video 4 Linux]     [Linux for the blind]     [Linux Resources]
  Powered by Linux