John Graham-Cumming wrote:
Linus Torvalds <torvalds <at> osdl.org> writes:
I'm sorry, but spam-filtering is simply harder than the bayesian
word-count weenies think it is. I even used to _know_ something about
bayesian filtering, since it was one of the projects I worked on at uni,
and dammit, it's not a good approach, as shown by the fact that it's
trivial to get around.
Have you actually followed any of the research into Bayesian (and similar
machine learning based) anti-spam filtering, and attacks on such filters? Are
you making a claim that these filters are 'trivial to get around' based on a
project you did at University over 10 years ago?
Well the recent spate of spams with technical/jargon keywords in their
subjects was enough to make my Seamonkey client start marking all
incoming mail as spam. Interesting that recent journals talk about this
as an approach to get spam past current filters; instead it had a
reverse effect.
So much for email management at our hosting provider. At least on my
highlandsun.com domain I've got my own sendmail milter blocking spams
before they get into the server. It's basically the equivalent of a
sendmail accessdb in LDAP, plus simple rules to reject relays from
unregistered IP addresses, or addresses with dynamically generated
hostnames. Rejecting with 451 temporary failure is also useful, most
bulk mailer programs fail immediately and go away. Real mail servers
will retry; by looking at the logs of the envelope FROM and RCPT I can
pick out any emails that should have been let thru and add an OK
exception to LDAP so the message eventually gets redelivered. I suppose
I could put a URL in the reject error message, and let the sender
confirm it from there. At this point the only spam that gets thru is
from dedicated mass marketers with legitimate DNS registrations and I
just manually add their subnets to my blacklist.
(One then is faced with the interesting question - what if someone from
one of those companies was actually trying to hire my services? Their
loss I guess, sometimes money really is tainted...)
--
-- Howard Chu
Chief Architect, Symas Corp. http://www.symas.com
Director, Highland Sun http://highlandsun.com/hyc
OpenLDAP Core Team http://www.openldap.org/project/
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [email protected]
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
[Index of Archives]
[Kernel Newbies]
[Netfilter]
[Bugtraq]
[Photo]
[Stuff]
[Gimp]
[Yosemite News]
[MIPS Linux]
[ARM Linux]
[Linux Security]
[Linux RAID]
[Video 4 Linux]
[Linux for the blind]
[Linux Resources]