[vox-tech] bogofilter question

Henry House vox-tech@lists.lugod.org
Tue, 14 Oct 2003 18:40:33 -0700


--C7zPtVaVf+AK4Oqc
Content-Type: text/plain; charset=iso-8859-1
Content-Disposition: inline
Content-Transfer-Encoding: quoted-printable

P=E5 tisdag, 14 oktober 2003, skrev p@dirac.org:
[...]
> the bogofilter docs recommend that i should do this at about 10,000
> emails.  a bogofilter website (one of the developers) said this number
> should be more like 20,000.

That is rather extreme. I found that matching because really good after 1000
messages.

> that's absurd.  i've only seen a false positive once or twice, when
> i first started to use bogofilter.  false negatives are rare.  maybe one
> or two a week.
>=20
> are there any more experienced bogofilter people out there who thought
> about this issue?  if so, what was your conclusion?  i can't see doing
> this for much past 1000 emails in each ham/spam bin.

I think 1000 mails is plenty, unless you are obsessed with never getting a
false positive. LWN.net did a study on this and came to a similar conclusion
(sorry, i don't have the url handy.)

> lastly, the docs recommend not to share databases with other people
> because the whole point is to tailor bogofilter for the type of spam and
> ham that arrives in YOUR inbox.  not other people's inboxes.  otherwise,
> you might as well use a lexical analyzer like spamcop.  are there any
> experienced bogofilter users here that have thought about this issue?  i
> suspect the docs may overstate this claim.  we all get offered XXX
> videos, penis enlargements and international bank transfers.  but then
> again, i'm still vaguely a bogofilter newbie, so i'd like some guidance
> if anybody has actually thought about this issue.

I used to discount this, but I am starting to agree that sharing is bad sin=
ce
I am now getting degraded matching quality (spams getting through when they
should not) with a shared database.

--=20
Henry House
The unintelligible text that may follow is a digital signature.=20
See <http://hajhouse.org/pgp> for information.  My OpenPGP key:
<http://hajhouse.org/hajhouse.asc>.


--C7zPtVaVf+AK4Oqc
Content-Type: application/pgp-signature
Content-Disposition: inline

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.2.2 (GNU/Linux)

iD8DBQE/jKWQi3lu92AVGcIRAk8zAJ9FZo2jOY9bZtBdQ5ij10h4dlyICwCfeJ7I
IQ68l2s09ZoVzU5x9VL1th0=
=sJDv
-----END PGP SIGNATURE-----

--C7zPtVaVf+AK4Oqc--