Print Page | Close Window

New beta 2.0.256 question...

Printed From: LogSat Software
Category: Spam Filter ISP
Forum Name: Spam Filter ISP Support
Forum Description: General support for Spam Filter ISP
URL: https://www.logsat.com/spamfilter/forums/forum_posts.asp?TID=2379
Printed Date: 26 December 2024 at 5:59pm


Topic: New beta 2.0.256 question...
Posted By: Guests
Subject: New beta 2.0.256 question...
Date Posted: 11 November 2003 at 4:43pm

Hello,

We installed the new beta on Sunday and have received 1108 spam and 734 good messages (according to the corpus.ini file).  When does the new filter start working, I do not see any messages in the tblquarantine with the reject code of 14 (new bayesian filter)?  Also just FYI our db.dat file is about 500k now.

Thanks,

Kurt




Replies:
Posted By: Desperado
Date Posted: 11 November 2003 at 7:27pm

The previous Beta (255) "Kicked In" at 500 / 500.  I will assume this does also.

Dan S.



Posted By: Guests
Date Posted: 11 November 2003 at 9:52pm

Well, mine hasn't kicked in yet either.  I was just going to ask the same question:

I'm at 1363/1281 and only the old filters are doing anything.



Posted By: LogSat
Date Posted: 11 November 2003 at 11:28pm

Kurt,

The Bayesian filter begins to kick in after 500 good + 500 spam emails. However it's accuracy increases with time and database size. 500K is still quite small, it'll need to reach a few MBs in size to be more effective.

Roberto F.
LogSat Software



Posted By: Desperado
Date Posted: 12 November 2003 at 1:45am

With a "Fresh Corpus,  After an hour and 15 minute, I have 2049 Spam / 644 Good.  My db.dat is at 1268 KB and I have started blocking.  Example below:

Sel # http://spamman.mags.net/VirtAdmin/VirtAdminListSpam.asp?SortBy=EmailTo" CLASS="ASPForums" TITLE="WARNING: URL created by poster. - To http://spamman.mags.net/VirtAdmin/VirtAdminListSpam.asp?SortBy=EmailFrom" CLASS="ASPForums" TITLE="WARNING: URL created by poster. - From Addr    http://spamman.mags.net/VirtAdmin/VirtAdminListSpam.asp?SortBy=Domain" CLASS="ASPForums" TITLE="WARNING: URL created by poster. - From Domain http://spamman.mags.net/VirtAdmin/VirtAdminListSpam.asp?SortBy=Subject" CLASS="ASPForums" TITLE="WARNING: URL created by poster. - Subject http://spamman.mags.net/VirtAdmin/VirtAdminListSpam.asp?SortBy=MsgDate" CLASS="ASPForums" TITLE="WARNING: URL created by poster. - Date http://spamman.mags.net/VirtAdmin/VirtAdminListSpam.asp?SortBy=RejectDesc" CLASS="ASPForums" TITLE="WARNING: URL created by poster. - Rejection http://spamman.mags.net/VirtAdmin/VirtAdminListSpam.asp?SortBy=RejectDetails" CLASS="ASPForums" TITLE="WARNING: URL created by poster. - Rejection Details http://spamman.mags.net/VirtAdmin/VirtAdminListSpam.asp?SortBy=ServerName" CLASS="ASPForums" TITLE="WARNING: URL created by poster. - Server
1   http://spamman.mags.net/VirtAdmin/VirtResolveSpam.asp?QuarID=4849663&MsgID=3646883" CLASS="ASPForums" TITLE="WARNING: URL created by poster. - view    christien@tms.net numsxy@regent.fr Interest Rate Reduction Notice: Form 734-RR so;t:uysw;u?n;x dwhm 11/12/2003 1:36:01 AM Statistical Analysis filter match matches Bayesian filter - rejected - 100% spam mx01

 

Dan S.

 



Posted By: Guests
Date Posted: 12 November 2003 at 10:02am

Thanks.  I did notice this morning that 2 messages have been flagged as matching the bayesian filter.  We are currently at 1516 SPAM, and 952 GOOD... so we just need to give the system more time to build the corpus db.dat file (only 568k now). 

From looking at the db.dat file it appears that each item counts about .4% so we would currently need about 225 matches in the file to get to the default catch rate of 90%.  Is this logic correct, or how does spamfilter use the db.dat file?

Kurt



Posted By: LogSat
Date Posted: 12 November 2003 at 10:52pm

Kurt,

We'd rather not divulge details on the format of the db.dat file and how we calculate the probabilities. We use Bayesian statistics, but can't say more... sorry!

Robero F.
LogSat Software




Print Page | Close Window