Import messages |
Post Reply |
Author | |
Derk
Groupie Joined: 26 January 2005 Location: United States Status: Offline Points: 39 |
Post Options
Thanks(0)
Posted: 26 January 2005 at 4:24pm |
is it, or could it be, possible to have the spam filter application read a text file containing e-mail messages to be analyzed and included in the Bayesian filter database? after they are imported the application could erase the file. I would like to be able to forward messages that get by the filter to a spam@mydomain catch all account. Then I can set up a scheduled task to retrieve the messages and write them to a text file to be imported by the application. |
|
LogSat
Admin Group Joined: 25 January 2005 Location: United States Status: Offline Points: 4104 |
Post Options
Thanks(0)
|
Derk,
That is currently not possible. SpamFilter does scan a folder (\SpamFilter\corpus\queue\*.token) for temporary text files with a .token extension. However the token files must only contain the email's "tokens", not the full email text. Tokens are the keywords that SpamFilter extracts from the email itself. The 1st line of the text file containt the words ".good" or ".spam" or ".falsepositive" to catalog the tokens as belonging to a good or spam email. In theory you could write an application to create tokens out of an email, but we're not going to be able to provide any details on how and using which rules we use to create tokens from an email. We don't want any spammer to find out about the innerworkings of our software... |
|
Web123
Newbie Joined: 26 January 2005 Location: Finland Status: Offline Points: 31 |
Post Options
Thanks(0)
|
I also get quite a lot false positives trough my system, and would
love to "import" them somehow to Spamfilter?
regards,
Kim
|
|
Marrab
Newbie Joined: 27 January 2005 Location: Russian Federation Status: Offline Points: 12 |
Post Options
Thanks(0)
|
Thanks. Now i can although somehow control bayesian filter. One more question: does any quarantined mail be considered bayesian filter as spam? |
|
Marrab
Newbie Joined: 27 January 2005 Location: Russian Federation Status: Offline Points: 12 |
Post Options
Thanks(0)
|
It seems i understand why my bayesian filter working so strange. Token files contains only mail headers. Interestingly why?
Edited by Marrab |
|
Derk
Groupie Joined: 26 January 2005 Location: United States Status: Offline Points: 39 |
Post Options
Thanks(0)
|
what about if I set up a subdomain that I can forward messages to, like spam@junk.mydomain.com? Can I configure the application to consider every single message spam to be added to the Bayesian database? |
|
LogSat
Admin Group Joined: 25 January 2005 Location: United States Status: Offline Points: 4104 |
Post Options
Thanks(0)
|
We recommend strongly against "interfering" with the Bayesian filter by
"forcing" emails or worse spam thru it. The bayesian filter needs to
see the true email received by the mail server to function properly. If
junk mail is fed to it, that junk is not representitative of the true
emails received by SpamFilter, and the bayesian filter will loose its
accuracy.
To answer Marrab, yes, all email that is quarantined is considered to be spam by the statistical filter. |
|
Web123
Newbie Joined: 26 January 2005 Location: Finland Status: Offline Points: 31 |
Post Options
Thanks(0)
|
How about at least Blacklist all emails sent to spam@junk.mydomain.com? Could it be done? |
|
LogSat
Admin Group Joined: 25 January 2005 Location: United States Status: Offline Points: 4104 |
Post Options
Thanks(0)
|
Yes, it can:
|
|
Web123
Newbie Joined: 26 January 2005 Location: Finland Status: Offline Points: 31 |
Post Options
Thanks(0)
|
sorry!,
meant that if I get a spam-message that is not caught by the filter,
I could then forward it to a address(spam@junk.mydomain.com)
and it would parse the message and blacklist all the non local addressees.
Kim
|
|
Guests
Guest Group |
Post Options
Thanks(0)
|
In theory you could, but you would have to forward the original,
unmodified message. This means that the email headers MUST be exactly
as the original ones, and the same applies to the email body. Sipmly
taking the email and forwarding it using an email client will not work
accurately, as email clients, especially Outlook, completely modiffy
the original email's format.
|
|
Blaine
Guest Group |
Post Options
Thanks(0)
|
We host many domains and some clients as that we bypass the filter for
their domain. When we pass their mail though the filter does this
impact our bayesian filter or are these emails not considered in the
filering rules?
It seems the longer we run our system the less effective the bayesian filter becomes we are considering dumping the database and starting over but we have process 500000 plus emails and hate to lose all those statistics? Any help would be really apprecitated. |
|
Post Reply | |
Tweet
|
Forum Jump | Forum Permissions You cannot post new topics in this forum You cannot reply to topics in this forum You cannot delete your posts in this forum You cannot edit your posts in this forum You cannot create polls in this forum You cannot vote in polls in this forum |
This page was generated in 0.344 seconds.