Bayesian Filer Test Not working |
Post Reply | Page <12 |
Author | |
Sundance
Newbie Joined: 18 July 2006 Location: Hungary Status: Offline Points: 10 |
Post Options
Thanks(0)
|
I sent it again. Both the corpus and the logfiles, taht You asked for. (support@logsat.com) Did you get it? Sundance |
|
LogSat
Admin Group Joined: 25 January 2005 Location: United States Status: Offline Points: 4104 |
Post Options
Thanks(0)
|
Got them. Please give us a few hours to analyze them.
|
|
Sundance
Newbie Joined: 18 July 2006 Location: Hungary Status: Offline Points: 10 |
Post Options
Thanks(0)
|
Any results?????
|
|
LogSat
Admin Group Joined: 25 January 2005 Location: United States Status: Offline Points: 4104 |
Post Options
Thanks(0)
|
I'm sorry. Even the 2nd time we placed your corpus files in our own live environment, it was populated correctly. It does seem that your type of email traffic is not able to populate the corpus with meaningful statistical values. It is indeed strange that since most of your clean emails are in hungarian, while the spam is in english, that this is not enough to create valid statistical data. However this is the what the data in your database shows.
As a last resport, could you send us a few (a dozen or so) clean emails (in hungarian) that you received? Please note we'll need the original, unmodified email source, so we can try to see if the foreign language is causing any problems. |
|
LogSat
Admin Group Joined: 25 January 2005 Location: United States Status: Offline Points: 4104 |
Post Options
Thanks(0)
|
I think we may have indeed found something!
The accented words (ex készül) are being incorrectly transformed by the Bayesian tokenization process. The characters with accents are being replaced by spaces, and this is definetly a problem, as will cause incorrect statistics about words. Allow us a few more hours to ensure the fix we have in mind will not cause other issues. We will have a patched version shortly. Thanks for your patience with all of this. |
|
Post Reply | Page <12 |
Tweet
|
Forum Jump | Forum Permissions You cannot post new topics in this forum You cannot reply to topics in this forum You cannot delete your posts in this forum You cannot edit your posts in this forum You cannot create polls in this forum You cannot vote in polls in this forum |
This page was generated in 0.146 seconds.