Print Page | Close Window

RegEx in subject, does it work??

Printed From: LogSat Software
Category: Spam Filter ISP
Forum Name: Spam Filter ISP Support
Forum Description: General support for Spam Filter ISP
URL: https://www.logsat.com/spamfilter/forums/forum_posts.asp?TID=4946
Printed Date: 14 March 2025 at 6:54am


Topic: RegEx in subject, does it work??
Posted By: Guests
Subject: RegEx in subject, does it work??
Date Posted: 05 January 2005 at 5:57pm

Hi,

I'm trying RegEx in subject and the same text is rejected in body and in the subject even if I add a specific line for the subject

Example

Bu%y C^ialis soft`tabs at Unbelievable Prices
is stopped by
(c.{0,2}i.{0,2}a.{0,2}l.{0,2}i.{0,2}s.{1,2}soft.{0,2}tabs)
but not in the subject
even if I add
Subject:(c.{0,2}i.{0,2}a.{0,2}l.{0,2}i.{0,2}s.{1,2}soft.{0,2}tabs)
 
But

 
An idea?
 
Thanks,
 
Gaby



Replies:
Posted By: Desperado
Date Posted: 06 January 2005 at 12:58am

Yes ... it does work.  As an example, The Expression:

((?i)Subject:(([\s]|[\!-\xB4]){0,10}[\|]){2})

will catch 2 or more "Pipes" in the subject but not in the body.  The {0,10} limits the search so that I don't get any |Loop Stack" errors.

Dan S.

 



Posted By: Guests
Date Posted: 06 January 2005 at 1:36am

Thank you Dan,

I didn't see any sample of subject filters...

I see, the ( is before the subject prefix word. That wasn't clear for me!!

It works fine!

Gaby



Posted By: Desperado
Date Posted: 06 January 2005 at 4:08pm

Gaby,

ALL Regular Expressions must be inclosed in "()" or they are not seen as RegEx's by the software but rather as a "literal".  I am glad the example helped.

Regards,

Dan S.



Posted By: Guests
Date Posted: 07 January 2005 at 5:06am

Dan,

Are you saying that in a keyword filter line beginning with the word Subject: you can not mix the normal wildcard matching with RegEx matching, using ( )'s?

I haven't done much with this, but I get these 2 expressions in the keywords file to do the same thing:

Subject:(cialis)

((?i)Subject:.*cialis.*)

The result being that an email with the word cialis anywhere in the subject line is blocked as spam.

Roberto and Dan could you please send me your latest, greatest keyword files also, I would greatly appreciate it.  I'm just trying to get this product to work, and from what I can see, the Bayesian Filter will not block any emails if you have a blank (or no) keyword or ip or attachments blacklist file.  My Bayesian Filter did not seem to learn anything with an "out of the box" install (no blacklist filters active, other than MAPS); all the probabilities seemed the same (400000005960464), and all the word tokens were marked good.  I thought I could just install the package, configure a few ip addresses for my servers and add my domain, and sit back and watch the Bayesian Filter learn and wipe out all my spam. 

Am I wrong in saying that a user must actively put one of the blacklist filters into action in order to get the Bayesian Filter to start learning?  And the more extensive, or complete, the keyword list is, the faster the Bayesian Filter learns?  Does it learn from MAPS rejections alone (or at all)?

If I'm right here, could you please put "you must configure a blacklist filter" into the release notes or instructions on how to set up this whole package.  Providing a good default keywords file would help, but I realize you might not want to publish it to the whole world.  This package works brilliantly, but I was certainly confused (still am?) as to why the Bayesian Filter was not marking all these "clearly spam" tokens as bad in a corpus dump.

Thanks much,

Bob



Posted By: Desperado
Date Posted: 07 January 2005 at 10:15am
Are you saying that in a keyword filter line beginning with the word Subject: you can not mix the normal wildcard matching with RegEx matching, using ( )'s?  Yes, that is what I am saying but I am just a user so LogSat should verify that.
I haven't done much with this, but I get these 2 expressions in the keywords file to do the same thing:
Subject:(cialis)
((?i)Subject:.*cialis.*)
The result being that an email with the word cialis anywhere in the subject line is blocked as spam.

Roberto and Dan could you please send me your latest, greatest keyword files also,
See the following link.  You can download my lists from there: http://www.logsat.com/spamfilter/forums/showmessage.asp?messageID=4950" CLASS="ASPForums" TITLE="WARNING: URL created by poster. - http://www.logsat.com/spamfilter/forums/showmessage.asp?messageID=4950" CLASS="ASPForums" TITLE="WARNING: URL created by poster. - http://www.logsat.com/spamfilter/forums/showmessage.asp?messageID=4950
 
I would greatly appreciate it.  I'm just trying to get this product to work, and from what I can see, the Bayesian Filter will not block any emails if you have a blank (or no) keyword or ip or attachments blacklist file.  My Bayesian Filter did not seem to learn anything with an "out of the box" install (no blacklist filters active, other than MAPS); all the probabilities seemed the same (400000005960464), and all the word tokens were marked good.  I thought I could just install the package, configure a few ip addresses for my servers and add my domain, and sit back and watch the Bayesian Filter learn and wipe out all my spam.
The Bayesian filtering will not work unless there are some active filters for it to learn from.

There is info on the Forum about this but I am running out the door so did not have time to search for them.
 
Regards,
 
Dan S.


Posted By: Guests
Date Posted: 07 March 2005 at 2:14pm

simple question...

I notice Cialis also stops "speCIALIST"  :)  if I only wanted to catch cialis would I enclose it in (cialis) ??

Lee



Posted By: Desperado
Date Posted: 07 March 2005 at 3:23pm

I answered this in another post but ...

Originally posted by Desperado Desperado wrote:

You can use a RegEx as:

((?i)(\bcialis\b))

The \b specifies a "word Boundary"

Dan S.

 



-------------
The Desperado
Dan Seligmann.
Work: http://www.mags.net
Personal: http://www.desperado.com



Posted By: mikek
Date Posted: 09 March 2005 at 6:12am

Dan, I tried registering on your website, but never received the authorization e-mail, because your sender domain has no mx... :-(



Posted By: Desperado
Date Posted: 09 March 2005 at 1:44pm

What website are you referring to?

Dan



-------------
The Desperado
Dan Seligmann.
Work: http://www.mags.net
Personal: http://www.desperado.com



Posted By: mikek
Date Posted: 10 March 2005 at 3:57am
http://spamfilterisp.dyndns.org/ - http://spamfilterisp.dyndns.org/


Posted By: Desperado
Date Posted: 10 March 2005 at 9:17pm

mikek,

This is not "My" website.  I just attempt to maintain the users.  I do not know if Sean (the owner) can do anything about the MX as it is using dyndns service.

Dan



-------------
The Desperado
Dan Seligmann.
Work: http://www.mags.net
Personal: http://www.desperado.com




Print Page | Close Window