In the past, if an email had this content:
------=_NextPart_000_0023_01C7D2C3.DADB76A0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit
this is a test for a pdf
------=_NextPart_000_0023_01C7D2C3.DADB76A0 Content-Type: application/pdf; name="SCADAWhitepaperfinal1.pdf" Content-Transfer-Encoding: base64 Content-Disposition: attachment; filename="Whitepaperfinal1.pdf"
JVBERi0xLjINJeLjz9MNCjMxIDAgb2JqDTw8IA0vTGluZWFyaXplZCAxIA0vTyAzMyANL0ggWyAx
SpamFilter would normalize the content and search for keywords in this new text:
content-type: text/plain content-transfer-encoding: 7bit content-type: application/pdf content-transfer-encoding: base64 content-disposition: attachment this is a test for a pdf
Prior to this version, there was a bug in SpamFilter where the name and filename portions of the Content-Type and Content-Disposition would not be included in the normalized text if they appeared (as is in most cases) in a separate indented line.
From this version on, we're specifically adding the filename and name in the normalized text:
content-type: text/plain content-transfer-encoding: 7bit content-type: application/pdf content-transfer-encoding: base64 content-disposition: attachment filename="scadawhitepaperfinal1.pdf" name="scadawhitepaperfinal1.pdf" this is a test for a pdf
so they can be searched along with other keywords.
------------- Roberto Franceschetti
http://www.logsat.com" rel="nofollow - LogSat Software
http://www.logsat.com/sfi-spam-filter.asp" rel="nofollow - Spam Filter ISP
|