Print Page | Close Window

Too many connections. Disconnecting

Printed From: LogSat Software
Category: Spam Filter ISP
Forum Name: Spam Filter ISP Support
Forum Description: General support for Spam Filter ISP
URL: https://www.logsat.com/spamfilter/forums/forum_posts.asp?TID=5822
Printed Date: 05 February 2025 at 10:48am


Topic: Too many connections. Disconnecting
Posted By: swaber
Subject: Too many connections. Disconnecting
Date Posted: 09 October 2006 at 8:06pm

I just updated our system to version 3.1.3.597 last Saturday 9/30/06, and have been running into an issue ever since. Our server will run fine for  a day or two then drop into a mode where it starts rejecting all connections indicating “Too many connections” requiring a restart of the service to correct. While in this state you can view the connections tab and find that there are no current connections in the table, but the system indicates 25. The settings we have configured for Max connections are the same as previously configured under  2.7.1.515.

Log File Sample

10/06/06 06:52:18:614 -- (8432) Connection from: 218.15.1.19  -  Originating country : China
10/06/06 06:52:18:614 -- (8432) Too many connections. Disconnecting: 218.15.1.19
10/06/06 06:52:18:614 -- (8432) No Data Received
10/06/06 06:52:18:614 -- (8432) Disconnect
10/06/06 06:52:20:630 -- (11832) Connection from: 190.24.129.66  -  Originating country : Colombia
10/06/06 06:52:20:630 -- (11832) Too many connections. Disconnecting: 190.24.129.66
10/06/06 06:52:20:630 -- (11832) No Data Received
10/06/06 06:52:20:630 -- (11832) Disconnect
10/06/06 06:52:31:520 -- (11840) Connection from: 207.44.208.114  -  Originating country : United States
10/06/06 06:52:31:520 -- (11840) Too many connections. Disconnecting: 207.44.208.114
10/06/06 06:52:31:520 -- (11840) No Data Received
10/06/06 06:52:31:520 -- (11840) Disconnect
Etc.....



-------------
Scott Waber, MCSE, CCNP
Systems Administration Specialist
City of Las Vegas



Replies:
Posted By: jerbo128
Date Posted: 09 October 2006 at 9:26pm

Scott, take a look here:

http://www.logsat.com/spamfilter/forums2/5637?TID=5637&PN=1 - http://www.logsat.com/spamfilter/forums2/5637?TID=5637&P N=1

I think it's the same issue that we are having - our just does not reach the max.

jerbo128



Posted By: swaber
Date Posted: 10 October 2006 at 8:50pm

Thanks I had seen your Post, your issue looked similar, but seems to be different. I'll confirm on the next occurrence.



-------------
Scott Waber, MCSE, CCNP
Systems Administration Specialist
City of Las Vegas


Posted By: swaber
Date Posted: 16 October 2006 at 5:37pm

I checked the system after it had been up for about a day and found what appeared to be  10 stuck connections, then checked it the next day and found about 18 stuck connections. So, it appears that this is the same issue, we slowly loose all available connections until no connections are available.



-------------
Scott Waber, MCSE, CCNP
Systems Administration Specialist
City of Las Vegas


Posted By: LogSat
Date Posted: 16 October 2006 at 10:49pm
Scott,

Could you please email us SpamFilter's activity logfile for the day, a screenshot of your "Connections" tab, and the output of the

netstat -n

command from a DOS prompt (screenshot and netstat command should be performed at the same time if possible)?



-------------
Roberto Franceschetti

http://www.logsat.com" rel="nofollow - LogSat Software

http://www.logsat.com/sfi-spam-filter.asp" rel="nofollow - Spam Filter ISP


Posted By: Stephane
Date Posted: 18 October 2006 at 11:40am

Hi,

I started to get this same error it happened 2 days ago, and now this morning again....

It looks like it is related to some time out with the SFDB..

10/16/06 03:07:34:460 -- (92564) HTTP Error in SFDBUploadIP check:HTTP/1.1 500 Server Error

10/18/06 10:09:29:159 -- (173708) HTTP Error in DoSFDBCheck:Socket Error # 10060 -- Connection timed out.

As soon as i get a timeout or error with the SFDB.... connections will stay.. and will eventually get the too many connections..and no more emails coming through..



Posted By: mikek
Date Posted: 18 October 2006 at 12:01pm
Stephane could have a point here - I reached "max connections" this afternoon as well and after restarting the service I see the number of connections "hanging" at the "RCPT TO" status rising constantly...

Running 3.1.3.601 registered


Posted By: swaber
Date Posted: 18 October 2006 at 12:03pm

Yes, I to have seen SFDB related errors in my log. I discounted them initially since they seem to occur on a regular bases. In the last couple days I have been actively watching the server trying to collect the data tech support asked for, and developed a new theory. Two days ago I disabled SFDB out of frustration with major ISPs being blocked, and I have not lost one connection yet. My guess is the problem is related to SFDB, our previous version (2.7.1.515) did not have SFDB and we never had this issue.



-------------
Scott Waber, MCSE, CCNP
Systems Administration Specialist
City of Las Vegas


Posted By: WebGuyz
Date Posted: 18 October 2006 at 1:08pm

Starting to suspect SFDB overload myself. Seeing a lot of SFDB hanging as well....

Any chance the bad guyz are attacking the SFDB server(s)?

 



-------------
http://www.webguyz.net


Posted By: Stephane
Date Posted: 18 October 2006 at 2:33pm

Hi,

Again after lunch..

10/18/06 12:26:56:724 -- (102792) HTTP Error in SFDBUploadIP check:Socket Error # 10054 -- Connection reset by peer.



Posted By: Stephane
Date Posted: 18 October 2006 at 2:34pm
Sorry, Forgot to mention .. my version is 3.1.3.598


Posted By: LogSat
Date Posted: 19 October 2006 at 12:22am
We're trying to figure out what the relationship is, but it is indeed a strange coincidence that we're looking into.

This morning, between 8am-1pm EST, we experienced severe slowdowns in our internet connection (it was an internal problem on our ISP, no hackers attacking the SFDB, no worries there), and from all your reports, the issues do seem related. We're looking into HTTP timeouts, and are trying to replicate the scenario in our labs. Hopefully we'll have updates soon.


-------------
Roberto Franceschetti

http://www.logsat.com" rel="nofollow - LogSat Software

http://www.logsat.com/sfi-spam-filter.asp" rel="nofollow - Spam Filter ISP


Posted By: LogSat
Date Posted: 19 October 2006 at 1:58am
SpamFilter has failsafes in place so that mail processing continues even if the SFDB service is not available.

We did find a problem when the SFDB web services are *slooow* rather than unavailable. In this case, the HTTP request timeouts are too long when reporting a spammer IP to the SFDB. The reporting occurs upon disconnect, and this can affect the counter that keeps track of the current connections. The counter missses are very rare, and were thus hard to locate.

We were able to replicate this by placing SpamFilter behind a 56K modem and hitting it with 200 concurrent connections. Here we were finally able to reproduce the "stuck connections" problem!

We're testing build 3.1.603 with a fix. So far it looks fine, but it will need to be tested quite a bit more to ensure there are no other issues. If anyone is still suffering from major issues with the "stuck connections", we've made the build available on the website.



-------------
Roberto Franceschetti

http://www.logsat.com" rel="nofollow - LogSat Software

http://www.logsat.com/sfi-spam-filter.asp" rel="nofollow - Spam Filter ISP


Posted By: mikek
Date Posted: 19 October 2006 at 5:19am
Build 603 looks very promising so far - connection stats have been  accurate for the last 3 hours...


Posted By: WebGuyz
Date Posted: 19 October 2006 at 12:30pm

We've run 12k msgs thru on .603 so far this morning and looks great.<fingers crossed>

  



-------------
http://www.webguyz.net


Posted By: kfries
Date Posted: 23 October 2006 at 11:46am
Any updates on the .603 version?  I have been having this issue as well but am hesitant to install .603.  Those of you that have it running, has it been stable so far?


Posted By: jerbo128
Date Posted: 23 October 2006 at 3:54pm
I have had no problems with 603.  We've processed about 75K messages.


Posted By: BigDog
Date Posted: 26 October 2006 at 11:05am

Been having bad problems with SF just as described, seems that SFDB is creating problems; SF has been down a lot in the last couple of months.

It goes for hours if not noticed and accepts no messages, tunring hte SFDB option off appears to make it stable.

 

Running 3.1.3.597 since it came out, have not tried any pre-release as there is a Barracuda sitting on the workbench waiting implementation.  :(

Here are some excerts from the logs....

10/25/06 16:50:41:446 -- Starting to process queue directory...
10/25/06 16:50:41:493 -- (24572) Blacklist cache - starting cleanup
10/25/06 16:50:41:524 -- (30004) HTTP Error in GetSFDBStats:Access violation at address 7C81D150 in module 'ntdll.dll'. Read of address FFBE001F
10/25/06 16:50:41:571 -- (24572) Blacklist cache - removed IP 190.45.235.212 from limbo during cleanup
10/25/06 16:50:41:618 -- (29492) Sending email from
mailto:parmentieuoralie@hydranautics.com - parmentieuoralie@hydranautics.com to mailto:sjw@gocolumbiamo.com - sjw@mydomain.com --
10/25/06 16:50:41:649 -- (24572) Blacklist cache - removed IP 202.96.114.27 from limbo during cleanup
10/25/06 16:50:41:696 -- (29492) Exception - Access Violation Access violation at address 7C81D150 in module 'ntdll.dll'. Read of address FFBE001F
10/25/06 16:50:41:743 -- (31452) Sending email from
mailto:Hortencia@aallonlineprofits.net - Hortencia@aallonlineprofits.net to mailto:bjbrooks@gocolumbiamo.com - bjbrooks@mydomain.com --
10/25/06 16:50:41:774 -- (24572) Blacklist cache - removed IP 209.16.28.247 from limbo during cleanup
10/25/06 16:50:41:821 -- (31452) Exception - Access Violation Access violation at address 7C81D150 in module 'ntdll.dll'. Read of address FFBE001F
10/25/06 16:50:41:852 -- (24572) Blacklist cache - removed IP 216.150.25.108 from limbo during cleanup
10/25/06 16:50:41:899 -- (32284) Sending email from
mailto:Hortencia@aallonlineprofits.net - Hortencia@aallonlineprofits.net to mailto:cje@gocolumbiamo.com - cje@mydomain.com --
10/25/06 16:50:41:946 -- (24572) Blacklist cache - removed IP 217.76.36.51 from limbo during cleanup
10/25/06 16:50:41:977 -- (27712) Sending email from
mailto:Hortencia@aallonlineprofits.net - Hortencia@aallonlineprofits.net to mailto:jml@gocolumbiamo.com - jml@mydomain.com --
10/25/06 16:50:42:024 -- (32284) Exception - Access Violation Access violation at address 7C81D150 in module 'ntdll.dll'. Read of address FFBE001F
10/25/06 16:50:42:071 -- (24572) Blacklist cache - removed IP 221.162.107.163 from limbo during cleanup
10/25/06 16:50:42:102 -- (27712) Exception - Access Violation Access violation at address 7C81D150 in module 'ntdll.dll'. Read of address FFBE001F
10/25/06 16:50:42:149 -- (24572) Blacklist cache - removed IP 222.154.30.23 from limbo during cleanup
10/25/06 16:50:42:196 -- (29544) Sending email from
mailto:Hortencia@aallonlineprofits.net - Hortencia@aallonlineprofits.net to mailto:ljp@mydomain.com - ljp@mydomain.com --
10/25/06 16:50:42:227 -- (24572) Blacklist cache - removed IP 222.37.134.95 from limbo during cleanup
10/25/06 16:50:42:274 -- (28576) Sending email from
mailto:Hortencia@aallonlineprofits.net - Hortencia@aallonlineprofits.net to mailto:blt@mydomain.com - blt@mydomain.com --
10/25/06 16:50:42:321 -- (29544) Exception - Access Violation Access violation at address 7C81D150 in module 'ntdll.dll'. Read of address FFBE001F
10/25/06 16:50:42:352 -- (24572) Blacklist cache - removed IP 24.123.22.198 from limbo during cleanup
10/25/06 16:50:42:399 -- (28576) Exception - Access Violation Access violation at address 7C81D150 in module 'ntdll.dll'. Read of address FFBE001F
10/25/06 16:50:42:446 -- (24572) Blacklist cache - removed IP 24.123.28.53 from limbo during cleanup
 
10/25/06 16:56:41:539 -- (11696) HTTP Error in GetSFDBStats:Access violation at address 7C81D150 in module 'ntdll.dll'. Read of address FFBE001F
10/25/06 18:07:03:117 -- Bayesian Thread is not running - starting...
10/25/06 18:07:03:149 -- (4524) BayesianThread starting
10/25/06 18:07:03:196 -- (4524) TBayesianThread - Begin LoadFromFile for corpus.db (db.dat)
10/25/06 18:07:03:305 -- (4524) TBayesianThread - LoadFromFile for Corpus.db - copied db.dat -> IndA142.tmp
10/25/06 18:07:03:399 -- (4524) TBayesianThread - LoadFromFile for Corpus.db - copied db.dat.prb -> IndA143.tmp
10/25/06 18:07:03:446 -- (4524) TBayesianThread - LoadFromFile for Corpus.db - setting Buffer size to 20930398
10/25/06 18:07:03:477 -- (4524) TBayesianThread - LoadFromFile for Corpus.db - Reading Buffer in mem
10/25/06 18:07:03:571 -- (4524) TBayesianThread - LoadFromFile for Corpus.db - loaded files in memory - IndA142.tmp
10/25/06 18:07:03:649 -- (4524) TBayesianThread - LoadFromFile for Corpus.db - loaded files in memory - IndA143.tmp
10/25/06 18:07:06:117 -- (4524) TBayesianThread - End LoadFromFile for corpus.db (db.dat) (2670)
10/25/06 18:07:41:446 -- (9384) Blacklist cache - starting cleanup
10/25/06 18:08:41:446 -- Starting to process queue directory...
10/25/06 18:08:41:492 -- (4848) HTTP Error in GetSFDBStats:Cannot allocate socket.
10/25/06 18:08:41:524 -- (8660) Blacklist cache - starting cleanup
10/25/06 18:09:41:446 -- (13180) Blacklist cache - starting cleanup
10/25/06 18:10:41:446 -- (16824) Blacklist cache - starting cleanup
10/25/06 18:10:41:492 -- (18148) HTTP Error in GetSFDBStats:Cannot allocate socket.
10/25/06 18:11:41:446 -- Starting to process queue directory...
10/25/06 18:11:41:477 -- (17376) Blacklist cache - starting cleanup
10/25/06 18:12:41:446 -- (23116) Blacklist cache - starting cleanup
10/25/06 18:12:41:492 -- (22684) HTTP Error in GetSFDBStats:Cannot allocate socket.
10/25/06 18:13:41:446 -- (23664) Blacklist cache - starting cleanup

 




Print Page | Close Window