Spam Filter ISP Support Forum

  New Posts New Posts RSS Feed - Too many connections. Disconnecting
  FAQ FAQ  Forum Search   Register Register  Login Login

Too many connections. Disconnecting

 Post Reply Post Reply
Author
swaber View Drop Down
Newbie
Newbie
Avatar

Joined: 21 February 2006
Location: United States
Status: Offline
Points: 15
Post Options Post Options   Thanks (0) Thanks(0)   Quote swaber Quote  Post ReplyReply Direct Link To This Post Topic: Too many connections. Disconnecting
    Posted: 09 October 2006 at 8:06pm

I just updated our system to version 3.1.3.597 last Saturday 9/30/06, and have been running into an issue ever since. Our server will run fine for  a day or two then drop into a mode where it starts rejecting all connections indicating “Too many connections” requiring a restart of the service to correct. While in this state you can view the connections tab and find that there are no current connections in the table, but the system indicates 25. The settings we have configured for Max connections are the same as previously configured under  2.7.1.515.

Log File Sample

10/06/06 06:52:18:614 -- (8432) Connection from: 218.15.1.19  -  Originating country : China
10/06/06 06:52:18:614 -- (8432) Too many connections. Disconnecting: 218.15.1.19
10/06/06 06:52:18:614 -- (8432) No Data Received
10/06/06 06:52:18:614 -- (8432) Disconnect
10/06/06 06:52:20:630 -- (11832) Connection from: 190.24.129.66  -  Originating country : Colombia
10/06/06 06:52:20:630 -- (11832) Too many connections. Disconnecting: 190.24.129.66
10/06/06 06:52:20:630 -- (11832) No Data Received
10/06/06 06:52:20:630 -- (11832) Disconnect
10/06/06 06:52:31:520 -- (11840) Connection from: 207.44.208.114  -  Originating country : United States
10/06/06 06:52:31:520 -- (11840) Too many connections. Disconnecting: 207.44.208.114
10/06/06 06:52:31:520 -- (11840) No Data Received
10/06/06 06:52:31:520 -- (11840) Disconnect
Etc.....

Scott Waber, MCSE, CCNP
Systems Administration Specialist
City of Las Vegas
Back to Top
jerbo128 View Drop Down
Senior Member
Senior Member
Avatar

Joined: 06 March 2006
Status: Offline
Points: 178
Post Options Post Options   Thanks (0) Thanks(0)   Quote jerbo128 Quote  Post ReplyReply Direct Link To This Post Posted: 09 October 2006 at 9:26pm

Scott, take a look here:

http://www.logsat.com/spamfilter/forums2/5637?TID=5637&P N=1

I think it's the same issue that we are having - our just does not reach the max.

jerbo128

Back to Top
swaber View Drop Down
Newbie
Newbie
Avatar

Joined: 21 February 2006
Location: United States
Status: Offline
Points: 15
Post Options Post Options   Thanks (0) Thanks(0)   Quote swaber Quote  Post ReplyReply Direct Link To This Post Posted: 10 October 2006 at 8:50pm

Thanks I had seen your Post, your issue looked similar, but seems to be different. I'll confirm on the next occurrence.

Scott Waber, MCSE, CCNP
Systems Administration Specialist
City of Las Vegas
Back to Top
swaber View Drop Down
Newbie
Newbie
Avatar

Joined: 21 February 2006
Location: United States
Status: Offline
Points: 15
Post Options Post Options   Thanks (0) Thanks(0)   Quote swaber Quote  Post ReplyReply Direct Link To This Post Posted: 16 October 2006 at 5:37pm

I checked the system after it had been up for about a day and found what appeared to be  10 stuck connections, then checked it the next day and found about 18 stuck connections. So, it appears that this is the same issue, we slowly loose all available connections until no connections are available.

Scott Waber, MCSE, CCNP
Systems Administration Specialist
City of Las Vegas
Back to Top
LogSat View Drop Down
Admin Group
Admin Group
Avatar

Joined: 25 January 2005
Location: United States
Status: Offline
Points: 4104
Post Options Post Options   Thanks (0) Thanks(0)   Quote LogSat Quote  Post ReplyReply Direct Link To This Post Posted: 16 October 2006 at 10:49pm
Scott,

Could you please email us SpamFilter's activity logfile for the day, a screenshot of your "Connections" tab, and the output of the

netstat -n

command from a DOS prompt (screenshot and netstat command should be performed at the same time if possible)?

Roberto Franceschetti

LogSat Software

Spam Filter ISP
Back to Top
Stephane View Drop Down
Newbie
Newbie


Joined: 16 October 2006
Status: Offline
Points: 5
Post Options Post Options   Thanks (0) Thanks(0)   Quote Stephane Quote  Post ReplyReply Direct Link To This Post Posted: 18 October 2006 at 11:40am

Hi,

I started to get this same error it happened 2 days ago, and now this morning again....

It looks like it is related to some time out with the SFDB..

10/16/06 03:07:34:460 -- (92564) HTTP Error in SFDBUploadIP check:HTTP/1.1 500 Server Error

10/18/06 10:09:29:159 -- (173708) HTTP Error in DoSFDBCheck:Socket Error # 10060 -- Connection timed out.

As soon as i get a timeout or error with the SFDB.... connections will stay.. and will eventually get the too many connections..and no more emails coming through..

Back to Top
mikek View Drop Down
Senior Member
Senior Member
Avatar

Joined: 22 February 2005
Location: Switzerland
Status: Offline
Points: 133
Post Options Post Options   Thanks (0) Thanks(0)   Quote mikek Quote  Post ReplyReply Direct Link To This Post Posted: 18 October 2006 at 12:01pm
Stephane could have a point here - I reached "max connections" this afternoon as well and after restarting the service I see the number of connections "hanging" at the "RCPT TO" status rising constantly...

Running 3.1.3.601 registered
Back to Top
swaber View Drop Down
Newbie
Newbie
Avatar

Joined: 21 February 2006
Location: United States
Status: Offline
Points: 15
Post Options Post Options   Thanks (0) Thanks(0)   Quote swaber Quote  Post ReplyReply Direct Link To This Post Posted: 18 October 2006 at 12:03pm

Yes, I to have seen SFDB related errors in my log. I discounted them initially since they seem to occur on a regular bases. In the last couple days I have been actively watching the server trying to collect the data tech support asked for, and developed a new theory. Two days ago I disabled SFDB out of frustration with major ISPs being blocked, and I have not lost one connection yet. My guess is the problem is related to SFDB, our previous version (2.7.1.515) did not have SFDB and we never had this issue.

Scott Waber, MCSE, CCNP
Systems Administration Specialist
City of Las Vegas
Back to Top
WebGuyz View Drop Down
Senior Member
Senior Member


Joined: 09 May 2005
Location: United States
Status: Offline
Points: 348
Post Options Post Options   Thanks (0) Thanks(0)   Quote WebGuyz Quote  Post ReplyReply Direct Link To This Post Posted: 18 October 2006 at 1:08pm

Starting to suspect SFDB overload myself. Seeing a lot of SFDB hanging as well....

Any chance the bad guyz are attacking the SFDB server(s)?

 



Edited by WebGuyz
http://www.webguyz.net
Back to Top
Stephane View Drop Down
Newbie
Newbie


Joined: 16 October 2006
Status: Offline
Points: 5
Post Options Post Options   Thanks (0) Thanks(0)   Quote Stephane Quote  Post ReplyReply Direct Link To This Post Posted: 18 October 2006 at 2:33pm

Hi,

Again after lunch..

10/18/06 12:26:56:724 -- (102792) HTTP Error in SFDBUploadIP check:Socket Error # 10054 -- Connection reset by peer.

Back to Top
Stephane View Drop Down
Newbie
Newbie


Joined: 16 October 2006
Status: Offline
Points: 5
Post Options Post Options   Thanks (0) Thanks(0)   Quote Stephane Quote  Post ReplyReply Direct Link To This Post Posted: 18 October 2006 at 2:34pm
Sorry, Forgot to mention .. my version is 3.1.3.598
Back to Top
LogSat View Drop Down
Admin Group
Admin Group
Avatar

Joined: 25 January 2005
Location: United States
Status: Offline
Points: 4104
Post Options Post Options   Thanks (0) Thanks(0)   Quote LogSat Quote  Post ReplyReply Direct Link To This Post Posted: 19 October 2006 at 12:22am
We're trying to figure out what the relationship is, but it is indeed a strange coincidence that we're looking into.

This morning, between 8am-1pm EST, we experienced severe slowdowns in our internet connection (it was an internal problem on our ISP, no hackers attacking the SFDB, no worries there), and from all your reports, the issues do seem related. We're looking into HTTP timeouts, and are trying to replicate the scenario in our labs. Hopefully we'll have updates soon.
Roberto Franceschetti

LogSat Software

Spam Filter ISP
Back to Top
LogSat View Drop Down
Admin Group
Admin Group
Avatar

Joined: 25 January 2005
Location: United States
Status: Offline
Points: 4104
Post Options Post Options   Thanks (0) Thanks(0)   Quote LogSat Quote  Post ReplyReply Direct Link To This Post Posted: 19 October 2006 at 1:58am
SpamFilter has failsafes in place so that mail processing continues even if the SFDB service is not available.

We did find a problem when the SFDB web services are *slooow* rather than unavailable. In this case, the HTTP request timeouts are too long when reporting a spammer IP to the SFDB. The reporting occurs upon disconnect, and this can affect the counter that keeps track of the current connections. The counter missses are very rare, and were thus hard to locate.

We were able to replicate this by placing SpamFilter behind a 56K modem and hitting it with 200 concurrent connections. Here we were finally able to reproduce the "stuck connections" problem!

We're testing build 3.1.603 with a fix. So far it looks fine, but it will need to be tested quite a bit more to ensure there are no other issues. If anyone is still suffering from major issues with the "stuck connections", we've made the build available on the website.

Roberto Franceschetti

LogSat Software

Spam Filter ISP
Back to Top
mikek View Drop Down
Senior Member
Senior Member
Avatar

Joined: 22 February 2005
Location: Switzerland
Status: Offline
Points: 133
Post Options Post Options   Thanks (0) Thanks(0)   Quote mikek Quote  Post ReplyReply Direct Link To This Post Posted: 19 October 2006 at 5:19am
Build 603 looks very promising so far - connection stats have been  accurate for the last 3 hours...
Back to Top
WebGuyz View Drop Down
Senior Member
Senior Member


Joined: 09 May 2005
Location: United States
Status: Offline
Points: 348
Post Options Post Options   Thanks (0) Thanks(0)   Quote WebGuyz Quote  Post ReplyReply Direct Link To This Post Posted: 19 October 2006 at 12:30pm

We've run 12k msgs thru on .603 so far this morning and looks great.<fingers crossed>

  

http://www.webguyz.net
Back to Top
kfries View Drop Down
Newbie
Newbie


Joined: 16 August 2006
Status: Offline
Points: 7
Post Options Post Options   Thanks (0) Thanks(0)   Quote kfries Quote  Post ReplyReply Direct Link To This Post Posted: 23 October 2006 at 11:46am
Any updates on the .603 version?  I have been having this issue as well but am hesitant to install .603.  Those of you that have it running, has it been stable so far?
Back to Top
jerbo128 View Drop Down
Senior Member
Senior Member
Avatar

Joined: 06 March 2006
Status: Offline
Points: 178
Post Options Post Options   Thanks (0) Thanks(0)   Quote jerbo128 Quote  Post ReplyReply Direct Link To This Post Posted: 23 October 2006 at 3:54pm
I have had no problems with 603.  We've processed about 75K messages.
Back to Top
BigDog View Drop Down
Newbie
Newbie


Joined: 26 January 2005
Location: United States
Status: Offline
Points: 11
Post Options Post Options   Thanks (0) Thanks(0)   Quote BigDog Quote  Post ReplyReply Direct Link To This Post Posted: 26 October 2006 at 11:05am

Been having bad problems with SF just as described, seems that SFDB is creating problems; SF has been down a lot in the last couple of months.

It goes for hours if not noticed and accepts no messages, tunring hte SFDB option off appears to make it stable.

 

Running 3.1.3.597 since it came out, have not tried any pre-release as there is a Barracuda sitting on the workbench waiting implementation.  :(

Here are some excerts from the logs....

10/25/06 16:50:41:446 -- Starting to process queue directory...
10/25/06 16:50:41:493 -- (24572) Blacklist cache - starting cleanup
10/25/06 16:50:41:524 -- (30004) HTTP Error in GetSFDBStats:Access violation at address 7C81D150 in module 'ntdll.dll'. Read of address FFBE001F
10/25/06 16:50:41:571 -- (24572) Blacklist cache - removed IP 190.45.235.212 from limbo during cleanup
10/25/06 16:50:41:618 -- (29492) Sending email from
parmentieuoralie@hydranautics.com to sjw@mydomain.com--
10/25/06 16:50:41:649 -- (24572) Blacklist cache - removed IP 202.96.114.27 from limbo during cleanup
10/25/06 16:50:41:696 -- (29492) Exception - Access Violation Access violation at address 7C81D150 in module 'ntdll.dll'. Read of address FFBE001F
10/25/06 16:50:41:743 -- (31452) Sending email from
Hortencia@aallonlineprofits.net to bjbrooks@mydomain.com --
10/25/06 16:50:41:774 -- (24572) Blacklist cache - removed IP 209.16.28.247 from limbo during cleanup
10/25/06 16:50:41:821 -- (31452) Exception - Access Violation Access violation at address 7C81D150 in module 'ntdll.dll'. Read of address FFBE001F
10/25/06 16:50:41:852 -- (24572) Blacklist cache - removed IP 216.150.25.108 from limbo during cleanup
10/25/06 16:50:41:899 -- (32284) Sending email from
Hortencia@aallonlineprofits.net to cje@mydomain.com --
10/25/06 16:50:41:946 -- (24572) Blacklist cache - removed IP 217.76.36.51 from limbo during cleanup
10/25/06 16:50:41:977 -- (27712) Sending email from
Hortencia@aallonlineprofits.net to jml@mydomain.com --
10/25/06 16:50:42:024 -- (32284) Exception - Access Violation Access violation at address 7C81D150 in module 'ntdll.dll'. Read of address FFBE001F
10/25/06 16:50:42:071 -- (24572) Blacklist cache - removed IP 221.162.107.163 from limbo during cleanup
10/25/06 16:50:42:102 -- (27712) Exception - Access Violation Access violation at address 7C81D150 in module 'ntdll.dll'. Read of address FFBE001F
10/25/06 16:50:42:149 -- (24572) Blacklist cache - removed IP 222.154.30.23 from limbo during cleanup
10/25/06 16:50:42:196 -- (29544) Sending email from
Hortencia@aallonlineprofits.net to ljp@mydomain.com --
10/25/06 16:50:42:227 -- (24572) Blacklist cache - removed IP 222.37.134.95 from limbo during cleanup
10/25/06 16:50:42:274 -- (28576) Sending email from
Hortencia@aallonlineprofits.net to blt@mydomain.com --
10/25/06 16:50:42:321 -- (29544) Exception - Access Violation Access violation at address 7C81D150 in module 'ntdll.dll'. Read of address FFBE001F
10/25/06 16:50:42:352 -- (24572) Blacklist cache - removed IP 24.123.22.198 from limbo during cleanup
10/25/06 16:50:42:399 -- (28576) Exception - Access Violation Access violation at address 7C81D150 in module 'ntdll.dll'. Read of address FFBE001F
10/25/06 16:50:42:446 -- (24572) Blacklist cache - removed IP 24.123.28.53 from limbo during cleanup
 
10/25/06 16:56:41:539 -- (11696) HTTP Error in GetSFDBStats:Access violation at address 7C81D150 in module 'ntdll.dll'. Read of address FFBE001F
10/25/06 18:07:03:117 -- Bayesian Thread is not running - starting...
10/25/06 18:07:03:149 -- (4524) BayesianThread starting
10/25/06 18:07:03:196 -- (4524) TBayesianThread - Begin LoadFromFile for corpus.db (db.dat)
10/25/06 18:07:03:305 -- (4524) TBayesianThread - LoadFromFile for Corpus.db - copied db.dat -> IndA142.tmp
10/25/06 18:07:03:399 -- (4524) TBayesianThread - LoadFromFile for Corpus.db - copied db.dat.prb -> IndA143.tmp
10/25/06 18:07:03:446 -- (4524) TBayesianThread - LoadFromFile for Corpus.db - setting Buffer size to 20930398
10/25/06 18:07:03:477 -- (4524) TBayesianThread - LoadFromFile for Corpus.db - Reading Buffer in mem
10/25/06 18:07:03:571 -- (4524) TBayesianThread - LoadFromFile for Corpus.db - loaded files in memory - IndA142.tmp
10/25/06 18:07:03:649 -- (4524) TBayesianThread - LoadFromFile for Corpus.db - loaded files in memory - IndA143.tmp
10/25/06 18:07:06:117 -- (4524) TBayesianThread - End LoadFromFile for corpus.db (db.dat) (2670)
10/25/06 18:07:41:446 -- (9384) Blacklist cache - starting cleanup
10/25/06 18:08:41:446 -- Starting to process queue directory...
10/25/06 18:08:41:492 -- (4848) HTTP Error in GetSFDBStats:Cannot allocate socket.
10/25/06 18:08:41:524 -- (8660) Blacklist cache - starting cleanup
10/25/06 18:09:41:446 -- (13180) Blacklist cache - starting cleanup
10/25/06 18:10:41:446 -- (16824) Blacklist cache - starting cleanup
10/25/06 18:10:41:492 -- (18148) HTTP Error in GetSFDBStats:Cannot allocate socket.
10/25/06 18:11:41:446 -- Starting to process queue directory...
10/25/06 18:11:41:477 -- (17376) Blacklist cache - starting cleanup
10/25/06 18:12:41:446 -- (23116) Blacklist cache - starting cleanup
10/25/06 18:12:41:492 -- (22684) HTTP Error in GetSFDBStats:Cannot allocate socket.
10/25/06 18:13:41:446 -- (23664) Blacklist cache - starting cleanup

 



Edited by BigDog
Back to Top
 Post Reply Post Reply
  Share Topic   

Forum Jump Forum Permissions View Drop Down



This page was generated in 0.215 seconds.