Apache :: GracefulShutdownTimeout setting is doing nothing

deepwell · Joined: 19 Jul 2025 Posts: 7 Location: AT, Vienna

Hi all!

we have an issue with Apache in connection to long running WebSocket connections and graceful reloads initiated by logrotate.

Whenever logrotate initiates the graceful reload (every morning), all Apache workers currently handling open WebSocket connections go into 'G' state and - as the WebSocket connections stay open (which is their primary purpose!) - stay that way. This puts the whole Apache process that worker belongs to (we are using event MPM) into a state of having some workers in 'G' state and the rest not accepting new connections.

Usually that means that at least one Apache process ends up in that state every day - and after a few days we reach ServerLimit, the scoreboards is full and we get a "AH00485: scoreboard is full, not at MaxRequestWorkers".

Easy fix: Just set `GracefulShutdownTimeout` to something not Zero and Apache should wait that amount of seconds for the 'G' state workers to finish and if they don't, simply kill them and continue the reload.

At least that is what
https://httpd.apache.org/docs/2.4/mod/mpm_common.html#GracefulShutdownTimeout
suggests IMHO.

But in fact it seems that `GracefulShutdownTimeout` does not do anything!

Even if setting this to just a few seconds, the 'G' state workers stay in that state forever and the hole Apache process is useless (and soon the whole Apache server)... Sad

Did I misinterpret the documentation?

Is there any other way to make Apache not wait forever for WebSocket connections to be closed before finishing a graceful reload? Without killing also "normal" connections!

Does this make Apache totally unusable as a reverse proxy for WebSocket connections?

tangent · Moderator Joined: 16 Aug 2020 Posts: 405 Location: UK

Looking into your problem with GracefulShutdownTimeout, I initially thought you might be running on Windows, with mpm_winnt.

I then noticed you mention event mpm.

Could you try switching to worker mpm to see if that makes a difference?

deepwell · Joined: 19 Jul 2025 Posts: 7 Location: AT, Vienna

Many thx for looking into this! I will try to set up a second machine using worker mpm and get back to you.

deepwell · Joined: 19 Jul 2025 Posts: 7 Location: AT, Vienna

OK, so I tried with the worker MPM now.

It behaves differently but IMHO also not correct.

My worker MPM config for testing is:

tangent · Moderator Joined: 16 Aug 2020 Posts: 405 Location: UK

Ok, so in relation to your problem, worker and event MPM's differ somewhat, but neither seem to honour the GracefulShutdownTimeout if there are open WebSocket connections.

You've not detailed your configuration for supporting WebSocket services, but assume you're using mod_proxy_http together with appropriate proxy options, including a defined timeout (ProxyTimeout), and possibly keepalive. To which end, is there also a firewall between your Apache and backend WebSocket service, which could also feel it's responsible for honouring keepalives? I've been bitten by this scenario before, where the firewall didn't close out inactive backend connections.

One other thing that might be worth exploring is trying mod_proxy_wstunnel over mod_proxy_http. The documentation suggests that since Apache 2.4.47, mod_proxy_http is favoured for handling websocket tunnelling, to which end you'd need to set ProxyWebsocketFallbackToProxyHttp to Off if you want to use mod_proxy_wstunnel.

The released code for mod_proxy_wstunnel.c hasn't been updated for some 2 years, but notably the current GitHub code for this module includes a ProxyWebsocketIdleTimeout option, which would hopefully resolve your problem. However, there's presumably a good reason why this code hasn't made it into the current Apache release.

Alternatively, as a last resort, could you script something to parse your server-status after a graceful restart, and kill off the stale/errant process?

deepwell · Joined: 19 Jul 2025 Posts: 7 Location: AT, Vienna

Thx again for looking into this. As to your last resort: That's what I already did. I built a watchdog into the WebSocket server that checks if the web server (the website) is responsive and if not (= Apache has locked up) kills all WebSocket connections. This works for now, but I only consider this a workaround.

For completeness, here's the Apache config regarding the WebSocket connections (yes, I am using mod_proxy_http):

James Blond

deepwell · Joined: 19 Jul 2025 Posts: 7 Location: AT, Vienna

James Blond

Yes, the old_gen is the one with the G state.

Parsing the scoreboard is a good idea.

untested code. It should work.

deepwell · Joined: 19 Jul 2025 Posts: 7 Location: AT, Vienna

Thx! Yes, I guess using the Apache scoreboard is the best way. We already have a system health check that looks at the scoreboard. I guess we'll add checking for 'G' states there - and switch to Nginx in the long run... Sad