[hunchentoot-devel] Server sporadically hanging

Jeff Cunningham jeffrey at cunningham.net
Wed Feb 20 16:09:18 UTC 2008


I am having a problem that I can't replicate on demand but has been 
happening with increasing frequency. I am hoping some of you might 
suggest ways to troubleshoot it. I don't know that it is a Hunchentoot 
problem, per se, but it may be an interaction problem between 
Hunchentoot and Apache2 via mod_lisp2.

The symptom is the server is hung first thing in the morning when I 
check it. The cpu is at 99% activity on the server image (sbcl). When I 
look in the error_log I see dozens of these:

[Wed Feb 20 05:10:10 2008] [error] (70007)The timeout specified has 
expired: error reading from Lisp
[Wed Feb 20 05:12:23 2008] [error] (70007)The timeout specified has 
expired: error reading from Lisp
[Wed Feb 20 05:15:14 2008] [error] (70007)The timeout specified has 
expired: error reading from Lisp
[Wed Feb 20 05:18:57 2008] [error] (70007)The timeout specified has 
expired: error reading from Lisp

These generally correspond to googlebot activity:

66.249.73.165 - - [20/Feb/2008:05:09:10 -0800] "GET 
/intrepid-obstacle-map.html?hunchentoot-session=18%3A15F2776FDB3BD7CA852EBF74A3B40B2B 
HTTP/1.1" 500 673 "-" "Mozilla/5.0 (c
ompatible; Googlebot/2.1; +http://www.google.com/bot.html)"
66.249.73.165 - - [20/Feb/2008:05:11:23 -0800] "GET 
/index.html?hunchentoot-session=17%3A549FDE65186DCCFD1D2552AA79A49871 
HTTP/1.1" 500 673 "-" "Mozilla/5.0 (compatible; Googl
ebot/2.1; +http://www.google.com/bot.html)"
66.249.73.165 - - [20/Feb/2008:05:14:14 -0800] "GET 
/intrepid-obstacle-map.html?hunchentoot-session=17%3A549FDE65186DCCFD1D2552AA79A49871 
HTTP/1.1" 500 673 "-" "Mozilla/5.0 (c
ompatible; Googlebot/2.1; +http://www.google.com/bot.html)"
66.249.73.165 - - [20/Feb/2008:05:17:57 -0800] "GET 
/intrepid-robothon-2007.html?hunchentoot-session=15%3A3C06276A2A5CD0567B6A332AE52B3940 
HTTP/1.1" 500 673 "-" "Mozilla/5.0 (
compatible; Googlebot/2.1; +http://www.google.com/bot.html)"

And finally, netstat shows a whole bunch of 4242 ports open (these are 
just a small subset:)

tcp      680      0 127.0.0.1:4242          127.0.0.1:38121         
CLOSE_WAIT
tcp      517      0 127.0.0.1:4242          127.0.0.1:44273         
CLOSE_WAIT
tcp        0      0 127.0.0.1:42641         127.0.0.1:4242          
ESTABLISHED
tcp        0      0 127.0.0.1:4242          127.0.0.1:51379         
ESTABLISHED
tcp      509      0 127.0.0.1:4242          127.0.0.1:33727         
CLOSE_WAIT
tcp      515      0 127.0.0.1:4242          127.0.0.1:44279         
CLOSE_WAIT
tcp        0      0 127.0.0.1:4242          127.0.0.1:48386         
ESTABLISHED
tcp      561      0 127.0.0.1:4242          127.0.0.1:44283         
CLOSE_WAIT
tcp        0      0 127.0.0.1:4242          127.0.0.1:42646         
ESTABLISHED
tcp      557      0 127.0.0.1:4242          127.0.0.1:43923         
CLOSE_WAIT
tcp        0      0 192.168.1.102:39210     192.168.1.2:993         
ESTABLISHED
tcp        0      0 127.0.0.1:42651         127.0.0.1:4242          
ESTABLISHED
tcp        0      0 127.0.0.1:4242          127.0.0.1:51382         
ESTABLISHED
tcp      589      0 127.0.0.1:4242          127.0.0.1:48377         
CLOSE_WAIT
tcp      589      0 127.0.0.1:4242          127.0.0.1:56928         
CLOSE_WAIT
tcp      589      0 127.0.0.1:4242          127.0.0.1:54195         
CLOSE_WAIT

The only way I can restart the server is to kill the process, restart 
it, and restart apache.

Any ideas?

Thanks.
--Jeff



More information about the Tbnl-devel mailing list