linux-kernel.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* Regression from 2.6.36
@ 2011-03-15 13:25 azurIt
  2011-03-17  0:15 ` Greg KH
  0 siblings, 1 reply; 45+ messages in thread
From: azurIt @ 2011-03-15 13:25 UTC (permalink / raw)
  To: linux-kernel


Hi,

we are successfully running several very busy web servers on 2.6.32.* and
few days ago I decided to upgrade to 2.6.37 (mainly because of blkio cgroup).
I installed 2.6.37.2 on one of the servers and very strange things started to
happen with Apache web server.

We are using Apache with MPM-ITK ( http://mpm-itk.sesse.net/ ) so it is doing
lots of 'fork' and lots of 'setuid'. I have also noticed that problem is
happening only on very busy servers.

Everything is ok when Apache is started but as time is passing by, its 'root'
processes (Apache processes running under root) are consuming more and more CPU.
Finally, the whole server becames very unstable and Apache must be restarted.
This is repeating until the load on web sites is much lower (usually on 22:00).
Sometimes it takes 3 hours when restart is needed, sometimes only 1 hour (again,
depends on load on web sites). Here is the graph of CPU utilization showing the
problem (red color), Apache was REstarted at 8:11 and 9:35:
http://watchdog.sk/lkml/cpu-problem.png

Here is how it looks on htop:
http://watchdog.sk/lkml/htop.jpg

And finally here is how it looks with older kernels (yes, when i install older
kernel, problem is gone), notice also that I/O wait is much lower and nicer
(blue color):
http://watchdog.sk/lkml/cpu-ok.png

I was also strace-ing Apache processes which were doing problems, here it is:
http://watchdog.sk/lkml/strace.txt

I'm not 100% sure but I think that CPU was consumed on 'futex' lines.

I tried several kernel versions and find out that everything BEFORE 2.6.36 is
NOT affected and everything AFTER 2.6.36 (included) is affected.

Versions which I tried and were NOT affected by this problem:
2.6.32.*
2.6.35.11

Versions which I tried and were affected by this problem:
2.6.36
2.6.36.4
2.6.37.2
2.6.37.3
2.6.38-rc8 (final version was not released yet)

All tests were made on vanilla kernels on Debian Lenny with this config:
http://watchdog.sk/lkml/config

Do you need any other information from me ? I'm able to try other versions or
patches but, please, take into account that I have to do this on _production_
server (I failed to reproduce it in testing environment). Also, I'm able to try
only one kernel per day.

Thank you !

azurit

^ permalink raw reply	[flat|nested] 45+ messages in thread

end of thread, other threads:[~2011-04-19 19:57 UTC | newest]

Thread overview: 45+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2011-03-15 13:25 Regression from 2.6.36 azurIt
2011-03-17  0:15 ` Greg KH
2011-03-17  0:53   ` Dave Jones
2011-03-17 13:30     ` azurIt
2011-04-07 10:01   ` azurIt
2011-04-07 10:19     ` Jiri Slaby
2011-04-07 11:21       ` Américo Wang
2011-04-07 11:57         ` Eric Dumazet
2011-04-07 12:13           ` Eric Dumazet
2011-04-07 15:27             ` Changli Gao
2011-04-07 15:36               ` Eric Dumazet
2011-04-12 22:49                 ` Andrew Morton
2011-04-13  1:23                   ` Changli Gao
2011-04-13  1:31                     ` Andrew Morton
2011-04-13  2:37                       ` Eric Dumazet
2011-04-13  6:54                         ` Regarding memory fragmentation using malloc Pintu Agarwal
2011-04-13 11:44                           ` Américo Wang
2011-04-13 13:56                             ` Pintu Agarwal
2011-04-13 15:25                               ` Michal Nazarewicz
2011-04-14  6:44                                 ` Pintu Agarwal
2011-04-14 10:47                                   ` Michal Nazarewicz
2011-04-14 12:24                                     ` Pintu Agarwal
2011-04-14 12:31                                       ` Michal Nazarewicz
2011-04-13 21:16                         ` Regression from 2.6.36 Andrew Morton
2011-04-13 21:24                           ` Andrew Morton
2011-04-19 19:29                             ` azurIt
2011-04-19 19:55                               ` Andrew Morton
2011-04-13 21:44                           ` David Rientjes
2011-04-13 21:54                             ` Andrew Morton
2011-04-14  2:10                           ` Eric Dumazet
2011-04-14  5:28                             ` Andrew Morton
2011-04-14  6:31                               ` Eric Dumazet
2011-04-14  9:08                                 ` azurIt
2011-04-14 10:27                                   ` Eric Dumazet
2011-04-14 10:31                                     ` azurIt
2011-04-14 10:25                           ` Mel Gorman
2011-04-15  9:59                             ` azurIt
2011-04-15 10:47                               ` Mel Gorman
2011-04-15 10:56                                 ` azurIt
2011-04-15 11:17                                   ` Mel Gorman
2011-04-15 11:36                                     ` azurIt
2011-04-15 13:01                                       ` Mel Gorman
2011-04-15 13:21                                         ` azurIt
2011-04-15 14:15                                           ` Mel Gorman
2011-04-08 12:25               ` azurIt

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).