linux-kernel.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: Anton Altaparmakov <aia21@cam.ac.uk>
To: Jens Axboe <axboe@suse.de>
Cc: Andy Chittenden <AChittenden@bluearc.com>,
	Andrew Morton <akpm@osdl.org>,
	davej@redhat.com, linux-kernel@vger.kernel.org,
	lwoodman@redhat.com
Subject: adding swap workarounds oom - was: Re: Out of Memory: Killed process 16498 (java).
Date: Fri, 03 Feb 2006 09:20:09 +0000	[thread overview]
Message-ID: <1138958409.3828.9.camel@imp.csi.cam.ac.uk> (raw)
In-Reply-To: <1138372797.22112.44.camel@imp.csi.cam.ac.uk>

Hi,

On Fri, 2006-01-27 at 14:39 +0000, Anton Altaparmakov wrote:
> A colleague has a server (which does backups) that is incapable of doing
> a backup due to the backup process being killed due to OOM after
> anywhere between 30s and a few minutes of running...  And the backup
> process is just a simple program that does the equivalent of "dd with
> one source but two destinations" where the source is an lvm/dm snapshot
> and the two destinations are two different tape drives attached via
> scsi.  That is pretty critical, admittedly only to us and that system...

We found a workaround for the OOM problems on above server yesterday.  

Add a 1MiB swap file:

dd if=/dev/zero of=/var/swapfile bs=1024 count=1024
mkswap /var/swapfile
swapon /var/swapfile

Run backup script and no problems!

Note: This is a suse SLES9 system and the problem is not present on
kernel kernel-smp-2.6.5-7.193.i586.rpm and all earlier kernels and it is
present on kernel-smp-2.6.5-7.201.i586.rpm and all later kernels
including the latest kernel (2.6.5-7.244).

Seems like a definite VM bug...  Interestingly on the .244 kernel the
OOM conditions print out a lot of debug information to dmesg about the
memory use in the system and AFAICS none of the memory is exhausted!  So
it seems the system goes OOM without it actually being OOM because it
detects that "free swap == 0" or something along those lines...

Or do we nowadays require swap to be present?

The machine has 6GiB RAM so swap was turned off on it.  (In our
experience if a machine with a lot of concurrent connections starts
swapping the system goes down the drain (it becomes too slow) so swap is
not something we want on servers with 40000+ users...)

If the above is not enough information to find/fix the problem please
let me know what more you would like to know...

Best regards,

        Anton
-- 
Anton Altaparmakov <aia21 at cam.ac.uk> (replace at with @)
Unix Support, Computing Service, University of Cambridge, CB2 3QH, UK
Linux NTFS maintainer / IRC: #ntfs on irc.freenode.net
WWW: http://linux-ntfs.sf.net/ & http://www-stu.christs.cam.ac.uk/~aia21/


  reply	other threads:[~2006-02-03  9:20 UTC|newest]

Thread overview: 44+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2006-01-27 11:53 Out of Memory: Killed process 16498 (java) Andy Chittenden
2006-01-27 14:21 ` Jens Axboe
2006-01-27 14:39   ` Anton Altaparmakov
2006-02-03  9:20     ` Anton Altaparmakov [this message]
2006-02-03  9:26       ` adding swap workarounds oom - was: " Andrew Morton
2006-02-03 11:01         ` Anton Altaparmakov
2006-02-03 13:54           ` Jens Axboe
2006-02-03 13:56 Andy Chittenden
2006-02-03 14:00 ` Jens Axboe
2006-02-22 10:43 Andy Chittenden
2006-02-22 13:34 ` Jens Axboe
2006-02-22 13:35   ` Jens Axboe
2006-02-22 13:38     ` Jens Axboe
2006-02-24  9:33 Andy Chittenden
     [not found] <89E85E0168AD994693B574C80EDB9C270393BF0E@uk-email.terastack.bluearc.com>
2006-02-27 14:28 ` Jens Axboe
2006-02-27 14:50 Andy Chittenden
2006-02-27 14:56 ` Jens Axboe
2006-02-27 16:39 Andy Chittenden
2006-02-28 10:10 Andy Chittenden
2006-02-28 10:20 ` Jens Axboe
2006-02-28 10:27 Andy Chittenden
2006-02-28 10:29 ` Jens Axboe
2006-03-01  9:42 Andy Chittenden
2006-03-01  9:55 ` Jens Axboe
2006-03-01 10:47 Andy Chittenden
2006-03-01 12:15 ` Jens Axboe
2006-03-01 12:19   ` Jens Axboe
2006-03-01 12:23     ` Andi Kleen
2006-03-01 12:25       ` Jens Axboe
2006-03-01 13:34 Andy Chittenden
2006-03-01 13:41 ` Jens Axboe
2006-03-01 14:05   ` Andi Kleen
2006-03-01 14:18     ` Jens Axboe
2006-03-01 14:26       ` Andi Kleen
2006-03-01 14:34         ` Jens Axboe
2006-03-01 14:41           ` Andi Kleen
2006-03-01 15:00             ` Jens Axboe
2006-03-01 14:40 Andy Chittenden
     [not found] <89E85E0168AD994693B574C80EDB9C270393C141@uk-email.terastack.bluearc.com>
2006-03-01 15:57 ` Jens Axboe
2006-03-02 10:46 Andy Chittenden
2006-03-02 11:10 ` Jens Axboe
2006-03-02 12:21   ` Andi Kleen
2006-03-02 12:26     ` Jens Axboe
2006-03-03  9:16 Andy Chittenden

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=1138958409.3828.9.camel@imp.csi.cam.ac.uk \
    --to=aia21@cam.ac.uk \
    --cc=AChittenden@bluearc.com \
    --cc=akpm@osdl.org \
    --cc=axboe@suse.de \
    --cc=davej@redhat.com \
    --cc=linux-kernel@vger.kernel.org \
    --cc=lwoodman@redhat.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).