* bo problem, file-max limit reached within two days @ 2012-04-02 8:36 Knut Petersen 2012-04-02 8:44 ` Chris Wilson 0 siblings, 1 reply; 6+ messages in thread From: Knut Petersen @ 2012-04-02 8:36 UTC (permalink / raw) To: Chris Wilson, intel-gfx After a few unattended hours Xorg was still running, but only a terminal window had survived the night :-( Xorg: git, a few days old, kernel: 3.3. Xorg.0.log: Nothing unusual dmesg: [156859.078080] [drm:drm_gem_create_mmap_offset] *ERROR* failed to allocate offset for bo 0 [156869.197310] [drm:drm_gem_create_mmap_offset] *ERROR* failed to allocate offset for bo 0 [156870.008424] [drm:drm_gem_create_mmap_offset] *ERROR* failed to allocate offset for bo 0 [156871.268130] [drm:drm_gem_create_mmap_offset] *ERROR* failed to allocate offset for bo 0 [156872.988325] [drm:drm_gem_create_mmap_offset] *ERROR* failed to allocate offset for bo 0 [156879.187113] [drm:drm_gem_create_mmap_offset] *ERROR* failed to allocate offset for bo 0 [156880.046793] [drm:drm_gem_create_mmap_offset] *ERROR* failed to allocate offset for bo 0 [156898.079853] [drm:drm_gem_create_mmap_offset] *ERROR* failed to allocate offset for bo 0 [170448.897871] [drm:drm_gem_create_mmap_offset] *ERROR* failed to allocate offset for bo 0 [170449.982378] [drm:drm_gem_create_mmap_offset] *ERROR* failed to allocate offset for bo 0 [170450.399949] [drm:drm_gem_create_mmap_offset] *ERROR* failed to allocate offset for bo 0 [170451.999312] [drm:drm_gem_create_mmap_offset] *ERROR* failed to allocate offset for bo 0 [170458.007902] [drm:drm_gem_create_mmap_offset] *ERROR* failed to allocate offset for bo 0 [170458.795343] [drm:drm_gem_create_mmap_offset] *ERROR* failed to allocate offset for bo 0 [170756.249755] [drm:drm_gem_create_mmap_offset] *ERROR* failed to allocate offset for bo 0 [170756.572389] [drm:drm_gem_create_mmap_offset] *ERROR* failed to allocate offset for bo 0 [170769.156889] [drm:drm_gem_create_mmap_offset] *ERROR* failed to allocate offset for bo 0 [170769.392065] [drm:drm_gem_create_mmap_offset] *ERROR* failed to allocate offset for bo 0 [170770.703797] [drm:drm_gem_create_mmap_offset] *ERROR* failed to allocate offset for bo 0 [170772.407662] [drm:drm_gem_create_mmap_offset] *ERROR* failed to allocate offset for bo 0 [170779.460247] [drm:drm_gem_create_mmap_offset] *ERROR* failed to allocate offset for bo 0 [170788.713283] [drm:drm_gem_create_mmap_offset] *ERROR* failed to allocate offset for bo 0 [170795.466570] [drm:drm_gem_create_mmap_offset] *ERROR* failed to allocate offset for bo 0 [170796.491909] [drm:drm_gem_create_mmap_offset] *ERROR* failed to allocate offset for bo 0 [170798.477484] [drm:drm_gem_create_mmap_offset] *ERROR* failed to allocate offset for bo 0 [170804.722188] [drm:drm_gem_create_mmap_offset] *ERROR* failed to allocate offset for bo 0 [170805.534740] [drm:drm_gem_create_mmap_offset] *ERROR* failed to allocate offset for bo 0 [179417.374016] VFS: file-max limit 204863 reached [179418.373995] VFS: file-max limit 204863 reached [179419.373969] VFS: file-max limit 204863 reached [179420.373982] VFS: file-max limit 204863 reached [179422.373995] VFS: file-max limit 204863 reached [179423.374002] VFS: file-max limit 204863 reached [179425.373970] VFS: file-max limit 204863 reached [179427.404335] VFS: file-max limit 204863 reached [179428.373994] VFS: file-max limit 204863 reached [179429.373992] VFS: file-max limit 204863 reached [179430.381375] VFS: file-max limit 204863 reached [179431.374017] VFS: file-max limit 204863 reached [179432.380678] VFS: file-max limit 204863 reached [179433.373972] VFS: file-max limit 204863 reached [179434.385198] VFS: file-max limit 204863 reached [179435.374010] VFS: file-max limit 204863 reached [179437.373992] VFS: file-max limit 204863 reached [179438.373992] VFS: file-max limit 204863 reached [179440.374007] VFS: file-max limit 204863 reached [179442.420661] VFS: file-max limit 204863 reached [179443.373989] VFS: file-max limit 204863 reached [179444.373998] VFS: file-max limit 204863 reached [179445.394205] VFS: file-max limit 204863 reached [179446.373974] VFS: file-max limit 204863 reached [179447.395051] VFS: file-max limit 204863 reached [179448.373964] VFS: file-max limit 204863 reached [179449.402791] VFS: file-max limit 204863 reached [179450.373994] VFS: file-max limit 204863 reached [179452.374004] VFS: file-max limit 204863 reached [179453.374010] VFS: file-max limit 204863 reached [179455.374067] VFS: file-max limit 204863 reached [179455.797582] VFS: file-max limit 204863 reached [179472.450203] VFS: file-max limit 204863 reached [179473.373970] VFS: file-max limit 204863 reached [179474.374087] VFS: file-max limit 204863 reached [179475.424852] VFS: file-max limit 204863 reached [179476.401695] VFS: file-max limit 204863 reached [179477.427067] VFS: file-max limit 204863 reached [179478.398170] VFS: file-max limit 204863 reached [179479.425794] VFS: file-max limit 204863 reached [179480.373985] VFS: file-max limit 204863 reached [179482.374007] VFS: file-max limit 204863 reached [179483.373988] VFS: file-max limit 204863 reached [179485.373987] VFS: file-max limit 204863 reached I built a new X server and tried kernel 3.2.12 - that does not seem to help as both the number of open files and the number of gem objects still grow without obvious reasons. Any idea? cu, Knut ^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: bo problem, file-max limit reached within two days 2012-04-02 8:36 bo problem, file-max limit reached within two days Knut Petersen @ 2012-04-02 8:44 ` Chris Wilson 2012-04-12 7:55 ` Knut Petersen 0 siblings, 1 reply; 6+ messages in thread From: Chris Wilson @ 2012-04-02 8:44 UTC (permalink / raw) To: Knut Petersen, intel-gfx On Mon, 02 Apr 2012 10:36:44 +0200, Knut Petersen <Knut_Petersen@t-online.de> wrote: > After a few unattended hours Xorg was still running, > but only a terminal window had survived the night :-( > > Xorg: git, a few days old, kernel: 3.3. > > Xorg.0.log: Nothing unusual > > dmesg: > > [156859.078080] [drm:drm_gem_create_mmap_offset] *ERROR* failed to allocate offset for bo 0 > [179417.374016] VFS: file-max limit 204863 reached > > I built a new X server and tried kernel 3.2.12 - that does not seem to help as > both the number of open files and the number of gem objects still grow without > obvious reasons. It's the EFILE that is truly worrying. We have a patch in the queue to help ease the ENOSPC issue, but the EFILE implies a bo reference leak. And that I have not found yet. Happy hunting, -Chris -- Chris Wilson, Intel Open Source Technology Centre ^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: bo problem, file-max limit reached within two days 2012-04-02 8:44 ` Chris Wilson @ 2012-04-12 7:55 ` Knut Petersen 2012-04-12 8:03 ` Daniel Vetter 2012-04-12 8:06 ` Chris Wilson 0 siblings, 2 replies; 6+ messages in thread From: Knut Petersen @ 2012-04-12 7:55 UTC (permalink / raw) To: Chris Wilson; +Cc: intel-gfx Am 02.04.2012 10:44, schrieb Chris Wilson: > On Mon, 02 Apr 2012 10:36:44 +0200, Knut Petersen<Knut_Petersen@t-online.de> wrote: >> After a few unattended hours Xorg was still running, >> but only a terminal window had survived the night :-( >> >> Xorg: git, a few days old, kernel: 3.3. >> >> Xorg.0.log: Nothing unusual >> >> dmesg: >> >> [156859.078080] [drm:drm_gem_create_mmap_offset] *ERROR* failed to allocate offset for bo 0 >> [179417.374016] VFS: file-max limit 204863 reached >> >> I built a new X server and tried kernel 3.2.12 - that does not seem to help as >> both the number of open files and the number of gem objects still grow without >> obvious reasons. > It's the EFILE that is truly worrying. We have a patch in the queue to > help ease the ENOSPC issue, but the EFILE implies a bo reference leak. > And that I have not found yet. > > Happy hunting, > -Chris > Well, it takes more than two days to trigger the EFILE limit here, but yesterday the ENOSPC bo problem appeared 6.5 hours after booting the system with current Xorg git and kernel 3.3.1. I wonder if this is one problem or if we face two independent bugs. cu, Knut ^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: bo problem, file-max limit reached within two days 2012-04-12 7:55 ` Knut Petersen @ 2012-04-12 8:03 ` Daniel Vetter 2012-04-12 12:50 ` Knut Petersen 2012-04-12 8:06 ` Chris Wilson 1 sibling, 1 reply; 6+ messages in thread From: Daniel Vetter @ 2012-04-12 8:03 UTC (permalink / raw) To: Knut Petersen; +Cc: intel-gfx On Thu, Apr 12, 2012 at 09:55:11AM +0200, Knut Petersen wrote: > Am 02.04.2012 10:44, schrieb Chris Wilson: > >On Mon, 02 Apr 2012 10:36:44 +0200, Knut Petersen<Knut_Petersen@t-online.de> wrote: > >>After a few unattended hours Xorg was still running, > >>but only a terminal window had survived the night :-( > >> > >>Xorg: git, a few days old, kernel: 3.3. > >> > >>Xorg.0.log: Nothing unusual > >> > >>dmesg: > >> > >>[156859.078080] [drm:drm_gem_create_mmap_offset] *ERROR* failed to allocate offset for bo 0 > >>[179417.374016] VFS: file-max limit 204863 reached > >> > >>I built a new X server and tried kernel 3.2.12 - that does not seem to help as > >>both the number of open files and the number of gem objects still grow without > >>obvious reasons. > >It's the EFILE that is truly worrying. We have a patch in the queue to > >help ease the ENOSPC issue, but the EFILE implies a bo reference leak. > >And that I have not found yet. > > > >Happy hunting, > >-Chris > > > > Well, it takes more than two days to trigger the EFILE limit here, but yesterday > the ENOSPC bo problem appeared 6.5 hours after booting the system with current > Xorg git and kernel 3.3.1. I wonder if this is one problem or if we face two independent bugs. Can you try the drm-intel-next-queued branch from my git repo at http://cgit.freedesktop.org/~danvet/drm-intel/ That contains a patch from Chris Wilson to mitigate mmio offset exhaustion, one possible reason for a -ENOSPC failure. -Daniel -- Daniel Vetter Mail: daniel@ffwll.ch Mobile: +41 (0)79 365 57 48 ^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: bo problem, file-max limit reached within two days 2012-04-12 8:03 ` Daniel Vetter @ 2012-04-12 12:50 ` Knut Petersen 0 siblings, 0 replies; 6+ messages in thread From: Knut Petersen @ 2012-04-12 12:50 UTC (permalink / raw) To: Daniel Vetter; +Cc: intel-gfx Am 12.04.2012 10:03, schrieb Daniel Vetter: > Can you try the drm-intel-next-queued branch from my git repo at http://cgit.freedesktop.org/~danvet/drm-intel/ That contains a patch from Chris Wilson to mitigate mmio offset exhaustion, one possible reason for a -ENOSPC failure. -Daniel It´s a short time for a test, but drm-intel-next-queued does not look better than other kernels: date; cat /proc/sys/fs/file-nr; cat /sys/kernel/debug/dri/0/i915_gem_objects Do 12. Apr 13:21:50 CEST 2012 9323 0 204854 1275 objects, 81797120 bytes 1251 [1251] objects, 84410368 [84410368] bytes in gtt 19 [19] active objects, 11051008 [11051008] bytes 2 [2] pinned objects, 5373952 [5373952] bytes 1230 [1230] inactive objects, 67985408 [67985408] bytes 0 [0] freed objects, 0 [0] bytes 3 pinned mappable objects, 13762560 bytes 786 fault mappable objects, 16318464 bytes 268435456 [268435456] gtt total Do 12. Apr 14:09:41 CEST 2012 12387 0 204854 4317 objects, 82264064 bytes 4282 [4282] objects, 84713472 [84713472] bytes in gtt 15 [15] active objects, 11304960 [11304960] bytes 2 [2] pinned objects, 5373952 [5373952] bytes 4265 [4265] inactive objects, 68034560 [68034560] bytes 0 [0] freed objects, 0 [0] bytes 3 pinned mappable objects, 13762560 bytes 3809 fault mappable objects, 27975680 bytes 268435456 [268435456] gtt total Do 12. Apr 14:47:18 CEST 2012 15616 0 204854 7015 objects, 148389888 bytes 905 [905] objects, 73834496 [73834496] bytes in gtt 15 [15] active objects, 11304960 [11304960] bytes 2 [2] pinned objects, 5373952 [5373952] bytes 888 [888] inactive objects, 57155584 [57155584] bytes 0 [0] freed objects, 0 [0] bytes 3 pinned mappable objects, 13762560 bytes 699 fault mappable objects, 13668352 bytes 268435456 [268435456] gtt total Files, total number of objects, inactive objects and fault mappable objects raise at a rate comparable to kernels 3.3.1, 3.2.* and 3.1*. I searched about 15 months of log files - both error messages appeared for the first time about 14 days ago. Kernel 3.2 is much older, and the system survived _much_ longer uptimes previously. Either it´s not a kernel problem, or that kernel problem is exposed by some recent (max 6 weeks) change in Xorg, KDE, ... etc. Unfortunately there were a lot of changes in that time. cu, Knut ^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: bo problem, file-max limit reached within two days 2012-04-12 7:55 ` Knut Petersen 2012-04-12 8:03 ` Daniel Vetter @ 2012-04-12 8:06 ` Chris Wilson 1 sibling, 0 replies; 6+ messages in thread From: Chris Wilson @ 2012-04-12 8:06 UTC (permalink / raw) To: Knut Petersen; +Cc: intel-gfx On Thu, 12 Apr 2012 09:55:11 +0200, Knut Petersen <Knut_Petersen@t-online.de> wrote: > Am 02.04.2012 10:44, schrieb Chris Wilson: > > On Mon, 02 Apr 2012 10:36:44 +0200, Knut Petersen<Knut_Petersen@t-online.de> wrote: > >> After a few unattended hours Xorg was still running, > >> but only a terminal window had survived the night :-( > >> > >> Xorg: git, a few days old, kernel: 3.3. > >> > >> Xorg.0.log: Nothing unusual > >> > >> dmesg: > >> > >> [156859.078080] [drm:drm_gem_create_mmap_offset] *ERROR* failed to allocate offset for bo 0 > >> [179417.374016] VFS: file-max limit 204863 reached > >> > >> I built a new X server and tried kernel 3.2.12 - that does not seem to help as > >> both the number of open files and the number of gem objects still grow without > >> obvious reasons. > > It's the EFILE that is truly worrying. We have a patch in the queue to > > help ease the ENOSPC issue, but the EFILE implies a bo reference leak. > > And that I have not found yet. > > > > Happy hunting, > > -Chris > > > > Well, it takes more than two days to trigger the EFILE limit here, but yesterday > the ENOSPC bo problem appeared 6.5 hours after booting the system with current > Xorg git and kernel 3.3.1. I wonder if this is one problem or if we face two independent bugs. Where there is one, there is likely to be a cluster. However both ENOSPC and EFILE can be explained by a bo leak, just differing by how they were used prior to the leak. Eventually we run out of fd handles on the system (EFILE). If we mapped those into our process space before the leak then we hit the much smaller address space limit (ENOSPC). -Chris -- Chris Wilson, Intel Open Source Technology Centre ^ permalink raw reply [flat|nested] 6+ messages in thread
end of thread, other threads:[~2012-04-12 12:50 UTC | newest] Thread overview: 6+ messages (download: mbox.gz / follow: Atom feed) -- links below jump to the message on this page -- 2012-04-02 8:36 bo problem, file-max limit reached within two days Knut Petersen 2012-04-02 8:44 ` Chris Wilson 2012-04-12 7:55 ` Knut Petersen 2012-04-12 8:03 ` Daniel Vetter 2012-04-12 12:50 ` Knut Petersen 2012-04-12 8:06 ` Chris Wilson
This is an external index of several public inboxes, see mirroring instructions on how to clone and mirror all data and code used by this external index.