linux-mips.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: Ralf Baechle <ralf@linux-mips.org>
To: Joshua Kinard <kumba@gentoo.org>
Cc: linux-mips@linux-mips.org
Subject: Re: ARCS can't load CONFIG_DEBUG_LOCK_ALLOC kernel
Date: Thu, 16 Mar 2017 21:50:06 +0100	[thread overview]
Message-ID: <20170316205006.GE10914@linux-mips.org> (raw)
In-Reply-To: <13b0221d-c17a-7e5b-6bb5-702ee7d896c1@gentoo.org>

On Thu, Mar 16, 2017 at 04:02:48PM -0400, Joshua Kinard wrote:

> On 03/16/2017 15:06, Ralf Baechle wrote:
> > On Thu, Mar 16, 2017 at 01:50:42PM -0400, Joshua Kinard wrote:
> > 
> >> On 03/16/2017 10:09, Ralf Baechle wrote:
> >>> On Wed, Mar 15, 2017 at 11:50:44PM -0400, Joshua Kinard wrote:
> >>>
> >>>> On 03/15/2017 16:11, Joshua Kinard wrote:
> >>>>> I've reported in the past that turning on CONFIG_DEBUG_LOCK_ALLOC produces a
> >>>>> kernel that can't boot on several SGI platforms.  It turns out that using
> >>>>> arcload (Stan's bootloader originally written for IP30), I can get some
> >>>>> debugging out on why.  I am still puzzled, but maybe this information can be
> >>>>> interpreted by someone else into something meaningful?
> >>>>>
> >>>>> All addresses printed out of arcload are physical address.
> >>>>>
> >>>>> ARCS Memory Map as printed by some debugging I added to the arcload binary:
> >>>>>
> >>>>> 0x00000000 - 0x00001000 ExceptionBlock
> >>>>> 0x00001000 - 0x00002000 SystemParameterBlock
> >>>>> 0x00002000 - 0x00004000 FirmwarePermanent
> >>>>> 0x20004000 - 0x20f00000 FreeMemory***
> >>>>> 0x20f00000 - 0x21000000 FirmwareTemporary
> >>>>> 0x21000000 - 0x5fff0000 FreeMemory
> >>>>> 0x5fff0000 - 0x5ffff000 LoadedProgram
> >>>>> 0x5ffff000 - 0x60000000 FreeMemory
> >>>>> 0x60000000 - 0xa0000000 FirmwarePermanent
> >>>>
> >>>> So it turns out I can get away, on Octane at least, by changing the load
> >>>> address from 0x20004000 to an arbitrary value in the other FreeMemory segment
> >>>> from 0x21000000 - 0x5fff0000.  Specifically, using 0x21004000 appears to work
> >>>> without any ill effects.
> >>>>
> >>>> The 0x20004000 value is the address used by IRIX to load (with symon, it
> >>>> becomes 0x200800000 instead).  I'll have to try this on the IP27 later on as
> >>>> well.  On Octane, CONFIG_DEBUG_LOCK_ALLOC didn't toss up any major locking
> >>>> issues yet.  Probably need to hammer the disks with bonnie++ or such.  At least
> >>>> I can get back to the BRIDGE/PCI mess now...
> >>>
> >>> I'm wondering where the ARC stack is on kernel entry if maybe the
> >>> ARC stack has corrupted the kernel?  If possible, can you get your
> >>> kernel or a test program to compute a checksum over itself to see
> >>> if it has been corrupted?
> >>
> >> As far as I can tell, it really does seem that it is a sizing issue.  I don't
> >> have the time to dive into what CONFIG_DEBUG_LOCK_ALLOC is exactly doing, but I
> >> found one hit on LKML (lost the URL) that indicates it fluffs up a particular
> >> struct that is very common and so introduces a fair bit of bloat, and it seems
> >> possible that the 0x20004000-0x20f00000 really is too small.  I wouldn't rule
> >> out the possibility that SGI designed ARCS on the Octane to allow only IRIX to
> >> load at this particular address and Linux has just gotten lucky thus far.
> >>
> >> As for whether loading at the next FreeMemory segment in 0x21000000-0x5fff0000
> >> smashes any ARCS segments, that I am not sure about.  A kernel booting in that
> >> segment does boot, and seems to behave no differently than a kernel booting in
> >> the other segment, including exhibiting the same bugs.  Like IP27, Octane
> >> doesn't have a need for ARCS after the kernel boots, as resetting the system
> >> can be done by flipping a bit in HEART, and power down is handled by the RTC
> >> driver (this feature broke, though, and I haven't chased down why yet).  So if
> >> we're clobbering ARCS using this load address...well, it can't be all that bad
> >> </famous-last-words>
> >>
> >> I'll see what IP27 does, assuming it even has a large enough FreeMemory segment
> >> to work with.
> >>
> >>
> >>> Let me repeat my ARC(S) mantra again, ARC(S) is broken, ARC(S) lies.
> >>> Trust is futile.  Even if ARC(S) claims something is free I'd rather
> >>> not rely on it.
> >>
> >> Apparently, and only on Octane, ARCS detects and maps out only the first 1GB of
> >> RAM.  All remaining RAM installed in the system is marked as FirmwarePermanent
> >> and mapped into 0x60000000 on up.
> > 
> > I think on IP27 it was only the first 32MB that are somehow used by
> > ARCS.  Everything else is entirely ignored and the OS is supposed to
> > use klconfig to query the hardware configuration.  That said, klconfig
> > is an infinitely better than ARCS, it actually works and is easy to
> > use.  What it does not provide is information on how firmware or
> > other loaded programs are using memory - it's really just a hardware
> > inventory.
> 
> IIRC, the first 32MB is reserved for use as directory memory on systems with
> less than 32 CPUs.  For 32 or more CPUs, I believe you have to start populating
> the special directory memory DIMM slots.

Completly wrong.  IP27's special memory modules contain the directory for
each 128 byte S-cache line.  This is similar to how other memory controllers
include an ECC for each line of memory.  The directory size and format of
standard memory modules is sufficient for up to 16 nodes.  Note the limit
is about nodes, not processors.

For larger systems IP27 node boards need to be populated with with
``premium directory DIMMs'' which extend the number of directory bits to
cover systems up to 64 nodes (which would be 128 CPUs).  For the few
systems that exceed even that size (we're talking about > 9 full size
racks!) each of the 64 directory bits represents a node in a particular
128 part of the system or coarse mode where each bit represents eight
nodes, thus allowing for 8 * 64 = 512 nodes = 1024 CPUs.

> This does remind me, though, when I installed a router board I found for cheap,
> the kernel, regardless of configuration, wouldn't load at the address defined
> in IP27's Platform file, as ARCS said it was too large.  If I can find a larger
> ARCS segment to load into, I'll have to test that again as well...

The load address used for the IP27 vmlinux was mindlessly copied from
either sash or vmunix itself.  I'd not call that a scientific method
and I never had access to ARC(S) source.

Your system is large enough to require a router board?  I hope you
got cheap power :)

  Ralf

  reply	other threads:[~2017-03-16 20:50 UTC|newest]

Thread overview: 12+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2017-03-15 20:11 ARCS can't load CONFIG_DEBUG_LOCK_ALLOC kernel Joshua Kinard
2017-03-16  3:50 ` Joshua Kinard
2017-03-16 14:09   ` Ralf Baechle
2017-03-16 17:50     ` Joshua Kinard
2017-03-16 19:06       ` Ralf Baechle
2017-03-16 20:02         ` Joshua Kinard
2017-03-16 20:50           ` Ralf Baechle [this message]
2017-03-17  3:01             ` Joshua Kinard
2017-03-18 23:42               ` Joshua Kinard
2017-03-19  7:23                 ` Joshua Kinard
2017-03-19  8:55                   ` Ralf Baechle
2017-03-21 21:52                     ` Joshua Kinard

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20170316205006.GE10914@linux-mips.org \
    --to=ralf@linux-mips.org \
    --cc=kumba@gentoo.org \
    --cc=linux-mips@linux-mips.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).