From mboxrd@z Thu Jan 1 00:00:00 1970 From: Albert ARIBAUD Date: Sat, 11 May 2013 09:40:02 +0200 Subject: [U-Boot] [PATCH 4/4] arm: factorize relocate_code routine In-Reply-To: <1470173111.739335.1368237849736.JavaMail.root@advansee.com> References: <1368223012-17609-1-git-send-email-albert.u.boot@aribaud.net> <1368223012-17609-2-git-send-email-albert.u.boot@aribaud.net> <1368223012-17609-3-git-send-email-albert.u.boot@aribaud.net> <1368223012-17609-4-git-send-email-albert.u.boot@aribaud.net> <1368223012-17609-5-git-send-email-albert.u.boot@aribaud.net> <1470173111.739335.1368237849736.JavaMail.root@advansee.com> Message-ID: <20130511094002.30f2c36d@lilith> List-Id: MIME-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit To: u-boot@lists.denx.de Hi Beno?t, On Sat, 11 May 2013 04:04:09 +0200 (CEST), Beno?t Th?baudeau wrote: > Hi Albert, > > On Friday, May 10, 2013 11:56:52 PM, Albert ARIBAUD wrote: > > Signed-off-by: Albert ARIBAUD > > diff --git a/arch/arm/cpu/pxa/start.S b/arch/arm/cpu/pxa/start.S > > index ada91a6..3078bec 100644 > > --- a/arch/arm/cpu/pxa/start.S > > +++ b/arch/arm/cpu/pxa/start.S > > @@ -170,90 +170,6 @@ reset: > > > > bl _main > > > > -/*------------------------------------------------------------------------------*/ > > -#ifndef CONFIG_SPL_BUILD > > -/* > > - * void relocate_code(addr_moni) > > - * > > - * This function relocates the monitor code. > > - */ > > - .globl relocate_code > > -relocate_code: > > - mov r6, r0 /* save addr of destination */ > > - > > -/* Disable the Dcache RAM lock for stack now */ > > -#ifdef CONFIG_CPU_PXA25X > > - mov r12, lr > > - bl cpu_init_crit > > - mov lr, r12 > > -#endif > > What about this thing that you silently drop? Overlook on my side, and PXA is not one of my test vehicles so I missed it while testing. Will 'undrop' in V2. > > diff --git a/arch/arm/lib/relocate.S b/arch/arm/lib/relocate.S > > new file mode 100644 > > index 0000000..ce533ca > > --- /dev/null > > +++ b/arch/arm/lib/relocate.S > > @@ -0,0 +1,100 @@ > > +/* > > + * relocate - common relocation function for ARM U-Boot > > + * > > + * Copyright (c) 2013 Albert ARIBAUD > > + * > > + * See file CREDITS for list of people who contributed to this > > + * project. > > + * > > + * This program is free software; you can redistribute it and/or > > + * modify it under the terms of the GNU General Public License as > > + * published by the Free Software Foundation; either version 2 of > > + * the License, or (at your option) any later version. > > + * > > + * This program is distributed in the hope that it will be useful, > > + * but WITHOUT ANY WARRANTY; without even the implied warranty of > > + * MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the > > + * GNU General Public License for more details. > > + * > > + * You should have received a copy of the GNU General Public License > > + * along with this program; if not, write to the Free Software > > + * Foundation, Inc., 59 Temple Place, Suite 330, Boston, > > + * MA 02111-1307 USA > > + */ > > + > > +/* > > + * These are defined in the board-specific linker script. > > + * Subtracting _start from them lets the linker put their > > + * relative position in the executable instead of leaving > > + * them null. > > + */ > > + > > + .globl __image_copy_start > > + .globl __image_copy_end > > + .globl __rel_dyn_start > > + .globl __rel_dyn_end > > + .globl __dynsym_start > > .globl is for exporting symbols to the linker, not importing, so the lines above > should be removed. correct. Will drop in V2. > > + > > +/*------------------------------------------------------------------------------*/ > > + > > +/* > > + * void relocate_code(addr_moni) > > + * > > + * This function relocates the monitor code. > > + */ > > + .globl relocate_code > > +relocate_code: > > Or "ENTRY(relocate_code)" instead of the two lines above. I'd gone for just moving code around rather than cosmetic, but I can do the cosmetic change at the end of the patch too. > > + mov r6, r0 /* save addr of destination */ > > + > > + ldr r0, =_start > > This is wrong. Here, r0 will get _start link-time address, while with 'adr' in > the original code, r0 was getting _start runtime address, which is not the same > for all boards. relocate_code() is and must stay position-independent code, at > least for SPL. > > E.g., for mx31pdk or tx25, the SPL is first loaded by the boot ROM in the NAND > Flash controller buffer, then executed from there. The SPL then has to use > relocate_code() to relocate itself to CONFIG_SPL_TEXT_BASE in order to free the > NFC buffer to load U-Boot from NAND Flash. This means that 'ldr r0, =_start' > would set r0 to CONFIG_SPL_TEXT_BASE, while 'adr r0, _start' sets r0 to the > address of the NFC buffer. Since the SPL calls relocate_code() with > CONFIG_SPL_TEXT_BASE, the 'ldr' choice would just result in a branch to > relocate_done below, which would abort the relocation and break the boot on > those boards. > > The issue is that 'adr' requires that _start be defined in the same file and > section. That's why my patch 31/31 was using a macro to define relocate_code() > in the start.S files. Perhaps some other constructs like > 'sub r0, pc, . - _start' would work, but I doubt it if _start is not in the same > file and section since this is exactly how 'adr' is translated. Alright, so -- formally, there is no assumption that code running before relocation must be position-independent, as far as I remember. The assumption is that it is designed to run at the link-time location, and that it is responsible for making sure that the rest of the code is properly relocated and thus will run at whatever location it was moved to. In the case you mention, SPL does not relocate itself: it only *moves* itself, skipping the actual relocation (reference fixing part) -- it could havdoe so with a short ad hoc loop instead of relocate_code, and thus not impose a specific requirement on relocate_code that was not initially there. I'd be happier with a preprocessor conditional at the crt0.S stage to choose between relocate_code and a new, short, move_code routine depending on whether we are building SPL or not. That being said, I am fine with trying to make the code before relocation as position-independent as possible anyway; let me see how this can be done. > > + subs r9, r6, r0 /* r9 <- relocation offset */ > > + beq relocate_done /* skip relocation */ > > + mov r1, r6 /* r1 <- scratch for copy_loop */ > > + ldr r2, =__image_copy_end > > + > > +copy_loop: > > + ldmia r0!, {r10-r11} /* copy from source address [r0] */ > > + stmia r1!, {r10-r11} /* copy to target address [r1] */ > > + cmp r0, r2 /* until source end address [r2] */ > > + blo copy_loop > > + > > +#ifndef CONFIG_SPL_BUILD > > + /* > > + * fix .rel.dyn relocations > > + */ > > + ldr r0, =_TEXT_BASE /* r0 <- Text base */ > > The value you load above into r0 then gets overwritten prior to having been > used, so you can drop this line, all the more it is wrong, loading the address > of _TEXT_BASE instead of its value. Correct. Will fix in V2. > > + ldr r10, =__dynsym_start /* r10 <- sym table ofs */ > > + ldr r2, =__rel_dyn_start /* r2 <- rel dyn start ofs */ > > + ldr r3, =__rel_dyn_end /* r3 <- rel dyn end ofs */ > > 'ofs' should be replaced with 'in FLASH' in the comments above. > > Contrary to above, this construct works here because this can't be SPL code. Actually, this would not work either if used by non-SPL code as SPL code expects to use it, e.g. by loading U-Boot at another address -- the issue is in the use scenario, not in the requirements for SPL vs. U-Boot. > > +relocate_done: > > + > > + bx lr > > This instruction is not supported on ARMv4, even if GCC does not complain about > it (it looks like this is not even fixed by the linker, except if the > instruction was issued by GCC itself), but it is required for more recent ARM > versions for Thumb inter-working, which is enabled by some boards in U-Boot. > Hence, shouldn't the line above be replaced with: > > #ifdef __ARM_ARCH_4__ > mov pc, lr > #else > bx lr > #endif > > Or using CONFIG_SYS_THUMB_BUILD? I would have preferred adding --fix-v4bx to the linker flags for ARMv4 targets, but it seems to have no effect at least with some toolchains. I'll fix the code above with the ARMv4 conditional, as it is always correct regardless of U-boot configuration options semantics. > "ENDPROC(relocate_code)" should be added here if you go for ENTRY() at the > beginning. Ok. > Best regards, > Beno?t Thanks for your review! Amicalement, -- Albert.