* [PATCH v3 0/2] Fix binfmt_flat loader for RISC-V @ 2021-04-15 6:15 Damien Le Moal 2021-04-15 6:15 ` [PATCH v3 1/2] binfmt_flat: allow not offsetting data start Damien Le Moal 2021-04-15 6:15 ` [PATCH v3 2/2] riscv: Disable text-data gap in flat binaries Damien Le Moal 0 siblings, 2 replies; 7+ messages in thread From: Damien Le Moal @ 2021-04-15 6:15 UTC (permalink / raw) To: uclinux-dev, ugerg, Palmer Dabbelt, linux-riscv, Alexander Viro, linux-kernel Cc: Max Filippov, Greg Ungerer, Anup Patel, Christoph Hellwig RISC-V NOMMU flat binaries cannot tolerate a gap between the text and data section as the toolchain fully resolves at compile time the PC relative global pointer (__global_pointer$ value loaded in gp register). Without a relocation entry provided, the flat bin loader cannot fix the value if a gap is introduced and executables fail to run. This series fixes this problem by allowing an architecture to request the flat loader to suppress the gap between the text and data sections. The first patch fixes binfmt_flat flat_load_file() using the new configuration option CONFIG_BINFMT_FLAT_NO_TEXT_DATA_GAP. The second patch enables this option for RISCV NOMMU builds. These patches do not change the binfmt_flat loader behavior for other architectures. Changes from v2: * Updated distribution list * Added Palmer ack-by tag Changes from v1: * Replace FLAT_TEXT_DATA_NO_GAP macro with CONFIG_BINFMT_FLAT_NO_TEXT_DATA_GAP config option (patch 1). * Remove the addition of riscv/include/asm/flat.h and set CONFIG_BINFMT_FLAT_NO_TEXT_DATA_GAP for RISCV and !MMU Damien Le Moal (2): binfmt_flat: allow not offsetting data start riscv: Disable text-data gap in flat binaries arch/riscv/Kconfig | 1 + fs/Kconfig.binfmt | 3 +++ fs/binfmt_flat.c | 21 +++++++++++++++------ 3 files changed, 19 insertions(+), 6 deletions(-) -- 2.30.2 _______________________________________________ linux-riscv mailing list linux-riscv@lists.infradead.org http://lists.infradead.org/mailman/listinfo/linux-riscv ^ permalink raw reply [flat|nested] 7+ messages in thread
* [PATCH v3 1/2] binfmt_flat: allow not offsetting data start 2021-04-15 6:15 [PATCH v3 0/2] Fix binfmt_flat loader for RISC-V Damien Le Moal @ 2021-04-15 6:15 ` Damien Le Moal 2021-04-15 14:04 ` Greg Ungerer 2021-04-15 6:15 ` [PATCH v3 2/2] riscv: Disable text-data gap in flat binaries Damien Le Moal 1 sibling, 1 reply; 7+ messages in thread From: Damien Le Moal @ 2021-04-15 6:15 UTC (permalink / raw) To: uclinux-dev, ugerg, Palmer Dabbelt, linux-riscv, Alexander Viro, linux-kernel Cc: Max Filippov, Greg Ungerer, Anup Patel, Christoph Hellwig Commit 2217b9826246 ("binfmt_flat: revert "binfmt_flat: don't offset the data start"") restored offsetting the start of the data section by a number of words defined by MAX_SHARED_LIBS. As a result, since MAX_SHARED_LIBS is never 0, a gap between the text and data sections always exists. For architectures which cannot support a such gap between the text and data sections (e.g. riscv nommu), flat binary programs cannot be executed. To allow an architecture to request contiguous text and data sections, introduce the config option CONFIG_BINFMT_FLAT_NO_TEXT_DATA_GAP. Using this new option, the macro DATA_GAP_WORDS is conditionally defined in binfmt_flat.c to MAX_SHARED_LIBS for architectures tolerating the text-to-data gap (CONFIG_BINFMT_FLAT_NO_TEXT_DATA_GAP disabled case) and to 0 when CONFIG_BINFMT_FLAT_NO_TEXT_DATA_GAP is enabled. DATA_GAP_WORDS is used in load_flat_file() to calculate the data section length and start position. An architecture enabling CONFIG_BINFMT_FLAT_NO_TEXT_DATA_GAP also prevents the use of the separate text/data load case (when the flat file header flags FLAT_FLAG_RAM and FLAT_FLAG_GZIP are not set with NOMMU kernels) and forces the use of a single RAM region for loading (equivalent to FLAT_FLAG_RAM being set). Signed-off-by: Damien Le Moal <damien.lemoal@wdc.com> Acked-by: Palmer Dabbelt <palmerdabbelt@google.com> --- fs/Kconfig.binfmt | 3 +++ fs/binfmt_flat.c | 21 +++++++++++++++------ 2 files changed, 18 insertions(+), 6 deletions(-) diff --git a/fs/Kconfig.binfmt b/fs/Kconfig.binfmt index c6f1c8c1934e..c6df931d5d45 100644 --- a/fs/Kconfig.binfmt +++ b/fs/Kconfig.binfmt @@ -112,6 +112,9 @@ config BINFMT_FLAT_ARGVP_ENVP_ON_STACK config BINFMT_FLAT_OLD_ALWAYS_RAM bool +config BINFMT_FLAT_NO_TEXT_DATA_GAP + bool + config BINFMT_FLAT_OLD bool "Enable support for very old legacy flat binaries" depends on BINFMT_FLAT diff --git a/fs/binfmt_flat.c b/fs/binfmt_flat.c index b9c658e0548e..2be29bb964b8 100644 --- a/fs/binfmt_flat.c +++ b/fs/binfmt_flat.c @@ -74,6 +74,12 @@ #define MAX_SHARED_LIBS (1) #endif +#ifdef CONFIG_BINFMT_FLAT_NO_TEXT_DATA_GAP +#define DATA_GAP_WORDS (0) +#else +#define DATA_GAP_WORDS (MAX_SHARED_LIBS) +#endif + struct lib_info { struct { unsigned long start_code; /* Start of text segment */ @@ -559,7 +565,10 @@ static int load_flat_file(struct linux_binprm *bprm, * case, and then the fully copied to RAM case which lumps * it all together. */ - if (!IS_ENABLED(CONFIG_MMU) && !(flags & (FLAT_FLAG_RAM|FLAT_FLAG_GZIP))) { + if (!IS_ENABLED(CONFIG_MMU) && + !IS_ENABLED(CONFIG_BINFMT_FLAT_NO_TEXT_DATA_GAP) && + !(flags & (FLAT_FLAG_RAM|FLAT_FLAG_GZIP))) { + /* * this should give us a ROM ptr, but if it doesn't we don't * really care @@ -576,7 +585,7 @@ static int load_flat_file(struct linux_binprm *bprm, goto err; } - len = data_len + extra + MAX_SHARED_LIBS * sizeof(unsigned long); + len = data_len + extra + DATA_GAP_WORDS * sizeof(unsigned long); len = PAGE_ALIGN(len); realdatastart = vm_mmap(NULL, 0, len, PROT_READ|PROT_WRITE|PROT_EXEC, MAP_PRIVATE, 0); @@ -591,7 +600,7 @@ static int load_flat_file(struct linux_binprm *bprm, goto err; } datapos = ALIGN(realdatastart + - MAX_SHARED_LIBS * sizeof(unsigned long), + DATA_GAP_WORDS * sizeof(unsigned long), FLAT_DATA_ALIGN); pr_debug("Allocated data+bss+stack (%u bytes): %lx\n", @@ -622,7 +631,7 @@ static int load_flat_file(struct linux_binprm *bprm, memp_size = len; } else { - len = text_len + data_len + extra + MAX_SHARED_LIBS * sizeof(u32); + len = text_len + data_len + extra + DATA_GAP_WORDS * sizeof(u32); len = PAGE_ALIGN(len); textpos = vm_mmap(NULL, 0, len, PROT_READ | PROT_EXEC | PROT_WRITE, MAP_PRIVATE, 0); @@ -638,7 +647,7 @@ static int load_flat_file(struct linux_binprm *bprm, realdatastart = textpos + ntohl(hdr->data_start); datapos = ALIGN(realdatastart + - MAX_SHARED_LIBS * sizeof(u32), + DATA_GAP_WORDS * sizeof(u32), FLAT_DATA_ALIGN); reloc = (__be32 __user *) @@ -714,7 +723,7 @@ static int load_flat_file(struct linux_binprm *bprm, ret = result; pr_err("Unable to read code+data+bss, errno %d\n", ret); vm_munmap(textpos, text_len + data_len + extra + - MAX_SHARED_LIBS * sizeof(u32)); + DATA_GAP_WORDS * sizeof(u32)); goto err; } } -- 2.30.2 _______________________________________________ linux-riscv mailing list linux-riscv@lists.infradead.org http://lists.infradead.org/mailman/listinfo/linux-riscv ^ permalink raw reply related [flat|nested] 7+ messages in thread
* Re: [PATCH v3 1/2] binfmt_flat: allow not offsetting data start 2021-04-15 6:15 ` [PATCH v3 1/2] binfmt_flat: allow not offsetting data start Damien Le Moal @ 2021-04-15 14:04 ` Greg Ungerer 2021-04-15 23:22 ` Damien Le Moal 0 siblings, 1 reply; 7+ messages in thread From: Greg Ungerer @ 2021-04-15 14:04 UTC (permalink / raw) To: Damien Le Moal, uclinux-dev, ugerg, Palmer Dabbelt, linux-riscv, Alexander Viro, linux-kernel Cc: Max Filippov, Anup Patel, Christoph Hellwig Hi Damien, On 15/4/21 4:15 pm, Damien Le Moal wrote: > Commit 2217b9826246 ("binfmt_flat: revert "binfmt_flat: don't offset > the data start"") restored offsetting the start of the data section by > a number of words defined by MAX_SHARED_LIBS. As a result, since > MAX_SHARED_LIBS is never 0, a gap between the text and data sections > always exists. For architectures which cannot support a such gap > between the text and data sections (e.g. riscv nommu), flat binary > programs cannot be executed. > > To allow an architecture to request contiguous text and data sections, > introduce the config option CONFIG_BINFMT_FLAT_NO_TEXT_DATA_GAP. > Using this new option, the macro DATA_GAP_WORDS is conditionally > defined in binfmt_flat.c to MAX_SHARED_LIBS for architectures > tolerating the text-to-data gap (CONFIG_BINFMT_FLAT_NO_TEXT_DATA_GAP > disabled case) and to 0 when CONFIG_BINFMT_FLAT_NO_TEXT_DATA_GAP is > enabled. DATA_GAP_WORDS is used in load_flat_file() to calculate the > data section length and start position. > > An architecture enabling CONFIG_BINFMT_FLAT_NO_TEXT_DATA_GAP also > prevents the use of the separate text/data load case (when the flat file > header flags FLAT_FLAG_RAM and FLAT_FLAG_GZIP are not set with NOMMU > kernels) and forces the use of a single RAM region for loading > (equivalent to FLAT_FLAG_RAM being set). So is it the case that a flat format file on RISC-V will never have relocations? > Signed-off-by: Damien Le Moal <damien.lemoal@wdc.com> > Acked-by: Palmer Dabbelt <palmerdabbelt@google.com> > --- > fs/Kconfig.binfmt | 3 +++ > fs/binfmt_flat.c | 21 +++++++++++++++------ > 2 files changed, 18 insertions(+), 6 deletions(-) > > diff --git a/fs/Kconfig.binfmt b/fs/Kconfig.binfmt > index c6f1c8c1934e..c6df931d5d45 100644 > --- a/fs/Kconfig.binfmt > +++ b/fs/Kconfig.binfmt > @@ -112,6 +112,9 @@ config BINFMT_FLAT_ARGVP_ENVP_ON_STACK > config BINFMT_FLAT_OLD_ALWAYS_RAM > bool > > +config BINFMT_FLAT_NO_TEXT_DATA_GAP > + bool > + > config BINFMT_FLAT_OLD > bool "Enable support for very old legacy flat binaries" > depends on BINFMT_FLAT > diff --git a/fs/binfmt_flat.c b/fs/binfmt_flat.c > index b9c658e0548e..2be29bb964b8 100644 > --- a/fs/binfmt_flat.c > +++ b/fs/binfmt_flat.c > @@ -74,6 +74,12 @@ > #define MAX_SHARED_LIBS (1) > #endif > > +#ifdef CONFIG_BINFMT_FLAT_NO_TEXT_DATA_GAP > +#define DATA_GAP_WORDS (0) > +#else > +#define DATA_GAP_WORDS (MAX_SHARED_LIBS) > +#endif > +> struct lib_info { > struct { > unsigned long start_code; /* Start of text segment */ > @@ -559,7 +565,10 @@ static int load_flat_file(struct linux_binprm *bprm, > * case, and then the fully copied to RAM case which lumps > * it all together. > */ > - if (!IS_ENABLED(CONFIG_MMU) && !(flags & (FLAT_FLAG_RAM|FLAT_FLAG_GZIP))) { > + if (!IS_ENABLED(CONFIG_MMU) && > + !IS_ENABLED(CONFIG_BINFMT_FLAT_NO_TEXT_DATA_GAP) && If RISC-V flat format files must always be loaded to RAM then why don't they set the FLAT_FLAG_RAM when compiled/generated? Regards Greg > + !(flags & (FLAT_FLAG_RAM|FLAT_FLAG_GZIP))) { > + > /* > * this should give us a ROM ptr, but if it doesn't we don't > * really care > @@ -576,7 +585,7 @@ static int load_flat_file(struct linux_binprm *bprm, > goto err; > } > > - len = data_len + extra + MAX_SHARED_LIBS * sizeof(unsigned long); > + len = data_len + extra + DATA_GAP_WORDS * sizeof(unsigned long); > len = PAGE_ALIGN(len); > realdatastart = vm_mmap(NULL, 0, len, > PROT_READ|PROT_WRITE|PROT_EXEC, MAP_PRIVATE, 0); > @@ -591,7 +600,7 @@ static int load_flat_file(struct linux_binprm *bprm, > goto err; > } > datapos = ALIGN(realdatastart + > - MAX_SHARED_LIBS * sizeof(unsigned long), > + DATA_GAP_WORDS * sizeof(unsigned long), > FLAT_DATA_ALIGN); > > pr_debug("Allocated data+bss+stack (%u bytes): %lx\n", > @@ -622,7 +631,7 @@ static int load_flat_file(struct linux_binprm *bprm, > memp_size = len; > } else { > > - len = text_len + data_len + extra + MAX_SHARED_LIBS * sizeof(u32); > + len = text_len + data_len + extra + DATA_GAP_WORDS * sizeof(u32); > len = PAGE_ALIGN(len); > textpos = vm_mmap(NULL, 0, len, > PROT_READ | PROT_EXEC | PROT_WRITE, MAP_PRIVATE, 0); > @@ -638,7 +647,7 @@ static int load_flat_file(struct linux_binprm *bprm, > > realdatastart = textpos + ntohl(hdr->data_start); > datapos = ALIGN(realdatastart + > - MAX_SHARED_LIBS * sizeof(u32), > + DATA_GAP_WORDS * sizeof(u32), > FLAT_DATA_ALIGN); > > reloc = (__be32 __user *) > @@ -714,7 +723,7 @@ static int load_flat_file(struct linux_binprm *bprm, > ret = result; > pr_err("Unable to read code+data+bss, errno %d\n", ret); > vm_munmap(textpos, text_len + data_len + extra + > - MAX_SHARED_LIBS * sizeof(u32)); > + DATA_GAP_WORDS * sizeof(u32)); > goto err; > } > } > _______________________________________________ linux-riscv mailing list linux-riscv@lists.infradead.org http://lists.infradead.org/mailman/listinfo/linux-riscv ^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: [PATCH v3 1/2] binfmt_flat: allow not offsetting data start 2021-04-15 14:04 ` Greg Ungerer @ 2021-04-15 23:22 ` Damien Le Moal 2021-04-16 7:24 ` Greg Ungerer 0 siblings, 1 reply; 7+ messages in thread From: Damien Le Moal @ 2021-04-15 23:22 UTC (permalink / raw) To: Greg Ungerer, uclinux-dev, ugerg, Palmer Dabbelt, linux-riscv, Alexander Viro, linux-kernel Cc: Max Filippov, Anup Patel, Christoph Hellwig On 2021/04/15 23:04, Greg Ungerer wrote: > Hi Damien, > > On 15/4/21 4:15 pm, Damien Le Moal wrote: >> Commit 2217b9826246 ("binfmt_flat: revert "binfmt_flat: don't offset >> the data start"") restored offsetting the start of the data section by >> a number of words defined by MAX_SHARED_LIBS. As a result, since >> MAX_SHARED_LIBS is never 0, a gap between the text and data sections >> always exists. For architectures which cannot support a such gap >> between the text and data sections (e.g. riscv nommu), flat binary >> programs cannot be executed. >> >> To allow an architecture to request contiguous text and data sections, >> introduce the config option CONFIG_BINFMT_FLAT_NO_TEXT_DATA_GAP. >> Using this new option, the macro DATA_GAP_WORDS is conditionally >> defined in binfmt_flat.c to MAX_SHARED_LIBS for architectures >> tolerating the text-to-data gap (CONFIG_BINFMT_FLAT_NO_TEXT_DATA_GAP >> disabled case) and to 0 when CONFIG_BINFMT_FLAT_NO_TEXT_DATA_GAP is >> enabled. DATA_GAP_WORDS is used in load_flat_file() to calculate the >> data section length and start position. >> >> An architecture enabling CONFIG_BINFMT_FLAT_NO_TEXT_DATA_GAP also >> prevents the use of the separate text/data load case (when the flat file >> header flags FLAT_FLAG_RAM and FLAT_FLAG_GZIP are not set with NOMMU >> kernels) and forces the use of a single RAM region for loading >> (equivalent to FLAT_FLAG_RAM being set). > > So is it the case that a flat format file on RISC-V will never have > relocations? No, it does have relocations. But there is no entry for the global pointer (__global_pointer$) location. This is because the loading of that value in the gp register in the C-library crt1.S is done using a PC-relative instruction. The value for it is resolved at compile time and does not get a relocation table entry. Other functions calls and symbol references do have relocation table entries, so the binary can be loaded anywhere. The missing relocation for the global pointer mandates that text and data be loaded at the same positions relative to each other that the linker file defines. Otherwise, loading of __global_pointer$ into the gp register (first thing that C libraries crt1.S do) result in a garbage value being loaded. I tried some tricks with the linker file and changing uclibc crt1.S to have the gp loading done using a symbol address instead of a PC-relative offset. I could then see a relocation table entry for that symbol. That still did not work as I was probably doing something wrong. Anyway, such solution requires changing a lot of things in C libraries loading assembler that is common between NOMMU and MMU code. Changing it would break MMU enabled programs. >> Signed-off-by: Damien Le Moal <damien.lemoal@wdc.com> >> Acked-by: Palmer Dabbelt <palmerdabbelt@google.com> >> --- >> fs/Kconfig.binfmt | 3 +++ >> fs/binfmt_flat.c | 21 +++++++++++++++------ >> 2 files changed, 18 insertions(+), 6 deletions(-) >> >> diff --git a/fs/Kconfig.binfmt b/fs/Kconfig.binfmt >> index c6f1c8c1934e..c6df931d5d45 100644 >> --- a/fs/Kconfig.binfmt >> +++ b/fs/Kconfig.binfmt >> @@ -112,6 +112,9 @@ config BINFMT_FLAT_ARGVP_ENVP_ON_STACK >> config BINFMT_FLAT_OLD_ALWAYS_RAM >> bool >> >> +config BINFMT_FLAT_NO_TEXT_DATA_GAP >> + bool >> + >> config BINFMT_FLAT_OLD >> bool "Enable support for very old legacy flat binaries" >> depends on BINFMT_FLAT >> diff --git a/fs/binfmt_flat.c b/fs/binfmt_flat.c >> index b9c658e0548e..2be29bb964b8 100644 >> --- a/fs/binfmt_flat.c >> +++ b/fs/binfmt_flat.c >> @@ -74,6 +74,12 @@ >> #define MAX_SHARED_LIBS (1) >> #endif >> >> +#ifdef CONFIG_BINFMT_FLAT_NO_TEXT_DATA_GAP >> +#define DATA_GAP_WORDS (0) >> +#else >> +#define DATA_GAP_WORDS (MAX_SHARED_LIBS) >> +#endif >> +> struct lib_info { >> struct { >> unsigned long start_code; /* Start of text segment */ >> @@ -559,7 +565,10 @@ static int load_flat_file(struct linux_binprm *bprm, >> * case, and then the fully copied to RAM case which lumps >> * it all together. >> */ >> - if (!IS_ENABLED(CONFIG_MMU) && !(flags & (FLAT_FLAG_RAM|FLAT_FLAG_GZIP))) { >> + if (!IS_ENABLED(CONFIG_MMU) && >> + !IS_ENABLED(CONFIG_BINFMT_FLAT_NO_TEXT_DATA_GAP) && > > If RISC-V flat format files must always be loaded to RAM then why don't > they set the FLAT_FLAG_RAM when compiled/generated? That is done. The patch I have for elf2flt sets it. Coding it like this here is I think safer (whatever the userspace toolchain did, the kernel assumes FLAT_FLAG_RAM). And it also has the nice side effect to suppress the first part of the if () in the final binary. Smaller code size :) > > Regards > Greg > > >> + !(flags & (FLAT_FLAG_RAM|FLAT_FLAG_GZIP))) { >> + >> /* >> * this should give us a ROM ptr, but if it doesn't we don't >> * really care >> @@ -576,7 +585,7 @@ static int load_flat_file(struct linux_binprm *bprm, >> goto err; >> } >> >> - len = data_len + extra + MAX_SHARED_LIBS * sizeof(unsigned long); >> + len = data_len + extra + DATA_GAP_WORDS * sizeof(unsigned long); >> len = PAGE_ALIGN(len); >> realdatastart = vm_mmap(NULL, 0, len, >> PROT_READ|PROT_WRITE|PROT_EXEC, MAP_PRIVATE, 0); >> @@ -591,7 +600,7 @@ static int load_flat_file(struct linux_binprm *bprm, >> goto err; >> } >> datapos = ALIGN(realdatastart + >> - MAX_SHARED_LIBS * sizeof(unsigned long), >> + DATA_GAP_WORDS * sizeof(unsigned long), >> FLAT_DATA_ALIGN); >> >> pr_debug("Allocated data+bss+stack (%u bytes): %lx\n", >> @@ -622,7 +631,7 @@ static int load_flat_file(struct linux_binprm *bprm, >> memp_size = len; >> } else { >> >> - len = text_len + data_len + extra + MAX_SHARED_LIBS * sizeof(u32); >> + len = text_len + data_len + extra + DATA_GAP_WORDS * sizeof(u32); >> len = PAGE_ALIGN(len); >> textpos = vm_mmap(NULL, 0, len, >> PROT_READ | PROT_EXEC | PROT_WRITE, MAP_PRIVATE, 0); >> @@ -638,7 +647,7 @@ static int load_flat_file(struct linux_binprm *bprm, >> >> realdatastart = textpos + ntohl(hdr->data_start); >> datapos = ALIGN(realdatastart + >> - MAX_SHARED_LIBS * sizeof(u32), >> + DATA_GAP_WORDS * sizeof(u32), >> FLAT_DATA_ALIGN); >> >> reloc = (__be32 __user *) >> @@ -714,7 +723,7 @@ static int load_flat_file(struct linux_binprm *bprm, >> ret = result; >> pr_err("Unable to read code+data+bss, errno %d\n", ret); >> vm_munmap(textpos, text_len + data_len + extra + >> - MAX_SHARED_LIBS * sizeof(u32)); >> + DATA_GAP_WORDS * sizeof(u32)); >> goto err; >> } >> } >> > -- Damien Le Moal Western Digital Research _______________________________________________ linux-riscv mailing list linux-riscv@lists.infradead.org http://lists.infradead.org/mailman/listinfo/linux-riscv ^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: [PATCH v3 1/2] binfmt_flat: allow not offsetting data start 2021-04-15 23:22 ` Damien Le Moal @ 2021-04-16 7:24 ` Greg Ungerer 2021-04-16 7:35 ` Damien Le Moal 0 siblings, 1 reply; 7+ messages in thread From: Greg Ungerer @ 2021-04-16 7:24 UTC (permalink / raw) To: Damien Le Moal, uclinux-dev, ugerg, Palmer Dabbelt, linux-riscv, Alexander Viro, linux-kernel Cc: Max Filippov, Anup Patel, Christoph Hellwig On 16/4/21 9:22 am, Damien Le Moal wrote: > On 2021/04/15 23:04, Greg Ungerer wrote: >> Hi Damien, >> >> On 15/4/21 4:15 pm, Damien Le Moal wrote: >>> Commit 2217b9826246 ("binfmt_flat: revert "binfmt_flat: don't offset >>> the data start"") restored offsetting the start of the data section by >>> a number of words defined by MAX_SHARED_LIBS. As a result, since >>> MAX_SHARED_LIBS is never 0, a gap between the text and data sections >>> always exists. For architectures which cannot support a such gap >>> between the text and data sections (e.g. riscv nommu), flat binary >>> programs cannot be executed. >>> >>> To allow an architecture to request contiguous text and data sections, >>> introduce the config option CONFIG_BINFMT_FLAT_NO_TEXT_DATA_GAP. >>> Using this new option, the macro DATA_GAP_WORDS is conditionally >>> defined in binfmt_flat.c to MAX_SHARED_LIBS for architectures >>> tolerating the text-to-data gap (CONFIG_BINFMT_FLAT_NO_TEXT_DATA_GAP >>> disabled case) and to 0 when CONFIG_BINFMT_FLAT_NO_TEXT_DATA_GAP is >>> enabled. DATA_GAP_WORDS is used in load_flat_file() to calculate the >>> data section length and start position. >>> >>> An architecture enabling CONFIG_BINFMT_FLAT_NO_TEXT_DATA_GAP also >>> prevents the use of the separate text/data load case (when the flat file >>> header flags FLAT_FLAG_RAM and FLAT_FLAG_GZIP are not set with NOMMU >>> kernels) and forces the use of a single RAM region for loading >>> (equivalent to FLAT_FLAG_RAM being set). >> >> So is it the case that a flat format file on RISC-V will never have >> relocations? > > No, it does have relocations. But there is no entry for the global pointer > (__global_pointer$) location. This is because the loading of that value in the > gp register in the C-library crt1.S is done using a PC-relative instruction. The > value for it is resolved at compile time and does not get a relocation table > entry. Other functions calls and symbol references do have relocation table > entries, so the binary can be loaded anywhere. The missing relocation for the > global pointer mandates that text and data be loaded at the same positions > relative to each other that the linker file defines. Otherwise, loading of > __global_pointer$ into the gp register (first thing that C libraries crt1.S do) > result in a garbage value being loaded. > > I tried some tricks with the linker file and changing uclibc crt1.S to have the > gp loading done using a symbol address instead of a PC-relative offset. I could > then see a relocation table entry for that symbol. That still did not work as I > was probably doing something wrong. Anyway, such solution requires changing a > lot of things in C libraries loading assembler that is common between NOMMU and > MMU code. Changing it would break MMU enabled programs. > > >>> Signed-off-by: Damien Le Moal <damien.lemoal@wdc.com> >>> Acked-by: Palmer Dabbelt <palmerdabbelt@google.com> >>> --- >>> fs/Kconfig.binfmt | 3 +++ >>> fs/binfmt_flat.c | 21 +++++++++++++++------ >>> 2 files changed, 18 insertions(+), 6 deletions(-) >>> >>> diff --git a/fs/Kconfig.binfmt b/fs/Kconfig.binfmt >>> index c6f1c8c1934e..c6df931d5d45 100644 >>> --- a/fs/Kconfig.binfmt >>> +++ b/fs/Kconfig.binfmt >>> @@ -112,6 +112,9 @@ config BINFMT_FLAT_ARGVP_ENVP_ON_STACK >>> config BINFMT_FLAT_OLD_ALWAYS_RAM >>> bool >>> >>> +config BINFMT_FLAT_NO_TEXT_DATA_GAP >>> + bool >>> + >>> config BINFMT_FLAT_OLD >>> bool "Enable support for very old legacy flat binaries" >>> depends on BINFMT_FLAT >>> diff --git a/fs/binfmt_flat.c b/fs/binfmt_flat.c >>> index b9c658e0548e..2be29bb964b8 100644 >>> --- a/fs/binfmt_flat.c >>> +++ b/fs/binfmt_flat.c >>> @@ -74,6 +74,12 @@ >>> #define MAX_SHARED_LIBS (1) >>> #endif >>> >>> +#ifdef CONFIG_BINFMT_FLAT_NO_TEXT_DATA_GAP >>> +#define DATA_GAP_WORDS (0) >>> +#else >>> +#define DATA_GAP_WORDS (MAX_SHARED_LIBS) >>> +#endif >>> +> struct lib_info { >>> struct { >>> unsigned long start_code; /* Start of text segment */ >>> @@ -559,7 +565,10 @@ static int load_flat_file(struct linux_binprm *bprm, >>> * case, and then the fully copied to RAM case which lumps >>> * it all together. >>> */ >>> - if (!IS_ENABLED(CONFIG_MMU) && !(flags & (FLAT_FLAG_RAM|FLAT_FLAG_GZIP))) { >>> + if (!IS_ENABLED(CONFIG_MMU) && >>> + !IS_ENABLED(CONFIG_BINFMT_FLAT_NO_TEXT_DATA_GAP) && >> >> If RISC-V flat format files must always be loaded to RAM then why don't >> they set the FLAT_FLAG_RAM when compiled/generated? > > That is done. The patch I have for elf2flt sets it. Coding it like this here is > I think safer (whatever the userspace toolchain did, the kernel assumes > FLAT_FLAG_RAM). And it also has the nice side effect to suppress the first part > of the if () in the final binary. Smaller code size :) My concern here is that CONFIG_BINFMT_FLAT_NO_TEXT_DATA_GA being enabled doesn't just in itself mean you need to force a RAM load. It is just in the RISC-V case it currently does. And it may change in the future. The considerable RAM savings you get from supporting a separate data segment to code segment means there is motivation to create tooling and code generation to support it. I don't feel that strongly about it, but this code is obtuse enough already. No need to make it worse if we don't have too. Regards Greg >>> + !(flags & (FLAT_FLAG_RAM|FLAT_FLAG_GZIP))) { >>> + >>> /* >>> * this should give us a ROM ptr, but if it doesn't we don't >>> * really care >>> @@ -576,7 +585,7 @@ static int load_flat_file(struct linux_binprm *bprm, >>> goto err; >>> } >>> >>> - len = data_len + extra + MAX_SHARED_LIBS * sizeof(unsigned long); >>> + len = data_len + extra + DATA_GAP_WORDS * sizeof(unsigned long); >>> len = PAGE_ALIGN(len); >>> realdatastart = vm_mmap(NULL, 0, len, >>> PROT_READ|PROT_WRITE|PROT_EXEC, MAP_PRIVATE, 0); >>> @@ -591,7 +600,7 @@ static int load_flat_file(struct linux_binprm *bprm, >>> goto err; >>> } >>> datapos = ALIGN(realdatastart + >>> - MAX_SHARED_LIBS * sizeof(unsigned long), >>> + DATA_GAP_WORDS * sizeof(unsigned long), >>> FLAT_DATA_ALIGN); >>> >>> pr_debug("Allocated data+bss+stack (%u bytes): %lx\n", >>> @@ -622,7 +631,7 @@ static int load_flat_file(struct linux_binprm *bprm, >>> memp_size = len; >>> } else { >>> >>> - len = text_len + data_len + extra + MAX_SHARED_LIBS * sizeof(u32); >>> + len = text_len + data_len + extra + DATA_GAP_WORDS * sizeof(u32); >>> len = PAGE_ALIGN(len); >>> textpos = vm_mmap(NULL, 0, len, >>> PROT_READ | PROT_EXEC | PROT_WRITE, MAP_PRIVATE, 0); >>> @@ -638,7 +647,7 @@ static int load_flat_file(struct linux_binprm *bprm, >>> >>> realdatastart = textpos + ntohl(hdr->data_start); >>> datapos = ALIGN(realdatastart + >>> - MAX_SHARED_LIBS * sizeof(u32), >>> + DATA_GAP_WORDS * sizeof(u32), >>> FLAT_DATA_ALIGN); >>> >>> reloc = (__be32 __user *) >>> @@ -714,7 +723,7 @@ static int load_flat_file(struct linux_binprm *bprm, >>> ret = result; >>> pr_err("Unable to read code+data+bss, errno %d\n", ret); >>> vm_munmap(textpos, text_len + data_len + extra + >>> - MAX_SHARED_LIBS * sizeof(u32)); >>> + DATA_GAP_WORDS * sizeof(u32)); >>> goto err; >>> } >>> } >>> >> > > _______________________________________________ linux-riscv mailing list linux-riscv@lists.infradead.org http://lists.infradead.org/mailman/listinfo/linux-riscv ^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: [PATCH v3 1/2] binfmt_flat: allow not offsetting data start 2021-04-16 7:24 ` Greg Ungerer @ 2021-04-16 7:35 ` Damien Le Moal 0 siblings, 0 replies; 7+ messages in thread From: Damien Le Moal @ 2021-04-16 7:35 UTC (permalink / raw) To: Greg Ungerer, uclinux-dev, ugerg, Palmer Dabbelt, linux-riscv, Alexander Viro, linux-kernel Cc: Max Filippov, Anup Patel, Christoph Hellwig On 2021/04/16 16:24, Greg Ungerer wrote: > > On 16/4/21 9:22 am, Damien Le Moal wrote: >> On 2021/04/15 23:04, Greg Ungerer wrote: >>> Hi Damien, >>> >>> On 15/4/21 4:15 pm, Damien Le Moal wrote: >>>> Commit 2217b9826246 ("binfmt_flat: revert "binfmt_flat: don't offset >>>> the data start"") restored offsetting the start of the data section by >>>> a number of words defined by MAX_SHARED_LIBS. As a result, since >>>> MAX_SHARED_LIBS is never 0, a gap between the text and data sections >>>> always exists. For architectures which cannot support a such gap >>>> between the text and data sections (e.g. riscv nommu), flat binary >>>> programs cannot be executed. >>>> >>>> To allow an architecture to request contiguous text and data sections, >>>> introduce the config option CONFIG_BINFMT_FLAT_NO_TEXT_DATA_GAP. >>>> Using this new option, the macro DATA_GAP_WORDS is conditionally >>>> defined in binfmt_flat.c to MAX_SHARED_LIBS for architectures >>>> tolerating the text-to-data gap (CONFIG_BINFMT_FLAT_NO_TEXT_DATA_GAP >>>> disabled case) and to 0 when CONFIG_BINFMT_FLAT_NO_TEXT_DATA_GAP is >>>> enabled. DATA_GAP_WORDS is used in load_flat_file() to calculate the >>>> data section length and start position. >>>> >>>> An architecture enabling CONFIG_BINFMT_FLAT_NO_TEXT_DATA_GAP also >>>> prevents the use of the separate text/data load case (when the flat file >>>> header flags FLAT_FLAG_RAM and FLAT_FLAG_GZIP are not set with NOMMU >>>> kernels) and forces the use of a single RAM region for loading >>>> (equivalent to FLAT_FLAG_RAM being set). >>> >>> So is it the case that a flat format file on RISC-V will never have >>> relocations? >> >> No, it does have relocations. But there is no entry for the global pointer >> (__global_pointer$) location. This is because the loading of that value in the >> gp register in the C-library crt1.S is done using a PC-relative instruction. The >> value for it is resolved at compile time and does not get a relocation table >> entry. Other functions calls and symbol references do have relocation table >> entries, so the binary can be loaded anywhere. The missing relocation for the >> global pointer mandates that text and data be loaded at the same positions >> relative to each other that the linker file defines. Otherwise, loading of >> __global_pointer$ into the gp register (first thing that C libraries crt1.S do) >> result in a garbage value being loaded. >> >> I tried some tricks with the linker file and changing uclibc crt1.S to have the >> gp loading done using a symbol address instead of a PC-relative offset. I could >> then see a relocation table entry for that symbol. That still did not work as I >> was probably doing something wrong. Anyway, such solution requires changing a >> lot of things in C libraries loading assembler that is common between NOMMU and >> MMU code. Changing it would break MMU enabled programs. >> >> >>>> Signed-off-by: Damien Le Moal <damien.lemoal@wdc.com> >>>> Acked-by: Palmer Dabbelt <palmerdabbelt@google.com> >>>> --- >>>> fs/Kconfig.binfmt | 3 +++ >>>> fs/binfmt_flat.c | 21 +++++++++++++++------ >>>> 2 files changed, 18 insertions(+), 6 deletions(-) >>>> >>>> diff --git a/fs/Kconfig.binfmt b/fs/Kconfig.binfmt >>>> index c6f1c8c1934e..c6df931d5d45 100644 >>>> --- a/fs/Kconfig.binfmt >>>> +++ b/fs/Kconfig.binfmt >>>> @@ -112,6 +112,9 @@ config BINFMT_FLAT_ARGVP_ENVP_ON_STACK >>>> config BINFMT_FLAT_OLD_ALWAYS_RAM >>>> bool >>>> >>>> +config BINFMT_FLAT_NO_TEXT_DATA_GAP >>>> + bool >>>> + >>>> config BINFMT_FLAT_OLD >>>> bool "Enable support for very old legacy flat binaries" >>>> depends on BINFMT_FLAT >>>> diff --git a/fs/binfmt_flat.c b/fs/binfmt_flat.c >>>> index b9c658e0548e..2be29bb964b8 100644 >>>> --- a/fs/binfmt_flat.c >>>> +++ b/fs/binfmt_flat.c >>>> @@ -74,6 +74,12 @@ >>>> #define MAX_SHARED_LIBS (1) >>>> #endif >>>> >>>> +#ifdef CONFIG_BINFMT_FLAT_NO_TEXT_DATA_GAP >>>> +#define DATA_GAP_WORDS (0) >>>> +#else >>>> +#define DATA_GAP_WORDS (MAX_SHARED_LIBS) >>>> +#endif >>>> +> struct lib_info { >>>> struct { >>>> unsigned long start_code; /* Start of text segment */ >>>> @@ -559,7 +565,10 @@ static int load_flat_file(struct linux_binprm *bprm, >>>> * case, and then the fully copied to RAM case which lumps >>>> * it all together. >>>> */ >>>> - if (!IS_ENABLED(CONFIG_MMU) && !(flags & (FLAT_FLAG_RAM|FLAT_FLAG_GZIP))) { >>>> + if (!IS_ENABLED(CONFIG_MMU) && >>>> + !IS_ENABLED(CONFIG_BINFMT_FLAT_NO_TEXT_DATA_GAP) && >>> >>> If RISC-V flat format files must always be loaded to RAM then why don't >>> they set the FLAT_FLAG_RAM when compiled/generated? >> >> That is done. The patch I have for elf2flt sets it. Coding it like this here is >> I think safer (whatever the userspace toolchain did, the kernel assumes >> FLAT_FLAG_RAM). And it also has the nice side effect to suppress the first part >> of the if () in the final binary. Smaller code size :) > > My concern here is that CONFIG_BINFMT_FLAT_NO_TEXT_DATA_GA being > enabled doesn't just in itself mean you need to force a RAM load. > It is just in the RISC-V case it currently does. Good point. > > And it may change in the future. The considerable RAM savings > you get from supporting a separate data segment to code segment > means there is motivation to create tooling and code generation > to support it. Totally agree here. And I did try hard to get it to work... > > I don't feel that strongly about it, but this code is obtuse enough already. > No need to make it worse if we don't have too. I see your point. I will remove that !IS_ENABLED(CONFIG_BINFMT_FLAT_NO_TEXT_DATA_GAP) from the top if() condition. Without it, fixing the toolchain etc will indeed not require patching again the kernel. Sending a v4. > > Regards > Greg > > > >>>> + !(flags & (FLAT_FLAG_RAM|FLAT_FLAG_GZIP))) { >>>> + >>>> /* >>>> * this should give us a ROM ptr, but if it doesn't we don't >>>> * really care >>>> @@ -576,7 +585,7 @@ static int load_flat_file(struct linux_binprm *bprm, >>>> goto err; >>>> } >>>> >>>> - len = data_len + extra + MAX_SHARED_LIBS * sizeof(unsigned long); >>>> + len = data_len + extra + DATA_GAP_WORDS * sizeof(unsigned long); >>>> len = PAGE_ALIGN(len); >>>> realdatastart = vm_mmap(NULL, 0, len, >>>> PROT_READ|PROT_WRITE|PROT_EXEC, MAP_PRIVATE, 0); >>>> @@ -591,7 +600,7 @@ static int load_flat_file(struct linux_binprm *bprm, >>>> goto err; >>>> } >>>> datapos = ALIGN(realdatastart + >>>> - MAX_SHARED_LIBS * sizeof(unsigned long), >>>> + DATA_GAP_WORDS * sizeof(unsigned long), >>>> FLAT_DATA_ALIGN); >>>> >>>> pr_debug("Allocated data+bss+stack (%u bytes): %lx\n", >>>> @@ -622,7 +631,7 @@ static int load_flat_file(struct linux_binprm *bprm, >>>> memp_size = len; >>>> } else { >>>> >>>> - len = text_len + data_len + extra + MAX_SHARED_LIBS * sizeof(u32); >>>> + len = text_len + data_len + extra + DATA_GAP_WORDS * sizeof(u32); >>>> len = PAGE_ALIGN(len); >>>> textpos = vm_mmap(NULL, 0, len, >>>> PROT_READ | PROT_EXEC | PROT_WRITE, MAP_PRIVATE, 0); >>>> @@ -638,7 +647,7 @@ static int load_flat_file(struct linux_binprm *bprm, >>>> >>>> realdatastart = textpos + ntohl(hdr->data_start); >>>> datapos = ALIGN(realdatastart + >>>> - MAX_SHARED_LIBS * sizeof(u32), >>>> + DATA_GAP_WORDS * sizeof(u32), >>>> FLAT_DATA_ALIGN); >>>> >>>> reloc = (__be32 __user *) >>>> @@ -714,7 +723,7 @@ static int load_flat_file(struct linux_binprm *bprm, >>>> ret = result; >>>> pr_err("Unable to read code+data+bss, errno %d\n", ret); >>>> vm_munmap(textpos, text_len + data_len + extra + >>>> - MAX_SHARED_LIBS * sizeof(u32)); >>>> + DATA_GAP_WORDS * sizeof(u32)); >>>> goto err; >>>> } >>>> } >>>> >>> >> >> > -- Damien Le Moal Western Digital Research _______________________________________________ linux-riscv mailing list linux-riscv@lists.infradead.org http://lists.infradead.org/mailman/listinfo/linux-riscv ^ permalink raw reply [flat|nested] 7+ messages in thread
* [PATCH v3 2/2] riscv: Disable text-data gap in flat binaries 2021-04-15 6:15 [PATCH v3 0/2] Fix binfmt_flat loader for RISC-V Damien Le Moal 2021-04-15 6:15 ` [PATCH v3 1/2] binfmt_flat: allow not offsetting data start Damien Le Moal @ 2021-04-15 6:15 ` Damien Le Moal 1 sibling, 0 replies; 7+ messages in thread From: Damien Le Moal @ 2021-04-15 6:15 UTC (permalink / raw) To: uclinux-dev, ugerg, Palmer Dabbelt, linux-riscv, Alexander Viro, linux-kernel Cc: Max Filippov, Greg Ungerer, Anup Patel, Christoph Hellwig uclibc/gcc combined with elf2flt riscv linker file fully resolve the PC relative __global_pointer$ value at compile time and do not generate a relocation entry to set a runtime gp value. As a result, if the flatbin loader introduces a gap between the text and data sections, the gp value becomes incorrect and prevent correct execution of a flatbin executable. Avoid this problem by enabling CONFIG_BINFMT_FLAT_NO_TEXT_DATA_GAP automatically when CONFIG_RISCV is enabled and CONFIG_MMU disabled. Signed-off-by: Damien Le Moal <damien.lemoal@wdc.com> Acked-by: Palmer Dabbelt <palmerdabbelt@google.com> --- arch/riscv/Kconfig | 1 + 1 file changed, 1 insertion(+) diff --git a/arch/riscv/Kconfig b/arch/riscv/Kconfig index 0d0cf67359cb..6a85fbbd056e 100644 --- a/arch/riscv/Kconfig +++ b/arch/riscv/Kconfig @@ -33,6 +33,7 @@ config RISCV select ARCH_WANT_DEFAULT_TOPDOWN_MMAP_LAYOUT if MMU select ARCH_WANT_FRAME_POINTERS select ARCH_WANT_HUGE_PMD_SHARE if 64BIT + select BINFMT_FLAT_NO_TEXT_DATA_GAP if !MMU select CLONE_BACKWARDS select CLINT_TIMER if !MMU select COMMON_CLK -- 2.30.2 _______________________________________________ linux-riscv mailing list linux-riscv@lists.infradead.org http://lists.infradead.org/mailman/listinfo/linux-riscv ^ permalink raw reply related [flat|nested] 7+ messages in thread
end of thread, other threads:[~2021-04-16 8:02 UTC | newest] Thread overview: 7+ messages (download: mbox.gz / follow: Atom feed) -- links below jump to the message on this page -- 2021-04-15 6:15 [PATCH v3 0/2] Fix binfmt_flat loader for RISC-V Damien Le Moal 2021-04-15 6:15 ` [PATCH v3 1/2] binfmt_flat: allow not offsetting data start Damien Le Moal 2021-04-15 14:04 ` Greg Ungerer 2021-04-15 23:22 ` Damien Le Moal 2021-04-16 7:24 ` Greg Ungerer 2021-04-16 7:35 ` Damien Le Moal 2021-04-15 6:15 ` [PATCH v3 2/2] riscv: Disable text-data gap in flat binaries Damien Le Moal
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for NNTP newsgroup(s).