Re: git: e85eaa930862 - main - Have rtld query the page size from the kernel
Date: Wed, 13 Nov 2024 20:10:53 UTC
On 13 Nov 2024, at 19:44, John Baldwin <jhb@FreeBSD.org> wrote: > > On 4/7/22 07:38, Andrew Turner wrote: >> The branch main has been updated by andrew: >> URL: https://cgit.FreeBSD.org/src/commit/?id=e85eaa930862d5b4dc917bc31e8d7254a693635d >> commit e85eaa930862d5b4dc917bc31e8d7254a693635d >> Author: Andrew Turner <andrew@FreeBSD.org> >> AuthorDate: 2022-04-04 15:05:40 +0000 >> Commit: Andrew Turner <andrew@FreeBSD.org> >> CommitDate: 2022-04-07 14:37:37 +0000 >> Have rtld query the page size from the kernel >> To allow for a dynamic page size on arm64 have the runtime linker >> query the kernel for the currentl page size. >> Reviewed by: kib >> Sponsored by: The FreeBSD Foundation >> Differential Revision: https://reviews.freebsd.org/D34765 > > This broke relro handling for rtld. The reason is that init_pagesizes() is > called after parsing the program headers for rltd in init_rtld(). As a result, > page_size is 0 when rtld_round_page() is called so the relro_size is 0. The > RTLD_INIT_EARLY_PAGESIZES case was for ia64, and in the early case it's probably > not safe to call sysctl? If it is safe to call sysctl, we could just always > init pagesizes early? It looks like there are a few things going on: 1. relocate_object calls obj_enforce_relro if !obj->mainprog, so will try to enforce RELRO for RTLD itself whilst page_size is 0 2. init_rtld later calls obj_enforce_relro for obj_rtld, after page_size has been initialised 3. init_rtld is careful to avoid using global variables until it’s called relocate_objects for RTLD itself, but by hiding accesses to page_size away in rtld_*_page that’s no longer true (definitely not true in the case of text relocations, for example, though whether it also occurs for other cases we care more about I don’t know) So I think there are a couple of things to fix: 1. Stop accessing page_size prior to relocate_objects returning for RTLD itself 2. Stop enforcing RELRO twice for RTLD (e.g. add && obj != rtldobj to relocate_object’s case) At least, that’s what I’ve inferred from reading the code. Though, to be honest, things might be rather nicer if we just made .rtld_start responsible for relocating RTLD itself prior to calling init_rtld, that’s what we have to do for CHERI, as do arm, powerpc and powerpc64, and it means you can use globals from the start in init_rtld. Jess