How loader Maps DLL in to Process Address Space

最新推荐文章于 2022-09-10 17:29:19 发布

转载最新推荐文章于 2022-09-10 17:29:19 发布 · 809 阅读

windows 专栏收录该内容

67 篇文章

订阅专栏

本文详细介绍了动态链接库(DLL)如何被加载到进程地址空间的过程，包括动态链接库的编译方式、加载过程中的内存映射机制、导入地址表(IAT)的更新以及DLL共享机制等关键技术点。

http://stackoverflow.com/questions/336759/how-loader-maps-dll-in-to-process-address-space

What level of detail are you looking for? On the basic level, all dynamic linkers work pretty much the same way:

Dynamic libraries are compiled to relocatable code (using relative jumps instead of absolute, for example).
The linker finds an appropriately-sized empty space in the memory map of the application, and reads the DLL's code and any static data into that space.
The dynamic library contains a table of offsets to the start of each exported function, and calls to the DLL's functions in the client program are patched at load-time with a new destination address, based on where the library was loaded.
Most dynamic linker systems have some system for setting a preferred base address for a particular library. If a library is loaded at its preferred address, then the relocation in steps 2 and 3 can be skipped.

Okay, I'm assuming the Windows side of things here. What happens when you load a PE file is that the loader (contained in NTDLL) will do the following:

Locate each of the DLLs using the DLL search semantics (system and patch-level specific), well-known DLLs are kind of exempt from this
Map the file into memory (MMF), where pages are copy-on-write (CoW)
Traverse the import directory and for each import start (recursively) at point 1.
Resolve relocations, which most of the time is only a very limited number of entities, since the code itself is position-independent code (PIC)
(IIRC) patch the EAT from RVA (relative virtual address) to VA (virtual address within current process memory space)
Patch the IAT (import address table) to reference the imports with their actual address within the process memory space
For a DLL call DLLMain() for an EXE create a thread whose start address is at the entry point of the PE file (this is also oversimplified, because the actual start address is inside kernel32.dll for Win32 processes)

Now when you compile code it depends on the linker how the external function is referenced. Some linkers create stubs so that - in theory - trying to check the function address against NULL will always say it's not NULL. It's a quirk you have to be aware of if and when your linker is affected. Others reference the IAT entry directly in which case an unreferenced function (think delay-loaded DLLs) address can be NULL and the SEH handler will then invoke the delay-load helper and (attempt to) resolve the function address, before resuming execution at the point it failed.

There is a lot of red tape involved in the above process which I oversimplified.

The gist for what you wanted to know is that the mapping into the process happens as an MMF, though you can artificially mimic the behavior with heap space. However, if you remember the point about CoW, that's the crux in the idea of DLLs. Actually the same copy of (most of) the pages of the DLL will be shared among the processes that load a particula DLL. The pages which are not shared are the ones that we wrote to, for example when resolving relocations and similar things. In this case each process has a - now modified - copy of the original page.

And a word of warning concerning EXE packers on DLL. They defeat exactly this CoW mechanism I described in that they allocate space for the unpacked contents of the DLL on the heap of the process into which the DLL is loaded. So while the actual file contents are still mapped as MMF and shared, the unpacked contents occupy the same amount of memory for each process loading the DLL instead of sharing that.