堆概述_学逆向论坛|免费的CTF在线练习平台|ctf攻防训练靶场|网安夺旗竞赛系统|软件破解|病毒分析|游戏外挂|视频教程|xuenixiang.com

roger 发表于 2020-9-2 09:45:41

堆概述

堆的基本操作malloc在glibc的malloc.c中：/*
　　malloc(size_t n)
　　Returns a pointer to a newly allocated chunk of at least n bytes, or null
　　if no space is available. Additionally, on failure, errno is
　　set to ENOMEM on ANSI C systems.
　　If n is zero, malloc returns a minumum-sized chunk. (The minimum
　　size is 16 bytes on most 32bit systems, and 24 or 32 bytes on 64bit
　　systems.)On most systems, size_t is an unsigned type, so calls
　　with negative arguments are interpreted as requests for huge amounts
　　of space, which will often fail. The maximum supported value of n
　　differs across systems, but is in all cases less than the maximum
　　representable value of a size_t.
　　*/
malloc 函数返回对应大小字节的内存块的指针。
[*]当 n=0 时，返回当前系统允许的堆的最小内存块。
[*]当 n 为负数时，由于在大多数系统上，size_t 是无符号数（这一点非常重要），所以程序就会申请很大的内存空间，但通常来说都会失败，因为系统没有那么多的内存可以分配。
free/*
　　free(void* p)
　　Releases the chunk of memory pointed to by p, that had been previously
　　allocated using malloc or a related routine such as realloc.
　　It has no effect if p is null. It can have arbitrary (i.e., bad!)
　　effects if p has already been freed.
　　Unless disabled (using mallopt), freeing very large spaces will
　　when possible, automatically trigger operations that give
　　back unused memory to the system, thus reducing program footprint.
　　*/
free 函数会释放由 p 所指向的内存块。这个内存块有可能是通过 malloc 函数或者realloc函数得到的
[*]当 p 为空指针时，函数不执行任何操作。
[*]当 p 已经被释放之后，再次释放会出现乱七八糟的效果，这其实就是 double free。
[*]除了被禁用 (mallopt) 的情况下，当释放很大的内存空间时，程序会将这些内存空间还给系统，以便于减小程序所使用的内存空间。
内存分配后的系统调用无论是 malloc 函数还是 free 函数，背后的系统调用主要是 (s)brk 函数以及 mmap, munmap 函数。(s)brk对于堆的操作，操作系统提供了 brk 函数，glibc 库提供了 sbrk 函数们可以通过增加 brk 的大小来向操作系统申请内存。初始时，堆的起始地址 start_brk 以及堆的当前末尾 brk 指向同一地址。根据是否开启 ASLR，两者的具体位置会有所不同
[*]不开启 ASLR 保护时，start_brk 以及 brk 会指向 data/bss 段的结尾。
[*]开启 ASLR 保护时，start_brk 以及 brk 也会指向同一位置，只是这个位置是在 data/bss 段结尾后的随机偏移处。
具体效果如下图:例子/* sbrk and brk example */
　　#include
　　#include
　　#include
　　int main()
　　{
　　void *curr_brk, *tmp_brk = NULL;
　　printf("Welcome to sbrk example:%d\n", getpid());
　　/* sbrk(0) gives current program break location */
　　tmp_brk = curr_brk = sbrk(0);
　　printf("Program Break Location1:%p\n", curr_brk);
　　getchar();
　　/* brk(addr) increments/decrements program break location */
　　brk(curr_brk+4096);
　　curr_brk = sbrk(0);
　　printf("Program break Location2:%p\n", curr_brk);
　　getchar();
　　brk(tmp_brk);
　　curr_brk = sbrk(0);
　　printf("Program Break Location3:%p\n", curr_brk);
　　getchar();
　　return 0;
　　}
在第一次调用brk前在第一次调用brk后
[*]0x080ed000 是相应堆的起始地址
[*]rw-p 表明堆具有可读可写权限，并且属于隐私数据。
[*]00000000 表明文件偏移，由于这部分内容并不是从文件中映射得到的，所以为 0。
[*]00:00 是主从 (Major/mirror) 的设备号，这部分内容也不是从文件中映射得到的，所以也都为 0。
[*]0 表示着 Inode 号。由于这部分内容并不是从文件中映射得到的，所以为 0。
mmapmalloc 会使用 mmap 来创建独立的匿名映射段。匿名映射的目的主要是可以申请以 0 填充的内存，并且这块内存仅被调用进程所使用。例子：/* Private anonymous mapping example using mmap syscall */
　　#include
　　#include
　　#include
　　#include
　　#include
　　#include
　　#include
　　void static inline errExit(const char* msg)
　　{
　　printf("%s failed. Exiting the process\n", msg);
　　exit(-1);
　　}
　　int main()
　　{
　　int ret = -1;
　　printf("Welcome to private anonymous mapping example::PID:%d\n", getpid());
　　printf("Before mmap\n");
　　getchar();
　　char* addr = NULL;
　　addr = mmap(NULL, (size_t)132*1024, PROT_READ|PROT_WRITE, MAP_PRIVATE | MAP_ANONYMOUS, -1, 0);
　　if (addr == MAP_FAILED)
　　errExit("mmap");
　　printf("After mmap\n");
　　getchar();
　　/* Unmap mapped region. */
　　ret = munmap(addr, (size_t)132*1024);
　　if(ret == -1)
　　errExit("munmap");
　　printf("After munmap\n");
　　getchar();
　　return 0;
　　}
执行mmap之前：mmap后：
好像没区别？？？？正常来说，应该会多一块munmap后：还是没区别，正常来说是刚刚的多的一块又没了多线程支持在原来的 dlmalloc 实现中，当两个线程同时要申请内存时，只有一个线程可以进入临界区申请内存，而另外一个线程则必须等待直到临界区中不再有线程。这是因为所有的线程共享一个堆。在 glibc 的 ptmalloc 实现中，比较好的一点就是支持了多线程的快速访问。在新的实现中，所有的线程共享多个堆。/* Per thread arena example. */
　　#include
　　#include
　　#include 　　#include
　　#include
　　void* threadFunc(void* arg) {
　　printf("Before malloc in thread 1\n");
　　getchar();
　　char* addr = (char*) malloc(1000);
　　printf("After malloc and before free in thread 1\n");
　　getchar();
　　free(addr);
　　printf("After free in thread 1\n");
　　getchar();
　　}
　　int main() {
　　pthread_t t1;
　　void* s;
　　int ret;
　　char* addr;
　　printf("Welcome to per thread arena example::%d\n",getpid());
　　printf("Before malloc in main thread\n");
　　getchar();
　　addr = (char*) malloc(1000);
　　printf("After malloc and before free in main thread\n");
　　getchar();
　　free(addr);
　　printf("After free in main thread\n");
　　getchar();
　　ret = pthread_create(&t1, NULL, threadFunc, NULL);
　　if(ret)
　　{
　　printf("Thread creation error\n");
　　return -1;
　　}
　　ret = pthread_join(t1, &s);
　　if(ret)
　　{
　　printf("Thread join error\n");
　　return -1;
　　}
　　return 0;
　　}
第一次申请之前第一次申请之后：、堆段被建立了，并且它就紧邻着数据段，这说明 malloc 的背后是用 brk 函数来实现的。同时，需要注意的是，我们虽然只是申请了 1000 个字节，但是我们却得到了 0x00623000-0x00602000=0x21000 个字节的堆。这说明虽然程序可能只是向操作系统申请很小的内存，但是为了方便，操作系统会把很大的内存分配给程序。这样的话，就避免了多次内核态与用户态的切换，提高了程序的效率。我们称这一块连续的内存区域为 arena。此外，我们称由主线程申请的内存为 main_arena。后续的申请的内存会一直从这个 arena 中获取，直到空间不足。当 arena 空间不足时，它可以通过增加 brk 的方式来增加堆的空间。类似地，arena 也可以通过减小 brk 来缩小自己的空间。在主线程free后：其对应的 arena 并没有进行回收，而是交由 glibc 来进行管理。当后面程序再次申请内存时，在 glibc 中管理的内存充足的情况下，glibc 就会根据堆分配的算法来给程序分配相应的内存。在第一个线程malloc后：线程 1 的堆段被建立了。而且它所在的位置为内存映射段区域，同样大小也是 132KB同时，我们可以看出实际真的分配给程序的内存为 1M。而且，只有 132KB 的部分具有可读可写权限，这一块连续的区域成为 thread arena。在第一个线程free后：这样释放内存同样不会把内存重新给系统。

页: [1]

学逆向论坛's Archiver

堆概述