pwn.college

In the previous module, you wrote assembly programs and built them into executables. But what if someone gives you a program and you want to understand what it does? This is where disassembly comes in: the process of converting the binary machine code in an executable back into human-readable assembly instructions.

Though you will learn to use vastly more powerful tooling later in your journey, we will start with one of the most common tools for disassembly: objdump. Given a binary, objdump -d will disassemble the executable sections and show you the assembly instructions:

hacker@dojo:~$ objdump -d -M intel /tmp/your-program

/tmp/your-program:     file format elf64-x86-64


Disassembly of section .text:

0000000000401000 <_start>:
  401000:	48 c7 c7 39 05 00 00 	mov    rdi,0x539
  401007:	48 c7 c7 00 00 00 00 	mov    rdi,0
  40100e:	48 c7 c0 3c 00 00 00 	mov    rax,0x3c
  401015:	0f 05                	syscall

There are a few things to note here. First, by default, objdump uses the wrong assembly syntax, which is why we pass the -M intel option. Don't forget this option! Viewing assembly in non-Intel syntax can be confusing and harmful for your health.

Second, objdump displays the raw bytes of each instruction (e.g., the hexadecimal values 0f 05 is the syscall instruction) alongside the human-readable assembly. These are the actual values that are stored in computer memory to represent the instructions. For mathematical reasons, these are represented in "base 16" (hexadecimal) rather than the "base 10" (decimal) that we are used to counting with. If that does not make sense, please run through the first half or so of the Dealing with Data module and then come back here!

Third, the values that are being moved into registers are also represented as hexadecimal. This can make it slightly tricky to understand what the program is doing. Above, we can see that it is setting rax to the hexadecimal value 0x3c, which is 60 in decimal and, thus, is our familiar syscall number of exit! Right before that, it sets rdi to 0, which will be the exit code of the program.

But interestingly, right before that, it sets rdi to 0x539, which we can't really observe from the outside because it's overwritten to 0 immediately. While this "secret" is benign, by reading the code of software, we can extract many different such secrets, some of which are security relevant!

We'll practice this secret extraction in this challenge, using a binary at /challenge/disassemble-me. Use objdump to disassemble it and find the number being loaded into rdi before it's wiped out. Then, submit that number using /challenge/submit-number. The number will be displayed in hexadecimal in the disassembly, but /challenge/submit-number accepts both hexadecimal (e.g., 0x539) and decimal (e.g., 1337) values. Good luck!

Connect with SSH

Link your SSH key, then connect with: ssh [email protected]

sudo

The first one is pretty simple: the syscall tracer, strace.

Given a program to run, strace will use functionality of the Linux operating system to introspect and record every system call that the program invokes, and its result. For example, let's look at our program from the previous challenge:

hacker@dojo:~$ strace /tmp/your-program
execve("/tmp/your-program", ["/tmp/your-program"], 0x7ffd48ae28b0 /* 53 vars */) = 0
exit(42)                                 = ?
+++ exited with 42 +++
hacker@dojo:~$

As you can see, strace reports what system calls are triggered, what parameters were passed to them, and what data they returned. The syntax used here for output is system_call(parameter, parameter, parameter, ...). This syntax is borrowed from a programming language called C, but we don't have to worry about that yet. Just keep in mind how to read this specific syntax.

In this example, strace reports two system calls: the second is the exit system call that your program uses to request its own termination, and you can see the parameter you passed to it (42). The first is an execve system call. We'll learn about this system call later, but it's somewhat of a yin to exit's yang: it starts a new program (in this case, your-program). It's not actually invoked by your-program in this case: its detection by strace is a weird artifact of how strace works, that we'll investigate later.

In the final line, you can see the result of exit(42), which is that the program exits with an exit code of 42!

Now, the exit syscall is easy to introspect without using strace --- after all, part of the point of exit is to give you an exit code that you can access. But other system calls are less visible. For example, the alarm system call (syscall number 37!) will set a timer in the operating system, and when that many seconds pass, Linux will terminate the program. The point of alarm is to, e.g., kill the program when it's frozen, but in this case, we'll use alarm to practice our strace snooping!

In this challenge, you must strace the /challenge/trace-me program to figure out what value it passes as a parameter to the alarm system call, then call /challenge/submit-number with the number you've retrieved as the argument. Good luck!

Connect with SSH

Link your SSH key, then connect with: ssh [email protected]

sudo

Next, let's move on to GDB. GDB stands for the GNU Debugger, and it is typically used to hunt down and understand bugs. More specifically, a debugger is a tool that enables the close monitoring and introspection of another process. There are many famous debuggers, and in the Linux space, gdb is by far the most common.

We'll learn gdb step by step through a series of challenges. In this one, we'll focus on simply launching it. That's done as so:

hacker@dojo:~$ gdb /path/to/binary/file

In this challenge, the binary that holds the secret is /challenge/debug-me. Once you load it in gdb, the rest will happen magically: we'll handle the analysis and give you the secret number. In later levels, you'll learn how to get that number on your own!

Again, once you have the number, exchange it for the flag with /challenge/submit-number.

Connect with SSH

Link your SSH key, then connect with: ssh [email protected]

sudo

In the previous level, GDB automatically quit for you. Now it's your turn!

When you're done working in GDB, you exit it with the quit command (or just q):

(gdb) quit

In this level, we'll still handle the analysis for you. All you need to do is launch GDB, let the magic happen, and then type quit to exit.

Connect with SSH

Link your SSH key, then connect with: ssh [email protected]

sudo

Debuggers, including gdb, observe the debugged program as it runs to expose information about its runtime behavior. In the previous level, we automatically launched the program for you. Here, we will tone down the magic somewhat: you must start the execution of the program, and we'll do the rest (e.g., recover the secret value from it).

When you launch gdb now, it will eventually bring up a command prompt, that looks like this:

(gdb)

You start a program with the starti command:

(gdb) starti

starti starts the program at the very first instruction. Once the program is running, you can use other gdb commands to inspect its actual runtime state. We'll start with the code that's running, which you can disassemble using the disassemble command! For example:

(gdb) disassemble
Dump of assembler code for function main:
=> 0x0000000000401000 <+0>:     mov    rdi,0x539
   0x0000000000401007 <+7>:     mov    rdi,0x0
   0x000000000040100e <+14>:    mov    rax,0x3c
   0x0000000000401015 <+21>:    syscall
End of assembler dump.

This is the same program from the objdump challenge, now running in gdb. Like before, you can gleam its secrets by reading the disassembly, though later we'll dig even deeper! For now, run starti after loading the binary in gdb, and we'll take care of the rest.

Connect with SSH

Link your SSH key, then connect with: ssh [email protected]

sudo

In the previous level, we ran the disassemble command for you after you started the program. Now it's your turn!

After starting the program with starti, you will need to run the disassemble command yourself:

(gdb) starti
...
(gdb) disassemble
Dump of assembler code for function main:
=> 0x0000000000401000 <+0>:     mov    rdi,0x539
   0x0000000000401007 <+7>:     mov    rdi,0x0
   0x000000000040100e <+14>:    mov    rax,0x3c
   0x0000000000401015 <+21>:    syscall
End of assembler dump.

Read the output to find the secret number, then submit it with /challenge/submit-number.

Connect with SSH

Link your SSH key, then connect with: ssh [email protected]

sudo

So far, you've been reading the secret from the program's disassembly. But what if the secret is hidden?

In this level, the disassembly is censored: the secret value is replaced with CENSORED. However, even though you can't read the value from the code, you can still execute the code! When the CPU executes mov rdi, CENSORED, it loads the actual secret value into the rdi register.

To execute a single instruction in GDB, use the stepi command (step one instruction, also abbreviated si):

(gdb) stepi

Once you step past the mov instruction, we'll read the rdi register for you and show the secret value. Submit it with /challenge/submit-number!

Connect with SSH

Link your SSH key, then connect with: ssh [email protected]

sudo

In the previous level, we automatically read the register value for you after you stepped. Now it's your turn!

The disassembly is still censored, so you'll need to:

Start the program with starti
Step one instruction with stepi (or si)
Read the register yourself with print $rdi

The print command displays the value of an expression. Register names in GDB are prefixed with $, so you can read rdi like this:

(gdb) print $rdi
$1 = 1337

Then submit the value with /challenge/submit-number.

Connect with SSH

Link your SSH key, then connect with: ssh [email protected]

sudo

In the previous level, you used print to read a register's value. GDB can also change a register while the program is stopped.

The set command assigns a new value to a register:

(gdb) set $rax = 42

As with print, prefix the register name with $.

In this level, you will need to:

Start the program with starti and step past its first instruction.
Set rdi to 1337 and step once more.
Print the resulting secret number, then submit it with /challenge/submit-number.

Run gdb /challenge/debug-me and change that register!

Connect with SSH

Link your SSH key, then connect with: ssh [email protected]

sudo

In previous levels, the secret was hidden in the program's code (a hardcoded mov instruction). This time, the secret comes from the program's runtime state: it's the argument count (argc), which lives on the stack.

The program pops this value off the stack with pop rdi, but then immediately overwrites rdi with 0 before exiting:

pop    rdi          <- reads argc from the stack into rdi
mov    rdi,0x0      <- overwrites rdi with 0!
mov    rax,0x3c
syscall             <- exit(0) --- the secret is gone!

The code is fully visible, and nothing is censored, but you can't determine the secret just by reading the disassembly because argc depends on how many arguments the program was launched with. In this level, GDB handles that for you, but in the future, we'll show you how to set the program's arguments in gdb as well!

For now, you'll need to:

Start the program.
Step one instruction to execute just pop rdi
print the resulting value in rdi before it gets overwritten
Quit gdb and then submit the value with /challenge/submit-number.

Connect with SSH

Link your SSH key, then connect with: ssh [email protected]

sudo

In the last level, you could stepi to execute pop rdi and then print $rdi to read the secret. This time, there's no pop at all --- the program just exits immediately:

mov    rdi,0x0
mov    rax,0x3c
syscall             <- exit(0) --- the secret was never read!

The secret is still argc, and it's sitting right on top of the stack, but the program never loads it into a register. You'll need to examine memory directly!

GDB's x (examine) command lets you look at the contents of memory. As you learned earlier, the stack pointer ($rsp) starts out pointing right at argc, so you can read it with:

x $rsp

Go and do that!

Start the program
Examine the top of the stack
Quit gdb and submit the value with /challenge/submit-number

NOTE: x displays values in hexadecimal by default. You can change the display format by appending / to the command. For example, if you'd rather see decimal, use x/d $rsp. Either way, /challenge/submit-number accepts both hex (e.g., 0x2a) and decimal (e.g., 42).

Connect with SSH

Link your SSH key, then connect with: ssh [email protected]

sudo

In the last level, you used x to read argc from the top of the stack. But the stack holds more than just argc!

Right after the argument count, the stack stores pointers to each program argument. These are addresses stored in memory: $rsp+16 doesn't contain the argument text directly --- it contains the address where that text lives.

For example, if your program is run as /challenge/debug-me Hi:

     Address    │ Contents
   +────────────────────────────+
   │  rsp + 0   │ 2             │◀── argc
   +────────────────────────────+
   │  rsp + 8   │ 0x1234000     │──────┐
   +────────────────────────────+      │
   │  rsp + 16  │ 0x1234560     │────┐ │
   +────────────────────────────+    │ │
                                     │ │
                                     │ │
     Address    │ Contents           │ │
   +──────────────────────────────+  │ │
   │ 0x1234000  │ "/challenge/..."│◀─│─┘ the program name
   +──────────────────────────────+  │
   │ ...        │ ...                │
   +──────────────────────────────+  │
   │ 0x1234560  │ "Hi"            │◀─┘   the first argument
   +──────────────────────────────+

To get the actual argument data, you need two dereferences: one to get the pointer from the stack, and one to follow it to the string.

In this level, THE FLAG ITSELF is passed as the first argument! The program doesn't use it --- it just exits --- but the flag is right there in memory.

To find it, you'll need two x commands, with two different display modes:

First: You'll need the pointer the first argument. You've done this before, but now you're doing it in gdb.

x/a $rsp+16

/a tells x to display the value as a memory address. You'll see a very large hexadecimal number, something like 0x7ffc001c4750.

Second: Read the text of the first argument at that address:

x/s 0x7ffc001c4750

/s tells x to display the value as a string. Replace the address with whatever you got from step 1. This will show you the flag!

Go and do that!

Start the program
x/a $rsp+16 to get the address of the first argument
x/s <address> to read the flag string

Connect with SSH

Link your SSH key, then connect with: ssh [email protected]

sudo

So far, the debugging you've done has been preemptive: you (the debugger) started the program with stepi, which immediately forces it to stop and let you debug it, without the program necessarily being aware of it. In this challenge, we'll learn another model for this, where the program decides when the debugger stop happens. We'll call this cooperative debugging.

On our now-familiar x86 architecture, the program can signal a desire to be debugged by using the int3 instruction. If a debugger is attached when int3 is executed, it stops the program. This is called a program breakpoint.

Later, we'll learn how to set breakpoints from the debugger itself, going back to the preemptive model. But in this challenge, the checker will run your program under gdb and expect your program to trigger its own breakpoint. To do this, rather than using starti to start your program and immediately stop it, we'll use gdb's run command, which will simply run it until a breakpoint is hit!

When your program executes int3, gdb will break and the checker script will inspect $rdi. If $rdi is 1337 at that point, you get the flag!

Go and write a program that:

Moves 1337 into rdi
Executes int3 to cooperatively hand control to the debugger

Assemble and link your code into an ELF executable, then submit that executable:

hacker@dojo:~$ /challenge/check /tmp/your-program

NOTE: When an int3 is executed by a program not running under a debugger, you will see:

hacker@dojo:~$ /tmp/your-program
Trace/breakpoint trap
hacker@dojo:~$

And the program will terminate... If you want the program to run outside a debugger, take out that int3!

Connect with SSH

Link your SSH key, then connect with: ssh [email protected]

sudo

In the previous level, you used gdb's run command for the first time: run starts the program and lets it execute freely. But what if the program needs command-line arguments to work?

Outside gdb, you've been passing arguments by just typing them after the program name:

hacker@dojo:~$ /challenge/debug-me hello

Inside gdb, the analog is to pass them to run:

(gdb) run hello

Whatever you put after run becomes the inferior's argv[1], argv[2], and so on --- exactly as if you'd typed those arguments on the shell command line. GDB also accepts the short form r:

(gdb) r hello

(Anywhere you see run in gdb's docs, r works too.)

In this challenge, /challenge/debug-me only prints your flag when you give it the string pwn as argv[1]. Do it!

Connect with SSH

Link your SSH key, then connect with: ssh [email protected]

sudo

In the previous level, you passed command-line arguments through gdb's run. Programs can also read from stdin, and gdb lets you redirect stdin when you run the inferior. The syntax is the same redirection you've seen in the shell, but it goes after run inside gdb:

(gdb) run < /path/to/input

In this challenge, /challenge/debug-me reads its required input from /challenge/secret. Run it under gdb with /challenge/secret redirected into stdin, read the secret number it prints, and submit that number with /challenge/submit-number.

Connect with SSH

Link your SSH key, then connect with: ssh [email protected]

sudo

Software Introspection

Computing 101.

Disassembling Programs

Connect with SSH

Tracing Syscalls

Connect with SSH

Starting GDB

Connect with SSH

Quitting GDB

Connect with SSH

Starting Programs in GDB

Connect with SSH

Disassembling in GDB

Connect with SSH

Stepping Through Instructions

Connect with SSH

Reading Register Values

Connect with SSH

Setting Register Values

Connect with SSH

Popping Stack Values

Connect with SSH

Examining Memory

Connect with SSH

Examining Stack Pointers

Connect with SSH

Cooperative Debugging

Connect with SSH

Running with Arguments

Connect with SSH

Redirecting Input in GDB

Connect with SSH

30-Day Scoreboard:

Software Introspection

Computing 101.

Disassembling Programs 3037 solves

Disassembling Programs

Connect with SSH

Tracing Syscalls 11666 solves

Tracing Syscalls

Connect with SSH

Starting GDB 9885 solves

Starting GDB

Connect with SSH

Quitting GDB 2992 solves

Quitting GDB

Connect with SSH

Starting Programs in GDB 1 hacking, 9842 solves

Starting Programs in GDB

Connect with SSH

Disassembling in GDB 2971 solves

Disassembling in GDB

Connect with SSH

Stepping Through Instructions 2963 solves

Stepping Through Instructions

Connect with SSH

Reading Register Values 1 hacking, 2950 solves

Reading Register Values

Connect with SSH

Setting Register Values 1 hacking, 523 solves

Setting Register Values

Connect with SSH

Popping Stack Values 1 hacking, 2931 solves

Popping Stack Values

Connect with SSH

Examining Memory 1 hacking, 2926 solves

Examining Memory

Connect with SSH

Examining Stack Pointers 2913 solves

Examining Stack Pointers

Connect with SSH

Cooperative Debugging 2825 solves

Cooperative Debugging

Connect with SSH

Running with Arguments 1099 solves

Running with Arguments

Connect with SSH

Redirecting Input in GDB 3 hacking, 754 solves

Redirecting Input in GDB

Connect with SSH

30-Day Scoreboard: