Overhaul writing style of DMA Hijacking guide (#41)

ISSOtm · web-flow · commit fa52aaa35042 · 2022-01-08T23:43:11.000+01:00
diff --git a/list/guides/dma_hijacking.md b/list/guides/dma_hijacking.md
@@ -2,97 +2,188 @@
 
 Written by [ISSOtm](https://github.com/ISSOtm).
 
+::: tip TARGET AUDIENCE
+
+Unlike most resources here, this guide is not very useful to developers or even ROM hackers, but rather to glitch-hunters and exploit developers.
+
+:::
+
 ---
 
-## What is this ?
-It's a simple technique that allows you to run custom code in most GB/SGB/CGB games, provided you have an ACE exploit.
+## What is it?
+
+*OAM DMA hijacking* is a simple technique that allows you to run custom code in most GB/SGB/CGB games, provided you have an ACE exploit.
 
-What's the point, then? It's that code ran through DMA Hijacking will be run on every game frame (for most games, at least).
+One would be quick to point out that if you have an ACE exploit, you can already execute custom code.
+So then, what is the point?
+It's that code ran through DMA Hijacking will be run *on every game frame* (for most games, at least).
 
-## How is it done ?
-If you are familiar enough with OAM, you know about that feature called *OAM DMA* that requires a small routine to be ran in HRAM ?
+## How is it done?
 
-Well, most games copy the routine when starting up and run it on every frame. I encountered some games which don't transfer OAM unless a specific flag is set ; I believe that it is always possible to override this limitation. more on that later.
+If you are familiar enough with [OAM](https://gbdev.io/pandocs/OAM), you may know about a feature called *OAM DMA*.
 
-But if the routine is modified while the game is running - assuming you modify it fully in-between to VBlanks to prevent a crash, or you temporarily put a RET while modifying - then the game will happily run your custom routine.
+[OAM DMA](https://gbdev.io/pandocs/OAM_DMA_Transfer) is a convenient feature that allows quickly updating the on-screen ["objects"](https://gbdev.io/pandocs/Rendering#objects) (often known as "sprites") quickly—which is especially useful since it typically needs to occur on every frame.
+However, using OAM DMA requires a small routine to be copied to HRAM and then run from there.
 
-Here is the standard routine, given by Nintendo in the GB programming manual :
+Interestingly, most games only copy the routine when starting up, and then execute it on every subsequent frame.
+But, *if we modified that routine while the game is running*, then the game will happily run the customized routine!
+
+### Patching the code
+
+Here is the standard routine, given by Nintendo in the GB programming manual (using [RGBASM syntax](https://rgbds.gbdev.io/docs/rgbasm.5) and a symbol from [`hardware.inc`](https://github.com/gbdev/hardware.inc)):
 
 ```asm
-ld a, OAMBuffer >> 8
-ldh [$FF46], a
-ld a, $28
+    ld a, HIGH(OAMBuffer)
+    ldh [rDMA], a  ; $FF46
+    ld a, 40
 DMALoop:
-dec a
-jr nz, DMALoop
-ret
+    dec a
+    jr nz, DMALoop
+    ret
 ```
 
-It's usually placed right at `$FF80`, but this isn't true for every game.
-Now, overwriting the routine to place custom code would yield us 10 bytes to perform custom operations, at the cost of sprites.
-But we can do better.
+The simplest way to get custom code (let's call it `DMAHook`) executed would be to overwrite the first few bytes with a jump to `DMAHook`:
 
-```asm
-call DMAHook
-ldh [$FF00+c], a
-ld a, $28
+```asm{1}
+    jp DMAHook
+    db $46    ; Leftover operand byte of `ldh [rDMA], a`
+    ld a, 40  ; None of this is executed
 DMALoop:
-dec a
-jr nz, DMALoop
-ret
+    dec a
+    jr nz, DMALoop
+    ret
 ```
 
-Allows us to make the perfect compromise !
+Now, overwriting the routine like this works for our purposes, but comes with a large drawback: the routine isn't doing what it is intended to anymore, and so the game's objects won't update (unless you manually copied OAM, but beware of [the OAM corruption bug](https://gbdev.io/pandocs/OAM_Corruption_Bug)).
+Further, it's not possible to write to `rDMA` from `DMAHook`, as the write and subsequent wait loop **must** be executed from HRAM.
+
+But, there is a solution.
+
+```asm{1-2}
+    call DMAHook
+    ldh [c], a  ; A write to `rDMA`, set up by DMAHook
+    ld a, 40
+DMALoop:
+    dec a
+    jr nz, DMALoop
+    ret
+```
+
+Provided that `DMAHook` returns with properly set registers, this allows writing to `rDMA` in the single HRAM byte left by the `call` instruction.
 Here is a pattern for DMAHook :
 
 ```asm
 DMAHook:
-[ custom code, do whatever you want, it's VBlank time ! ]
-ld c, $46
-ld a, OAMBuffer >> 8
-ret
+    ;;  Custom code, do whatever you want, it's VBlank time!
+    ; ...
+    ld c, LOW(rDMA)  ; $46
+    ld a, HIGH(OAMBuffer)
+    ret
 ```
 
-DMAHook can be anywhere (in WRAM, mostly). It will be executed in the context of the VBlank interrupt, so for most games interrupts will be disabled, etc.
-An alert reader will notice the new DMA handler modifies C (whereas the original simply zeroes A). I don't know any game whose behavior is altered by this.
+`DMAHook` can live anywhere in memory, but typically it will be in WRAM.
+It will be executed in the context of the VBlank interrupt, so for most games interrupts will be disabled, etc.
 
-DMA hijacking is also useful when combined with [cartswap](https://gist.github.com/ISSOtm/3008fd73ec66cb56f1caecfcc8b6fb6f) (swapping carts without shutting the console down, concept found by furrtek, developed by Cryo and me on the GCL forums), because it allows porting ACE to other games.
+## With Cartswap
 
-General procedure :
+DMA Hijacking is also useful when combined with [cartswap](https://gist.github.com/ISSOtm/3008fd73ec66cb56f1caecfcc8b6fb6f) (swapping carts without shutting the console down, concept found by furrtek, developed by Cryo and me on the GCL forums), because it allows "transporting" ACE to other games.
 
-- Acquire ACE in the donor game
-- Perform cartswap, insert the recipient game
-- Pseudo-initialize the recipient (clear enough memory to avoid crashing, while keeping our custom code in an unused region of memory we don't clear)
-- Place the modified DMA handler in HRAM
-- Transfer control back to the recipient's ROM
-- ????
-- Profit.
+General procedure:
 
-[Video demonstration, performed by Torchickens/ChickasaurusGL in BGB](http://youtu.be/BNyDmZlbsNI)
+1. Acquire ACE in the "source" game
+1. Perform cartswap, insert the "victim" game
+1. "Pseudo-initialize" the victim
+1. Place the modified DMA handler in HRAM
+1. Transfer control back to the victim's ROM
+1. ????
+1. Profit!
 
 Possible applications are checking for a button combo to trigger specific code (for example, credits warp), checking one or multiple memory addresses to detect a certain game state, etc.
 
-Possible "attack vectors", ie ways of affecting the recipient game, are setting certain memory addresses (like GameShark), or even better : manipulating the stack.
+Possible "attack vectors", i.e. ways of affecting the victim game, are setting certain memory addresses (like a GameShark), or even better: manipulating the stack.
 
-Manipulating the stack with this technique can not crash if the triggering game state is specific enough. I achieved text pointer manipulation in Pokémon Red this way.
+Here is a video demonstration:
+<iframe width="560" height="315" src="https://www.youtube-nocookie.com/embed/BNyDmZlbsNI" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture" allowfullscreen></iframe>
 
+Manipulating the stack with this technique can not crash if the triggering game state is specific enough.
+I achieved text pointer manipulation in Pokémon Red this way.
+(This is not a ROM hack!)
+<iframe width="560" height="315" src="https://www.youtube-nocookie.com/embed/yXy5sYZR9mk" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture" allowfullscreen></iframe>
 
 ### Details
-Here are some details on how to combine DMA hijacking and cartswap to pwn any game.
 
-First thing you will need is to find some RAM to store the DMA hook code. We'll call it "HookRAM". I recommend checking how much memory is allocated to the stack.
+This new technique hinges on breaking one of any game's core assumptions: its entry point.
+You see, normally, [the console transfers control to the game at address $0100](https://gbdev.io/pandocs/The_Cartridge_Header#0100-0103---entry-point), so any code placed there is designed to initialize all of the game's systems, in particular their memory.
 
-Then :
-- Clear as much RAM as needed for the game to run properly
-- Copy the DMA hook code to HookRAM
-- Copy the hijacked DMA routine to HRAM
-- Emulate all game initialization up to right before DMA routine copy / HookRAM clearing
-- Jump back to ROM
+However, since we have control of the CPU, we can jump to any location in the game's ROM, which allows bypassing some of said initialization.
+Doing so without any precautions is very likely to go haywire, though—it is important to initialize *enough* that the game runs, but not *too much* that it would end up overwriting the code we are trying to inject.
+This is what I call "**pseudo-initialization**".
 
+Another important part is finding some free space to store the hook code in.
+The stack area can work surprisingly well for this, as many games appear to over-allocate (e.g. 256 bytes when the typical usage doesn't go beyond 32).
 
-## Trivia
-DMA hijacking works similarly to the GameShark : it detected when the GB tried reading from the VBlank interrupt vector, and responded with instructions that applied the codes.
+None of this has a silver bullet: the game's init code must be analyzed, and its memory usage carefully scrutinized in order to dig up enough free space for your hook.
 
-And yep, it is possible to use DMA hijacking to emulate GameShark codes. I have a PoC in Pokémon Red (a BGB save state), if anyone's interested.
+## Trivia
 
-[Demo video](http://gbdev.gg8.se/forums/viewtopic.php?id=430).
+DMA hijacking works similarly to the GameShark: that device intercepts accesses to the ROM, and when it detects that the VBlank handler is being run, it "overlays" different instructions that apply the stored codes, and jump back to the actual handler.
+
+And, why yes, it is possible to use DMA hijacking to emulate GameShark codes!
+[Here is a proof-of-concept in Pokémon Red](http://gbdev.gg8.se/forums/viewtopic.php?id=430).
+
+## Notes
+
+- I encountered some games that don't transfer OAM unless a specific flag is set; I believe that it is always possible to override this limitation, by setting the flag back in the hook.
+- The OAM DMA routine is often placed at $FF80 in commercial games.
+- The patched OAM DMA routine with our hook may be modifying registers that the game expects to be preserved.
+  This is all dependent on the target game, so no general advice can be given.
+
+  Additionally, if the hook takes too long, it may cause code expecting to run in VBlank to break.
+  This might be solved for example by manipulating the stack and injecting an additional return address; here is an example.
+  ```asm
+      jp DMAHook
+  PostDMAHook:
+      ldh [c], a
+      ld a, 40
+  DMALoop:
+      dec a
+      jr nz, DMALoop
+      jp hl
+  ```
+  ```asm
+      pop hl  ; Get original return address
+      ld bc, PostHandlerHook  ; Address of code that will be executed once the VBlank handler finishes
+      push bc  ; Inject return address for VBlank handler
+      ld c, LOW(rDMA)
+      ld a, HIGH(OAMBuffer)
+      jp PostDMAHook
+  ```
+  (Since the handler almost certainly performs some `pop`s before returning, you will almost certainly need more complex stack manipulation, but that's the gist of it.)
+- Some games have a slightly more clever routine in HRAM, that omits the initial `ld a, HIGH(OAMBuffer)` saving 2 bytes of HRAM.
+  ```asm
+      ldh [rDMA], a
+      ld a, 40
+  DMALoop:
+      dec a
+      jr nz, DMALoop
+      ret
+  ```
+  They can still be patched by overwriting the `ld a, 40` instead, and using e.g. the `b` register for the loop:
+  ```asm{1-2,4}
+      call DMAHook
+      ldh [c], a  ; Write to rDMA
+  DMALoop:
+      dec b
+      jr nz, DMALoop
+      ret
+  ```
+  Then `DMAHook` needs to return with `b` additionally set to 40:
+  ```asm{4-5}
+  DMAHook:
+      ;;  Custom code, do whatever you want, it's VBlank time!
+      ; ...
+      ld bc, 40 << 8 | LOW(rDMA)  ; 40 in B, $46 in C
+      ld a, HIGH(OAMBuffer)
+      ret
+  ```
+  However, if the OAM buffer address passed to the function (in `a`) is not static, `push af` and `pop af` will have to be used instead of `ld a, HIGH(OAMBuffer)`.