Fuzzing Linux Kernel

What is Fuzzing ?

Feeding random inputs untill program crashes.

fuzzing

for Fuzzing we need to answer these questions

except for #3 all others depend on the program that we are Fuzzing.

Just generating random data does not always work,

for example: if we are fuzzing an xml parser, the just to generate header <xml it will take ~2^32 guesses.

So random data does not always work

So there are 3 approaches to generate better inputs

Structured inputs (structure-aware-fuzzing)
- We build a grammar for inputs and fuzz them.
Guided generation (coverage-guided-fuzzing)
- We use an existing pool of corpus input or a random input
- We mutate (change) it
- We use it as an input and execute the program
- We check if covers new code ?
  - If yes then we add it to Corpus inputs pool
  - else we start again from random input.
Collecting corpus samples and mutating them
- We can scrape the internet and collect inputs.
- These inputs can be mutated and fed into the program.

These approaches can be combined with each other to create new inputs for fuzzing.

To inject inputs we need to understand what inputs does kernel have.

Kernel does not accept data as inputs it accepts syscalls.

Most syscalls are used as API i.e

sequence of calls in the input to the kernel

API-aware fuzzing

External inputs are also similar to API's.

So most common input structures are

There are other tools but most common are

Building kernel code as userspace program and fuzzing that
- Works for code that is separable from kernel, but some kernel code cannot be separated.
Reusing a userspace fuzzer
- Works for fuzzing blob-like inputs, but most kernel inputs are not blobs
Using syzkaller
- Good for fuzzing kernel API
Writing a fuzzer from scratch
- Only benefits when the interface is not API-based.

Don't just fuzz mainline with the default config
- fuzz with different configs
- fuzz a small number of related syscalls i.e fuzz 3 or 4 syscall related to networking
- Fuzz distro kernels
Build your fuzzer on top of syzkaller, extend syzkaller rather than writing your own fuzzer.
Reuse parts of the syzkaller for your fuzzer.