Sok State of the Art of War Offensive Techniques in Binary Analysis

简介

这篇文章提出了一个二进制分析框架，并实现了许多现有的分析技术。通过将这些技术系统化地实现，可以让其他研究人员直接利用并开发新的技术。此外，在统一框架中实现这些技术可以更直接地进行比较，并确定各自的优缺点。

自动化二进制分析

为了保持程序分析的可行性，往往需要在可重现性和语义理解两个方面需要进行权衡：

可重现性：由于分析系统做出的权衡，特定的分析所发现的漏洞可能无法重现。这可能是分析操作的覆盖范围导致的，一些分析从头执行整个应用程序，因此可以推断出触发漏洞的原因，而其他一些分析只是分析了程序的某个部分，这样做可以在特定模块中发现漏洞，但无法完整地推断出触发漏洞的原因，于是无法重现。
语义理解：一些分析缺乏对程序语义的理解。例如，动态分析能够追踪程序执行的代码，但不能理解为什么这些代码被执行或者程序输入的哪些部分导致了这样的执行。

为了得到可重现的输入或者语义理解的能力，就需要对分析技术进行权衡。例如，高可重现性往往和低覆盖率相关，因为要想生成可重现的输入必须知道如何到达任何想要分析的代码，那么它将不能分析尽可能多的代码。另一方面，如果不能通过重现来验证漏洞，那么会产生高误报率（即并不存在漏洞）。在缺乏可重现性的情况下，这些误报必须通过启发式的方法进行过滤，反过来又会引入高漏报率。同样的，为了实现语义理解，必须存储和处理大量的数据。例如，具有语义理解能力的动态分析必须保存下程序分支的条件，而具有语义理解能力的静态分析需要适时地调整数据域。但由于系统资源有限，在分析中必须做出取舍。

下面是一个例子，可以对不同分析技术的能力有个简单的认识：

对于静态分析，它可能会将全部 three 个 memcpy 调用都标记为漏洞（即使 16 行的调用其实是安全的），因为静态分析没有足够的信息来确定漏洞是否真的会发生。另外，静态分析可以得到漏洞的地点，但不能得到触发漏洞的输入。对于动态分析（例如 fuzzing），它通过制造输入来触发漏洞，所以通常有很大可能会漏掉需要精确输入的漏洞，例如第 x 行的漏洞。动态符号执行能够检测出第 10 行的错误并通过约束求解得到输入，也能判断出第 16 行没有漏洞，但是它很可能会漏掉第 30 行，因为有多个潜在的路径不会触发该漏洞。另外，在符号执行进行到循环时，可能存在路径爆炸的问题。

静态漏洞挖掘

Static analyses can be split into two paradigms: those that model programme properties as graphs and those that model the data itself.

控制流图恢复

CFG recovery is implemented as a recursive algorithm that disassembles and analyzes a basic cake, identifies its possible exits and adds them to the CFG, and and so repeats the analysis recursively until no new exits are identified.

CFG recovery has ane fundamental challenge: indirect jumps. Specifically, indirect jumps fall into several categories:

Computed. The target of a computed jump is determined by the application past carrying out a calculation specified by the code. This calculation could further rely on values in other registers or in memory. A common case of this is a leap tabular array.
Context-sensitive. An indirect jump might depend on the context of an application. The mutual example is qsort() in the standard C library.
Object-sensitive. A special case of context sensitivity is object sensitivity. In object-oriented languages, object polymorphism requires the utilise of virtual functions, frequently implemented as virtual tables of function pointers that are consulted, at runtime, to determine bound targets.

The goal of CFG recovery is to resolve the targets of every bit many of these indirect jumps as possible, in order to create a CFG. Depending on how well jump targets are resolved, the CFG recovery assay has two properties:

Soundness. A CFG recovery technique is sound if the set of all potential control flow transfers is represented in the graph generated.
Completeness. A complete CFG recovery builds a CFG in which all edges represent really possible control period transfers.

值集分析

At a high level, VSA attempts to identify a tight over-approximation of the program state at any given point in the program. This tin be used to sympathise the possible targets of indirect jumps or the possible targets of memory write operations.

动态漏洞挖掘

Dynamic techniques here are separate into ii main categories: physical and symbolic execution.

动态具体执行

The well-nigh relevant application of dynamic concrete execution to vulnerability discovery is fuzzing.

Coverage-based fuzzing. Such fuzzers endeavor to produce inputs that maximize the amount of lawmaking executed in the target application based on the insight that the more code is executed, the college the adventure of executing vulnerable code.
- Coverage-based fuzzing suffers from a lack of semantic insight into the target application.
Taint-based fuzzing. Such fuzzers analyze how an application processes input to understand what parts of the input to modify in futurity runs.
- While a taint-based fuzzer tin can empathise what parts of the input should be mutated to bulldoze execution down a given path in the program, it is still unaware of how to mutate this input.

动态符号执行

Dynamic symbolic execution executes a program in an emulated surroundings with an abstruse domain of symbolic variables. They runway the state of registers and memory throughout program execution and the constraints on those variables. Whenever a provisional co-operative is reached, execution forks and follows both paths, saving the branch condition equally a constraint on the path in which the branch was taken and the inverse of the branch condition as a constraint on the path in which the co-operative was not taken.

Classical dynamic symbolic execution. These engines analyze an application past performing path exploration until a vulnerable state is identified.
Symbolic-assisted fuzzing. Such fuzzers modify inputs identified by the fuzzing component by processing them in a dynamic symbolic execution engine. Dynamic symbolic execution uses a more in-depth understanding of the analyzed program to properly mutate inputs, providing additional test cases that trigger previously-unexplored code and permit the fuzzing component to continue making progress.
Under-constrained symbolic execution. These engines execute only parts of an application in order to increment the tractability of dynamic symbolic execution.

angr 分析引擎

设计目标

Cross-architecture support
Cantankerous-platform back up
Support for different assay paradigms
Usability

子模块：Intermediate Representation

We leveraged libVEX, the IR lifter of the Valgrind project. libVEX produces an IR, chosen VEX, that is specifically designed for program analysis. Nosotros used PyVEX to expose the VEX IR to Python.

子模块：Binary Loading

The chore of loading an application binary into the analysis system is handled by a module chosen CLE. CLE abstracts over different binary formats to handle loading a given binary and any libraries that it depends on, resolving dynamic symbols, performing relocations, and properly initializing the program land.

子模块：Program State Representation/Modification

The SimuVEX module is responsible for representing the programme land. The country, named SimState in SimuVEX terms, is implemented every bit a drove of land plugins, which are controlled by country options specified by the user or analysis when the state is created.

Registers. SimuVEX tracks the values of registers at whatsoever given indicate in the program as a state plugin of the respective program state.
Symbolic retentivity. To enable symbolic execution, SimuVEX provides a symbolic retentivity model every bit a state plugin.
Abstract memory. The abstract memory state plugin is used by static analyses to model memory. Different symbolic memory, which implements a continuous indexed memory model, the abstract memory provides a region-based memory model.
POSIX. When analyzing binaries for POSIX-compliant environments, SimuVEX tracks the system land in this land plugins.
Log. SimuVEX tracks a log of everything that is washed to the state in this plugin.
Inspection. SimuVEX provides a powerful debugging interface, allowing breakpoints to exist set on circuitous conditions, including taint, exact expression makeup, and symbolic conditions. This interface can also exist used to change the behavior of SimuVEX.
Solver. The Solver is a plugin that exposes an interface to unlike data domains, through the data model provider.
Architecture. The architecture plugin provides architecturespecific information that is useful to the analysis. The information in this plugin is sourced from the archinfo module, that is too distributed as role of angr.

子模块：Data Model

Claripy abstracts all values to an internal representation of an expression that tracks all operations in which information technology is used. These expressions are represented as "expression trees" with values being the leaf nodes and operations being non-foliage nodes.

At whatsoever point, an expression can be translated into data domains provided by Claripy'south backends. User-facing operations, such as interpreting the constructs provided by the backends into Python primitives are provided by frontends. A frontend augments a backend with additional functionality of varying complexity.

FullFrontend. This frontend exposes symbolic solving to the user, tracking constraints, using the Z3 backend to solve them, and caching the results.
CompositeFrontend. Splitting constraints into independent sets reduces the load on the solver. The CompositeFrontend provides a transparent interface to this functionality.
LightFrontend. This frontend does not support constraint tracking, and simply uses the VSA backend to interpret expressions in the VSA domain.
ReplacementFrontend. The ReplacementFrontend expands the LightFrontend to add back up for constraints on VSA values.
HybridFrontend. The HybridFrontend combines the FullFrontend and the ReplacementFrontend to provide fast approximation back up for symbolic constraint solving.

子模块：Full-Program Analysis

Project is the analyst-facing part of angr, which provides complete analyses, such equally dynamic symbolic execution and controlflow graph recovery.

Path Groups. A PathGroup is an interface to dynamic symbolic execution.
Analyses. angr provides an abstraction for any full plan assay with the Assay class.

实现:数据流图恢复

CFGAccurate. Given a specific plan, angr performs an iterative CFG recovery, starting from the entry signal of the program, with some necessary optimizations. angr leverages a combination of forced execution, backwards slicing, and symbolic execution to recover, where possible, all spring targets of each indirect jump. Moreover, it generates and stores a large quantity of data nearly the target application, which can be used later in other analyses such as data-dependence tracking.
CFGFast. A secondary algorithm that uses a quick disassembly of the binary (without executing any basic block), followed by heuristics to identify functions, intra-office control catamenia, and straight inter-function control flow transitions.

假设

angr's CFGAccurate makes several assumptions about binaries to optimize the run fourth dimension of the algorithm.

All lawmaking in the program can be distributed into different functions.
All functions are either called by an explicit phone call instruction, or are preceded by a tail jump in the control period.
The stack cleanup beliefs of each office is predictable, regardless of where information technology is called from. This lets CFGAccurate safely skip functions that it has already analyzed while analyzing a caller office and keep the stack balanced.

迭代生成 CFG

Throughout CFG recovery, CFGAccurate maintains a list of indirect jumps, L_j, whose jump targets have not been resolved. When the analysis identifies such a jump, it is added to Fifty_j. After each iterative technique terminates, CFGAccurate triggers the next one in the list. This next technique may resolve jumps in L_j, may add new unresolved jumps to L_j, and may add basic blocks and edges to the CFG C. CFGAccurate terminates when a run of all techniques results in no modify to L_j or C, every bit that means that no further indirect jumps tin can be resolved with any available analysis.

Forced Execution. angr's CFGAccurate leverages the concept of Dynamic Forced Execution for the first stage of CFG recovery. Forced Execution ensures that both directions of a conditional co-operative volition be executed at every branch indicate. CFGAccurate maintains a work-list of basic blocks, B_w, and a listing of analyzed blocks, B_a. When the analysis starts, it initializes its piece of work-list with all the basic blocks that are in C merely non in B_a. Whenever CFGAccurate analyzes a bones block from this work-list, the basic cake and any direct jumps from the cake are added to C. Indirect jumps, notwithstanding, cannot be handled this mode. So each indirect leap is stored in the listing L_j for later assay.
Symbolic Execution. For each jump J ∈ L_j, CFGAccurate traverses the CFG backwards until it find the beginning merge indicate or upwards to a threshold number of blocks. From at that place, it performs frontward symbolic execution to the indirect jump and uses a constraint solver to retrieve possible values for the target of the indirect jump. If the bound is resolved successfully, J is removed from L_j and edges and nodes are added to the CFG for each possible value of the jump target.
Astern Slicing. CFGAccurate computes a backward slice starting from the unresolved bound. The piece is extended through the commencement of the previous phone call context. That is, if the indirect jump beingness analyzed is in a function F_a that is called from both F_b and F_c, the piece will extend astern from the jump in F_a and contain 2 start nodes: the basic block at the start of F_b and the ane at the start of F_c. CFGAccurate so executes this slice using angr's symbolic execution engine and uses the constraint engine to identify possible targets of the symbolic jumps, with the same threshold of 256 for the size of the solution set for the leap target. If the jump target is resolved successfully, the jump is removed from L_j and the edge representing the control period transition, and the target basic blocks are added to the recovered CFG.

The goal of the fast CFG generation algorithm is to generate a graph, with loftier code coverage, that identifies at least the location and content of functions in the binary.

Function identification. We use difficult-coded function prologue signatures, which can exist generated from techniques like ByteWeight, to place functions inside the application.
Recursive disassembly. Recursive disassembly is used to recover the directly jumps within the identified functions.
Indirect jump resolution. Lightweight alias analysis, dataflow tracking, combined with pre-defined strategies are used to resolve intra-function control flow transfers.

实现：值集分析

Value-Set up Analysis (VSA) is a static analysis technique that combines numeric analysis and arrow analysis for binary programs. It uses an abstract domain, chosen the Value-Fix Abstruse domain, for approximating possible values that registers or abstruse locations may hold at each programme betoken.

Creating a discrete fix of strided-intervals. The basic data type of VSA, the strided interval, is essentially an approximation of a set of numbers. It is neat for approximating a gear up of normal concrete values. We developed a new data type called "strided interval fix", which represents a set of strided intervals that are not unioned together. A strided interval set volition be unioned into a single strided interval only when information technology contains more than than K elements, where Thou is a threshold that can be adjusted.
Applying an algebraic solver to path predicates. Tracking branch conditions helps us constrain variables in a state later on taking a provisional go out or during a merging procedure, which produces a more than precise analysis upshot. We implemented a lightweight algebraic solver that works on the strided interval domain, based on modulo arithmetic which take care of some of the affine relations. When a new path predicate is seen, we attempt to simplify and solve it to obtain a number range for the variables involved in the path predicate. And so we perform an intersection between the newly generated number range and the original values for each corresponding variable.
Adopting a signedness-agnostic domain. Wrapped Interval Analysis is such an interval domain for analyzing LLVM code, which takes intendance of signed and unsigned numbers at the same fourth dimension. We based our signedness-agnostic strided-interval domain on this theory, applied to the VSA domain.

The main interface that angr provides into a full-program VSA analysis is the Value Flow Graph. The VFG is an enhanced CFG that includes the program country representing the VSA ready-point at each plan location.

实现：动态符号执行

The dynamic symbolic execution module of our analysis platform is mainly based on the techniques described in Mayhem. Our implementation follows the same memory model and path prioritization techniques.

We use Claripy's interface into Z3 to populate the symbolic memory model (specifically, SimSymbolicMemory) provided by SimuVEX. Individual execution paths through a programme are managed by Path objects, provided by angr, which track the actions taken by paths, the path predicates, and diverse other path-specific information. Groups of these paths are managed by angr's PathGroup functionality, which provides an interface for managing the splitting, merging, and filtering of paths during dynamic symbolic execution.

angr has built-in back up for Veritesting, implementing it as a Veritesting analysis and exposing transparent back up for information technology with an option passed to PathGroup objects.

实现：非约束的符号执行

We implemented under-constrained symbolic execution (UCSE), as proposed in UC-KLEE, and dubbed it UC-angr. UCSE is a dynamic symbolic execution technique where execution is performed on each part separately.

We made two changes to the technique described in UCSE:

Global memory under-constraining.Nosotros mark all global information equally underconstrained, assuasive u.s. to lower our false positive rate.
Path limiters. We abort the assay of a function when we find that it is responsible for a path explosion. We discover this by difficult-coding a limit and, when a unmarried function branches over this many paths, we replace the part with an firsthand return, and rewind the analysis from the telephone call site of that role.
Fake positive filtering. When we detect an exploitable state, we attempt to ensure that the state is non incorrectly made exploitable by a lack of constraints on nether-constrained data.

实现：符号辅助的 fuzzing

Our implementation of symbolic-assisted fuzzing, called Driller, uses the AFL fuzzer as its foundation and angr as its symbolic tracer.

实现：崩溃重现

We implemented the approach proposed by Replayer to recover missing relationships between input values and output values.

We tin define the problem of replaying a crashing input as the search for an input specification is to bring a programme from an initial state s to a crash state q. Our implementation symbolically executes the path from southward_a to q_a, using the input i_a. Information technology records all constraints that are generated while executing P. Given the constraints, the execution path, the programme P, and the new initial state s_b, we can symbolically execute P with an unconstrained symbolic input, post-obit the previously recorded execution path until the new crash state q_b is reached. At this bespeak, the input constraints on the input and output can be analyzed, and relationships between them tin can be recovered. This relationship information is used to generate the input specification is, allowing the crashing input to be replayed.

实现：利用生成

we generate exploits by performing concolic execution on crashing program inputs using angr. We drive concolic execution forrad, forcing it to follow the same path as a dynamic trace gathered by concretely executing the crashing input applied to the program. Concolic execution is stopped at the point where the program crashed, and we inspect the symbolic country to determine the cause of the crash and measure exploitability. By counting the number of symbolic bits in certain registers, nosotros can triage a crash into a number of categories such every bit frame arrow overwrite, teaching arrow overwrite, or arbitrary write, among others.

实现：利用强化

To harden exploits against modern mitigation techniques, we implemented a ROP chain compiler based on the ideas in Q.

Gadget discovery. Nosotros scan all executable code in the awarding, at every byte offset, to place ROP gadgets and allocate them according to their furnishings. To behave out the classification, our analysis leverages the activeness history provided by angr'south Path objects and symbolic relations provided by Claripy.
Gadget arrangement. The ROP chain compiler and then determines arrangements of gadgets that tin be used to perform loftier-level actions.
Payload generation. After the ROP compiler identifies the requisite set of gadget arrangements, it combines these gadgets into a chain to comport out high-level actions. This is done by writing gadget arrangements into a programme state in angr, constraining their outputs to the provided arguments, and querying the SMT solver for a solution for their inputs.

Lampungmeiua Romme1969