Architectural Overview

Plume is divided into two major parts: an extractor and a driver. Each part can then be subdivided further but this separation of concerns allows for each part of the CPG analysis lifecycle to be distributed between different parts of a program.

Driver

The driver provides the interface via which one can communicate to a chosen graph database. This interface is defined by IDriver and is what is implemented by the database drivers such as the TinkerGraph driver or TigerGraph driver.

Driver Architectural Overview

The extractor and analyser make use of the driver but one can use the driver independently to perform one's own CPG construction or analysis. Currently there are only a handful of graph databases supported by the driver but we are continually looking at supporting more graph databases.

Extractor

The extractor is in charge of constructing the code property graph from bytecode. The entrypoint for this component is the Jimple2Cpg class where one can load and project the CPG using the given driver and classpath.

Either class or JAR files can be loaded. The extractor makes use of Soot to convert the class files into Jimple from which the call graph is constructed. Soot's SootMethod is used in order to construct the method bodies from AST information. AST ordering and control targets are then used to derive a CFG from which the rest of the dependencies and subgraphs are derived from.

Extractor Architectural Overview

Due to the fact that Plume constructs the CPG from bytecode, the graph produced is not completely interchangeable with the source code (and even less after the transformation to Jimple!) but line number and dataflow information is accurately preserved.