public abstract class

AbstractConverter

extends Object
implements IEConverter<InsnType extends IInstruction>
java.lang.Object
   ↳ com.pnfsoftware.jeb.core.units.code.asm.decompiler.AbstractConverter<InsnType extends com.pnfsoftware.jeb.core.units.code.IInstruction>

Class Overview

A skeleton for code converters. TODO: documentation + examples.

Summary

Fields
protected IERoutineContext ctx Per-routine data - works because convert() is not re-entrant (same found in X86ConvXxx helper classes).
protected boolean doNotGenerateNops
protected IEGlobalContext gCtx
protected int methodConversionCountFailure
protected int methodConversionCountSuccess
protected Set<Integer> parameterRegistersForAllCC
protected IProcessor<InsnType extends IInstruction> proc
protected int regNormalBitsize
protected Set<Integer> spoiledRegistersForAllCC
Protected Constructors
AbstractConverter(IProcessor<InsnType> proc, int regNormalBitsize)
Public Methods
IWildcardPrototype buildFailsafePrototype(IERoutineContext ctx, IEStatement stm)
Create a simple, failsafe prototype for the given routine.
IERoutineContext convert(INativeMethodItem routine)
Convert a routine.
IERoutineContext convert(INativeMethodItem routine, boolean delayBuild)
Convert the blocks of the native routine's CFG after the other.
final List<IEStatement> convertBlockForTest(BasicBlock<InsnType> b)
Reserved for testing.
List<IEGeneric> convertParameterExpressions(IERoutineContext ctx, IWildcardPrototype prototype, INativeMethodItem targetRoutine, int addSlotCount)
Given a prototype, determine the list of IR expressions representing the input expressions (would also contain implicitly read registers).
List<IEGeneric> convertReturnExpressions(IERoutineContext ctx, IWildcardPrototype prototype)
Given a prototype, determine the list of IR expressions representing the output expressions (would also contain spoiled registers, which are implicitly written).
IEGeneric convertReturnLocation(IERoutineContext ctx, IWildcardPrototype prototype)
Given a prototype, determine the IR expression representing the return address location of a call to a method of said prototype.
IEMem createStackMemoryAccess(IEGeneric address, int bitsize)
The default implementation assumes a segment-less memory model.
int defaultPCConversion(IERoutineContext ctx)
Convert PC-assignment to EJumpFar statements.
Integer determineStackPointerDeltaAfterIRCall(IWildcardPrototype prototype, int addSlotCount)
Given a prototype, determine the stack pointer delta (in bytes) after the ECall executed and returned.
Integer determineStackPointerDeltaFromSimulation(SimulationPointInformation simuinfo)
The default implementation returns null.
IEImm evaluateUntranslatedIR(IEUntranslatedInstruction insn, IERoutineContext ectx, IEState state)
Evaluate untranslated IR instructions.
String formatStatistics()
The default implementation formats basic details about the number of converted methods.
ICStatement generateASTForUntranslatedIR(IEUntranslatedInstruction insn, IERoutineContext ectx, ICMethod cctx)
CURRENTLY RESERVED FOR INTERNAL USE.
int getAddressBitsize()
The default implementation returns the size of the program counter.
INativeDecompilerUnit<InsnType> getDecompiler()
IEBranchDetails getDefaultBranchToRoutineSideEffects(INativeMethodItem optionalRoutine)
The default implementation returns a basic branch detail object specifying nothing
IEGlobalContext getGlobalContext()
Retrieve the global program context.
IEVar getInputVariableByIndex(IERoutineContext ctx, int i)
The default implementation returns null.
long getNativeRegisterIdFromRegisterVariable(IEVar regVar, boolean shortForm)
The default implementation throws.
int getProcessorMode(IEState state)
Retrieve the current processor mode relative to this EState.
IEPrototypeHandler getPrototypeHandler(IERoutineContext ctx)
Create an instance of a prototype handler.
int getRegisterBitsize()
The default implementation returns the address bitsize.
IEVar getRegisterVariableFromNativeRegisterId(long nativeRegId)
The default implementation throws.
IEVar getReturnAddressRegister()
The default implementation considers that there is no specific Return Address Register.
int getStackSlotSize()
Get the size of standard slot on the stack, in bytes.
void initialize()
This method is called by the owner decompiler after the principal components of the lower-level code unit have been initialized.
void initializeStateRegisters(IEState state, Long optionalNativeProgramCounter)
The default implementation initializes and sets all physical registers declared by this converter to 0, except for the ones possibly used as arguments that are kept undefined (see #isPossibleParameterRegisterForProcessorCallingConventions(IEGeneric)).
int insertReturns(IERoutineContext _ctx)
This method introduces EReturn statements into the IR.
boolean isDoNotGenerateNops()
boolean isPossibleParameterRegisterForProcessorCallingConventions(long nativeRegId)
Check if a native register id corresponds to a register that can possibly be used as a routine parameter on this processor.
boolean isPossibleSpoiledRegistersForProcessorCallingConventions(long nativeRegId)
Check if a native register id corresponds to a physical register that can possibly be spoiled by a routine call on this processor.
Boolean isSegmentEMemReferencingPrimaryMemory(IEMem e)
The default implementation returns true: by default, all EMem are assumed to reference bytes in the primary VM.
boolean resolveCustomCalls(INativeCodeUnit<?> pbcu, IERoutineContext ctx)
Implementations should be aggressive and fast: calls to this method should always be enclosed in a try-catch.
void setDecompiler(INativeDecompilerUnit<InsnType> decompiler)
Reserved.
void setDoNotGenerateNops(boolean doNotGenerateNops)
Protected Methods
boolean attemptConversionByExtension(ConverterInstructionEntry<InsnType> e)
abstract void convertBlock(BasicBlock<InsnType> b, List<IEStatement> interlist)
Convert a block.
boolean insertOptionalEntryPointTrampoline(IERoutineContext ctx, List<IEStatement> IRStatementList)
For native routines that have their entry point in-the-middle (ie, not at their lowest address), insert a synthetic IEJump as the first instruction of the converted method.
boolean isPCRightValueCompatibleReturnValue(BasicBlock<IEStatement> b, IEGeneric PCRightVal, IEGeneric expectedReturnAddress)
Can be overridden by implementors when complex return expression matching is needed.
void postBlockConversion(CFG<InsnType> cfg, BasicBlock<InsnType> b, List<IEStatement> interlist, int cnt)
This method is called after convertBlock(BasicBlock, List) is called.
void postRoutineConversion(INativeMethodItem routine, IERoutineContext ctx)
Executed by convert after converting the blocks.
BasicBlock<InsnType> preBlockConversion(CFG<InsnType> cfg, BasicBlock<InsnType> b, List<IEStatement> interlist)
This method is called before convertBlock(BasicBlock, List) is called.
void preRoutineConversion(INativeMethodItem routine, IERoutineContext ctx)
Executed by convert before converting the blocks.
void setCurrentContext(IERoutineContext ctx)
setup the helper converter classes
[Expand]
Inherited Methods
From class java.lang.Object
From interface com.pnfsoftware.jeb.core.units.code.asm.decompiler.IEConverter

Fields

protected IERoutineContext ctx

Per-routine data - works because convert() is not re-entrant (same found in X86ConvXxx helper classes).
Should be set before converting a routine, and reset to null after it's done.

protected boolean doNotGenerateNops

protected IEGlobalContext gCtx

protected int methodConversionCountFailure

protected int methodConversionCountSuccess

protected Set<Integer> parameterRegistersForAllCC

protected IProcessor<InsnType extends IInstruction> proc

protected int regNormalBitsize

protected Set<Integer> spoiledRegistersForAllCC

Protected Constructors

protected AbstractConverter (IProcessor<InsnType> proc, int regNormalBitsize)

Public Methods

public IWildcardPrototype buildFailsafePrototype (IERoutineContext ctx, IEStatement stm)

Create a simple, failsafe prototype for the given routine.

Parameters
ctx routine for which a failsafe prototype is to be generated
stm optional statement calling into the routine
Returns
  • a prototype object, should never be null

public IERoutineContext convert (INativeMethodItem routine)

Convert a routine. Convenience method for convert(routine, false).

public IERoutineContext convert (INativeMethodItem routine, boolean delayBuild)

Convert the blocks of the native routine's CFG after the other. The blocks' edges are not considered.

Parameters
routine the routine to be converted; must have a data definition
delayBuild if true, the IR validation and CFG building will not take place (in that case, getCfg() would return false)
Returns
  • the converted IR routine context

public final List<IEStatement> convertBlockForTest (BasicBlock<InsnType> b)

Reserved for testing.

public List<IEGeneric> convertParameterExpressions (IERoutineContext ctx, IWildcardPrototype prototype, INativeMethodItem targetRoutine, int addSlotCount)

Given a prototype, determine the list of IR expressions representing the input expressions (would also contain implicitly read registers).

Used by ECall.

Parameters
prototype mandatory prototype
targetRoutine (optional) may be null; if provided, can be used with addSlotCount (if non 0)
addSlotCount (optional) may be non-zero for a var-arg routine; the implementation should do its best to determine what are the additional (not specified in the prototype) parameters

public List<IEGeneric> convertReturnExpressions (IERoutineContext ctx, IWildcardPrototype prototype)

Given a prototype, determine the list of IR expressions representing the output expressions (would also contain spoiled registers, which are implicitly written).

Used by ECall.

public IEGeneric convertReturnLocation (IERoutineContext ctx, IWildcardPrototype prototype)

Given a prototype, determine the IR expression representing the return address location of a call to a method of said prototype.

Used by ECall.

public IEMem createStackMemoryAccess (IEGeneric address, int bitsize)

The default implementation assumes a segment-less memory model. (The EMem segment component is set to null.)

public int defaultPCConversion (IERoutineContext ctx)

Convert PC-assignment to EJumpFar statements. This method is a fail-safe converter for PC-assignments, and should be called as late as possible in the IR conversion phases.

Returns
  • the number of PC-conversions performed

public Integer determineStackPointerDeltaAfterIRCall (IWildcardPrototype prototype, int addSlotCount)

Given a prototype, determine the stack pointer delta (in bytes) after the ECall executed and returned.
Careful: "IR-Call" and the "native-call" may not have the same structure; it depends on how the conversion of "native-call" is implemented, and therefore is converter-dependent. The SP delta returned by this method is the one after an execution over ECall.

Used by ECall.

Returns
  • null on error

public Integer determineStackPointerDeltaFromSimulation (SimulationPointInformation simuinfo)

The default implementation returns null.

Parameters
simuinfo state of IR execution; PC is assumed to be on the target routine entry-point
Returns
  • null on error, otherwise, the SP delta after the routine has executed and returned to the caller

public IEImm evaluateUntranslatedIR (IEUntranslatedInstruction insn, IERoutineContext ectx, IEState state)

Evaluate untranslated IR instructions. This method is preferred to the internal evaluator, which currently throws by default when attempting to evaluate an IEUntranslatedInstruction.

The default implementation returns null.

public String formatStatistics ()

The default implementation formats basic details about the number of converted methods.

public ICStatement generateASTForUntranslatedIR (IEUntranslatedInstruction insn, IERoutineContext ectx, ICMethod cctx)

CURRENTLY RESERVED FOR INTERNAL USE.

Generate C statement from an untranslated IR instruction. This method is preferred to the internal generator, which produces ICNativeStatement wrapping IEUntranslatedInstruction.

The default implementation returns null.

public int getAddressBitsize ()

The default implementation returns the size of the program counter.

public INativeDecompilerUnit<InsnType> getDecompiler ()

public IEBranchDetails getDefaultBranchToRoutineSideEffects (INativeMethodItem optionalRoutine)

The default implementation returns a basic branch detail object specifying nothing

Parameters
optionalRoutine the optional target routine; if provided, the implementation may use that information to provide more accurate results
Returns
  • a branch details object (never null)

public IEGlobalContext getGlobalContext ()

Retrieve the global program context. There is a single global context per decompiler, common to all ERoutineContext routine contexts.

Returns
  • the global context

public IEVar getInputVariableByIndex (IERoutineContext ctx, int i)

The default implementation returns null.

public long getNativeRegisterIdFromRegisterVariable (IEVar regVar, boolean shortForm)

The default implementation throws.

Parameters
shortForm if true the value provided does not contain the converter-specific information present in the long form (used in particular by IInstructionOperandRegisterBased#getRegisterName(long)), otherwise it is the long form
Returns
  • a native register id

public int getProcessorMode (IEState state)

Retrieve the current processor mode relative to this EState. A bit can be set in the EState to indicate the current mode (for example ARM Thumb Bit).

Parameters
state current state
Returns
  • processor mode: most often 16, 32 or 64 Bit

public IEPrototypeHandler getPrototypeHandler (IERoutineContext ctx)

Create an instance of a prototype handler.

public int getRegisterBitsize ()

The default implementation returns the address bitsize.

public IEVar getRegisterVariableFromNativeRegisterId (long nativeRegId)

The default implementation throws.

Parameters
nativeRegId a native register id, used by the IProcessor that this converter is relying on; the id may be a full id, that is, one supported by IInstructionOperandRegisterBased#getRegisterName(long), or a short-form of it (eg, the default X86 Processor module uses short-form 0 for the eAX register, that is the first GP register whose size matches the current processor mode).
Returns
  • a global EVar representing the register

public IEVar getReturnAddressRegister ()

The default implementation considers that there is no specific Return Address Register. Override if one exists.

Returns
  • can return null value if Processor does not use Return Address Register

public int getStackSlotSize ()

Get the size of standard slot on the stack, in bytes.

public void initialize ()

This method is called by the owner decompiler after the principal components of the lower-level code unit have been initialized. Initialization requiring, e.g. access to the type manager, should be performed here instead of within the constructor.

public void initializeStateRegisters (IEState state, Long optionalNativeProgramCounter)

The default implementation initializes and sets all physical registers declared by this converter to 0, except for the ones possibly used as arguments that are kept undefined (see #isPossibleParameterRegisterForProcessorCallingConventions(IEGeneric)).

Parameters
state input state to be initialized
optionalNativeProgramCounter optional current native PC

public int insertReturns (IERoutineContext _ctx)

This method introduces EReturn statements into the IR. Must be performed only after prototype discovery is completed.

Data chains: not used, may be invalidated; in the latter case, this method is responsible for recalculating them.

Parameters
_ctx the routine context
Returns
  • the number of insertions performed

public boolean isDoNotGenerateNops ()

public boolean isPossibleParameterRegisterForProcessorCallingConventions (long nativeRegId)

Check if a native register id corresponds to a register that can possibly be used as a routine parameter on this processor.

To do so, the given id is compared against all input registers for all known calling conventions for the current processor. In particular, this method does not rely on the routine prototype and calling convention being defined, and can therefore be used early on during decompilation.

Returns
  • true if the provided register id corresponds to a register that can be used as a parameter, false otherwise

public boolean isPossibleSpoiledRegistersForProcessorCallingConventions (long nativeRegId)

Check if a native register id corresponds to a physical register that can possibly be spoiled by a routine call on this processor.

To do so, the given id is compared against all spoiled registers for all known calling conventions for the current processor. In particular, this method does not rely on the routine prototype and calling convention being defined, and can therefore be used early on during decompilation.

Returns
  • true if the provided a native register id is a register that can be spoiled by a routine call, false otherwise

public Boolean isSegmentEMemReferencingPrimaryMemory (IEMem e)

The default implementation returns true: by default, all EMem are assumed to reference bytes in the primary VM.

Returns
  • 3-state result: true/false, null= cannot determine

public boolean resolveCustomCalls (INativeCodeUnit<?> pbcu, IERoutineContext ctx)

Implementations should be aggressive and fast: calls to this method should always be enclosed in a try-catch.

TODO: MOVE to the CBDU

public void setDecompiler (INativeDecompilerUnit<InsnType> decompiler)

Reserved.

public void setDoNotGenerateNops (boolean doNotGenerateNops)

Protected Methods

protected boolean attemptConversionByExtension (ConverterInstructionEntry<InsnType> e)

protected abstract void convertBlock (BasicBlock<InsnType> b, List<IEStatement> interlist)

Convert a block.

Parameters
b block to be converted
interlist output list of converted IR statements

protected boolean insertOptionalEntryPointTrampoline (IERoutineContext ctx, List<IEStatement> IRStatementList)

For native routines that have their entry point in-the-middle (ie, not at their lowest address), insert a synthetic IEJump as the first instruction of the converted method.

By using this trick, we ensure that native routines that have their entry-point in the middle of their code can be converted to IR routines with an entry-point always set to offset 0 (the beginning of the IR code). The reason is that this helps with graph manipulation passes that come later on, in the AST phases.

Parameters
ctx translation context
IRStatementList global list of converted IR-statements, should be empty when this method is called
Returns
  • true if a trampoline was inserted, false if it was not needed (and therefore, not inserted)

protected boolean isPCRightValueCompatibleReturnValue (BasicBlock<IEStatement> b, IEGeneric PCRightVal, IEGeneric expectedReturnAddress)

Can be overridden by implementors when complex return expression matching is needed.

Parameters
b BasicBlock containing the PC assign instruction
PCRightVal right Value of the PC assign instruction
expectedReturnAddress Expected IEVar as per current Calling Convention
Returns
  • true if right value is equivalent to a return instruction

protected void postBlockConversion (CFG<InsnType> cfg, BasicBlock<InsnType> b, List<IEStatement> interlist, int cnt)

This method is called after convertBlock(BasicBlock, List) is called. The default implementation does nothing.

Parameters
cfg the native CFG of the method being converted
b the native basic block that was just converted
interlist the current list of IR statements for the routine (not only for the provided block!); the last IR statements in that list are those of the routine that was just converted
cnt the number of IR statements that correspond to the conversion of the provided native block

protected void postRoutineConversion (INativeMethodItem routine, IERoutineContext ctx)

Executed by convert after converting the blocks. The default implementation does nothing.

protected BasicBlock<InsnType> preBlockConversion (CFG<InsnType> cfg, BasicBlock<InsnType> b, List<IEStatement> interlist)

This method is called before convertBlock(BasicBlock, List) is called. It can be overridden. Sub-classes do not need to call the default implementation. The default implementation returns the provided input block.

Parameters
cfg the CFG
b block about to be converted
Returns
  • the provided input block or a new block

protected void preRoutineConversion (INativeMethodItem routine, IERoutineContext ctx)

Executed by convert before converting the blocks. The default implementation does nothing.

protected void setCurrentContext (IERoutineContext ctx)

setup the helper converter classes