Skip to content

Latest commit

 

History

History
154 lines (95 loc) · 3.36 KB

File metadata and controls

154 lines (95 loc) · 3.36 KB

EXECUTABLES

Malware detection algorithms need to extract features from executable files (PE, ELF, MachO, OAT, DEX, VDEX, and ART formats). The LIEF package (https://github.com/lief-project/LIEF) has tools to parse these files and create features that can be used in training.

PE Features

Features for PE files are based on EMBER (https://arxiv.org/abs/1804.04637) (https://github.com/endgameinc/ember).

PEGeneralFeatures

➡️ Code

➡️ Description

Extracts general features from PE files such as size, import/export counts, and other basic features.

➡️ Inputs

  • Single text column which contains full paths to PE files on the same machine running DAI

➡️ Outputs

  • Multiple numerical columns

➡️ Environment expectation

No limitations

➡️ Dependenencies

  • lief

PEHeaderFeatures

➡️ Code

➡️ Description

Features derived from the PE file header and option header.

➡️ Inputs

  • Single text column which contains full paths to PE files on the same machine running DAI

➡️ Outputs

  • 63 numerical columns

➡️ Environment expectation

No limitations

➡️ Dependenencies

  • lief

PESectionCharacteristics

➡️ Code

➡️ Description

Extracts section characteristics from PE files.

➡️ Inputs

  • Single text column which contains full paths to PE files on the same machine running DAI

➡️ Outputs

  • Multiple numerical columns

➡️ Environment expectation

No limitations

➡️ Dependenencies

  • lief

PENormalizedByteCount

➡️ Code

➡️ Description

The counts for each byte value in a PE file. These counts are then normalized.

➡️ Inputs

  • Single text column which contains full paths to PE files on the same machine running DAI

➡️ Outputs

  • 256 numerical columns

➡️ Environment expectation

No limitations

➡️ Dependenencies

  • lief

PEDataDirectoryFeatures

➡️ Code

➡️ Description

Features derived from the PE file data directory

➡️ Inputs

  • Single text column which contains full paths to PE files on the same machine running DAI

➡️ Outputs

  • 30 numerical columns

➡️ Environment expectation

No limitations

➡️ Dependenencies

  • lief

PEImportsFeatures

➡️ Code

➡️ Description

Features derived from the PE file imports

➡️ Inputs

  • Single text column which contains full paths to PE files on the same machine running DAI

➡️ Outputs

  • 1280 numerical columns

➡️ Environment expectation

No limitations

➡️ Dependenencies

  • lief

PEExportsFeatures

➡️ Code

➡️ Description

Features derived from the export data section of the PE file.

➡️ Inputs

  • Single text column which contains full paths to PE files on the same machine running DAI

➡️ Outputs

  • 128 numerical columns

➡️ Environment expectation

No limitations

➡️ Dependenencies

  • lief