Train.7z
: Disassembled code metadata (assembly language) produced by the IDA Pro disassembler .
: 50,000 color images (32x32 pixels) across 10 categories.
: Researchers often use n-grams of opcodes, file size, and image-based intensity features derived from these raw files. 🖼️ Image Classification (CIFAR-10) For the CIFAR-10 competition , train.7z typically contains: train.7z
Depending on the specific competition, this file contains different types of "features" or raw inputs: 🛡️ Malware Classification (Microsoft BIG 2015)
: Hexadecimal representations of the file's binary content. : Disassembled code metadata (assembly language) produced by
: The raw RGB values of these images serve as the base features for convolutional neural networks (CNNs). 🎙️ Speech Recognition
In the TensorFlow Speech Recognition Challenge , train.7z contains: rohan-paul/Microsoft_Malware_Prediction_BIG_2015 - GitHub train.7z
In the Microsoft Malware Classification Challenge, the train.7z file contains raw malware samples in two formats:
Ver 4 comentarios