Repository logo
 
No Thumbnail Available
Publication

Explainable machine learning for malware detection on Android applications

Use this identifier to reference this record.
Name:Description:Size:Format: 
Explainable.pdf1.92 MBAdobe PDF Download

Advisor(s)

Abstract(s)

The presence of malicious software (malware), for example, in Android applications (apps), has harmful or irreparable consequences to the user and/or the device. Despite the protections app stores provide to avoid malware, it keeps growing in sophistication and diffusion. In this paper, we explore the use of machine learning (ML) techniques to detect malware in Android apps. The focus is on the study of different data pre-processing, dimensionality reduction, and classification techniques, assessing the generalization ability of the learned models using public domain datasets and specifically developed apps. We find that the classifiers that achieve better performance for this task are support vector machines (SVM) and random forests (RF). We emphasize the use of feature selection (FS) techniques to reduce the data dimensionality and to identify the most relevant features in Android malware classification, leading to explainability on this task. Our approach can identify the most relevant features to classify an app as malware. Namely, we conclude that permissions play a prominent role in Android malware detection. The proposed approach reduces the data dimensionality while achieving high accuracy in identifying malware in Android apps.

Description

Keywords

android applications datasets explainability feature selection machine learning malware detection numerosity balancing; security soft computing supervised learning

Citation

Palma C, Ferreira A, Figueiredo M. Explainable Machine Learning for Malware Detection on Android Applications. Information. 2024; 15(1):25. https://doi.org/10.3390/info15010025

Organizational Units

Journal Issue