|
Learning autoencoder ensembles for detecting malware hidden communications in IoT ecosystems
Nunziato Cassavia,
Luca Caviglione,
Massimo Guarascio,
Angelica Liguori ,
Marco Zuppelli
Journal of Intelligent Information Systems
Modern IoT ecosystems are the preferred target of threat actors wanting to incorporate
resource-constrained devices within a botnet or leak sensitive information. A major research effort is then devoted to
create countermeasures for mitigating attacks, for instance, hardware-level verification mechanisms or effective
network intrusion detection frameworks. Unfortunately, advanced malware is often endowed with the ability of
cloaking communications within network traffic, e.g., to orchestrate compromised IoT nodes or exfiltrate data
without being noticed. Therefore, this paper showcases how different autoencoder-based architectures can spot
the presence of malicious communications hidden in conversations, especially in the TTL of IPv4 traffic.
To conduct tests, this work considers IoT traffic traces gathered in a real setting and the presence of an attacker
deploying two hiding schemes (i.e., naive and “elusive” approaches). Collected results showcase the effectiveness
of our method as well as the feasibility of deploying autoencoders in production-quality IoT settings.
|
|
A federated approach for detecting data hidden in icons of mobile applications delivered via web and multiple stores
Nunziato Cassavia,
Luca Caviglione,
Massimo Guarascio,
Angelica Liguori ,
Giuseppe Manco,
Marco Zuppelli
Social Network Analysis and Mining
An increasing volume of malicious software exploits information hiding techniques
to cloak additional attack stages or bypass frameworks enforcing security. This trend has intensified with
the growing diffusion of mobile ecosystems, and many threat actors now conceal scripts or configuration data
within high-resolution icons. Even if machine learning has proven to be effective in detecting various hidden payloads,
modern mobile scenarios pose further challenges in terms of scalability and privacy. In fact, applications can be retrieved
from multiple stores or directly from the Web or social media. Therefore, this paper introduces an approach based
on federated learning to reveal information hidden in high-resolution icons bundled with mobile applications.
Specifically, multiple nodes are used to mitigate the impact of different privacy regulations, the lack of
comprehensive datasets, or the computational burden arising from distributed stores and unofficial repositories.
Results collected through simulations indicate that our approach achieves performances similar to those of
centralized blueprints. Moreover, federated learning demonstrated its effectiveness in coping with simple “obfuscation”
schemes like Base64 encoding and zip compression used by attackers to avoid detection.
|
|
Using AI to face covert attacks in IoT and softwarized scenarios: challenges and opportunities
Angelica Liguori ,
Simone Mungari,
Marco Zuppelli,
Carmela Comito,
Enrico Cambiaso,
Matteo Repetto,
Massimo Guarascio,
Luca Caviglione,
Giuseppe Manco
Ital-IA 2023: 3rd National Conference on Artificial Intelligence
Recently, the number of attacks aiming at breaching networked and softwarized environments has been growing exponentially.
In particular, information hiding methods and covert attacks have been proven to be able to elude traditional
detection systems and exfiltrate sensitive data without producing visible network flows or data exchanges.
In this context, Artificial Intelligence techniques can play a key role in detecting these new emerging attacks,
owing to their capability of quickly processing huge amounts of data without the necessity of expert intervention.
In this work, we discuss the main challenges to face covert attacks in IoT and softwarized environments and
we describe some preliminary results obtained by adopting Deep Learning architectures.
[ Paper
|
|
Towards Self-Supervised Cross-Domain Fake News Detection
Carmela Comito,
Francesco Sergio Pisani,
Erica Coppolillo,
Angelica Liguori ,
Massimo Guarascio,
Giuseppe Manco
ITASEC 2023: The Italian Conference on CyberSecurity
Twitter, Facebook, and Instagram are just some examples of social media currently used by people to share news with other users worldwide.
However, the information widespread through these channels is typically unverified and/or interpreted according to the user’s point of view.
Accordingly, those means represent the perfect tool to hack user opinions with misleading or false news and make fake news viral.
Identifying this malicious information is a crucial but challenging task since fake news can concern different topics.
Indeed, the detection models learned against a specific domain will exhibit poor performances when tested on a different one.
In this work, we propose a novel deep learning-based architecture able to mitigate this problem by yielding cross-domain high-level features for addressing this task.
Preliminary experimentation conducted on two benchmarks demonstrated the validity of the proposed solution.
[ Paper ]
|
|
Siamese Network for Fake Item Detection
Erica Coppolillo,
Daniela Gallo,
Angelica Liguori ,
Simone Mungari,
Ettore Ritacco,
Giuseppe Manco
SEBD 2023: 31st Symposium on Advanced Database System
Currently, most multimedia users choose to purchase items through e-commerce.
Nevertheless, one of the main concerns of online shopping is the possibility of obtaining counterfeit products.
Therefore, it is crucial to monitor the authenticity of the product, thus adopting an automatic mechanism
to validate the similarity between the purchased item and the delivered one.
To overcome this issue, we propose a Siamese Network model for detecting forged items.
Preliminary experimentation on a publicly available dataset proves the effectiveness of our solution.
[
Paper, Slides
]
|
|
Neuro-Symbolic techniques for Predictive Maintenance
Angelica Liguori ,
Simone Mungari,
Ettore Ritacco,
Francesco Ricca,
Giuseppe Manco,
Salvatore Iiritano
SEBD 2023: 31st Symposium on Advanced Database System
Predictive maintenance plays a key role in the core business of the industry due to its potential in reducing unexpected machine downtime and related cost.
To avoid such issues, it is crucial to devise artificial intelligence models that can effectively predict failures.
Predictive maintenance current approaches have several limitations that can be overcome by exploiting hybrid approaches such as Neuro-Symbolic techniques.
Neuro-symbolic models combine neural methods with symbolic ones leading to improvements in efficiency, robustness, and explainability.
In this work, we propose to exploit hybrid approaches by investigating their advantage over classic predictive maintenance approaches.
[
Paper, Poster
]
|
|
Modeling Events and Interactions through Temporal Processes -- A Survey
Angelica Liguori ,
Luciano Caroprese,
Marco Minici,
Bruno Veloso,
Francesco Spinnato,
Mirco Nanni,
Giuseppe Manco,
Joao Gama
arXiv
In real-world scenario, many phenomena produce a collection of events that occur in continuous time.
Point Processes provide a natural mathematical framework for modeling these sequences of events. In this survey, we investigate probabilistic models
for modeling event sequences through temporal processes. We revise the notion of event modeling and provide the mathematical foundations that characterize
the literature on the topic. We define an ontology to categorize the existing approaches in terms of three families: simple, marked,
and spatio-temporal point processes. For each family, we systematically review the existing approaches based based on deep learning.
Finally, we analyze the scenarios where the proposed techniques can be used for addressing prediction and modeling aspects.
|
|
Generative Methods for Out-of-distribution Prediction and Applications for Threat Detection and Analysis: A Short Review
Erica Coppolillo,
Angelica Liguori ,
Massimo Guarascio,
Francesco Sergio Pisani,
Giuseppe Manco
Digital Sovereignty in Cyber Security: New Challenges in Future Vision
Part of the Communications in Computer and Information Science book series (CCIS,volume 1807)
In recent times, Machine Learning has played an important role in developing novel advanced tools for threat detection and mitigation.
Intrusion Detection, Misinformation, Malware, and Fraud Detection are just some examples of cybersecurity fields in which Machine Learning techniques are used to reveal the presence of malicious behaviors.
However, Out-of-Distribution, i.e., the potential distribution gap between training and test set, can heavily affect the performances of the traditional Machine Learning based methods.
Indeed, they could fail in identifying out-of-samples as possible threats, therefore devising robust approaches to cope with this issue is a crucial and relevant challenge to mitigate the risk of undetected attacks.
Moreover, a recent emerging line proposes to use generative models to yield synthetic likely examples to feed the learning algorithms.
In this work, we first survey recent Machine Learning and Deep Learning based solutions to face both the problems, i.e., outlier detection and generation;
then we illustrate the main cybersecurity application scenarios in which these approaches have been adopted successfully.
|
|
Federated Learning for the Efficient Detection of Steganographic Threats Hidden in Image Icons
Nunziato Cassavia,
Luca Caviglione,
Massimo Guarascio,
Angelica Liguori ,
Giuseppe Surace,
Marco Zuppelli
EAI PerSoM 2022 - EAI International Conference on Pervasive knowledge and collective intelligence on Web and Social Media
Best Paper Award!
An increasing number of threat actors takes advantage of information hiding techniques to prevent detection or
to drop payloads containing attack routines. With the ubiquitous diffusion of mobile appli- cations, high-resolution icons should be considered
a very attractive carrier for cloaking malicious information via steganographic mechanisms.
Despite machine learning approaches proven to be effective to detect hidden payloads, the mobile scenario could challenge their deployment
in realistic use cases, for instance due to scalability constraints. Therefore, this paper introduces an approach based on federated learning
able to prevent hazards characterizing production-quality scenarios, including different privacy regulations and lack of comprehensive datasets.
Numerical results indicate that our approach achieves performances similar to those of centralized solutions.
|
|
Ensembling Sparse Autoencoders for Network Covert Channel Detection in IoT Ecosystems
Nunziato Cassavia,
Luca Caviglione,
Massimo Guarascio,
Angelica Liguori ,
Marco Zuppelli
International Symposium on Methodologies for Intelligent Systems, ISMIS 2022
Foundations of Intelligent Systems
Part of the Lecture Notes in Computer Science book series (LNAI,volume 13515)
Network covert channels are becoming exploited by a wide-range of threats to avoid detection.
Such offensive schemes are expected to be also used against IoT deployments, for instance to exfiltrate data or to covertly orchestrate botnets composed of simple devices.
Therefore, we illustrate a solution based on Deep Learning for the detection of covert channels targeting the TTL field of IPv4 datagrams.
To this aim, we take advantage of an Autoencoder ensemble to reveal anomalous traffic behaviors.
An experimentation on realistic traffic traces demonstrates the effectiveness of our approach.
|
|
Adversarial Regularized Reconstruction for Anomaly Detection and Generation
Angelica Liguori ,
Giuseppe Manco,
Francesco Sergio Pisani,
Ettore Ritacco
2021 IEEE International Conference on Data Mining (ICDM)
We propose ARN, a semisupervised anomaly detection and generation method based on adversarial reconstruction.
ARN exploits a regularized autoencoder to optimize the reconstruction of variants of normal examples with minimal differences,
that are recognized as outliers. The combination of regularization and adversarial reconstruction helps to stabilize the learning process,
which results in both realistic outlier generation and substantial detection capability.
Experiments on several benchmark datasets show that our model improves the current state-of-the-art by valuable margins
because of its ability to model the true boundaries of the data manifold.
[
Slides
]
|
|
A Deep Learning Approach for Unsupervised Failure Detection in Smart Industry
(Discussion Paper)
Angelica Liguori ,
Giuseppe Manco,
Ettore Ritacco,
Massimilano Ruffolo,
Salvatore Iiritano
SEBD 2021: The 29th Italian Symposium on Advanced Database Systems
We propose an unsupervised anomaly detection model that is able to identify abnormal behavior
by analysing streaming data coming from IoT sensors installed on critical devices.
The proposed model is based on a Siamese neural network which embeds time series windows
in a latent space, thus generating distance-based clusters of normal behavior
[
Paper
]
|
|
Exploiting Temporal Convolution for Activity Prediction in Process Analytics
Francesco Folino,
Massimo Guarascio,
Angelica Liguori ,
Giuseppe Manco,
Luigi Pontieri,
Ettore Ritacco
ECML PKDD: Joint European Conference on Machine Learning and Knowledge Discovery in Databases, 2020
Part of the Communications in Computer and Information Science book series (CCIS, volume 1323),
Springer International Publishing
We propose a novel convolution-based deep learning approach to the prediction of the next activity
which relies on: (i) extracting high-level features (at dif- ferent levels of abstraction) through the computation
of time-oriented dilated convolutions over traces, and (ii) exploiting residual-like con- nections to make
the training of the predictive model more robust and faster.
[
Paper ]
|
|
Deep Autoencoder Ensembles for Anomaly Detection on Blockchain
Francesco Scicchitano,
Angelica Liguori ,
Massimo Guarascio,
Ettore Ritacco,
Giuseppe Manco
25th International Symposium on Methodologies for Intelligent Systems, ISMIS 2020
Foundations of Intelligent Systems
Part of the Lecture Notes in Computer Science book series (LNCS, volume 12117),
Springer International Publishing
We propose an Ensemble Deep Learning approach to detect deviant behaviors on Blockchain
where the base learner, an encoder-decoder model,
is strengthened by iteratively learning and aggregating multiple instances,
to compute an outlier score for each observation.
[
Paper,
Slides
]
|
|
A Deep Learning Approach for Detecting Security Attacks on Blockchain
Francesco Scicchitano,
Angelica Liguori ,
Massimo Guarascio,
Ettore Ritacco,
Giuseppe Manco
Italian Conference on CyberSecurity, ITASEC 2020
We define an anomaly detection system based on a encoder-decoder deep learning model, that is trained exploiting aggregate information extracted by monitoring blockchain activities.
[
Paper ]
|
|