Semisupervised learning with data augmentation for raw network traffic detection

Robin C. Bhoo; Nathaniel D. Bastian

doi:10.1117/12.3013183

7 June 2024 Semisupervised learning with data augmentation for raw network traffic detection

Robin C. Bhoo, Nathaniel D. Bastian

Proceedings Volume 13051, Artificial Intelligence and Machine Learning for Multi-Domain Operations Applications VI; 130511E (2024) https://doi.org/10.1117/12.3013183
Event: SPIE Defense + Commercial Sensing, 2024, National Harbor, Maryland, United States

Abstract

Deep learning (DL) has revolutionized machine learning tasks in various domains, but conventional DL methods often demand substantial amounts of labeled data. Semi-supervised learning (SSL) provides an effective solution by incorporating unlabeled data, offering significant advantages in terms of cost and data accessibility. While DL has shown promise with its integration as a component of modern network intrusion detection systems (NIDS), the majority of research in this field focuses on fully supervised learning. However, more recent SSL algorithms leveraging data augmentations do not perform optimally “out of the box” due to the absence of suitable augmentation schemes for packet-level network traffic data. Through the introduction of a novel data augmentation scheme tailored to packet-level network traffic datasets, this paper presents a comprehensive analysis of multiple SSL algorithms for multi-class network traffic detection in a few-shot learning scenario. We find that even relatively simple approaches like vanilla pseudo-labeling can achieve an F1-Score that is within 5% of fully supervised learning methods while utilizing less than 2% of the labeled data.

Conference Presentation

(2024) Published by SPIE. Downloading of the abstract is permitted for personal use only.

Citation Download Citation

Robin C. Bhoo and Nathaniel D. Bastian "Semisupervised learning with data augmentation for raw network traffic detection", Proc. SPIE 13051, Artificial Intelligence and Machine Learning for Multi-Domain Operations Applications VI, 130511E (7 June 2024); https://doi.org/10.1117/12.3013183

ACCESS THE FULL ARTICLE

INSTITUTIONAL
Select your institution to access the SPIE Digital Library.

SELECT YOUR INSTITUTION

PERSONAL
Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.

PERSONAL SIGN IN

No SPIE Account? Create one

PURCHASE THIS CONTENT

SUBSCRIBE TO DIGITAL LIBRARY

50 downloads per 1-year subscription

Members: $195

Non-members: $335 ADD TO CART

25 downloads per 1 - year subscription

Members: $145

Non-members: $250 ADD TO CART

PURCHASE SINGLE ARTICLE

Includes PDF, HTML & Video, when available

Members: $17.00

Non-members: $21.00 ADD TO CART

PROCEEDINGS
11 PAGES + PRESENTATION

DOWNLOAD PAPER SAVE TO MY LIBRARY

WATCH
PRESENTATION

GET CITATION

RIGHTS & PERMISSIONS

Get copyright permission Get copyright permission on Copyright Marketplace

KEYWORDS

Machine learning

Data modeling

Deep learning

Image classification

Computer intrusion detection

Network security

RELATED CONTENT

Impact study on MuSES generated EO IR synthetic imagery for...
Proceedings of SPIE (October 17 2023)

Research on anomalous behavior detection of federated deep learning network...
Proceedings of SPIE (June 26 2023)

Comparative analysis of classification algorithm to authenticate user based on...
Proceedings of SPIE (January 05 2024)

Convolutional Siamese network based few shot learning for monkeypox detection...
Proceedings of SPIE (March 24 2023)

A semi-supervised deep learning method in network intrusion detection
Proceedings of SPIE (June 01 2023)

Optimization of Python sorting algorithm for CIFAR 10 image classification...
Proceedings of SPIE (December 07 2023)

Topic modeling for analysis of big data tensor decompositions
Proceedings of SPIE (May 09 2018)

Subscribe to Digital Library

Receive Erratum Email Alert

Keywords/Phrases

Search In:

Publication Years