Feature Extraction for Novelty Detection in Network Traffic

Yang, Kun; Kpotufe, Samory; Feamster, Nick

Full-text links:

Download:

Current browse context:

cs.NI

< prev | next >

new | recent | 2006

Computer Science > Networking and Internet Architecture

Title: Feature Extraction for Novelty Detection in Network Traffic

Authors: Kun Yang, Samory Kpotufe, Nick Feamster

(Submitted on 30 Jun 2020 (v1), last revised 10 Jun 2021 (this version, v2))

Abstract: Data representation plays a critical role in the performance of novelty detection (or ``anomaly detection'') methods in machine learning. The data representation of network traffic often determines the effectiveness of these models as much as the model itself. The wide range of novel events that network operators need to detect (e.g., attacks, malware, new applications, changes in traffic demands) introduces the possibility for a broad range of possible models and data representations. In each scenario, practitioners must spend significant effort extracting and engineering features that are most predictive for that situation or application. While anomaly detection is well-studied in computer networking, much existing work develops specific models that presume a particular representation -- often IPFIX/NetFlow. Yet, other representations may result in higher model accuracy, and the rise of programmable networks now makes it more practical to explore a broader range of representations. To facilitate such exploration, we develop a systematic framework, open-source toolkit, and public Python library that makes it both possible and easy to extract and generate features from network traffic and perform and end-to-end evaluation of these representations across most prevalent modern novelty detection models. We first develop and publicly release an open-source tool, an accompanying Python library (NetML), and end-to-end pipeline for novelty detection in network traffic. Second, we apply this tool to five different novelty detection problems in networking, across a range of scenarios from attack detection to novel device detection. Our findings general insights and guidelines concerning which features appear to be more appropriate for particular situations.

Subjects:	Networking and Internet Architecture (cs.NI); Machine Learning (cs.LG)
ACM classes:	C.2.3; I.2.6
Cite as:	arXiv:2006.16993 [cs.NI]
	(or arXiv:2006.16993v2 [cs.NI] for this version)

Submission history

From: Nick Feamster [view email]
[v1] Tue, 30 Jun 2020 17:53:59 GMT (227kb,D)
[v2] Thu, 10 Jun 2021 15:58:34 GMT (301kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2006.16993

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Networking and Internet Architecture

Title: Feature Extraction for Novelty Detection in Network Traffic

Submission history