An Efficient Technique for Network Traffic Summarization using Multiview Clustering and Statistical Sampling

Authors

DOI:

https://doi.org/10.4108/sis.2.5.e4

Keywords:

Scalable Data Mining, Network Traffic Summarization, Multiview Clustering

Abstract

There is significant interest in the data mining and network management communities to efficiently analyse huge amounts of network traffic, given the amount of network traffic generated even in small networks. Summarization is a primary data mining task for generating a concise yet informative summary of the given data and it is a research challenge to create summary from network traffic data. Existing clustering based summarization techniques lack the ability to create a suitable summary for further data mining tasks such as anomaly detection and require the summary size as an external input. Additionally, for complex and high dimensional network traffic datasets, there is often no single clustering solution that explains the structure of the given data. In this paper, we investigate the use of multiview clustering to create a meaningful summary using original data instances from network traffic data in an efficient manner. We develop a mathematically sound approach to select the summary size using a sampling technique. We compare our proposed approach with regular clustering based summarization incorporating the summary size calculation method and random approach. We validate our proposed approach using the benchmark network traffic dataset and state-of-theart summary evaluation metrics.

Downloads

Published

02-07-2015

How to Cite

1.
Ahmed M, Mahmood AN, Maher MJ. An Efficient Technique for Network Traffic Summarization using Multiview Clustering and Statistical Sampling. EAI Endorsed Scal Inf Syst [Internet]. 2015 Jul. 2 [cited 2024 May 5];2(5):e4. Available from: https://publications.eai.eu/index.php/sis/article/view/2303