Data representation for CNN based internet traffic classification: a comparative study

dc.contributor.authorSalman, Ola
dc.contributor.authorElhajj, Imad H.
dc.contributor.authorKayssi, Ayman I.
dc.contributor.authorChehab, Ali
dc.contributor.departmentDepartment of Electrical and Computer Engineering
dc.contributor.facultyMaroun Semaan Faculty of Engineering and Architecture (MSFEA)
dc.contributor.institutionAmerican University of Beirut
dc.date.accessioned2025-01-24T11:30:32Z
dc.date.available2025-01-24T11:30:32Z
dc.date.issued2021
dc.description.abstractIt has been well established that the Internet of Things will bring an expansion in traffic volume and types. This will bring new challenges in terms of Quality of Service (QoS) and security, requiring innovative traffic management techniques. Traffic classification is a main network function that helps in managing both QoS and security. Different machine learning based methods have been applied for this aim. However, traditional machine learning methods rely on hand crafted features, limiting the model ability to learn. Deep Learning (DL), a branch of machine learning, is characterized by its representation learning ability. In this paper, we analyse two methods of data representation for DL-based classification: a raw packet-based representation and a quasi-raw flow-based representation. Different tests are performed to evaluate the robustness of these data representation methods. The tests include features’ importance, model robustness, and anonymization tests. The results show that raw data representation suffers from traffic anonymization and the fact that many packet fields are data-dependent. On the other hand, the flow-based representation is sensitive to the number of packets used for classification and to traffic obfuscation. © 2020, Springer Science+Business Media, LLC, part of Springer Nature.
dc.identifier.doihttps://doi.org/10.1007/s11042-020-09459-4
dc.identifier.eid2-s2.0-85089599565
dc.identifier.urihttp://hdl.handle.net/10938/27447
dc.language.isoen
dc.publisherSpringer
dc.relation.ispartofMultimedia Tools and Applications
dc.sourceScopus
dc.subjectData representation
dc.subjectDeep learning
dc.subjectInternet of things
dc.subjectTraffic classification
dc.subjectFlow visualization
dc.subjectQuality of service
dc.subjectComparative studies
dc.subjectData representations
dc.subjectInternet traffic classifications
dc.subjectLearning abilities
dc.subjectMachine learning methods
dc.subjectTraffic management
dc.subjectTraffic obfuscations
dc.subjectLearning systems
dc.titleData representation for CNN based internet traffic classification: a comparative study
dc.typeArticle

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
2021-6048.pdf
Size:
3.63 MB
Format:
Adobe Portable Document Format